I want to buy a parallella board. I want to run my program which involves lot of matrix multiplication.How do I do it like yaniv showed in the video?
https://www.youtube.com/watch?v=DkctH7_tYSc
Could someone point me to the exact library and example code to do matrix multiplication on host and the parallella board. Right now my algo takes 120 ms and if I run all the matrix multiplications on the t16 core board, how fast will my program execute? Please let me know.