Hello,
I want to know how to tweak the current hello_world example program which can load the program in all cores by creating a group of 4x4 and dump the output of each core to its local memory(using e_write) and then the host sides read the memory of each cores for the output
Thanks