[quote user="Chris Jagielski"]
This is exactly correct. It works very well for my applications, and is so simple.
Also - note that this method makes it easy for multiple cores to communicate too. They can talk to each other "across" the DDR, since all 8 cores on your c6678 share the same DDR memory.
So for my application, I actually have a few cores doing the same thing for one algorithm. So, they all receive the same flag and run the same code, and they all set their own unique flags when they're done. Then, another separate core can read those new flags, and run the next algorithm once the previous set of cores are finished.
I had to use this approach because of the problems I faced getting OpenMP to play nice with the other software.
[/quote]
Chris, thank you for your clear explanation. I think I got it. Your method provides a really very good reference for my project.
I have one more question, with your method you'll need to load the specific application code to each specific core before you sending any data to run, since you have up to 32 cores, theoretically use can accomplish your algorithm using such many cores with very fast computation speed. Will you reload some of the application code in case of that what have been on the DSP cores are not enough for your algorithm?