SharpWeapon
Member level 5
Hi folks,
In my design I have N(could range from 64 to 1024) number of processing elements, each PE taking x samples at a time. I would like to feed all N PEs and trigger their processing at the same time from a single on-chip memory. I tried designing serdes-like module which reads the memory serially and deserialize each x consecutive samples and feed each PES. But I am not happy with my design choice since it has a long delay N*x clock cycles to feed all PEs, which is equal to the same delay as streaming each sample to the PEs. I am wondering if there is any more efficient way of doing this?
Cheers,
In my design I have N(could range from 64 to 1024) number of processing elements, each PE taking x samples at a time. I would like to feed all N PEs and trigger their processing at the same time from a single on-chip memory. I tried designing serdes-like module which reads the memory serially and deserialize each x consecutive samples and feed each PES. But I am not happy with my design choice since it has a long delay N*x clock cycles to feed all PEs, which is equal to the same delay as streaming each sample to the PEs. I am wondering if there is any more efficient way of doing this?
Cheers,