1D FFT — MFLOPS
30,000 25,000 20,000 15,000 10,000 5,000 0
256 512 4K 16K 256K 512K
Figure 1 | A comparison of complex FFT performance using 128-bit SSE versus 256-bit AVX instructions – The code was run on the same processor and clock rate. Data was provided by Intel.
Figure 2 | The Curtiss-Wright CHAMP-AV8 VPX board with 269 GFLOPS and featuring dual Intel Core i7 2715QE CPUs with four Gen2 Serial RapidIO ports and 80 Gbps to the backplane.
but would-be Intel-based DSP military designers were deprived of the option to design systems around Serial RapidIO, the multiprocessor fabric of choice.
IDT’s PCIe-to-Serial RapidIO bridge and new Gen2 Serial RapidIO switches will enable system designers to b uild Intel architecture-based processing engines with much more fabric bandwidth than that offered by any other currently avail- able technology. The upcoming IDT bridge product supports 5 Gbps interfaces on both PCIe and Serial RapidIO ports. With the advantage of smal l size and low power consumption, system design- ers can add bandwidth by using m ultiple PCIe2-to-Serial RapidIO2 bridges con- nected directly to the processors or via a PCIe switch. This performance can scale at the system le vel with the new Gen2 Serial RapidIO: This new generation of systems will deli ver double the back - plane bandwidth provided by the already fast 3.125 Gbps Gen1 Serial RapidIO technology. A 19" rack, OpenVPX pro - cessing system will be able to deplo y 1.2 terabits per second of fabric band- width. The Intel/Serial RapidIO combina- tion is also suited for SwaP-constrained systems, as designers can maximize the power available for actual computing knowing that Serial RapidIO fabric tech- nology provides the best bandwidth/watt.
Serial RapidIO bridges implemented in FPGAs don’t support high-performance
messaging, a feature which directly maps to higher -level software APIs such as MPI. IDT’ s new bridge product will support the two main Serial RapidIO transfer modes, Serial RapidIO messaging, and memory-mapped transfers. Another benefit of the IDT silicon is the inclusion of DMA engines that speed computation while of fload- ing the host processor . Intel pro - cessors typically don’t have DMA engines on-chip, but depend instead on the peripheral chip to move data.
Without a DMA engine, moving data can require a large amount of the host proces - sor’s attention, with the result that a multi- core processor might have one of its cores (and associated power) largely consumed by moving data, which is all the more bur- densome because it has to be done in code.
Another advantage of Serial RapidIO for SWaP-constrained military systems is its ability to support distrib uted switch and centralized switch architectures. Distrib - uted switch systems (an e xample is the VITA 65 BPK6-CEN05-11.2.5-n back- plane profile) can make use of the local
Intel®
SSE Intel®
AVX
Core™ Core™
i7 2715QE NAS i7 2715QE NAS
Serial RapidIO switch and thus a void the need for a separate switch card. For example, if the system were using a ½ ATR Short enclosure (four 1" slots), this capa- bility saves 25 percent of the space and a considerable amount of power. For large systems, centralized switch architectures are often preferred, and Serial RapidIO is equally adept at this approach.
An example of a high-performance DSP engine designed to tak e full advan- tage of the latest offering for Intel’ s Core i7 is the ne w CHAMP-AV8 from Curtiss-Wright Controls Embedded Computing (Figure 2). The CHAMP-AV8 is an Intel Core i7-2715QE-based rug - ged, high-performance OpenVPX DSP engine. Performance of this dual Core i7 DSP engine is rated at up to 269 GFLOPS. It also supports the IDT Gen2 PCIe-to- Serial RapidIO bridge product, effectively tripling the bandwidth of first-generation VPX products with up to 240 Gbps of fabric performance. CS
Ian Stalker is the DSP product manager for Curtiss-Wright Controls Embedded Computing. He has more than 20 years of experience in the embedded industry and holds a degree in Electronics Engineering. Contact him at
ian.stalker@
curtisswright.com.
Alan Baldus, Intel SBC Product Marketing Manager at Curtiss- Wright Embedded Controls, has more than 15 years in the embedded computer industry. He holds a BS in Electrical Engineering. Contact him at
abaldus@curtisswright.com.
Curtiss-Wright Controls Embedded Computing 703-779-7800
www.cwcembedded.com
VX3035 - 2nd Generation Intel®
Core™ Core™ i7 2655LE processor,
Product Spotlight Core™
i7 Single Board Computer
Kontron expands its VPX ecosystem with the 3U VPX CPU board VX3035, based on the 2nd generation Intel®
i7 processor. Integrating the Intel® Intel®
HD graphics,
and features such as Intel® Turbo Boost technology and Intel® Advanced Vector Extensions (Intel®
performance class for SWaP (Size, Weight, and Power) optimized VPX applications. VX3035 supports VXFabric™
AVX), VX3035 defines a new , allowing data flow applications
using IP sockets to enjoy high-bandwidth inter-boards communication at PCIe DMA hardware speed. VX3035 is available standard air-cooled, rugged air-cooled, or rugged conduction-cooled for extreme environmental conditions. For more information –
www.kontron.com/vpx.
+1 888 294 4558 |
www.kontron.com VME and Critical Systems / Spring 2011 29
MFLOPS
Page 1 |
Page 2 |
Page 3 |
Page 4 |
Page 5 |
Page 6 |
Page 7 |
Page 8 |
Page 9 |
Page 10 |
Page 11 |
Page 12 |
Page 13 |
Page 14 |
Page 15 |
Page 16 |
Page 17 |
Page 18 |
Page 19 |
Page 20 |
Page 21 |
Page 22 |
Page 23 |
Page 24 |
Page 25 |
Page 26 |
Page 27 |
Page 28 |
Page 29 |
Page 30 |
Page 31 |
Page 32