vme and critical systems,gpus          Other topics:   OpenVPX, RTOS, multicore, VPX, AdvancedTCA, microcontroller, FPGAs, analog-to-digital
VME and Critical Systems
home
articles & topics
product search
White Papers
newswire
E-letter
E-cast Schedule
articles >
Technology Feature
vme and critical systems,gpus
RSS Link
Industry News:
vme and critical s...
  • NVIDIA Releases Version 2.1 Beta of the CUDA Toolkit and SDK
    1 year ago
  • NVIDIA CUDA Toolkit 2.2 Released
    1 year ago
  • PCI Embedded announces availability of VME Products Jackson, California, September 17, 2008
    1 year ago
  • More Industry News headlines...
Technology Partnerships:
vme and critical s...
  • US Technologies Offers Testing and Repair of VME, MVME, VMEbus Products
    1 year ago
  • Motorola and Hybricon Collaborate to Develop Proof-of-Concept for Conduction-Cooled MicroTCA Platform
    3 years ago
  • Tokyo Tech Builds First Tesla GPU Based Heterogeneous Cluster to Reach Top 500
    1 year ago
  • More Technology Partnerships headlines...
Contracts:
vme and critical s...
  • NVIDIA Tesla GPU Computing Solutions Selected for Flagship Z Workstation from HP
    11 months ago
  • Tundra Semiconductor's Serial RapidIO Switch Selected by VMETRO
    2 years ago
  • Tekmicro supplies signal processing system for NASA
    3 years ago
  • More Contracts headlines...
New Products:
vme and critical s...
  • Mercury Computer Systems Tackles Processing, Exploitation, and Dissemination Challenge with Powerful GPU-Based Rugged Solution
    Last month
  • Elma Bustronic has Over 30 Standard Slot Sizes for 6U and 7U VME64x Backplanes
    10 months ago
  • New VPX Mesh Hybrid Backplane from Bustronic
    1 year ago
  • More New Products headlines...
People:
vme and critical s...
  • BittWare Expands Technical Management Team
    2 years ago
  • USMC 234th Birthday Tribute Video
    8 months ago
  • VMETRO Bolsters Leadership Team
    5 years ago
  • More People headlines...
Mergers and Acquisitions:
vme and critical s...
  • From the Blog: Former Motorola Manager Sounds off on Emerson's Acquisition
    2 years ago
  • Eurotech Acquires Japanese Embedded Systems Company Advanet
    2 years ago
  • Kontron signs contract to acquire Thales Computers
    2 years ago
  • More Mergers and Acquisitions headlines...
Conferences and Awards:
vme and critical s...
  • CUDA Cleans up at Supercomputing Industry Awards
    1 year ago
  • Scientists Explore Astrophysical Problems with NVIDIA GPUs
    2 years ago
  • Diversified Technology, Inc. to Present at the AdvancedTCA Summit
    4 years ago
  • More Conferences and Awards headlines...
Media and Education:
vme and critical s...
  • OpenSystems Publishing Renames VMEbus Systems Magazine to 'VME and Critical Systems' Magazine
    3 years ago
  • Free Jacket Webinar: MATLAB Acceleration for Life Science Applications using CUDA-enabled GPUs
    1 year ago
  • OpenSystems Publishing Launches New VME E-site
    3 years ago
  • More Media and Education headlines...
Standard Certifications and References:
vme and critical s...
  • PICMG adds Ethernet Fabric to Advanced Mezzanine Card
    3 years ago
  • BittWare Commits to Long Term VITA 41 VXS Roadmap
    4 years ago
  • New PXI MultiComputing(tm) Specification Enables Multi-Controller Systems with High Performance Communication
    7 months ago
  • More Standard Certifications and References headlines...
Browse topics
Search Articles
Browse Articles
See Also:
Military Articles
Embedded Computing Articles
CompactPCI Articles
Magazine >

About the Magazine
Editorial Topics
Free Subscription
Reader Service Card
Search Articles
Search Products
Contact Information
Columns

Editor's Foreword
VITA News
VITA Standards
Technology in Europe
Military Technology Insider
Guest Editorial
Defining Standards
Departments

Editor's Choice Products
by Chris A. Ciufo
VMEnow Blog
What is VME?
VME: Then & NOW
Webcasts

Upcoming E-casts
Archived E-casts
Submissions

Submit a Press Release
Submit a New Product
Submit an Abstract for Review
Vendors/Sponsors

Do an E-cast
Preferred Vendors
Upcoming Issue
Advertise
Editorial Calendar
Media Kits










GPUs lend key flexibility in high-performance military computing systems

By
Leslee Schneider
Quantum3D, Inc.

With changing political and even social environments driving demand for the latest “run-faster, jump-higher” processing wares, GPUs – combined with GPPs, DSPs, and/or FPGAs – just might make the “miracle” supercomputing system a reality. Variables to consider include power consumption, compute power, life-cycle extension, and physical environment.

The world of high-performance computing is particularly challenging, as new flavors of sophisticated sensors, complex cameras, and even changing political or social environments drive demand for the latest and greatest “run-faster, jump-higher” processing wares. Advanced computations, which include Software-Defined Radio, cryptography, and other types of arithmetic-intensive algorithms, are valued in R&D endeavors. And great progress has been made by using supercomputers, computer grids, clusters, clouds, gangs, and other forms of networked or otherwise connected compute nodes for such work. This is all well and good, until one wants to use this technology in real-world applications, particularly in military deployed environments such as unmanned vehicles or soldier-worn computers, where power consumption requirements – and certainly the weather – come into play.

Consider, for example, the real-world environment of a helicopter brownout, where billows of dust or sand caused by the helicopter rotors during takeoff and landing obstruct visual cues necessary for operational safely. Real-time synthetic vision systems deployed on the aircraft might assist pilots in these situations, but these systems must be designed to operate within the power budgets for the aircraft. It would be terrific if deployed high-performance computing requirements were simply solved with a one-size-fits-all system that was intrinsically power-efficient for any application and deployable in any environment.

This miracle of a system does not exist, and so the challenges remain: What kind of processing can best empower a military embedded system to provide supercomputing processing requirements that change nearly in real-time over product life cycles that could span decades? While none of them alone serves as the best remedy, modern Graphics Processing Units (GPUs) are providing a viable remedy when combined with other computing “rivals” – the General Purpose Processor (GPP), DSP, and FPGA. Variables of power consumption, environment, computing performance, and life cycle are examined for these four processing technologies. Additionally, an example of a GPU-based system helps exemplify this principle.

Traditional GPPs

GPPs, also known as “traditional CPUs,” feature ever-improving performance, well-understood, and mature software development tools – and are available in many form factors. The downside of using these GPPs in embedded high-performance applications might be limited product lifespan brought about by end-of-life commercial components or changes in platform support, along with latency issues that are always a concern with real-time applications (particularly vehicle systems or soldier-worn equipment). Meanwhile, environmental concerns can result in thermal issues and reduced quality of service in cold temperatures, and power consumption draws can run high.

High-throughput DSPs

One alternative to the GPP is the classic DSP, typically offering lower-power, low-latency components with high-throughput potential: These are all good things, excepting the pain that comes with learning the development tools and the slower relative processing performance, both of which mean that DSPs are not always real-world practical for military applications. Networks of DSPs are a tried-and-true solution for parallel processing requirements but magnify the shortcomings of the technology in general. Long development cycles abound, and deployment and maintenance can be difficult, sometimes limiting the usefulness of the technology. Furthermore, DSPs, like other processors, are becoming more power-hungry as their performance increases, meaning that heat dissipation and power usage must be addressed.

Reconfigurable FPGAs

Another alternative to GPPs is the FPGA, which has found a niche in the high-performance computing world, often utilized as a coprocessing device to massage data. Their inherent parallel architecture and performance – as related to processing power, latency, and throughput – are well-suited to many types of mission-critical signal processing applications. The field-programmable aspect of the FPGA’s processing unit is also highly beneficial, as updates can be implemented in near real time.

Although excessive power consumption can be an issue with FPGAs in embedded applications, power usage can usually be managed according to a given technical requirement. However, FPGAs are not always available in extended-temperature or rugged packaging, limiting their use in systems designed for harsh environments. FPGAs can also have longer product life cycles; if a particular device is end-of-life, functional deployed application firmware can usually be employed on a newer part with little additional effort.

Flexible GPUs

In contrast, GPUs are traditionally tasked with compute-intensive, floating-point graphics functions such as 3D rendering and texture mapping. However, some modern GPUs are structured much like parallel-architecture supercomputers and are being used for numerical, signal processing, physics, general scientific, or even statistical applications – all of which might be viable applications on the battlefield.

Programming tools developed for this purpose, essentially extensions of the ubiquitous high-level C as well as C++ (and recently Fortran programming languages), leverage GPU parallel compute engines to solve complex computational problems. These computations include largely parallelizable problems, which can be solved in significantly shorter timeframes by the GPU – in some cases 100x faster – than by a traditional CPU. This computing paradigm is called General Purpose computing on Graphics Processing Units or GPGPU.

Figure 1 depicts a traditional CPU versus two generations of NVIDIA GPUs, measured in iterations per second. Test 1 and Test 2 are two algorithms from a well-known benchmark suite. They both benefit from porting to the GPGPU, but by differing degrees. Comparison of the two results demonstrates that porting both algorithms to GPGPU benefits both algorithms. One can also conclude that the improvement in performance for well-selected algorithms warrants the effort of porting by the degree of improvement over a pure CPU implementation. A perfect CPU multicore port would multiply the leftmost column by a small integer number relating to the number of CPU cores available, whereas the GPGPU results are several orders of magnitude better, and seem to be increasing per generation of GPU at a greater rate than seems plausible for CPUs.

Figure1
Figure 1: A traditional CPU versus two generations of NVIDIA GPUs, measured in iterations per second, showing increasing superiority of GPGPUs.
(click graphic to zoom by 1.5x)

Additionally, GPUs are available in extended temperature and rugged packages, making them suitable for deployment on airborne or other environmentally challenging platforms. The projected GPU lifespan can be limited, but with careful material planning, this can be managed. As with GPPs, care must also be used with power management and heat dissipation, particularly with small form factor  systems.

Enter a flexible GPU system architecture

So back to our original question: What kind of “miracle” processing system can best provide supercomputing processing requirements that change nearly in real time over product life cycles that could span decades – while taking power consumption and environmental concerns into consideration?

The answer: A system of GPUs combined with traditional CPUs, DSPs, or FPGAs. Such a “miracle” system would allow a developer’s specific signal processing or other algorithms to deploy effectively. These GPU-based systems can be designed to execute many operations (OPS, FLOPs, Teraflops) of usable processing. Moreover, the architecture could comprise a highly customized suite of 6U VPX boards, consisting of one or more compute nodes, an input/output board, an InfiniBand switch, and a management node, all housed in a conduction-cooled chassis and supported by a Linux-based operating system. Estimated system-level performance could then be as high as 1.55 GF/W at theoretical peak/Thermal Design Point (TDP). Fully populated, a GPU-based high-performance computing system could provide a theoretical 1.94 TF of floating-point computation in a physical package of less than 2 cubic feet, which would elicit a power draw of only slightly more than 1 kilowatt.

Such a GPU-based high-performance computing system could additionally include a 6U VPX carrier card, rendering it a configurable SBC with graphics capability. These flexible SBCs could then be installed in an air-cooled chassis, with forced-air cooling, or could be used in a rugged conduction-cooled chassis suitable for harsh environments such as for airborne or naval deployment.

As technology advances, the COTS modules can be upgraded, extending the system’s life cycle. Because the deployed form factor does not need to change upon upgrade, enabling deployed systems to be quickly upgraded – or even downsized – to fit power budgets or other environmental constraints is easier. An example of a GPU-based high-performance system suitable for military deployment is Quantum3D’s Katana system, based on the Tanto compute node.

Flexibility is good

Leveraging GPUs ‚Äì combined with various compute options including GPPs, DSPs, and/or FPGAs ‚Äì on a rugged and flexible hardware platform is a step in the right direction to creating a ‚Äúmiracle‚Äù supercomputing platform for modern military systems. The ability to upgrade boards or modules in a rugged chassis increases the length of the product duty cycle and allows adaptation to environmental factors. As processing capabilities improve and technologies such as GPGPU progress, the ability to design in an already-qualified VPX chassis, such as Quantum3D‚Äôs Tanto VPX system (Figure 2), for example, is an added benefit. CS

Figure2
Figure 2: Quantum3D’s Tanto VPX system

Leslee Schneider is director of embedded product marketing at Quantum3D, Inc. She joined Quantum3D in 2007 with more than 12 years in technical computing experience. Prior to joining Quantum3D, she held management positions at Transtech DSP, Synergy Microsystems, and Curtiss-Wright Controls Embedded Computing. She holds a Bachelor’s degree from the University of California at Berkeley. Leslee can be reached at lschneider@quantum3d.com.

Quantum3D, Inc. 408-600-2595 www.quantum3d.com




©MMIX VME and Critical Systems. An OpenSystems Media, LLC publication.
About this Magazine and Website | Contact Us | VME and Critical Systems Media Kit