Nvidia Tesla, AMD Epyc to Power New Berkeley Supercomputer

Nvidia Tesla, AMD Epyc to Power New Berkeley Supercomputer

Supercomputing wins are a big deal for semiconductor manufacturers. Winning space in top systems is seen as a mark of stability and longevity. It implies vendors at the top of the market and with access to important, long-term government contracts believe your hardware is robust and capable enough to be used for cutting-edge scientific research at one of the United States’ largest installations. The economics of these deals are decidedly more cloudy, but as a matter of public relations, companies tend to adore them. Both Nvidia and AMD have something to crow about today, given a new announcement from Cray and Berkeley National Labs.

The National Energy Research Scientific Computing Center will use a Cray Shasta installation for its next-generation supercomputer, codenamed “Perlmutter.” To be clear, “Shasta” is what Cray calls the supercomputer architecture, while Perlmutter is the name of the specific NERSC system to be constructed. Shasta is designed to be compatible with both ARM and x86 architectures from Intel and AMD with support for a variety of interconnect standards, including Cray’s Slingshot, Intel’s Omnipath, and Mellanox (Infiniband). For those of you who associate Infiniband with Intel, the company bought its own Infiniband technology back in 2012, but Omnipath is a different, Intel-specific technology, which Mellanox’s Infiniband competes with. Shasta is designed to integrate components and accelerators from a variety of companies, including GPUs, FPGAs, and AI-specific accelerators.

Cray’s new Slingshot interconnect is a critical part of the system (and scaling interconnect power downwards is important to how we eventually reach exascale compute capability). A short video about the new interconnect is embedded below:

We don’t have figures yet on which Epyc CPUs or how many cores NERSC will deploy — or how many GPUs it will contain. Nvidia, however, is also crowing about its own inclusion on the project, with one study claiming that 50 percent of the workloads Perlmutter will perform are capable of running on GPUs. This will be the first NERSC supercomputer focused on heterogeneous compute, so picking up the win is a nice feather in Nvidia’s cap and doubtless represents a great deal of long-term effort to ensure workloads will run well. We don’t know which Tesla products NERSC will use, either, but would expect it to be hardware near the top of Nvidia’s Tesla stack.

NERSC GPU kernels
NERSC GPU kernels

NERSC does work in a number of critical fields, including nuclear fusion research, climate modeling, materials science research, and biology research focused on molecular structure and how it relates to drug discovery and vaccine development. While this machine isn’t itself an exascale-class deployment, it’s expected to use some of the same technologies and standards we’ll deploy for exascale computing when the first systems come online (theoretically) in 2021.

Continue reading

Europe Plans 20,000 GPU Supercomputer to Create ‘Digital Twin’ of Earth
Europe Plans 20,000 GPU Supercomputer to Create ‘Digital Twin’ of Earth

The plan to create a digital twin of Earth might end up delayed due to the relative lack of available GPUs, but this isn't going to be an overnight project.

Nvidia Unveils ‘Grace’ Deep-Learning CPU for Supercomputing Applications
Nvidia Unveils ‘Grace’ Deep-Learning CPU for Supercomputing Applications

Nvidia is already capitalizing on its ARM acquisition with a massively powerful new CPU-plus-GPU combination that it claims will speed up the training of large machine-learning models by a factor of 10.

Tesla Built a Supercomputer to Develop Camera-Only Self-Driving Tech
Tesla Built a Supercomputer to Develop Camera-Only Self-Driving Tech

Tesla is talking about what it sees as the next leap in autonomous driving that could do away with lidar and radar, leaving self-driving cars to get around with regular optical cameras only.

Meta is Building a Massive New Supercomputer
Meta is Building a Massive New Supercomputer

It'll be used for real-time speech recognition, neuro-linguistic programming . . . and the metaverse, obviously.