iTWire - Nvidia updates HPC/AI range

Hardware Left Skin

Hardware Right Skin

iTWire

Tuesday, 17 November 2020 11:58

Nvidia updates HPC/AI range

By Stephen Withers

Comments:0 Comments

Nvidia A100 80GB GPU

HPC and AI vendor Nvidia has introduced an upgraded GPU, a new workgroup server, and a next-generation networking technology.

The Nvidia A100 80GB GPU has twice the memory of its predecessor, and with over 2TBps of memory bandwidth provides "unprecedented speed and performance" for AI and HPC applications.

"Achieving state-of-the-results in HPC and AI research requires building the biggest models, but these demand more memory capacity and bandwidth than ever before," said Nvidia vice president of applied deep learning research Bryan Catanzaro.

"The A100 80GB GPU provides double the memory of its predecessor, which was introduced just six months ago, and breaks the 2TB per second barrier, enabling researchers to tackle the world's most important scientific and big data challenges."

For example, training recommender models such as DLRM can be done three times more quickly. The additional memory also means larger models can be trained on a single server.

Conversely, multi-instance GPU technology means an A100 can be partitioned into up to seven GPU instances, each with 10GB of memory. This provides secure hardware isolation and maximises GPU utilisation for a variety of smaller workloads, the company said.

Performance improvements can also been seen in inferencing. The RNN-T speech recognition model delivers 1.25 times higher inference throughput in production.

HPC applications also benefit. Quantum Espresso, a materials simulation, achieved throughput gains of nearly 2x on a single A100 80GB.

"Speedy and ample memory bandwidth and capacity are vital to realising high performance in supercomputing applications," said Satoshi Matsuoka, director at Riken centre for Computational Science.

"The Nvidia A100 with 80GB of HBM2e GPU memory, providing the world's fastest 2TBps of bandwidth, will help deliver a big boost in application performance."

The Nvidia A100 80GB GPU is available in the Nvidia DGX A100 systems and the new Nvidia DGX Station A100.

Other vendors expected to announce systems with integrated four or eight A100 80GB GPUs include Atos, Dell Technologies, Fujitsu, Gigabyte, Hewlett Packard Enterprise, Inspur, Lenovo, Quanta and Supermicro, with delivery in the first half of 2021.

The Nvidia DGX Station A100 is described as "the world's only petascale workgroup server," delivering 2.5 petaflops of AI performance.

NVIDIA DGX Station A100 Open View

According to Nvidia, it is the only workgroup server with four of the latest Nvidia A100 Tensor Core GPUs fully interconnected with Nvidia NVLink, with up to 320GB of GPU memory.

Nvidia Multi-Instance GPU technology means one DGX Station A100 provides up to 28 separate GPU instances to run parallel jobs and support multiple users without impacting system performance.

"DGX Station A100 brings AI out of the data centre with a server-class system that can plug in anywhere," said Nvidia vice president and general manager of DGX systems Charlie Boyle.

"Teams of data science and AI researchers can accelerate their work using the same software stack as Nvidia DGX A100 systems, enabling them to easily scale from development to deployment."

For data centre workloads, the A100 80GB GPUs will be available in DGX A100 systems giving 640GB per system, allowing the use of larger datasets and models.

These DGX A100 640GB systems can also be integrated into the Nvidia DGX SuperPOD Solution for Enterprise.

The first DGX SuperPOD systems with DGX A100 640GB will include the UK's Cambridge-1 supercomputer for healthcare research, and the University of Florida HiPerGator AI supercomputer.

Nvidia DGX Station A100 and Nvidia DGX A100 640GB systems will be available this quarter through Nvidia partner network resellers worldwide. An upgrade option will be provided for Nvidia DGX A100 320GB customers.

Nvidia Mellanox 400G InfiniBand provides "a dramatic leap in performance offered on the world's only fully offloadable, in-network computing platform," company officials said.

The seventh generation of Mellanox InfiniBand provides ultra-low latency and doubles data throughput with NDR 400Gbps and adds Nvidia In-Network Computing engines for additional acceleration.

Vendors including including Atos, Dell Technologies, Fujitsu, Inspur, Lenovo and Supermicro plan to add Nvidia Mellanox 400G InfiniBand to their enterprise and HPC products.

"The most important work of our customers is based on AI and increasingly complex applications that demand faster, smarter, more scalable networks," said Nvidia senior vice president of networking Gilad Shainer.

"The Nvidia Mellanox 400G InfiniBand's massive throughput and smart acceleration engines let HPC, AI and hyperscale cloud infrastructures achieve unmatched performance with less cost and complexity."

The Nvidia Mellanox NDR 400G InfiniBand offers 3x the switch port density and boosts AI acceleration power by 32 times, increases switch system aggregated bi-directional throughput five times to 1.64Pbps.

"Microsoft Azure's partnership with Nvidia Networking stems from our shared passion for helping scientists and researchers drive innovation and creativity through scalable HPC and AI. In HPC, Azure HBv2 VMs are the first to bring HDR InfiniBand to the cloud and achieve supercomputing scale and performance for MPI customer applications with demonstrated scaling to eclipse 80,000 cores for MPI HPC," said Microsoft head of product for Azure HPC Nidhi Chappell.

"In AI, to meet the high-ambition needs of AI innovation, the Azure NDv4 VMs also leverage HDR InfiniBand with 200Gbps per GPU, a massive total of 1.6Tbps of interconnect bandwidth per VM, and scale to thousands of GPUs under the same low-latency InfiniBand fabric to bring AI supercomputing to the masses. Microsoft applauds the continued innovation in Nvidia's Mellanox InfiniBand product line, and we look forward to continuing our strong partnership together."

The third-generation of the Nvidia Mellanox Sharp technology allows deep learning training operations to be offloaded and accelerated by the InfiniBand network, resulting in 32 times higher AI acceleration power.

Edge switches based on the Mellanox InfiniBand architecture, carry an aggregated bi-directional throughput of 51.2Tbps, while modular switches will carry an aggregated bi-directional throughput of 1.64Pbps, which is five times that of the previous generation.

NVIDIA Mellanox 400G InfiniBand

Products based on Nvidia Mellanox NDR 400G InfiniBand are expected to become available in sample form in the second quarter of 2021.

Read 2595 times

Please join our community here and become a VIP.

Subscribe to ITWIRE UPDATE Newsletter here
JOIN our iTWireTV our YouTube Community here
BACK TO LATEST NEWS here

EXL AI IN ACTION VIRTUAL EVENT 20 MARCH 2025

Industry leaders are looking to transform their businesses and achieve measurable outcomes with AI.

As organisations across APAC navigate the complexities of AI adoption, this must-attend event brings together industry leaders, real-world demonstrations, and visionary panel discussions to bridge the gap between proof-of-concepts and enterprise-wide AI implementation.

Learn how to overcome common challenges in deploying AI at scale.

Unlock cost savings, efficiency, and better customer experiences with AI.

Discover how industry expertise and data intelligence enable practical AI deployment.

Register for the event now!

REGISTER!

PROMOTE YOUR WEBINAR ON ITWIRE
It's all about Webinars.

Marketing budgets are now focused on Webinars combined with Lead Generation.

If you wish to promote a Webinar we recommend at least a 3 to 4 week campaign prior to your event.

The iTWire campaign will include extensive adverts on our News Site itwire.com and prominent Newsletter promotion https://itwire.com/itwire-update.html and Promotional News & Editorial. Plus a video interview of the key speaker on iTWire TV https://www.youtube.com/c/iTWireTV/videos which will be used in Promotional Posts on the iTWire Home Page.

Now we are coming out of Lockdown iTWire will be focussed to assisting with your webinars and campaigns and assistance via part payments and extended terms, a Webinar Business Booster Pack and other supportive programs. We can also create your adverts and written content plus coordinate your video interview.

We look forward to discussing your campaign goals with you. Please click the button below.
MORE INFO HERE!

BACK TO HOME PAGE

Published in Hardware

Tagged under

Nvidia

Nvidia A100

GPUs

Nvidia DGX Station A100

InfiniBand

400G InfiniBand

Stephen Withers

HPC

AI

Stephen Withers

Stephen Withers is one of Australia¹s most experienced IT journalists, having begun his career in the days of 8-bit 'microcomputers'. He covers the gamut from gadgets to enterprise systems. In previous lives he has been an academic, a systems programmer, an IT support manager, and an online services manager. Stephen holds an honours degree in Management Sciences and a PhD in Industrial and Business Studies.

Latest from Stephen Withers

Pure makes storage even more as a service, and adds data resilience as a service

Wider regional role for New Relic's Leonidas

Beware of YouTube stream-jacking: Bitdefender

Lozada takes over as country manager at Ascom

Related items

Cloudera appoints Leo Brunnick as Chief Product Officer

Lenovo Hybrid Ai Advantage With Nvidia Introduces New Agentic Ai

Bud Financial Uses DataStax AI and NVIDIA to Drive Real-Time Financial Insights for ANZ Bank

TELETRAC NAVMAN survey finds 83% of fleets say that AI is future of safety

More in this category: « AMD claims HPC GPU crown with MI100 Tritium claims ‘world first’ launch of scalable electric vehicle charging platform »
Share News tips for the iTWire Journalists? Your tip will be anonymous

back to top

Subscribe to Newsletter

* Enter the security code shown:

WEBINARS & EVENTS

UiPath to Unveil Latest Agentic Automation Solutions at Agentic AI Summit
UiPath to hold Agentic AI Summit on March 25 to…

ALL WELCOME - 6 DAYS TO GO - AI in Action is your opportunity to gain actionable strategies for deploying scalable, reliable AI solutions that drive measurable business outcomes.
AI in Action As organisations across APAC…

TODAY FREE VIRTUAL EVENT - EXL SERVICE, AI in action Driving the shift to scalable AI
Through a blend of visionary discussions, real-world demos, and expert…

Expert warns: Small businesses missing out on AI cost-saving and growth opportunities
Experienced business marketing and sales strategist, Jennifer Benedek, founder and…

Datadog Opens Registration for Its 2025 DASH Conference
GUEST EVENTS:  The annual conference will take place in New York…

CYBERSECURITY

Motorola Solutions Expands Avigilon Enterprise Security Suite to Accelerate Response During Emergencies
Company’s deep public safety expertise underpins new Avigilon features to…

81% of Australian IT leaders want more government intervention to help them manage cybersecurity
New KnowBe4 research finds local IT leaders are seeking increased…

64% of Australian companies hit with ransomware ‘forced to halt operations’
Impact of ransomware causing significant damage to the revenue and…

How robust is your cloud identity security?
GUEST OPINION: Charles Chu, General Manager, IT and Developer Solutions,…

Securing cloud environments ‘more complex, more urgent’
GUEST OPINION Robin Long, Field CTO, Asia Pacific, Rapid7: As…

PEOPLE MOVES

Sysdig Appoints Gary Olson CRO and Crendal Kear CBO
Amid Global Expansion and >330% Growth of Sysdig Sage AI:…

MaxMine announces strategic leadership appointments, names Shaun Mitchell as CEO to ‘lead next phase of growth’
MaxMine, a technology and services solution for open-pit mine sites,…

Cloudera appoints Leo Brunnick as Chief Product Officer
Hybrid platform for data, analytics, and AI provider Australia and…

Informatica Announces Krish Vitaldevara as EVP, Chief Product Officer
Company rounds out product leadership team with Bala Kumaresan, EVP,…

Commvault Names Security Veteran Bill O’Connell as Chief Security Officer
COMPANY NEWS: Commvault, a leading provider of cyber resilience and…

GUEST ARTICLES

How to break your device habit once and for all
GUEST OPINION: If you're like millions of Americans, you spend…

How to choose reliable hosting for your website
GUEST OPINION: One of the most significant decisions when starting…

Proactive actionable intelligence leading contemporary security operations
GUEST OPINION: Society is now at the stage where cybercriminals…

Blue Yonder Survey: Consumers Willing to Pay Sustainability Premium for Everyday Retail Products, But Not Other Categories
GUEST RESEARCH:  Across regions, consumers say they are willing to spend…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINON: “AI won’t take your job. It’s somebody using…

Federal Budget - Commentary from WatchGuard ANZ
GUEST OPINION: Following the release of the 2025–2026 Federal Budget,…

Amber Technology welcomes ABB-free@home® to its portfolio
COMPANY NEWS: ABB and Amber Technology have expanded their distribution…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINION by Peter Graves, Area Vice President ANZ, UiPath:…

Sihoo Concludes 2025 Brand Globalization Strategy Conference, Unveils Technology-Driven Vision for a Health-Centered Ergonomic Ecosystem
On March 29, 2025, Sihoo, a global leader in ergonomic…

360 Privacy announces strategic executive appointments following $36 million growth investment
COMPANY NEWS: Key executive hires accelerate growth strategy and enhance…

Guest Opinion

How to break your device habit once and for all
GUEST OPINION: If you're like millions of Americans, you spend…

How to choose reliable hosting for your website
GUEST OPINION: One of the most significant decisions when starting…

Proactive actionable intelligence leading contemporary security operations
GUEST OPINION: Society is now at the stage where cybercriminals…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINON: “AI won’t take your job. It’s somebody using…

Federal Budget - Commentary from WatchGuard ANZ
GUEST OPINION: Following the release of the 2025–2026 Federal Budget,…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINION by Peter Graves, Area Vice President ANZ, UiPath:…

New Research Shows Parents Prone to this Common Scam
With school holidays just around the corner, Vodafone is warning…

How AI can supercharge your productivity at work in 2025
GUEST OPINION:  The workplace is undergoing a profound transformation driven by…

Unlock online freedom with a free VPN
GUEST OPINION: The internet is a vast and powerful tool,…

AI and the Future of Insurance: Why Solving the Data Integration Challenge Matters
GUEST OPINION: Artificial intelligence (AI) is already woven into our…

ITWIRETV & INTERVIEWS

Amazon CISO CJ Moses gives rare interview
GUEST INTERVIEW: CJ Moses moves out of the shadows to…

Matt Salier explains the Australian Cyber Collaboration Centre's voluntary data classification framework
Cybercriminals never rest and company data is an increasingly valuable…

Qualys CEO Sumedh Thakar explains the Risk Operations Centre (ROC)
iTWireTV: Special guest Qualys CEO Sumedh Thakar tells us about…

iTWire talks to SailPoint about identity management in the Enterprise
GUEST INTERVIEW: We talk to Andrew Moore, VP of Product,…

How Blue Yonder is applying AI and innovation to solve supply chain challenges
iTWireTV interview: We've all felt the effects of supply chain…

RESEARCH & CASE STUDIES

Blue Yonder Survey: Consumers Willing to Pay Sustainability Premium for Everyday Retail Products, But Not Other Categories
GUEST RESEARCH:  Across regions, consumers say they are willing to spend…

Workplace Tug-of-War: Organisations Mandate Office Days, But Employees Push for More Flexibility
New research confirms workplace policies are in flux, with two-thirds…

Channel News

AWS announces Generative AI Partner Innovation Alliance
COMPANY NEWS: Amazon Web Services (AWS) has, announced the launch…

Rackspace Technology signs multi-year, strategic collaboration agreement with AWS
AI technology services company Rackspace Technologies has signed a Strategic…

Comments

Re: iTWire - Can cyber security be a platform for innovation and growth?

Most cybersecurity is making up for weak platforms. We need to address the fundamentals, design platforms that prevent out-of-bounds access[…]

Re: iTWire - Why Software Developers Need a Security ‘Rewards Program’

For most developers the security/performance trade off is still the hardest one to tackle, even as the cost of processing[…]

Re: iTWire - The Risc-V architecture that can shape the future of computing

RISC has been overhyped. While it is an interesting low-level processor architecture, what the world needs is high-level system architectures,[…]

Re: iTWire - Is Linux finally ready to storm the mainstream?

There are two flaws that are widespread in the industry here. The first is that any platform or language should[…]

Re: iTWire - Transport for NSW and HCLTech expand digital transformation partnership

Ajai Chowdhry, one of the founders and CEO of HCL is married to a cousin of a cousin of mine.[…]

Home
Latest News
Your IT
Business IT
IT Industry
NEWSLETTER
MAGAZINE
IT People
Government
RSS

Services

Promotional News & Content

Sponsored Announcements

Self Posting

JobZilla IT Jobs

See Newsletter

Our Journalists

Company

About

Contact

Advertising Specs

Advertise NOW

Privacy

Editorial Guidlines& Complaints Handling

Sitemap

Connect

Facebook
Twitter

Cloud Hosting by Digital Pacific