iTWire - AMD releases world-leading generative AI accelerator to power the future of AI

Hardware Left Skin

Hardware Right Skin

iTWire

Thursday, 07 December 2023 07:00

AMD releases world-leading generative AI accelerator to power the future of AI ^Featured

By David M Williams

Comments:0 Comments

AMD releases world-leading generative AI accelerator to power the future of AI

Processor manufacturer AMD says AI is the most transformational technology in 50 years, and that the biggest driver of this has been generative AI. However, the amazing things AI can achieve are constrained by the availability and capability of GPUs - so, to accelerate AI, AMD has today announced its brand new AMD Instinct MI300X accelerator, bringing the highest performance in the world for generative AI.

There's no denying that ChatGPT and generative AI have captured the world's attention. Artificial intelligence has always been part of computing since its inception and as part of the world of science fiction. AI has developed over time, of course, and history shows the advances in AI and the opportunities available are directly tied to the availability of computational power.

Machine learning, for example, brought great advances in AI by flipping problems on their head; instead of trying to teach a computer how to recognise an image of a cat based on adjacent pixels and colours, researchers could instead feed a large set of images to a machine learning model and effectively say "this is a bunch of cat photos; you work out what's a cat." - and machine learning became possible thanks to the power, scalability, and capacity of cloud computing.

Yet machine learning, for all its goodness, has been dwarfed by the massive explosion of interest in generative AI.

Generative AI explosion requires greater infrastructure

It barely seems that only 12 months ago OpenAI unveiled ChatGPT to the world and demonstrated the impressive, expressive power that generative AI offers. AMD chair, president, and CEO Dr Lisa Su said, "AI is the most transformational technology in 50 years. Maybe the only thing close is the introduction of the Internet, but with AI the adoption has been much, much quicker and we're only at the beginning of the AI era."

Further, "ChatGPT has sparked a revolution that transformed the technology landscape. AI hasn't simply progressed but exploded. The year has shown us AI isn't simply a cool new thing, but the future of IT."

Generative AI is being used everywhere, she said; healthcare, climate research, AI assistants, robotics, security, and lots of tools for content creation.

Yet, this takes power. Massive power. Generative AI has become the most demanding data centre workload, requiring infrastructure and especially GPUs for training, creating models with billions of parameters, and then the same power again in run-time for answering user questions against those models. The amount of infrastructure that can be given to generative AI directly impacts how extensive the model can be, and how rapidly and deeply it can answer questions.

Dr Su explained that AMD sought last year to estimate what the infrastructure growth would be, due to generative AI. At the time AMD figured 50% CAGR (compound annual growth rate) which meant a spend of 30 billion US dollars in 2023 would become over 150 billion US dollars in 2027. "That felt like a big number," Su said, "but as we look at everything that happened in the last 12 months and the rate of change in industry across the world, it's clear demand is growing faster."

Thus, AMD revised its figures and forecasts 70% CAGR in data centre AI accelerators, from an actual spend of 45 billion US dollars in 2023 to well over 400 billion US dollars in 2027 to be spent on this area.

AMD's AI strategy

Expressing the company's commitment to advancing end-to-end AI infrastructure across cloud, HPC, enterprise, embedded, and PC, Dr Su said "our AI strategy is centred around three big priorities."

Specifically,

broad portfolio of training and inference compute engines
open and proven software capabilities
AI ecosystem with deep co-innovation

Introducing the AMD MI300X

"The availability and capability of GPU compute is the single biggest driver of AI adoption," Dr Su said, before unveiling the AMD Instinct MI300X. "It is the highest performance accelerator in the world for generative AI," she said.

The MI300X is no small chip; it's a hefty silicon sandwich that combines a raft of components with names like GPUs and APUs, building upon AMD's previous MI devices. The MI100 was launched in 2020 as the first purpose-built GPU architecture to accelerate FP64 and FP32 HPC workloads. The second-gen, the MI200, introduced a denser compute architecture with leading memory capacity and bandwidth. And now the MI300, launched today, brings focused improvements on unified memory, AI data format performance, and in-node networking. It is optimised for performance and power efficiency and allows generative AI models to be trained for longer, or more simultaneous models, than ever before.

AMD DC AI Technology Premiere Keynote Deck for Press and Analysts 63

Spec-wise, the AMD MI300X is a beast. It sports 192GB HBM3 memory, with a peak theoretical memory bandwidth of 5.3 TB/s, and up to 896 GB/s AMD Infinity Fabric bandwidth. It stacks 8x XCD compute units on 4x IODs (I/O dies), with 8x HBM3 stacks, 3.5D packaging, and 256MB AMD Infinity Cache technology.

AMD says the nearest competitor is the Nvidia H100, and while the bandwidth and network performance are roughly equivalent between the two GPU beasts, the MI300X brings 2.4x more memory, and 1.3x more compute. In short, with less rack space, lower Capex, and lower Opex, customers can run more models or larger models on the same server. That's 2x the number of models for training and inference than Nvidia, or 2x the number of LLMs.

AMD has partners taking up the AMD MI300X already, with Microsoft Azure and Oracle Cloud Infrastructure on board from day one. Additionally, OpenAI's Philippe Tillet said, "OpenAI is working with AMD in support of an open ecosystem. We plan to support AMD's GPUs including MI300 in the standard Triton distribution starting with the upcoming 3.0 release."

Software and co-innovation

AMD also announced ROCm 6, its software stack to bring advanced LLM optimisations and performant AI, that can bring about 8x the performance of previous options when combined with the MI300X - so much so that AMD President Victor Peng said, "this is an infelction point for developers."

"Innovators are advancing the state of AI on AMD GPUs now," but with the MI300X and ROCm 6, "we're empowering innovators to realise the transformational power of generative AI faster."

AMD also announced partnerships with Hugging Face, PyTorch, ONNX, JAX, and others - in addition to those with Azure, Oracle, and Open AI above.

Further, AMD has partnered with Meta, with Meta AI senior director engineering Ajit Mathews saying Meta's testing with the MI300X and ROCm 6 has shown vast optimisations and promising performance numbers.

Additionally, Dell president core business operations global infrastructure solutions group Arthur Lewis announced the Dell PowerEdge 9680 server using the AMD MI300X accelerator, bringing a smaller footprint, low-latency server, and out-of-the-box LLM experience. "As of today we're open for business, ready to quote, and taking orders," Lewis said.

Additional partnerships include SuperMicro and Lenovo.

Check out the Advancing AI presentation here:

Read 2180 times

Please join our community here and become a VIP.

Subscribe to ITWIRE UPDATE Newsletter here
JOIN our iTWireTV our YouTube Community here
BACK TO LATEST NEWS here

EXL AI IN ACTION VIRTUAL EVENT 20 MARCH 2025

Industry leaders are looking to transform their businesses and achieve measurable outcomes with AI.

As organisations across APAC navigate the complexities of AI adoption, this must-attend event brings together industry leaders, real-world demonstrations, and visionary panel discussions to bridge the gap between proof-of-concepts and enterprise-wide AI implementation.

Learn how to overcome common challenges in deploying AI at scale.

Unlock cost savings, efficiency, and better customer experiences with AI.

Discover how industry expertise and data intelligence enable practical AI deployment.

Register for the event now!

REGISTER!

PROMOTE YOUR WEBINAR ON ITWIRE
It's all about Webinars.

Marketing budgets are now focused on Webinars combined with Lead Generation.

If you wish to promote a Webinar we recommend at least a 3 to 4 week campaign prior to your event.

The iTWire campaign will include extensive adverts on our News Site itwire.com and prominent Newsletter promotion https://itwire.com/itwire-update.html and Promotional News & Editorial. Plus a video interview of the key speaker on iTWire TV https://www.youtube.com/c/iTWireTV/videos which will be used in Promotional Posts on the iTWire Home Page.

Now we are coming out of Lockdown iTWire will be focussed to assisting with your webinars and campaigns and assistance via part payments and extended terms, a Webinar Business Booster Pack and other supportive programs. We can also create your adverts and written content plus coordinate your video interview.

We look forward to discussing your campaign goals with you. Please click the button below.
MORE INFO HERE!

BACK TO HOME PAGE

Published in Hardware

Tagged under

David M Williams

AMD

Generative AI

ChatGPT

Processor

GPU

Hardware

GPU Hardware Acceleration

Victor Peng

Lisa Su

OpenAI

Dell

Azure

Lenovo

META

David M Williams

David has been computing since 1984 where he instantly gravitated to the family Commodore 64. He completed a Bachelor of Computer Science degree from 1990 to 1992, commencing full-time employment as a systems analyst at the end of that year. David subsequently worked as a UNIX Systems Manager, Asia-Pacific technical specialist for an international software company, Business Analyst, IT Manager, and other roles. David has been the Chief Information Officer for national public companies since 2007, delivering IT knowledge and business acumen, seeking to transform the industries within which he works. David is also involved in the user group community, the Australian Computer Society technical advisory boards, and education.

Latest from David M Williams

Can Konica Minolta take you through digital transformation? Yes it kan(kei) !

Hearthstone enters the Emerald Dream as the Year of the Raptor begins

CyberPower sees major demand for UPS in Australian consumer, retail, and commercial markets

All Cricuts great and small

Related items

Can Konica Minolta take you through digital transformation? Yes it kan(kei) !

Datadobi Announces StorageMAP 7.2, Unlocking Value with Unparalleled Visibility and Control Over Unstructured Data

SASE/SDWAN player Aryaka ramps up APAC presence

Lenovo Hybrid Ai Advantage With Nvidia Introduces New Agentic Ai

More in this category: « Canalys sees global PC shipments growing by 8% to 267m units in 2024 AMD advances the AI PC with the new AMD Ryzen 8040 series processors and Ryzen AI software »
Share News tips for the iTWire Journalists? Your tip will be anonymous

back to top

Subscribe to Newsletter

* Enter the security code shown:

WEBINARS & EVENTS

UiPath to Unveil Latest Agentic Automation Solutions at Agentic AI Summit
UiPath to hold Agentic AI Summit on March 25 to…

ALL WELCOME - 6 DAYS TO GO - AI in Action is your opportunity to gain actionable strategies for deploying scalable, reliable AI solutions that drive measurable business outcomes.
AI in Action As organisations across APAC…

TODAY FREE VIRTUAL EVENT - EXL SERVICE, AI in action Driving the shift to scalable AI
Through a blend of visionary discussions, real-world demos, and expert…

Expert warns: Small businesses missing out on AI cost-saving and growth opportunities
Experienced business marketing and sales strategist, Jennifer Benedek, founder and…

Datadog Opens Registration for Its 2025 DASH Conference
GUEST EVENTS:  The annual conference will take place in New York…

CYBERSECURITY

81% of Australian IT leaders want more government intervention to help them manage cybersecurity
New KnowBe4 research finds local IT leaders are seeking increased…

Illumio research reveals 64% of Australian companies hit with ransomware have been ‘forced to halt operations’
Impact of ransomware causing significant damage to the revenue and…

How robust is your cloud identity security?
GUEST OPINION: Charles Chu, General Manager, IT and Developer Solutions,…

Securing cloud environments ‘more complex, more urgent’
GUEST OPINION Robin Long, Field CTO, Asia Pacific, Rapid7: As…

Motorola Solutions Delivers Resilient Security for Critical Infrastructure with New Pelco Portfolio
Pelco’s cutting-edge technologies deliver reliable performance under harsh conditions and…

PEOPLE MOVES

Cloudera appoints Leo Brunnick as Chief Product Officer
Hybrid platform for data, analytics, and AI provider Australia and…

Informatica Announces Krish Vitaldevara as EVP, Chief Product Officer
Company rounds out product leadership team with Bala Kumaresan, EVP,…

Commvault Names Security Veteran Bill O’Connell as Chief Security Officer
COMPANY NEWS: Commvault, a leading provider of cyber resilience and…

Tenable Appoints Scott Magill as Country Manager for Australia and New Zealand
Tenable, the exposure management company, today announced the appointment of…

Dirk Vorster Joins Wild Tech to Strengthen Business Development and Industry Alliances
Wild Tech is pleased to announce the appointment of Dirk…

GUEST ARTICLES

Federal Budget - Commentary from WatchGuard ANZ
GUEST OPINION: Following the release of the 2025–2026 Federal Budget,…

Amber Technology welcomes ABB-free@home® to its portfolio
COMPANY NEWS: ABB and Amber Technology have expanded their distribution…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINION by Peter Graves, Area Vice President ANZ, UiPath:…

Sihoo Concludes 2025 Brand Globalization Strategy Conference, Unveils Technology-Driven Vision for a Health-Centered Ergonomic Ecosystem
On March 29, 2025, Sihoo, a global leader in ergonomic…

360 Privacy announces strategic executive appointments following $36 million growth investment
COMPANY NEWS: Key executive hires accelerate growth strategy and enhance…

Dryad Networks demonstrates first fully functional drone prototype for detecting, locating, and monitoring wildfires
COMPANY NEWS: Dryad’s Silvaguard drone system extends Silvanet’s ultra-early fire…

SevenRooms unveils new AI features to power ‘SuperHuman Hospitality’, ‘helping restaurants’ personalise, streamline & save time
AI Responses, AI Feedback and AI Note Polish are giving…

New Research Shows Parents Prone to this Common Scam
With school holidays just around the corner, Vodafone is warning…

How AI can supercharge your productivity at work in 2025
GUEST OPINION:  The workplace is undergoing a profound transformation driven by…

SOTI and Urovo Partner to Combat Device Downtime with Proactive Battery Management and Remote Support
COMPANY NEWS by SOTI: SOTI a leading provider of Enterprise…

Guest Opinion

Federal Budget - Commentary from WatchGuard ANZ
GUEST OPINION: Following the release of the 2025–2026 Federal Budget,…

Australia’s Great Work Reallocation: How Businesses Can Take Charge
GUEST OPINION by Peter Graves, Area Vice President ANZ, UiPath:…

New Research Shows Parents Prone to this Common Scam
With school holidays just around the corner, Vodafone is warning…

How AI can supercharge your productivity at work in 2025
GUEST OPINION:  The workplace is undergoing a profound transformation driven by…

Unlock online freedom with a free VPN
GUEST OPINION: The internet is a vast and powerful tool,…

AI and the Future of Insurance: Why Solving the Data Integration Challenge Matters
GUEST OPINION: Artificial intelligence (AI) is already woven into our…

What are haplogroups, and how do they connect with ethnicity?
GUEST OPINION: People interested in their genetic composition can learn…

Why Missouri car accident victims should act fast to protect their legal rights
GUEST OPINION: When a car accident in Independence, Missouri, happens,…

What to bring to your first meeting with a Florida personal injury lawyer
GUEST OPINION: Meeting with a personal injury lawyer for the…

How to prevent tool wear and damage in CNC Aluminum Processing
GUEST OPINION: In the world of precision manufacturing, maintaining the…

ITWIRETV & INTERVIEWS

Amazon CISO CJ Moses gives rare interview
GUEST INTERVIEW: CJ Moses moves out of the shadows to…

Matt Salier explains the Australian Cyber Collaboration Centre's voluntary data classification framework
Cybercriminals never rest and company data is an increasingly valuable…

Qualys CEO Sumedh Thakar explains the Risk Operations Centre (ROC)
iTWireTV: Special guest Qualys CEO Sumedh Thakar tells us about…

iTWire talks to SailPoint about identity management in the Enterprise
GUEST INTERVIEW: We talk to Andrew Moore, VP of Product,…

How Blue Yonder is applying AI and innovation to solve supply chain challenges
iTWireTV interview: We've all felt the effects of supply chain…

RESEARCH & CASE STUDIES

Workplace Tug-of-War: Organisations Mandate Office Days, But Employees Push for More Flexibility
New research confirms workplace policies are in flux, with two-thirds…

New Research from Claroty’s Team82 Highlights Riskiest Medical Device Exposures in Healthcare Environments
“State of CPS Security: Healthcare Exposures 2025” Highlights the Most…

Channel News

AWS announces Generative AI Partner Innovation Alliance
COMPANY NEWS: Amazon Web Services (AWS) has, announced the launch…

Rackspace Technology signs multi-year, strategic collaboration agreement with AWS
AI technology services company Rackspace Technologies has signed a Strategic…

Comments

Re: iTWire - Can cyber security be a platform for innovation and growth?

Most cybersecurity is making up for weak platforms. We need to address the fundamentals, design platforms that prevent out-of-bounds access[…]

Re: iTWire - Why Software Developers Need a Security ‘Rewards Program’

For most developers the security/performance trade off is still the hardest one to tackle, even as the cost of processing[…]

Re: iTWire - The Risc-V architecture that can shape the future of computing

RISC has been overhyped. While it is an interesting low-level processor architecture, what the world needs is high-level system architectures,[…]

Re: iTWire - Is Linux finally ready to storm the mainstream?

There are two flaws that are widespread in the industry here. The first is that any platform or language should[…]

Re: iTWire - Transport for NSW and HCLTech expand digital transformation partnership

Ajai Chowdhry, one of the founders and CEO of HCL is married to a cousin of a cousin of mine.[…]

Home
Latest News
Your IT
Business IT
IT Industry
NEWSLETTER
MAGAZINE
IT People
Government
RSS

Services

Promotional News & Content

Sponsored Announcements

Self Posting

JobZilla IT Jobs

See Newsletter

Our Journalists

Company

About

Contact

Advertising Specs

Advertise NOW

Privacy

Editorial Guidlines& Complaints Handling

Sitemap

Connect

Facebook
Twitter

Cloud Hosting by Digital Pacific

AMD releases world-leading generative AI accelerator to power the future of AI Featured

Please join our community here and become a VIP.

EXL AI IN ACTION VIRTUAL EVENT 20 MARCH 2025

PROMOTE YOUR WEBINAR ON ITWIRE

David M Williams

Latest from David M Williams

Related items

Subscribe to Newsletter

WEBINARS & EVENTS

CYBERSECURITY

PEOPLE MOVES

GUEST ARTICLES

Guest Opinion

ITWIRETV & INTERVIEWS

RESEARCH & CASE STUDIES

Channel News

Comments

Re: iTWire - Can cyber security be a platform for innovation and growth?

Re: iTWire - Why Software Developers Need a Security ‘Rewards Program’

Re: iTWire - The Risc-V architecture that can shape the future of computing

Re: iTWire - Is Linux finally ready to storm the mainstream?

Re: iTWire - Transport for NSW and HCLTech expand digital transformation partnership

Services

Company

Connect

AMD releases world-leading generative AI accelerator to power the future of AI ^Featured