iTWire - Cloudera unveils AI inference service with embedded Nvidia NIM microservices to accelerate genAI development and deployment

iTWire

Message

Failed loading XML... Document is empty

Wednesday, 09 October 2024 11:19

Cloudera unveils AI inference service with embedded Nvidia NIM microservices to accelerate genAI development and deployment

By Cloudera

Comments:0 Comments

Cloudera unveils AI inference service with embedded Nvidia NIM microservices to accelerate genAI development and deployment

COMPANY NEWS: Cloudera, the only true hybrid platform for data, analytics, and AI, today launched Cloudera AI Inference powered by Nvidia NIM microservices, part of the Nvidia AI Enterprise platform. As one of the industry’s first AI inference services to provide embedded NIM microservice capability, Cloudera AI Inference uniquely streamlines the deployment and management of large-scale AI models, allowing enterprises to harness their data’s true potential to advance genAI from pilot phases to full production.

Recent data from Deloitte reveals the biggest barriers to genAI adoption for enterprises are compliance risks and governance concerns, yet adoption of genAI is progressing at a rapid pace, with over two-thirds of organisations increasing their genAI budgets in Q3 this year. To mitigate these concerns, businesses must turn to running AI models and applications privately - whether on premises or in public clouds. This shift requires secure and scalable solutions that avoid complex, do-it-yourself approaches.

Cloudera AI Inference protects sensitive data from leaking to non-private, vendor-hosted AI model services by providing secure development and deployment within enterprise control. Powered by Nvidia technology, the service helps to build trusted data for trusted AI with high-performance speeds, enabling the efficient development of AI-driven chatbots, virtual assistants, and agentic applications impacting both productivity and new business growth.

The launch of Cloudera AI Inference comes on the heels of the company’s collaboration with Nvidia, reinforcing Cloudera’s commitment to driving enterprise AI innovation at a critical moment, as industries navigate the complexities of digital transformation and AI integration.

Developers can build, customise, and deploy enterprise-grade LLMs with up to 36x faster performance using Nvidia Tensor Core GPUs and nearly 4x throughput compared with CPUs. The seamless user experience integrates UI and APIs directly with Nvidia NIM microservice containers, eliminating the need for command-line interfaces (CLI) and separate monitoring systems. The service integration with Cloudera’s AI Model Registry also enhances security and governance by managing access controls for both model endpoints and operations. Users benefit from a unified platform where all models—whether LLM deployments or traditional models—are seamlessly managed under a single service.

Additional key features of Cloudera AI Inference include:

Advanced AI Capabilities: Utilise Nvidia NIM microservices to optimise open-source LLMs, including LLama and Mistral, for cutting-edge advancements in natural language processing (NLP), computer vision, and other AI domains.
Hybrid Cloud & Privacy: Run workloads on prem or in the cloud, with VPC deployments for enhanced security and regulatory compliance.
Scalability & Monitoring: Rely on auto-scaling, high availability (HA), and real-time performance tracking to detect and correct issues, and deliver efficient resource management.
Open APIs & CI/CD Integration: Access standards-compliant APIs for model deployment, management, and monitoring for seamless integration with CI/CD pipelines and MLOps workflows.
Enterprise Security: Enforce model access with Service Accounts, Access Control, Lineage, and Auditing features.
Risk-Managed Deployment: Conduct A/B testing and canary rollouts for controlled model updates.

“Enterprises are eager to invest in genAI, but it requires not only scalable data but also secure, compliant, and well-governed data," said industry analyst Sanjeev Mohan. "Productionising AI at scale privately introduces complexity that DIY approaches struggle to address. Cloudera AI Inference bridges this gap by integrating advanced data management with Nvidia's AI expertise, unlocking data's full potential while safeguarding it. With enterprise-grade security features like service accounts, access control, and audit, organisations can confidently protect their data and run workloads on prem or in the cloud, deploying AI models efficiently with the necessary flexibility and governance."

“We are excited to collaborate with Nvidia to bring Cloudera AI Inference to market, providing a single AI/ML platform that supports nearly all models and use cases so enterprises can both create powerful AI apps with our software and then run those performant AI apps in Cloudera as well,” said Cloudera chief product officer Dipto Chakravarty. “With the integration of Nvidia AI, which facilitates smarter decision-making through advanced performance, Cloudera is innovating on behalf of its customers by building trusted AI apps with trusted data at scale.”

"Enterprises today need to seamlessly integrate generative AI with their existing data infrastructure to drive business outcomes," said Nvidia vice president of AI software, models, and services Kari Briski. "By incorporating Nvidia NIM microservices into Cloudera's AI Inference platform, we're empowering developers to easily create trustworthy generative AI applications while fostering a self-sustaining AI data flywheel".

“The responsible deployment of AI is crucial for Australian businesses to build trust and ensure its ethical use. However, establishing trusted large-scale data sets is undeniably complex," said Cloudera regional vice president ANZ Keir Garrett. The Cloudera AI Inference service powered by NVIDIA helps our customers navigate the challenges around compliance and governance by enabling the acceleration of GenAI application development at scale while maintaining optimal performance, flexibility and security.”

Click here to learn more about how these latest updates deepen Cloudera’s commitment, elevating enterprise data from pilot to production with genAI.

About Cloudera
Cloudera is the only true hybrid platform for data, analytics, and AI. With 100x more data under management than other cloud-only vendors, Cloudera empowers global enterprises to transform data of all types, on any public or private cloud, into valuable, trusted insights. Our open data lakehouse delivers scalable and secure data management with portable cloud-native analytics, enabling customers to bring genAI models to their data while maintaining privacy and ensuring responsible, reliable AI deployments. The world's largest brands in financial services, insurance, media, manufacturing, and government rely on Cloudera to use their data to solve what seemed impossible—today and in the future.

To learn more, visit Cloudera.com and follow us on LinkedIn and X. Cloudera and associated marks are trademarks or registered trademarks of Cloudera, Inc. All other company and product names may be trademarks of their respective owners.

Read 986 times

Please join our community here and become a VIP.

Subscribe to ITWIRE UPDATE Newsletter here
JOIN our iTWireTV our YouTube Community here
BACK TO LATEST NEWS here

EXL AI IN ACTION VIRTUAL EVENT 20 MARCH 2025

Industry leaders are looking to transform their businesses and achieve measurable outcomes with AI.

As organisations across APAC navigate the complexities of AI adoption, this must-attend event brings together industry leaders, real-world demonstrations, and visionary panel discussions to bridge the gap between proof-of-concepts and enterprise-wide AI implementation.

Learn how to overcome common challenges in deploying AI at scale.

Unlock cost savings, efficiency, and better customer experiences with AI.

Discover how industry expertise and data intelligence enable practical AI deployment.

Register for the event now!

REGISTER!

PROMOTE YOUR WEBINAR ON ITWIRE
It's all about Webinars.

Marketing budgets are now focused on Webinars combined with Lead Generation.

If you wish to promote a Webinar we recommend at least a 3 to 4 week campaign prior to your event.

The iTWire campaign will include extensive adverts on our News Site itwire.com and prominent Newsletter promotion https://itwire.com/itwire-update.html and Promotional News & Editorial. Plus a video interview of the key speaker on iTWire TV https://www.youtube.com/c/iTWireTV/videos which will be used in Promotional Posts on the iTWire Home Page.

Now we are coming out of Lockdown iTWire will be focussed to assisting with your webinars and campaigns and assistance via part payments and extended terms, a Webinar Business Booster Pack and other supportive programs. We can also create your adverts and written content plus coordinate your video interview.

We look forward to discussing your campaign goals with you. Please click the button below.
MORE INFO HERE!

BACK TO HOME PAGE

Published in Company news

Tagged under

Cloudera

Nvidia

Related items

Lenovo Hybrid AI Advantage with NVIDIA boosts Business Productivity and Efficiency with New Scalable Agentic AI Solutions

SoftBank develops Large Telecom Model using genAI

Nvidia, Cisco, T-Mobile team to develop AI-Native wireless networks for 6G

Hitachi Vantara Introduces Hitachi iQ M Series, a Modular Design with Hybrid Cloud Data Orchestration for GenAI and Industry-Specific Workloads

More in this category: « Genetec Retains World Leader Position In Video Management Software, Analysts Confirm NielsenIQ mid-year consumer outlook shows more buying decisions influenced by AI »
Share News tips for the iTWire Journalists? Your tip will be anonymous

back to top

Subscribe to Newsletter

* Enter the security code shown:

WEBINARS & EVENTS

UiPath to Unveil Latest Agentic Automation Solutions at Agentic AI Summit
UiPath to hold Agentic AI Summit on March 25 to…

ALL WELCOME - 6 DAYS TO GO - AI in Action is your opportunity to gain actionable strategies for deploying scalable, reliable AI solutions that drive measurable business outcomes.
AI in Action As organisations across APAC…

TODAY FREE VIRTUAL EVENT - EXL SERVICE, AI in action Driving the shift to scalable AI
Through a blend of visionary discussions, real-world demos, and expert…

Expert warns: Small businesses missing out on AI cost-saving and growth opportunities
Experienced business marketing and sales strategist, Jennifer Benedek, founder and…

Datadog Opens Registration for Its 2025 DASH Conference
GUEST EVENTS: The annual conference will take place in New York…

CYBERSECURITY

Milestone Systems Expands XProtect with Enhanced CLOUD Integration, Advanced Vehicle Analytics
Milestone Systems, a leading provider of open platform video management…

Fastly Empowers Organisations to Prioritise Security Without Disrupting End-User Experiences
Latest Fastly Bot Management update reduces CAPTCHA reliance, enhances bot…

Enhancing Threat Intelligence and Threat Detection in Australian Central Government Organisations
Enhancing Threat Intelligence and Threat Detection in Australian Central Government…

A Million Phishing-as-a-Service Attacks in Two Months Highlight a Fast-Evolving Threat
In the first two months of 2025, Barracuda detection systems…

Hundreds of ‘malicious’ Google Play-hosted apps bypassed Android 13 Security ‘with ease’
COMPANY NEWS: Bitdefender's security researchers have identified a large-scale ad…

PEOPLE MOVES

Seeing Machines appoints John Noble as technology chief
Advanced computer vision technology company Seeing Machines has appointment John…

Interactive expands ‘South Australian presence’ with key sales appointment
IT service providers Interactive has appointed of Darren Broadbent as…

Pure Storage announces Altay Ayyuce as Area Vice President for Australia & New Zealand
Pure Storage® announced Altay Ayyuce as Area Vice President of Australia…

SYSPRO appoints Josef Al-Sibaie to ‘drive global growth and strategic expansion’
Global software provider for the manufacturing and distribution industries SYSPRO…

Nutanix Appoints Jay Tuseth as New APJ Leader
Jay Tuseth has joined Nutanix as Vice President and General…

GUEST ARTICLES

SOTI and Urovo Partner to Combat Device Downtime with Proactive Battery Management and Remote Support
COMPANY NEWS by SOTI: SOTI a leading provider of Enterprise…

SOTI and Urovo Partner to Combat Device Downtime with Proactive Battery Management and Remote Support
COMPANY NEWS by: SOTI a leading provider of Enterprise Mobility…

Trend Micro customers lower cyber risk scores through proactive security
GUEST RESEARCH: The newly published report harnesses data from Trend’s…

New Splunk Survey Highlights Financial Impact of Downtime for Australian Businesses
GUEST RESEARCH: New report shows unplanned and cyber incident downtime…

Transitioning to a new ERP system is like open heart surgery for business
GUEST OPINION: Transitioning from one enterprise resource planning (ERP) system…

Why you should think twice before using a VPN
GUEST OPINION: Virtual private networks (VPNs) are known for protecting…

Radware Named as a Strong Performer in Analyst Report for Web Application Firewall Solutions
GUEST RESEARCH: Radware (NASDAQ: RDWR), a global leader in application security and…

Pronto Software achieves Australian owned certification
COMPANY NEWS: Home-grown certification reflects the leading ERP provider’s commitment…

4 Ways to streamline your hiring process
GUEST OPINION: Prolonged and inefficient hiring processes have far-reaching consequences…

The Truth About Browser Add-Ons: Are They Safe or a Hidden Threat?
GUEST OPINION: Browser add-ons are a necessary component of Internet…

Guest Opinion

Transitioning to a new ERP system is like open heart surgery for business
GUEST OPINION: Transitioning from one enterprise resource planning (ERP) system…

Why you should think twice before using a VPN
GUEST OPINION: Virtual private networks (VPNs) are known for protecting…

4 Ways to streamline your hiring process
GUEST OPINION: Prolonged and inefficient hiring processes have far-reaching consequences…

The Truth About Browser Add-Ons: Are They Safe or a Hidden Threat?
GUEST OPINION: Browser add-ons are a necessary component of Internet…

Blockchain vs. identity theft: Is this the end of digital fraud as we know it?
GUEST OPINION: In an era where data is the new…

AI in Cybersecurity — Friend or Foe?
GUEST OPINION: The cybersecurity landscape in Australia has evolved dramatically.…

6 Top Tips for Increasing Employee ROI
GUEST OPINION: Your employees are one of your biggest business…

Benefits of Add-ons and Riders for Family Health Insurance Policies
In this evolving world, where people are often preoccupied with…

How to apply for a credit card against your fixed deposit
GUEST OPINION: Traditional Credit Card applications often pose challenges, particularly…

Unpacking the power of personalisation in your tech stack
GUEST OPINION: Personalisation has become the most exciting and important…

ITWIRETV & INTERVIEWS

Amazon CISO CJ Moses gives rare interview
GUEST INTERVIEW: CJ Moses moves out of the shadows to…

Matt Salier explains the Australian Cyber Collaboration Centre's voluntary data classification framework
Cybercriminals never rest and company data is an increasingly valuable…

Qualys CEO Sumedh Thakar explains the Risk Operations Centre (ROC)
iTWireTV: Special guest Qualys CEO Sumedh Thakar tells us about…

iTWire talks to SailPoint about identity management in the Enterprise
GUEST INTERVIEW: We talk to Andrew Moore, VP of Product,…

How Blue Yonder is applying AI and innovation to solve supply chain challenges
iTWireTV interview: We've all felt the effects of supply chain…

RESEARCH & CASE STUDIES

Trend Micro customers lower cyber risk scores through proactive security
GUEST RESEARCH: The newly published report harnesses data from Trend’s…

New Splunk Survey Highlights Financial Impact of Downtime for Australian Businesses
GUEST RESEARCH: New report shows unplanned and cyber incident downtime…

Channel News

AWS announces Generative AI Partner Innovation Alliance
COMPANY NEWS: Amazon Web Services (AWS) has, announced the launch…

Rackspace Technology signs multi-year, strategic collaboration agreement with AWS
AI technology services company Rackspace Technologies has signed a Strategic…

Comments

Home
Latest News
Your IT
Business IT
IT Industry
NEWSLETTER
MAGAZINE
IT People
Government
RSS

Services

Promotional News & Content

Sponsored Announcements

Self Posting

JobZilla IT Jobs

See Newsletter

Our Journalists

Company

About

Contact

Advertising Specs

Advertise NOW

Privacy

Editorial Guidlines& Complaints Handling

Sitemap

Connect

Facebook
Twitter

Cloud Hosting by Digital Pacific