NVIDIA and Azure’s Generative AI Service for Startups Worldwide

NVIDIA introduces an AI foundry service on Microsoft Azure, combining Foundation Models, NeMo framework, and DGX Cloud for tailored generative AI applications, with industry leaders like SAP and Amdocs already leveraging the service.
The service offers curated NVIDIA AI Foundation models like Nemotron-3 8B, optimized for various uses, available on Azure AI model catalog, alongside DGX Cloud’s availability on Azure Marketplace, empowering Azure customers with scalable AI supercomputing.

NVIDIA introduced an AI foundry service to supercharge the development and tuning of custom generative AI applications for enterprises and startups deploying on Microsoft Azure.

The NVIDIA AI foundry service pulls together three elements — a collection of NVIDIA AI Foundation Models, NVIDIA NeMo™ framework and tools, and NVIDIA DGX™ Cloud AI supercomputing services — that give enterprises an end-to-end solution for creating custom generative AI models. Businesses can then deploy their customized models with NVIDIA AI Enterprise software to power generative AI applications, including intelligent search, summarization and content generation.

Industry leaders SAP SE, Amdocs and Getty Images are among the pioneers building custom models using the service.

“Enterprises need custom models to perform specialized skills trained on the proprietary DNA of their company — their data,” said Jensen Huang, founder and CEO of NVIDIA. “NVIDIA’s AI foundry service combines our generative AI model technologies, LLM training expertise and giant-scale AI factory. We built this in Microsoft Azure so enterprises worldwide can connect their custom model with Microsoft’s world-leading cloud services.”

“Our partnership with NVIDIA spans every layer of the Copilot stack — from silicon to software — as we innovate together for this new age of AI,” said Satya Nadella, chairman and CEO of Microsoft. “With NVIDIA’s generative AI foundry service on Microsoft Azure, we’re providing new capabilities for enterprises and startups to build and deploy AI applications on our cloud.”

Industry Leaders Building Tailored, Timely LLMs

NVIDIA’s AI foundry service can be used to customize models for generative AI-powered applications across industries, including enterprise software, telecommunications and media. Once ready to deploy, enterprises can use a technique called retrieval-augmented generation (RAG) to connect their models with their enterprise data and access new insights.

As the first customer of NVIDIA DGX Cloud on Microsoft Azure, SAP plans to use the service and optimized RAG workflow with NVIDIA DGX Cloud and NVIDIA AI Enterprise software running on Azure to help customize and deploy Joule®, its new natural language generative AI copilot.

“Joule draws on SAP’s unique position at the nexus of business and technology, and builds on our relevant, reliable and responsible approach to Business AI,” said Christian Klein, CEO and member of the Executive Board of SAP SE. “In partnership with NVIDIA, Joule can help customers unlock the potential of generative AI for their business by automating time-consuming tasks and quickly analyzing data to deliver more intelligent, personalized experiences.”

Amdocs, a leading provider of software and services to communications and media companies, is optimizing models for the Amdocs amAIz framework to speed adoption of generative AI applications and services for telcos globally.

“Generative AI technology presents an incredible opportunity for service providers to reinvent the way they engage with customers,” said Shuky Sheffer, president and CEO at Amdocs. “Leveraging NVIDIA’s and Microsoft’s technology to power the Amdocs amAlz framework will bring new GenAI-powered applications to customers faster and enable them to benefit from the immense potential of generative AI, while also providing enterprise-grade security, reliability and performance.”

Curated, Optimized Models for Custom Generative AI

Customers using the NVIDIA foundry service can choose from several NVIDIA AI Foundation models, including a new family of NVIDIA Nemotron-3 8B models hosted in the Azure AI model catalog. Developers can also access the Nemotron-3 8B models on the NVIDIA NGC™ catalog, as well as community models such as Meta’s Llama 2 models optimized for NVIDIA for accelerated computing, which are also coming soon to the Azure AI model catalog.

Optimized with 8 billion parameters, the Nemotron-3 8B family includes versions tuned for different use cases and have multilingual capabilities for building custom enterprise generative AI applications.

NVIDIA DGX Cloud Now Available on Microsoft Azure Marketplace

NVIDIA DGX Cloud AI supercomputing is available today on Azure Marketplace. It features instances customers can rent, scaling to thousands of NVIDIA Tensor Core GPUs, and comes with NVIDIA AI Enterprise software, including NeMo, to speed LLM customization.

The addition of DGX Cloud on the Azure Marketplace enables Azure customers to use their existing Microsoft Azure Consumption Commitment credits to speed model development with NVIDIA AI supercomputing and software.

NVIDIA AI Enterprise software is now integrated into Azure Machine Learning, adding NVIDIA’s platform of secure, stable and supported AI and data science software. This brings NeMo and NVIDIA Triton Inference Server™ to Azure’s enterprise-grade AI service.

NVIDIA AI Enterprise is also available on Azure Marketplace, providing businesses worldwide with broad options for production-ready AI development and deployment of custom generative AI applications.

Source: NVIDIA

Related Topics:Microsoft Azure NVIDIA sap

Up Next

AI for All – Lenovo Unveils Vehicle Computing Roadmap Through Ongoing Collaboration with NVIDIA

Don't Miss

Unveiling NVIDIA’s Supercharged Hopper: Pioneering the Next Era of AI Computing

Click to comment

Cookie	Duration	Description
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Client Support

NVIDIA and Azure’s Generative AI Service for Startups Worldwide

Leave a Reply

THAILAND EVENT 2024

Text Translator

SPRING ISSUE 2025

GLOBAL BRAND AWARDS EVENT 2025

Top Reads

Impact of Remote Work Policies on Revenue Growth: Scoop and Boston Consulting Group Report

Top 10 Mobile Brands in the World

The Future of Healthcare? Etiome’s Bid to Redefine Early Detection

Samsung Unveils Galaxy Z Fold 6 and Z Flip 6 Alongside Smart Ring Debut

Related Reads

SAP and Vivanda Serve Up FlavorPrint Technology

Performance Unleashed: Lenovo’s ThinkStation P8

IBM Expands Power10 Server Family to Help Clients Respond Faster to Rapidly Changing Business Demands

Designing the Next Generation of AI Systems Powered by NVIDIA: Accelerating AI Deployment Worldwide

Global Brand Awards