Industry NewsMarketing Automation
Nvidia to supply 1 million AI chips to Amazon in major cloud deal

Article content
Nvidia is set to supply up to one million AI chips to Amazon’s cloud division, Amazon Web Services (AWS), in a multi-year deal extending through 2027.
The agreement, which begins deliveries this year, represents one of the largest infrastructure commitments in the AI ecosystem to date. It includes not only GPUs but also a broader mix of Nvidia technologies such as networking chips and inference-focused hardware.
AWS plans to deploy a combination of multiple Nvidia chips across its data centres, using different processors for training and inference workloads. The approach reflects increasing complexity in AI infrastructure, where performance gains are achieved through integrated, multi-chip architectures rather than reliance on a single processor type.
The partnership comes amid surging demand for AI compute, driven by large language models, generative AI applications, and enterprise adoption of cloud-based AI services. Hyperscalers like Amazon are scaling infrastructure aggressively to meet these requirements, with chip supply becoming a critical constraint.
For Nvidia, the deal reinforces its central role in powering the AI boom, as cloud providers continue to depend on its hardware for both training and inference workloads. The company is also expanding its portfolio beyond GPUs to include networking and specialized AI chips, reflecting a broader shift toward full-stack infrastructure offerings.
From an AI and martech perspective, the development highlights how compute capacity is emerging as a key competitive differentiator. As enterprises increasingly rely on cloud-based AI tools, the ability of providers like AWS to secure large-scale chip supply will directly influence performance, pricing, and service availability.
The deal underscores a broader industry trend where partnerships between chipmakers and cloud providers are becoming foundational to scaling AI, shaping the next phase of growth in both infrastructure and enterprise AI adoption.