Breaking news

Nvidia’s $5.5B Hit: US Export Ban On AI Chips To China Shakes Global AI Race

Nvidia just took a $5.5 billion punch to the balance sheet—courtesy of the U.S. government’s latest move to tighten the leash on AI chip exports to China. The company’s most advanced processor available in the Chinese market, the H20, has now fallen under indefinite export restrictions, triggering a 6% slide in Nvidia shares in after-hours trading.

The decision, announced Tuesday, marks a major escalation in the U.S.-China tech standoff and underscores Washington’s growing concern over how AI hardware could fuel China’s supercomputing ambitions. The U.S. Commerce Department has now slapped licensing requirements not only on Nvidia’s H20, but also on AMD’s MI308 and similar chips. AMD shares dropped 7% after the news.

A Commerce Department spokesperson said the move reflects President Biden’s directive to safeguard U.S. national and economic security. Nvidia, meanwhile, confirmed the charges would cover unsold H20 inventory, outstanding purchase commitments, and related reserves.

A Workaround, Now Blocked

Nvidia had designed the H20 chip specifically to navigate around previous U.S. export limits—delivering toned-down performance but retaining high-speed interconnectivity. That design made the H20 attractive for AI inference tasks, an increasingly dominant segment of the market where models provide real-time answers rather than undergoing initial training.

Despite not being as powerful as Nvidia’s top-tier chips sold outside China, the H20 gained traction with major Chinese tech players including Tencent, Alibaba, and ByteDance. Reuters previously reported that demand surged after startups like DeepSeek ramped up development of low-cost AI models.

But that very design—optimized for high-bandwidth memory access and chip-to-chip connectivity—set off alarm bells in Washington. Analysts argue it still carries supercomputing potential, especially if deployed at scale.

“Likely In Violation”

A Washington, D.C.-based think tank, the Institute for Progress, didn’t mince words. In a statement Tuesday, it claimed that Tencent had already installed H20 chips in a facility likely used to train large AI models—potentially breaching U.S. export restrictions already in place. The group added that DeepSeek’s infrastructure, used for its latest V3 model, might also be in violation.

U.S. restrictions on chips used in supercomputing have been in effect since 2022. Now, the H20 is joining that list. Nvidia said it was formally notified on April 9 that the chip would require an export license—and on April 14, that the restriction would be indefinite. Whether the U.S. will issue any such licenses remains unclear.

A Fork In The Road

This latest move throws a wrench into Nvidia’s China strategy, just as demand in the region for generative AI tools is accelerating. It also highlights the growing friction between global innovation and geopolitical control—a tension Nvidia CEO Jensen Huang must now navigate carefully.

The setback comes one day after Nvidia unveiled plans to invest up to $500 billion into U.S.-based AI server infrastructure, working with partners like TSMC to align with American industrial policy.

Now, as Nvidia absorbs the financial blow and recalibrates, one thing is clear: the AI chip race isn’t just about performance anymore. It’s a front line in the broader battle over who controls the future of intelligent computing.

Amazon Launches OpenSearch Upgrade To Support AI Agent Workloads

Cloud infrastructure was largely designed around human activity, such as searching, browsing, streaming and interacting with websites. The rise of AI agents is creating a different type of demand, characterized by rapid bursts of automated activity involving database queries, document searches and API calls. As enterprises deploy more AI-powered systems, cloud providers are adapting infrastructure to support increasingly complex machine-to-machine workloads.

Adapting To The New Age Of Agentic Traffic

Recognizing the fundamental shift in traffic patterns, Amazon Web Services (AWS) has reimagined a foundational element of its cloud offering. On Thursday, AWS launched its next generation of OpenSearch Serverless. This advanced, fully managed search and vector database is engineered specifically for agentic workloads, scaling instantly when task bursts occur and minimizing costs by scaling down to zero during idle periods.

Meeting the Demands Of Machine-Generated Traffic

Industry leaders now understand that infrastructure optimized for human-driven internet is ill-suited for the exponential growth of machine-generated traffic. Cloudflare recently reported that bots accounted for 31% of HTTP traffic over the last six months, with AI crawlers and search assistants driving a significant portion of these requests. As Lai Yi Ohlsen, Senior Product Manager at Cloudflare, noted, “Non-human traffic will exceed human traffic sometime in the first half of 2027.”

AI Agents Move Into Production

Recent announcements across the technology sector indicate that AI agents are moving beyond experimentation and into wider commercial use. At Google I/O, Google introduced tools designed to help users delegate tasks such as research and travel planning to AI systems. Businesses are also deploying internal AI agents to automate workflows, increasing the volume of machine-to-machine interactions across enterprise networks.

Technical Changes To OpenSearch

Tia White said the updated platform separates compute resources from storage, allowing capacity to scale more efficiently as demand changes. According to AWS, the model is intended to help organizations manage unpredictable traffic spikes generated by AI systems while reducing infrastructure costs during idle periods.

Integrations and Industry Implications

At launch, OpenSearch Serverless will integrate natively with AI development platforms such as Vercel and Kiro, enabling developers to deploy robust search and vector backends without the overhead of infrastructure management. This innovation aligns with broader industry trends, as companies such as Databricks, Snowflake, Microsoft, and Cloudflare pivot their services to support AI-driven memory and retrieval for enterprise data. As AI adoption accelerates, the pressure for infrastructures that optimize for machine-generated workloads will only intensify.

eCredo
Uol
Aretilaw firm
The Future Forbes Realty Global Properties

Become a Speaker

Become a Speaker

Become a Partner

Subscribe for our weekly newsletter