Breaking news

OpenAI Releases GDPval Benchmark To Gauge AI Performance Against Human Experts

New Benchmark Sheds Light on AI’s Capabilities

OpenAI has unveiled GDPval, a new benchmark designed to evaluate its AI models against human professionals across a broad spectrum of industries. This initiative represents a critical step in understanding how far today’s AI is from matching or surpassing the work quality of experts in sectors such as healthcare, finance, manufacturing, and government.

Methodology and Industry Scope

The GDPval benchmark focuses on nine major industries contributing to America’s gross domestic product and tests AI performance in 44 distinct occupations—from software engineering to nursing and journalism. In its initial version, GDPval-v0, industry professionals compared reports generated by AI models with those produced by their human counterparts. For instance, investment bankers were tasked with evaluating competitor landscape analyses for the last-mile delivery industry, ensuring that the assessment reflects real-world complexity.

Comparative Performance: AI Advances and Limitations

Results indicate promising progress; OpenAI’s GPT-5-high, an enhanced iteration of its flagship model, achieved a win rate of 40.6% when compared head-to-head with industry veterans. More notably, Anthropic’s Claude Opus 4.1 reached nearly 49% on similar criteria. However, OpenAI acknowledges that these models are not yet positioned to replace human labor entirely, as the current iteration of GDPval covers a narrow slice of actual job responsibilities.

Expert Insights and Future Directions

In a discussion with TechCrunch, OpenAI’s chief economist, Dr. Aaron Chatterji, noted that the benchmark’s favorable outcomes suggest professionals may soon delegate routine tasks to AI. This, he argued, will free up valuable time for focusing on higher-impact work. Industry observer Tejal Patwardhan also expressed optimism, emphasizing the significant performance leap from GPT-4’s 13.7% score to nearly triple that figure with GPT-5.

Benchmarking And The Road To Comprehensive AI Evaluation

While GDPval represents an early milestone, it aligns with a broader effort among Silicon Valley titans to create robust testing frameworks, such as AIME 2025 and GPQA Diamond, that better quantify AI proficiency for real-world applications. OpenAI plans to expand GDPval to encapsulate more industries and interactive workflows, aiming to bolster its claims about AI’s growing economic value.

As the benchmark evolves, GDPval could play an instrumental role in the ongoing debate around artificial general intelligence, highlighting the potential and limitations of AI models poised to reshape the modern workforce.

Trump Discusses Equity Stakes In AI Companies For Public Benefit

Conceptualizing A Public Wealth Initiative

Recent comments by President Donald Trump have drawn attention to discussions around potential government equity stakes in artificial intelligence companies. Speaking about the idea, Trump suggested that such arrangements could allow the American public to benefit from the growth of the AI sector through government-backed ownership structures.

Strategic Conversations With Industry Leaders

Although Trump did not name specific companies, reports have pointed to OpenAI as one of the firms involved in discussions with the administration. CNBC previously reported that the Trump administration had discussed a potential equity stake in OpenAI. The company has also outlined a proposal for a “Public Wealth Fund,” under which a portion of the proceeds could be distributed to citizens.

Government Participation And Broader Political Debate

According to Bloomberg, Trump suggested that Americans could become indirect partners in AI companies through government-backed equity arrangements. The proposal follows previous government interventions in strategic industries, including the acquisition of a 10% stake in Intel. OpenAI CEO Sam Altman has reportedly discussed the possibility of government ownership stakes in major AI companies since early 2025.

Cross-Partisan Interest And Critical Perspectives

The idea has attracted attention from figures across the political spectrum. Senator Bernie Sanders has proposed a one-time 50% stock tax on major AI companies, including OpenAI, Anthropic and xAI, arguing that the economic benefits of AI should be distributed more broadly. Some investors and industry figures, including David Sacks, have expressed cautious support for aspects of the proposal while raising concerns about increasing overlap between government and corporate interests. Additional criticism has come from former Microsoft employee Dare Obasanjo, who argued that certain proposals could resemble government support measures for private companies.

Looking Forward

This emerging dialogue on blending public wealth with private innovation is set against the backdrop of a rapidly evolving AI landscape. As more companies consider public offerings, the debate over how best to harness AI’s economic promise while ensuring broad citizen benefit is likely to intensify, requiring careful regulatory and strategic consideration from both industry leaders and policymakers.

Aretilaw firm
Uol
The Future Forbes Realty Global Properties
eCredo

Become a Speaker

Become a Speaker

Become a Partner

Subscribe for our weekly newsletter