Breaking news

OpenAI Releases GDPval Benchmark To Gauge AI Performance Against Human Experts

New Benchmark Sheds Light on AI’s Capabilities

OpenAI has unveiled GDPval, a new benchmark designed to evaluate its AI models against human professionals across a broad spectrum of industries. This initiative represents a critical step in understanding how far today’s AI is from matching or surpassing the work quality of experts in sectors such as healthcare, finance, manufacturing, and government.

Methodology and Industry Scope

The GDPval benchmark focuses on nine major industries contributing to America’s gross domestic product and tests AI performance in 44 distinct occupations—from software engineering to nursing and journalism. In its initial version, GDPval-v0, industry professionals compared reports generated by AI models with those produced by their human counterparts. For instance, investment bankers were tasked with evaluating competitor landscape analyses for the last-mile delivery industry, ensuring that the assessment reflects real-world complexity.

Comparative Performance: AI Advances and Limitations

Results indicate promising progress; OpenAI’s GPT-5-high, an enhanced iteration of its flagship model, achieved a win rate of 40.6% when compared head-to-head with industry veterans. More notably, Anthropic’s Claude Opus 4.1 reached nearly 49% on similar criteria. However, OpenAI acknowledges that these models are not yet positioned to replace human labor entirely, as the current iteration of GDPval covers a narrow slice of actual job responsibilities.

Expert Insights and Future Directions

In a discussion with TechCrunch, OpenAI’s chief economist, Dr. Aaron Chatterji, noted that the benchmark’s favorable outcomes suggest professionals may soon delegate routine tasks to AI. This, he argued, will free up valuable time for focusing on higher-impact work. Industry observer Tejal Patwardhan also expressed optimism, emphasizing the significant performance leap from GPT-4’s 13.7% score to nearly triple that figure with GPT-5.

Benchmarking And The Road To Comprehensive AI Evaluation

While GDPval represents an early milestone, it aligns with a broader effort among Silicon Valley titans to create robust testing frameworks, such as AIME 2025 and GPQA Diamond, that better quantify AI proficiency for real-world applications. OpenAI plans to expand GDPval to encapsulate more industries and interactive workflows, aiming to bolster its claims about AI’s growing economic value.

As the benchmark evolves, GDPval could play an instrumental role in the ongoing debate around artificial general intelligence, highlighting the potential and limitations of AI models poised to reshape the modern workforce.

EU Moderates Emissions While Sustaining Economic Momentum

The European Union witnessed a modest decline in greenhouse gas emissions in the second quarter of 2025, as reported by Eurostat. Emissions across the EU registered at 772 million tonnes of CO₂-equivalents, marking a 0.4 percent reduction from 775 million tonnes in the same period of 2024. Concurrently, the EU’s gross domestic product rose by 1.3 percent, reinforcing the ongoing decoupling between economic growth and environmental impact.

Sector-By-Sector Performance

Within the broader statistics on emissions by economic activity, the energy sector—specifically electricity, gas, steam, and air conditioning supply—experienced the most significant drop, declining by 2.9 percent. In comparison, the manufacturing sector and transportation and storage both achieved a 0.4 percent reduction. However, household emissions bucked the trend, increasing by 1.0 percent over the same period.

National Highlights And Notable Exceptions

Among EU member states, 12 reported a reduction in emissions, while 14 saw increases, and Estonia’s figures remained static. Notably, Slovenia, the Netherlands, and Finland recorded the most pronounced declines at 8.6 percent, 5.9 percent, and 4.2 percent respectively. Of the 12 countries reducing emissions, three—Finland, Germany, and Luxembourg—also experienced a contraction in GDP growth.

Dual Achievement: Environmental And Economic Goals

In an encouraging development, nine member states, including Cyprus, managed to lower their emissions while maintaining economic expansion. This dual achievement—reducing environmental impact while fostering economic activity—is a trend that has increasingly influenced EU climate policies. Other nations that successfully balanced these outcomes include Austria, Denmark, France, Italy, the Netherlands, Romania, Slovenia, and Sweden.

Conclusion

As the EU continues to navigate its climate commitments, these quarterly insights underscore a gradual yet significant shift toward balancing emissions reductions with robust economic growth. The evolving landscape highlights the critical need for sustainable strategies that not only mitigate environmental risks but also invigorate economic resilience.

The Future Forbes Realty Global Properties

Become a Speaker

Become a Speaker

Become a Partner

Subscribe for our weekly newsletter