As AI agents evolve from simple chatbots into complex, autonomous assistants, the tech industry faces a new challenge: Is there enough computing power to support them? With AI agents poised to become integral in various industries, computational demands are rising rapidly.
A recent Barclays report forecasts that the AI industry can support between 1.5 billion and 22 billion AI agents, potentially revolutionizing white-collar work. However, the increase in AI’s capabilities comes at a cost. AI agents, unlike chatbots, generate significantly more tokens—up to 25 times more per query—requiring far greater computing power.
Follow THE FUTURE on LinkedIn, Facebook, Instagram, X and Telegram
Tokens, the fundamental units of generative AI, represent fragmented parts of language to simplify processing. This increase in token generation is linked to reasoning models, like OpenAI’s o1 and DeepSeek’s R1, which break tasks into smaller, manageable chunks. As AI agents process more complex tasks, the tokens multiply, driving up the demand for AI chips and computational capacity.
Barclays analysts caution that while the current infrastructure can handle a significant volume of agents, the rise of these “super agents” might outpace available resources, requiring additional chips and servers to meet demand. OpenAI’s ChatGPT Pro, for example, generates around 9.4 million tokens annually per subscriber, highlighting just how computationally expensive these reasoning models can be.
In essence, the tech industry is at a critical juncture. While AI agents show immense potential, their expansion could strain the limits of current computing infrastructure. The question is, can the industry keep up with the demand?