AI models lag behind AGI-level reasoning despite recent advances
Apple researchers have found that leading AI models still fall short of humanlike reasoning, casting doubt on claims that artificial general intelligence (AGI) is imminent.
In a June paper titled The Illusion of Thinking, Apple’s team evaluated major large reasoning models (LRMs), including ChatGPT and Claude, using custom puzzle games rather than standard coding or math benchmarks.
While recent AI updates show gains on conventional tests, the study found that these benchmarks fail to capture broader reasoning capabilities.
The researchers tested both “thinking” and “non-thinking” model variants and found performance dropped sharply as task complexity increased.
“We found that LRMs have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across puzzles,” the paper stated.
Additionally, they observed that AI has a propensity to overthink, frequently beginning with accurate solutions before deviating into flawed reasoning as the responses developed.
These patterns suggest current models imitate reasoning without internalising it, lacking the kind of generalisable thinking associated with AGI.
The study concluded that existing approaches may be hitting fundamental limits in replicating human reasoning.
This analysis contrasts with optimistic forecasts from figures like OpenAI CEO Sam Altman and Anthropic CEO Dario Amodei.
“We are now confident we know how to build AGI as we have traditionally understood it,” Altman said in January.
Amodei predicted AGI might exceed human ability by 2026 or 2027.
Apple’s findings suggest such projections may underestimate the complexity of achieving genuine general intelligence in machines.
Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.
You may also like
Bitcoin Buying Surge Driven by U.S. Investors
Bitcoin sees a strong buying trend among U.S. investors, signaling a healthy recovery pattern after recent corrections.A Healthy Post-Correction RallyWhat This Means for the Crypto Market

Here’s Why BlockDAG’s $293M Presale Makes It the Best Crypto to Buy, Bitcoin Holds, and Ondo Plays It Safe
Looking for the best crypto to buy right now? Discover how BlockDAG is disrupting the market with massive growth past $293M raised, while Bitcoin (BTC) holds firm, and Ondo builds trust.BlockDAG Powers Ahead with $293M Raised in Presale!Bitcoin (BTC): The Bedrock of CryptoOndo Finance: Bridging TradFi With Tokenized BondsFinal Thoughts

Guggenheim Taps XRP Ledger for Digital Debt Expansion
Guggenheim partners with Ripple to bring digital debt products to the XRP Ledger, signaling confidence in blockchain finance.Ripple Partnership Powers Blockchain IntegrationWhat It Means for Crypto and Traditional Finance

Siebert Financial Seeks $100 Million for Crypto and AI After SEC Nod
Trending news
MoreCrypto prices
More








