Bitget App
Trade smarter
Buy cryptoMarketsTradeFuturesBotsEarnCopy
AI models lag behind AGI-level reasoning despite recent advances

AI models lag behind AGI-level reasoning despite recent advances

GrafaGrafa2025/06/10 05:00
By:Mahathir Bayena

Apple researchers have found that leading AI models still fall short of humanlike reasoning, casting doubt on claims that artificial general intelligence (AGI) is imminent.

In a June paper titled The Illusion of Thinking, Apple’s team evaluated major large reasoning models (LRMs), including ChatGPT and Claude, using custom puzzle games rather than standard coding or math benchmarks.

While recent AI updates show gains on conventional tests, the study found that these benchmarks fail to capture broader reasoning capabilities.

The researchers tested both “thinking” and “non-thinking” model variants and found performance dropped sharply as task complexity increased.

“We found that LRMs have limitations in exact computation: they fail to use explicit algorithms and reason inconsistently across puzzles,” the paper stated.

Additionally, they observed that AI has a propensity to overthink, frequently beginning with accurate solutions before deviating into flawed reasoning as the responses developed.

These patterns suggest current models imitate reasoning without internalising it, lacking the kind of generalisable thinking associated with AGI.

The study concluded that existing approaches may be hitting fundamental limits in replicating human reasoning.

This analysis contrasts with optimistic forecasts from figures like OpenAI CEO Sam Altman and Anthropic CEO Dario Amodei.

“We are now confident we know how to build AGI as we have traditionally understood it,” Altman said in January.

Amodei predicted AGI might exceed human ability by 2026 or 2027.

Apple’s findings suggest such projections may underestimate the complexity of achieving genuine general intelligence in machines.

0

Disclaimer: The content of this article solely reflects the author's opinion and does not represent the platform in any capacity. This article is not intended to serve as a reference for making investment decisions.

PoolX: Locked for new tokens.
APR up to 10%. Always on, always get airdrop.
Lock now!

You may also like

Bitcoin Buying Surge Driven by U.S. Investors

Bitcoin sees a strong buying trend among U.S. investors, signaling a healthy recovery pattern after recent corrections.A Healthy Post-Correction RallyWhat This Means for the Crypto Market

Coinomedia2025/06/10 21:00
Bitcoin Buying Surge Driven by U.S. Investors

Here’s Why BlockDAG’s $293M Presale Makes It the Best Crypto to Buy, Bitcoin Holds, and Ondo Plays It Safe

Looking for the best crypto to buy right now? Discover how BlockDAG is disrupting the market with massive growth past $293M raised, while Bitcoin (BTC) holds firm, and Ondo builds trust.BlockDAG Powers Ahead with $293M Raised in Presale!Bitcoin (BTC): The Bedrock of CryptoOndo Finance: Bridging TradFi With Tokenized BondsFinal Thoughts

Coinomedia2025/06/10 21:00
Here’s Why BlockDAG’s $293M Presale Makes It the Best Crypto to Buy, Bitcoin Holds, and Ondo Plays It Safe

Guggenheim Taps XRP Ledger for Digital Debt Expansion

Guggenheim partners with Ripple to bring digital debt products to the XRP Ledger, signaling confidence in blockchain finance.Ripple Partnership Powers Blockchain IntegrationWhat It Means for Crypto and Traditional Finance

Coinomedia2025/06/10 21:00
Guggenheim Taps XRP Ledger for Digital Debt Expansion