Apple is behind in the AI race - and now its researchers say rival technologies 'collapse' and quit easily, too

Dow Jones
06-09

MW Apple is behind in the AI race - and now its researchers say rival technologies 'collapse' and quit easily, too

By Steve Goldstein

Apple is trailing its major rivals in rolling out artificial intelligence-related technologies - but its researchers say the technology may be overhyped anyway.

Apple $(AAPL)$ in a research paper took aim at so-called reasoning models, from the big names in AI - OpenAI, DeepSeek, Anthropic and Alphabet's Google $(GOOGL)$.

With puzzles including the Tower of Hanoi, a classic mathematical puzzle involving stacking disks, Apple tested these models.

"Rather than standard benchmarks (e.g., math problems), we adopt controllable puzzle environments that let us vary complexity systematically - by adjusting puzzle elements while preserving the core logic - and inspect both solutions and internal reasoning," the paper states.

In all of the models, accuracy progressively declines as problem complexity increases, until reaching complete collapse, or zero accuracy.

And not only do the reasoning models fail to get the right answer, they have something of a quitters' mentality. "Near this collapse point, [large reasoning models] begin reducing their reasoning effort (measured by inference-time tokens) as problem complexity increases, despite operating well below generation length limits," the researchers say.

This laziness is most pronounced in the o3-mini variants of OpenAI, and less severe in Anthropic's Claude 3.7 Sonnet.

One other finding was even when Apple told the models the correct algorithm for solving the Tower of Hanoi, their performance didn't improve.

The paper started circulating on social media over the weekend, though there's no release date on it.

An Apple developers conference is due to start on Monday. ChatGPT is used in the Apple Intelligence service that was rolled out, to lackluster reviews, last fall.

-Steve Goldstein

This content was created by MarketWatch, which is operated by Dow Jones & Co. MarketWatch is published independently from Dow Jones Newswires and The Wall Street Journal.

 

(END) Dow Jones Newswires

June 09, 2025 04:26 ET (08:26 GMT)

Copyright (c) 2025 Dow Jones & Company, Inc.

免责声明:投资有风险,本文并非投资建议,以上内容不应被视为任何金融产品的购买或出售要约、建议或邀请,作者或其他用户的任何相关讨论、评论或帖子也不应被视为此类内容。本文仅供一般参考,不考虑您的个人投资目标、财务状况或需求。TTM对信息的准确性和完整性不承担任何责任或保证,投资者应自行研究并在投资前寻求专业建议。

热议股票

  1. 1
     
     
     
     
  2. 2
     
     
     
     
  3. 3
     
     
     
     
  4. 4
     
     
     
     
  5. 5
     
     
     
     
  6. 6
     
     
     
     
  7. 7
     
     
     
     
  8. 8
     
     
     
     
  9. 9
     
     
     
     
  10. 10