By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
inkeinspires.cominkeinspires.cominkeinspires.com
Notification Show More
Font ResizerAa
  • Home
  • Breaking News
    Breaking NewsShow More
    Pro-EU centrist wins Romanian presidential race over hard-right nationalist
    May 18, 2025
    Ukraine’s Zelenskyy meets with U.S. officials, European leaders ahead of Trump-Putin call
    May 18, 2025
    Former US President Biden diagnosed with aggressive prostate cancer | Health News
    May 18, 2025
    Mexican navy cadet identified as victim in fatal Brooklyn Bridge ship collision
    May 18, 2025
    The market just gave investors a gift. Here’s how not to blow it
    May 18, 2025
  • Business
    BusinessShow More
    EOG Resources, Inc. (EOG) Awarded Oil Exploration Concession for UAE Shale Block
    May 18, 2025
    Sean Connery villa in French Riviera hits market for $26.4 million
    May 18, 2025
    Pro-EU centrist wins Romanian presidency in stunning reversal
    May 18, 2025
    Trump’s tariffs may mean Walmart shoppers pay more, his Treasury chief acknowledges
    May 18, 2025
    Costco Wholesale Corporation (COST) Places Limits on Customer Gold Purchases
    May 18, 2025
  • Entertainment
    EntertainmentShow More
    Jack Black Ruined His Role In A Sylvester Stallone Sci-Fi Movie
    May 18, 2025
    Gene Hackman’s ‘Royal Tenenbaums’ Salary Dispute with Wes Anderson
    May 18, 2025
    Kai Cenat Side-Eyes His Mom Over THIS Bold Stream Request
    May 18, 2025
    Ezra Miller Makes Shocking Reappearance After Years Of Scandal
    May 18, 2025
    Hulu’s Only Murders In The Building Could Have Had A Completely Different Vibe
    May 18, 2025
  • Gadgets
    GadgetsShow More
    CES 2025: 41 Products You Can Buy Right Now
    January 13, 2025
    I can’t wait try out these 3 great plant tech gadgets that I saw at CES 2025
    January 13, 2025
    6 on Your Side Consumer Confidence: Kitchen gadgets to upgrade family recipes – ABC 6 News
    January 13, 2025
    35+ Best New Products, Tech and Gadgets
    January 13, 2025
    These gadgets kept me connected and working through a 90-mile backpacking trip
    January 13, 2025
  • Health
    HealthShow More
    Retro Running: Benefits Of Running Backward inkeinspires
    May 18, 2025
    The 80/20 Rule In Running: Train Smarter, Not Harder inkeinspires
    May 18, 2025
    How To Push Through Gym Anxiety inkeinspires
    May 17, 2025
    Tips And Exercises For Grip Strength inkeinspires
    May 17, 2025
    Is The 70/30 Rule Key To Gym Progress? Let’s Explore inkeinspires
    May 17, 2025
  • Sports
    SportsShow More
    Liverpool want to sign £38m star ahead of Man Utd
    May 18, 2025
    Scudetto hangs in the balance after Napoli and Inter Milan held to dramatic draws
    May 18, 2025
    Howe unsure of Isak fitness for final day after Newcastle’s defeat at Arsenal
    May 18, 2025
    Overseas Replacements Heat Up Ahead of May 17 Restart
    May 18, 2025
    LSG vs SRH Head-to-Head Records- IPL 2025, Match 61
    May 18, 2025
  • Technology
    TechnologyShow More
    Watch NVIDIA CEO Jensen Huang deliver the opening keynote today
    May 18, 2025
    Premier League Soccer: Stream Leicester vs. Ipswich Live From Anywhere
    May 18, 2025
    Grok says it’s ‘skeptical’ about Holocaust death toll, then blames ‘programming error’
    May 18, 2025
    Netflix has figured out a way to make ads even worse using AI
    May 18, 2025
    ‘Love Island USA’ Season 7: Release Date and Time on Peacock
    May 18, 2025
  • Posts
    • Post Layouts
    • Gallery Layouts
    • Video Layouts
    • Audio Layouts
    • Post Sidebar
    • Review
      • User Rating
    • Content Features
    • Table of Contents
  • Contact US
  • Pages
    • Blog Index
    • Search Page
    • Customize Interests
    • My Bookmarks
    • 404 Page
Reading: A new, challenging AGI test stumps most AI models
Share
Font ResizerAa
inkeinspires.cominkeinspires.com
  • Entertainment
Search
  • Home
  • Categories
    • Breaking News
    • Business
    • Sports
    • Technology
    • Entertainment
    • Gadgets
    • Health
  • Contact
Have an existing account? Sign In
Follow US
inkeinspires.com > Technology > A new, challenging AGI test stumps most AI models
Technology

A new, challenging AGI test stumps most AI models

MTHANNACH
Last updated: March 25, 2025 12:43 am
MTHANNACH Published March 25, 2025
Share
SHARE

The Arc Prize Foundation, a non -profit organization co -founded by the researcher of the eminent AI François Chollet, announced in a blog On Monday, he created a difficult new test to measure the general intelligence of the main models of AI.

Until now, the new test, called Arc-Agi-2, has perplexed most models.

The “reasoning” models like O1-Pro of Openai and the Deepseek R1 score between 1% and 1.3% on Arc-Agi-2, according to the Arc prices classification. Powerful unreal models, including GPT-4.5, Claude 3.7 Sonnet and Gemini 2.0 Flash score of around 1%.

Arc-agi tests consist of puzzle-type problems where an AI must identify the visual models from a collection of squares of different colors and generate the correct “response” grid. The problems have been designed to force an AI to adapt to new problems that he has never seen before.

The Arc Prize Foundation had more than 400 people taking Arch-Agi-2 to establish a human reference base. On average, the “panels” of these people obtained 60% of the test questions – much better than the models of the models.

An example of an arc-ag-2 question (credit: arc price).

In a PublishChollet affirmed that Carte-Agi-2 is a better measure of the real intelligence of an AI model than the first iteration of the test, Arc-Agi-1. ARC Price Foundation Tests aim to assess whether an AI system can effectively acquire new skills outside the data on which it has been trained.

Chollet said that unlike Arc -Agi -1, the new test prevents AI models from relying on the “brute force” – an extensive calculation power – to find solutions. Chollet previously recognized that it was a major arc-agi-1 defect.

To approach the faults of the first test, Arc-Agi-2 has a new metric: efficiency. It also requires models to interpret the models on the fly instead of counting on memorization.

“Intelligence is not only defined by the ability to solve problems or achieve high scores,” wrote the co-founder of the Arc Foundation blog. “The efficiency with which these capacities are acquired and deployed is a crucial and decisive component. The main question asked is not only ” [the] Competence to solve a task? But also, “to what efficiency or at this cost?” »»

ARC-AGI-1 was undefeated for about five years until December 2024, when Openai published its advanced reasoning model, O3, which outperformed all the other models of AI and has adorned human performance on the evaluation. However, as we noted at the time, O3’s performance gains on Arc-Agi-1 came with a high price.

The version of the O3 model of Openai-O3 (Bas)-which was the first to reach new peaks on Arc-Agi-1, marking 75.7% in the test, obtained a meager 4% on Arc-Agi-2 using $ 200 of computing power per task.

Comparison of the performance of the AI ​​Frontier model on Arc-Agi-1 and Arc-Agi-2 (Credit: ARC price).

The arrival of Arc-Agi-2 occurs because many in the technology industry call for new unsaturated benchmarks to measure the progress of AI. Thomas Wolf, co-founder of Hugging Face, recently told Techcrunch that the AI ​​industry did not have enough tests to measure the key features of artificial artificial intelligence, including creativity.

In addition to the new reference, the Arc Prize Foundation has announced A new 2025 Arc competitionDifficult developers to reach an 85% precision on the Arc-Agi-2 test while spending only $ 0.42 per task.

You Might Also Like

The Best Mushroom Coffee, WIRED Tested and Reviewed (2025)

Today’s NYT Connections: Sports Edition Hints, Answers for April 13 #202

‘I’m going to have him run out of here by inauguration day’: Steve Bannon escalates his war with Elon Musk

These Are the Best Smart Devices for Amazon Alexa in 2025

30% Off Design Within Reach Promo Code | May 2025

Share This Article
Facebook X Email Print
Leave a Comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Subscribe to Our Newsletter
Subscribe to our newsletter to get our newest articles instantly!
loader

Email Address*

Name

Follow US

Find US on Social Medias
FacebookLike
XFollow
YoutubeSubscribe
TelegramFollow

Weekly Newsletter

Subscribe to our newsletter to get our newest articles instantly!
[mc4wp_form]
Popular News
Sports

Chevron Championship: Ariya Jutanugarn duffs chip at final regulation hole before Mao Saigo wins chaotic five-way play-off | Golf News

MTHANNACH MTHANNACH April 28, 2025
Marseille to consider Paul Pogba move in the summer
Donald Trump escalates US-Canada trade war with additional 25% tariff on steel and aluminium
U.S. Treasury yields: tariffs-led sell-off continues
Violence Sweeps Coastal Syria, Sowing Chaos: ‘We Have to Get Out of Here’
- Advertisement -
Ad imageAd image
Global Coronavirus Cases

Confirmed

0

Death

0

More Information:Covid-19 Statistics

Categories

  • Business
  • Breaking News
  • Entertainment
  • Technology
  • Health
  • Sports
  • Gadgets
We influence 20 million users and is the number one business and technology news network on the planet.
Quick Link
  • My Bookmark
  • InterestsNew
  • Contact Us
  • Blog Index
Top Categories
  • Entertainment

Subscribe US

Subscribe to our newsletter to get our newest articles instantly!

 

All Rights Reserved © Inkinspires 2025
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?