• Criptovalute
  • Mercati Predittivi
  • Notizie
  • Trading Agentico
  • Articoli
  • Leghe

Cerca Criptovalute

Criptovalute di tendenza



CoinRithm

Azienda

Entità legale
Bees-x Limited
Numero società
13308136
Costituita in
England and Wales
Sede legale
Monmouth House, High Street, Watford, England, WD17 1LN

CoinRithm è un servizio di informazione e ricerca gestito da Bees-x Limited. Non è autorizzato dalla Financial Conduct Authority (FCA) a svolgere attività regolamentate e nulla su questo sito costituisce consulenza finanziaria.

Esplora

CriptovaluteMercati PredittiviNotizieArticoliAgent ArenaLeghe

Funzionalità

CruscottoScambio DimostrativoTrading AgenticoPortafoglioLista di ControlloImpostazioni

Azienda

Chi SiamoMetodologiaTermini di UsoPolitica sulla RiservatezzaPolitica sui CookieDisconoscimento

Assistenza

Supporto ClientiDomande FrequentiKit per sviluppatoriDocumentazione MCP

Social

X (Twitter)FacebookLinkedInTelegramInstagramTikTokYouTube
© 2026 CoinRithm. Tutti i diritti riservati.
Disponibile su Google PlayScarica su App Store
  • Home
  • MercatiMercati Predittivi
  • Notizie
  • Cruscotto
  1. Mercati Predittivi
  2. IA
  3. By when will AIs perform at least as well as humans on GAIA?
By when will AIs perform at least as well as humans on GAIA?

By when will AIs perform at least as well as humans on GAIA?

IATecnologiaOne-Off9a
Manifold MarketsManifold MarketsSenza KYC
Previsione della comunità attuale
Before 2024-06-01
Before 2024-06-01 0%
In testa tra 7 esiti
Previsori

26

Tipo di domanda

multiple choice

Metodologia

Play-money forecasting platform

Tipo di fonte

Previsione

Dati di mercato

Aggiornato 7 giorni fa

Obsoleto
21 feb 24, 4:362 gen 36, 7:59

Trend

Esito24hProbabilità
Before 2024-06-01
Before 2024-06-01
0%
Before 2025-01-01
Before 2025-01-01
0%

Esito scelto

Before 2027-01-0192%

Regole

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)

Mercati Correlati

Which company has best AI model end of July?

Which company has best AI model end of July?

21,1K €
Anthropic: 82%PolymarketPOLYMARKET
Which company has the best Coding AI model end of June?

Which company has the best Coding AI model end of June?

7,2K €
Anthropic: 95%PolymarketPOLYMARKET
Will any AI model reach ___ Overall Arena Score by June 30?

Will any AI model reach ___ Overall Arena Score by June 30?

3,7K €
1510: 28%PolymarketPOLYMARKET
Manifold Markets

GPT 5.6 released by…?

947,9 €
11.59pm ET May 31 2026: 0%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

Will the Technological Singularity occur by January 1st, 2050?

349,5 €
Sì: 50%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

Elon's Tesla promises, Q1 26 Prop Bets

129,9 €
At least 100 Cybercabs produced in 2026: 99%Manifold MarketsMANIFOLD MARKETS

Attivi in questi argomenti

BitcoinBTC$62,632.64+2.54%EthereumETH$1,651.33+2.14%SolanaSOL$65.09+1.78%DogecoinDOGE$0.085+2.01%BNBBNB$594.43+1.80%XRPXRP$1.12+0.55%

Notizie Correlate

Anthropic launches Claude Fable 5 with new safeguardsCrypto NewsEU orders Meta to restore WhatsApp access for rival AI chatbotsCrypto NewsJPMorgan plans longer-running AI agents for corporate workflows Crypto NewsOpenAI Files for IPO, Targets Valuation Up to $850BBlockchain.NewsOpenAI confidentially files to go public in the USCointelegraphNvidia expands South Korean AI partnerships across chips, cloud, and robotics Crypto News

Regole

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)