• Kryptowährungen
  • Prognosemärkte
  • Nachrichten
  • Agentic Trading
  • Artikel
  • Ligen

Suche Kryptowährungen

Trendende Kryptowährungen



CoinRithm

Firma

Rechtsträger
Bees-x Limited
Unternehmensnummer
13308136
Eingetragen in
England and Wales
Eingetragener Sitz
Monmouth House, High Street, Watford, England, WD17 1LN

CoinRithm ist ein Informations- und Recherchedienst der Bees-x Limited. Das Unternehmen ist von der Financial Conduct Authority (FCA) nicht zur Ausübung regulierter Tätigkeiten zugelassen, und nichts auf dieser Website stellt Finanzberatung dar.

Entdecken

KryptowährungenPrognosemärkteNachrichtenArtikelAgent ArenaLigen

Funktionen

DashboardProbespielAgentic TradingPortfolioBeobachtungslisteEinstellungen

Firma

Über UnsMethodikNutzungsbedingungenDatenschutzrichtlinieCookie-RichtlinieHaftungsausschluss

Support

KundendienstFAQEntwickler-KitMCP-Dokumentation

Soziale Medien

X (Twitter)FacebookLinkedInTelegramInstagramTikTokYouTube
© 2026 CoinRithm. Alle Rechte vorbehalten.
Jetzt bei Google PlayLaden im App Store
  • Start
  • MärktePrognosemärkte
  • Nachrichten
  • Dashboard
  1. Prognosemärkte
  2. KI
  3. By when will AIs perform at least as well as humans on GAIA?
By when will AIs perform at least as well as humans on GAIA?

By when will AIs perform at least as well as humans on GAIA?

KITechnikOne-Off9J
Manifold MarketsManifold MarketsKein KYC
Aktuelle Community-Prognose
Before 2024-06-01
Before 2024-06-01 0%
Führend unter 7 Optionen
Prognostiker

26

Fragetyp

multiple choice

Methodik

Play-money forecasting platform

Quellentyp

Prognose

Marktdaten

Aktualisiert vor 7 Tagen

Veraltet
21. Feb. 24, 4:362. Jan. 36, 7:59

Trends

Ergebnis24hWahrscheinlichkeit
Before 2024-06-01
Before 2024-06-01
0%
Before 2025-01-01
Before 2025-01-01
0%

Gewähltes Ergebnis

Before 2027-01-0192%

Regeln

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)

Verwandte Märkte

Which company has best AI model end of July?

Which company has best AI model end of July?

21.120,1 €
Anthropic: 82%PolymarketPOLYMARKET
Which company has the best Coding AI model end of June?

Which company has the best Coding AI model end of June?

7223,2 €
Anthropic: 95%PolymarketPOLYMARKET
Will any AI model reach ___ Overall Arena Score by June 30?

Will any AI model reach ___ Overall Arena Score by June 30?

3746,4 €
1510: 28%PolymarketPOLYMARKET
Manifold Markets

GPT 5.6 released by…?

947,9 €
11.59pm ET May 31 2026: 0%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

Will the Technological Singularity occur by January 1st, 2050?

349,5 €
Ja: 50%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

Elon's Tesla promises, Q1 26 Prop Bets

129,9 €
At least 100 Cybercabs produced in 2026: 99%Manifold MarketsMANIFOLD MARKETS

In diesen Themen aktiv

BitcoinBTC$62,632.64+2.54%EthereumETH$1,651.33+2.14%SolanaSOL$65.09+1.78%DogecoinDOGE$0.085+2.01%BNBBNB$594.43+1.80%XRPXRP$1.12+0.55%

Verwandte Nachrichten

Anthropic launches Claude Fable 5 with new safeguardsCrypto NewsEU orders Meta to restore WhatsApp access for rival AI chatbotsCrypto NewsJPMorgan plans longer-running AI agents for corporate workflows Crypto NewsOpenAI Files for IPO, Targets Valuation Up to $850BBlockchain.NewsOpenAI confidentially files to go public in the USCointelegraphNvidia expands South Korean AI partnerships across chips, cloud, and robotics Crypto News

Regeln

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)