• Criptomonedas
  • Mercados de Predicción
  • Noticias
  • Trading Agéntico
  • Artículos
  • Ligas

Buscar Criptomonedas

Criptomonedas de tendencia



CoinRithm

Empresa

Entidad legal
Bees-x Limited
Número de empresa
13308136
Constituida en
England and Wales
Domicilio social
Monmouth House, High Street, Watford, England, WD17 1LN

CoinRithm es un servicio de información e investigación operado por Bees-x Limited. No está autorizado por la Financial Conduct Authority (FCA) para realizar actividades reguladas, y nada en este sitio constituye asesoramiento financiero.

Explorar

CriptomonedasMercados de PredicciónNoticiasArtículosAgent ArenaLigas

Funciones

TableroComercio SimuladoTrading AgénticoPortafolioLista de SeguimientoConfiguraciones

Empresa

Sobre NosotrosMetodologiaTérminos de UsoPolítica de PrivacidadPolítica de CookiesDescargo de Responsabilidad

Soporte

Contacto SoporteFAQKit para desarrolladoresDocumentación MCP

Sociales

X (Twitter)FacebookLinkedInTelegramInstagramTikTokYouTube
© 2026 CoinRithm. Reservados todos los derechos.
Disponible en Google PlayDescargar en App Store
  • Inicio
  • MercadosMercados de Predicción
  • Noticias
  • Tablero
  1. Mercados de Predicción
  2. IA
  3. By when will AIs perform at least as well as humans on GAIA?
By when will AIs perform at least as well as humans on GAIA?

By when will AIs perform at least as well as humans on GAIA?

IATecnologíaOne-Off9a
Manifold MarketsManifold MarketsSin KYC
Pronóstico comunitario actual
Before 2024-06-01
Before 2024-06-01 0%
Líder entre 7 opciones
Pronosticadores

26

Tipo de pregunta

multiple choice

Metodología

Play-money forecasting platform

Tipo de fuente

Pronóstico

Datos de mercado

Actualizado hace 7 días

Desactualizado
21 feb 24, 4:362 ene 36, 7:59

Tendencias

Resultado24hProbabilidad
Before 2024-06-01
Before 2024-06-01
0%
Before 2025-01-01
Before 2025-01-01
0%

Resultado elegido

Before 2027-01-0192%

Reglas

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)

Mercados Relacionados

Which company has best AI model end of June?

Which company has best AI model end of June?

344,5 mil €
Anthropic: 88%PolymarketPOLYMARKET
Which company has top AI model end of June? (Style Control On)

Which company has top AI model end of June? (Style Control On)

45,2 mil €
Anthropic: 90%PolymarketPOLYMARKET
Which company has best AI model end of July?

Which company has best AI model end of July?

21,7 mil €
Anthropic: 81%PolymarketPOLYMARKET
When will a non-SpaceX successfully reusable booster be first launched?

When will a non-SpaceX successfully reusable booster be first launched?

6,1 mil €
By Dec 31, 2025: 74%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

GPT 5.6 released by…?

1,1 mil €
11.59pm ET May 31 2026: 0%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

By when will Google add ads to Gemini?

627,7 €
By Jan 1, 2026: 0%Manifold MarketsMANIFOLD MARKETS

Activos en estos temas

BitcoinBTC$62,748.94+1.97%EthereumETH$1,654.07+1.26%SolanaSOL$65.07+1.17%DogecoinDOGE$0.0847+1.10%BNBBNB$596.08+1.54%XRPXRP$1.11+0.04%

Noticias Relacionadas

Anthropic launches Claude Fable 5 with new safeguardsCrypto NewsEU orders Meta to restore WhatsApp access for rival AI chatbotsCrypto NewsJPMorgan plans longer-running AI agents for corporate workflows Crypto NewsOpenAI Files for IPO, Targets Valuation Up to $850BBlockchain.NewsOpenAI confidentially files to go public in the USCointelegraphNvidia expands South Korean AI partnerships across chips, cloud, and robotics Crypto News

Reglas

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)