• Criptomoedas
  • Mercados de Previsão
  • Notícias
  • Trading Agêntico
  • Artigos
  • Ligas

Pesquisar Criptomoedas

Criptomoedas em tendência



CoinRithm

Empresa

Entidade legal
Bees-x Limited
Número da empresa
13308136
Constituída em
England and Wales
Sede registrada
Monmouth House, High Street, Watford, England, WD17 1LN

CoinRithm é um serviço de informação e pesquisa operado pela Bees-x Limited. Não é autorizado pela Financial Conduct Authority (FCA) a realizar atividades reguladas, e nada neste site constitui aconselhamento financeiro.

Explorar

CriptomoedasMercados de PrevisãoNotíciasArtigosAgent ArenaLigas

Recursos

Painel de ControleComércio SimuladoTrading AgênticoCarteiraLista de ObservaçãoConfigurações

Empresa

Sobre NósMetodologiaTermos de UsoPolítica de PrivacidadePolítica de CookiesAviso Legal

Suporte

Apoio ao ClienteFAQKit para desenvolvedoresDocumentação MCP

Redes Sociais

X (Twitter)FacebookLinkedInTelegramInstagramTikTokYouTube
© 2026 CoinRithm. Direitos reservados.
Disponível no Google PlayBaixar na App Store
  • Início
  • MercadosMercados de Previsão
  • Notícias
  • Painel de Controle
  1. Mercados de Previsão
  2. IA
  3. By when will AIs perform at least as well as humans on GAIA?
By when will AIs perform at least as well as humans on GAIA?

By when will AIs perform at least as well as humans on GAIA?

IATecnologiaOne-Off9a
Manifold MarketsManifold MarketsSem KYC
Previsão da comunidade atual
Before 2024-06-01
Before 2024-06-01 0%
Líder entre 7 opções
Previsores

26

Tipo de pergunta

multiple choice

Metodologia

Play-money forecasting platform

Tipo de fonte

Previsão

Dados do mercado

Atualizado há 7 dias

Desatualizado
21/02/24, 4:362/01/36, 7:59

Tendências

Resultado24hProbabilidade
Before 2024-06-01
Before 2024-06-01
0%
Before 2025-01-01
Before 2025-01-01
0%

Resultado escolhido

Before 2027-01-0192%

Regras

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)

Mercados Relacionados

Which company has best AI model end of June?

Which company has best AI model end of June?

344,5 mil €
Anthropic: 88%PolymarketPOLYMARKET
Which company has top AI model end of June? (Style Control On)

Which company has top AI model end of June? (Style Control On)

45,2 mil €
Anthropic: 90%PolymarketPOLYMARKET
Which company has best AI model end of July?

Which company has best AI model end of July?

21,7 mil €
Anthropic: 81%PolymarketPOLYMARKET
When will a non-SpaceX successfully reusable booster be first launched?

When will a non-SpaceX successfully reusable booster be first launched?

6,1 mil €
By Dec 31, 2025: 74%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

GPT 5.6 released by…?

1,1 mil €
11.59pm ET May 31 2026: 0%Manifold MarketsMANIFOLD MARKETS
Manifold Markets

By when will Google add ads to Gemini?

627,7 €
By Jan 1, 2026: 0%Manifold MarketsMANIFOLD MARKETS

Ativos nestes tópicos

BitcoinBTC$62,748.94+1.97%EthereumETH$1,654.07+1.26%SolanaSOL$65.07+1.17%DogecoinDOGE$0.0847+1.10%BNBBNB$596.08+1.54%XRPXRP$1.11+0.04%

Notícias Relacionadas

Anthropic launches Claude Fable 5 with new safeguardsCrypto NewsEU orders Meta to restore WhatsApp access for rival AI chatbotsCrypto NewsJPMorgan plans longer-running AI agents for corporate workflows Crypto NewsOpenAI Files for IPO, Targets Valuation Up to $850BBlockchain.NewsOpenAI confidentially files to go public in the USCointelegraphNvidia expands South Korean AI partnerships across chips, cloud, and robotics Crypto News

Regras

The GAIA benchmark (https://arxiv.org/abs/2311.12983) aims to test for the next level of capability for AI agents.

Manifold Markets
  • Quoting from the paper: "GAIA proposes real-world questions that require a set of fundamental abilities such as reasoning, multi-modality handling, web browsing, and generally tool-use proficiency.
  • GAIA questions are conceptually simple for humans yet challenging for most advanced AIs: we show that human respondents obtain 92% vs. 15% for GPT-4 equipped with plugins."
  • This market will resolve based on when an AI system performs as well or better than humans on all 3 of the different levels of the benchmark.
  • I'll use the numbers from Table 4 in paper: 93.9% on level 1, 91.8% on level 2, and 87.3% on level 3.
  • (I'm using the conjunction of all 3 levels rather than the average to be somewhat conservative about this level being achieved.)