To know AI capabilities throughout these cognitive skills, we suggest a three-stage analysis protocol that benchmarks system efficiency in relation to human capabilities:
- Consider AI programs throughout a broad suite of cognitive duties protecting every capability, utilizing held-out take a look at units to stop information contamination
- Accumulate human baselines for a similar duties from a demographically consultant pattern of adults
- Map every AI system’s efficiency relative to the distribution of human efficiency in every capability
Going from concept to apply
Defining these cognitive skills is an important first step, however we want greater than a framework to measure progress. To place this concept into apply, we’re launching a brand new Kaggle hackathon — “Measuring progress towards AGI: Cognitive skills”. The hackathon encourages the neighborhood to design evaluations for 5 cognitive skills the place the analysis hole is the biggest: studying, metacognition, consideration, government capabilities and social cognition.
Contributors can use Kaggle’s newly launched Group Benchmarks platform to construct and take a look at their evaluations in opposition to a lineup of frontier fashions.
We’re providing a complete prize pool of $200,000: $10,000 awards for the highest two submissions in every of the 5 tracks, and $25,000 grand prizes for the 4 best possible general submissions. Submissions are open March 17 by means of April 16, and we’ll announce the outcomes June 1. Head over to the Kaggle web site to begin constructing.







