AI and the Future of Skills, Volume 2 Methods for Evaluating AI Capabilities

As artificial intelligence (AI) expands its scope of applications across society, understanding its impact becomes increasingly critical. The OECD's AI and the Future of Skills (AIFS) project is developing a comprehensive framework for regularly measuring AI capabilities and comparing them to h...

Full description

Bibliographic Details
Corporate Author: Organisation for Economic Co-operation and Development
Format: eBook
Language:English
Published: Paris OECD Publishing 2023
Series:Educational Research and Innovation
Subjects:
Online Access:
Collection: OECD Books and Papers - Collection details see MPG.ReNa
LEADER 02761nmm a2200277 u 4500
001 EB002202551
003 EBX01000000000000001339754
005 00000000000000.0
007 cr|||||||||||||||||||||
008 240412 ||| eng
020 |a 9789264420359 
020 |a 9789264824294 
020 |a 9789264817326 
245 0 0 |a AI and the Future of Skills, Volume 2  |h Elektronische Ressource  |b Methods for Evaluating AI Capabilities  |c Organisation for Economic Co-operation and Development 
260 |a Paris  |b OECD Publishing  |c 2023 
300 |a 180 p.  |c 21 x 28cm 
505 0 |a Assessing AI capabilities on occupational tests -- Eliciting expert knowledge: Methods and challenges -- -- Foreword -- Executive summary -- AI direct tests: LNE and NIST evaluations -- -- Project goals, constraints and next steps -- A framework for characterising evaluation instruments of AI performance -- Assessing AI capabilities with education tests -- Overview -- -- Towards a synthesis of language capability in humans and AI -- Occupational tests -- 
653 |a Education 
710 2 |a Organisation for Economic Co-operation and Development 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OECD  |a OECD Books and Papers 
490 0 |a Educational Research and Innovation 
024 8 |a /10.1787/a9fe53cb-en 
856 4 0 |a oecd-ilibrary.org  |u https://doi.org/10.1787/a9fe53cb-en  |x Verlag  |3 Volltext 
082 0 |a 370 
520 |a As artificial intelligence (AI) expands its scope of applications across society, understanding its impact becomes increasingly critical. The OECD's AI and the Future of Skills (AIFS) project is developing a comprehensive framework for regularly measuring AI capabilities and comparing them to human skills. The resulting AI indicators should help policymakers anticipate AI's impacts on education and work. This volume describes the second phase of the project: exploring three different approaches to assessing AI. First, the project explored the use of education tests for the assessment by asking computer experts to evaluate AI's performance on OECD's tests in reading, mathematics and science. Second, the project extended the rating of AI capabilities to tests used to certify workers for occupations. These tests present complex practical tasks and are potentially useful for understanding the application of AI in the workplace. Third, the project explored measures from direct AI evaluations. It commissioned experts to develop methods for selecting high-quality direct measures, categorising them according to AI capabilities and systematising them into single indicators. The report discusses the advantages and challenges in using these approaches and describes how they will be integrated into developing indicators of AI capabilities