What is the role of data in jobs in the United Kingdom, Canada, and the United States? A natural language processing approach

This paper estimates the data intensity of occupations/sectors (i.e. the share of job postings per occupation/sector related to the production of data) using natural language processing (NLP) on job advertisements in the United Kingdom, Canada and the United States. Online job advertisement data col...

Full description

Bibliographic Details
Main Author: Schmidt, Julia
Other Authors: Pilgrim, Graham, Mourougane, Annabelle
Format: eBook
Language:English
Published: Paris OECD Publishing 2023
Series:OECD Statistics Working Papers
Subjects:
Online Access:
Collection: OECD Books and Papers - Collection details see MPG.ReNa
LEADER 02436nma a2200301 u 4500
001 EB002182320
003 EBX01000000000000001319807
005 00000000000000.0
007 cr|||||||||||||||||||||
008 231010 ||| eng
100 1 |a Schmidt, Julia 
245 0 0 |a What is the role of data in jobs in the United Kingdom, Canada, and the United States?  |h Elektronische Ressource  |b A natural language processing approach  |c Julia, Schmidt, Graham, Pilgrim and Annabelle, Mourougane 
260 |a Paris  |b OECD Publishing  |c 2023 
300 |a 45 p.  |c 21 x 28cm 
653 |a Employment 
653 |a United Kingdom 
653 |a Economics 
653 |a United States 
653 |a Canada 
700 1 |a Pilgrim, Graham 
700 1 |a Mourougane, Annabelle 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OECD  |a OECD Books and Papers 
490 0 |a OECD Statistics Working Papers 
028 5 0 |a 10.1787/fa65d29e-en 
856 4 0 |a oecd-ilibrary.org  |u https://doi.org/10.1787/fa65d29e-en  |x Verlag  |3 Volltext 
082 0 |a 330 
520 |a This paper estimates the data intensity of occupations/sectors (i.e. the share of job postings per occupation/sector related to the production of data) using natural language processing (NLP) on job advertisements in the United Kingdom, Canada and the United States. Online job advertisement data collected by Lightcast provide timely and disaggregated insights into labour demand and skill requirements of different professions. The paper makes three major contributions. First, indicators created from the Lightcast data add to the understanding of digital skills in the labour market. Second, the results may advance the measurement of data assets in national account statistics. Third, the NLP methodology can handle up to 66 languages and can be adapted to measure concepts beyond digital skills. Results provide a ranking of data intensity across occupations, with data analytics activities contributing most to aggregate data intensity shares in all three countries. At the sectoral level, the emerging picture is more heterogeneous across countries. Differences in labour demand primarily explain those variations, with low data-intensive professions contributing most to aggregate data intensity in the United Kingdom. Estimates of investment in data, using a sum of costs approach and sectoral intensity shares, point to lower levels in the United Kingdom and Canada than in the United States