Invariance analyses in large-scale studies

The most commonly applied procedures to establish invariance of cognitive and non-cognitive scales across countries in large-scale surveys are developed within the framework of confirmatory factor analysis and item response theory. The criteria that are commonly applied to evaluate the fit of such m...

Full description

Bibliographic Details
Main Author: Van de Vijver, Fons J. R.
Other Authors: Avvisati, Francesco, Davidov, Eldad, Eid, Michael
Format: eBook
Language:English
Published: Paris OECD Publishing 2019
Series:OECD Education Working Papers
Subjects:
Online Access:
Collection: OECD Books and Papers - Collection details see MPG.ReNa
LEADER 02956nma a2200289 u 4500
001 EB002074557
003 EBX01000000000000001214647
005 00000000000000.0
007 cr|||||||||||||||||||||
008 220928 ||| eng
100 1 |a Van de Vijver, Fons J. R. 
245 0 0 |a Invariance analyses in large-scale studies  |h Elektronische Ressource  |c Fons J. R., Van de Vijver ... [et al] 
260 |a Paris  |b OECD Publishing  |c 2019 
300 |a 110 p 
653 |a Education 
700 1 |a Avvisati, Francesco 
700 1 |a Davidov, Eldad 
700 1 |a Eid, Michael 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OECD  |a OECD Books and Papers 
490 0 |a OECD Education Working Papers 
024 8 |a /10.1787/254738dd-en 
856 4 0 |a oecd-ilibrary.org  |u https://doi.org/10.1787/254738dd-en  |x Verlag  |3 Volltext 
082 0 |a 370 
520 |a The most commonly applied procedures to establish invariance of cognitive and non-cognitive scales across countries in large-scale surveys are developed within the framework of confirmatory factor analysis and item response theory. The criteria that are commonly applied to evaluate the fit of such models, such as the decrement of the Comparative Fit Index in confirmatory factor analysis, work normally well in the comparison of a small number of countries or groups, but can perform poorly in large-scale surveys featuring a large number of countries. More specifically, the common criteria often result in the non-rejection of metric invariance; however, the step from metric invariance (i.e. identical factor loadings across countries) to scalar invariance (i.e. identical intercepts, in addition to identical factor loadings) appears to set overly restrictive standards for scalar invariance (i.e. identical intercepts).  
520 |a Large-scale surveys such as the Programme for International Student Assessment (PISA), the Teaching and Learning International Survey (TALIS), and the Programme for the International Assessment of Adult Competences (PIAAC) use advanced statistical models to estimate scores of latent traits from multiple observed responses. The comparison of such estimated scores across different groups of respondents is valid to the extent that the same set of estimated parameters holds in each group surveyed. This issue of invariance of parameter estimates is addressed in model fit indices which gauge the likelihood that one set of parameters can be used across all groups. Therefore, the problem of scale invariance across groups of respondents can typically be framed as the question of how well a single model fits the responses of all groups. However, the procedures used to evaluate the fit of these models pose a series of theoretical and practical problems.  
520 |a This report sets out to identify and apply novel procedures to evaluate model fit across a large number of groups, or novel scaling models that are more likely to pass common model fit criteria