The More the Merrier? A Machine Learning Algorithm for Optimal Pooling of Panel Data

We leverage insights from machine learning to optimize the tradeoff between bias and variance when estimating economic models using pooled datasets. Specifically, we develop a simple algorithm that estimates the similarity of economic structures across countries and selects the optimal pool of count...

Full description

Bibliographic Details
Main Author: Bolhuis, Marijn
Other Authors: Rayner, Brett
Format: eBook
Language:English
Published: Washington, D.C. International Monetary Fund 2020
Series:IMF Working Papers
Subjects:
Online Access:
Collection: International Monetary Fund - Collection details see MPG.ReNa
Description
Summary:We leverage insights from machine learning to optimize the tradeoff between bias and variance when estimating economic models using pooled datasets. Specifically, we develop a simple algorithm that estimates the similarity of economic structures across countries and selects the optimal pool of countries to maximize out-of-sample prediction accuracy of a model. We apply the new alogrithm by nowcasting output growth with a panel of 102 countries and are able to significantly improve forecast accuracy relative to alternative pools. The algortihm improves nowcast performance for advanced economies, as well as emerging market and developing economies, suggesting that machine learning techniques using pooled data could be an important macro tool for many countries
Physical Description:21 pages
ISBN:9781513529974