High-dimensional time series analysis: unit roots, cointegration and forecasting

Etiënne (Josepha Johannes) Wijler

doi:10.26481/dis.20210114ew

High-dimensional time series analysis: unit roots, cointegration and forecasting

Etiënne (Josepha Johannes) Wijler

Research output: Thesis › Doctoral Thesis › Internal

704 Downloads (Pure)

Abstract

This research develops new methods to forecast an economic variable based on a (very) large collection of potentially relevant variables. For example, it tries to predict the unemployment in the Netherlands based on the popularity of google search queries such as “werkloosheidsuitkering” and “vacatures”. Traditional economic models consider the effects of only a few variables at the same time. Nowadays, there is access to much larger datasets that potentially contain new information for us to explore. A simple idea would be to simply throw all the data into the same model, but that provides a lot of room for mistakes. Especially in economics, it is known that economic variables such as unemployment display strongly trending behaviour over time. If unemployment was very high in January, it will likely be high in February as well. This kind of behaviour requires careful treatment in statistical models. Therefore, different techniques from the statistics literature were combined to create an estimation method that automatically removes the irrelevant variables from the model, while at the same time respecting the unique (trending) characteristics of the variables in the estimated model.

Original language	English
Awarding Institution	Maastricht University
Supervisors/Advisors	Hecq, Alain, Supervisor Smeekes, Stephan, Supervisor Urbain, Jean pierre, Supervisor
Award date	14 Jan 2021
Place of Publication	Maastricht
Publisher	Datawyse / Universitaire Pers Maastricht
Print ISBNs	9789463807456
DOIs	https://doi.org/10.26481/dis.20210114ew
Publication status	Published - 2021

Keywords

big data
high-dimensional statistics
time series
forecasting

Access to Document

10.26481/dis.20210114ew

Full TextFinal published version, 3.31 MB
AbstractFinal published version, 140 KB
PropositionsFinal published version, 13.2 KB
CoverFinal published version, 14.3 KB
ValorisationFinal published version, 77.5 KB

Cite this

@phdthesis{2dc869235ea1451a900c171ca09920d8,

title = "High-dimensional time series analysis: unit roots, cointegration and forecasting",

abstract = "This research develops new methods to forecast an economic variable based on a (very) large collection of potentially relevant variables. For example, it tries to predict the unemployment in the Netherlands based on the popularity of google search queries such as “werkloosheidsuitkering” and “vacatures”. Traditional economic models consider the effects of only a few variables at the same time. Nowadays, there is access to much larger datasets that potentially contain new information for us to explore. A simple idea would be to simply throw all the data into the same model, but that provides a lot of room for mistakes. Especially in economics, it is known that economic variables such as unemployment display strongly trending behaviour over time. If unemployment was very high in January, it will likely be high in February as well. This kind of behaviour requires careful treatment in statistical models. Therefore, different techniques from the statistics literature were combined to create an estimation method that automatically removes the irrelevant variables from the model, while at the same time respecting the unique (trending) characteristics of the variables in the estimated model.",

keywords = "big data, high-dimensional statistics, time series, forecasting",

author = "Wijler, {Eti{\"e}nne (Josepha Johannes)}",

year = "2021",

doi = "10.26481/dis.20210114ew",

language = "English",

isbn = "9789463807456",

publisher = "Datawyse / Universitaire Pers Maastricht",

address = "Netherlands",

school = "Maastricht University",

}

TY - BOOK

T1 - High-dimensional time series analysis

T2 - unit roots, cointegration and forecasting

AU - Wijler, Etiënne (Josepha Johannes)

PY - 2021

Y1 - 2021

N2 - This research develops new methods to forecast an economic variable based on a (very) large collection of potentially relevant variables. For example, it tries to predict the unemployment in the Netherlands based on the popularity of google search queries such as “werkloosheidsuitkering” and “vacatures”. Traditional economic models consider the effects of only a few variables at the same time. Nowadays, there is access to much larger datasets that potentially contain new information for us to explore. A simple idea would be to simply throw all the data into the same model, but that provides a lot of room for mistakes. Especially in economics, it is known that economic variables such as unemployment display strongly trending behaviour over time. If unemployment was very high in January, it will likely be high in February as well. This kind of behaviour requires careful treatment in statistical models. Therefore, different techniques from the statistics literature were combined to create an estimation method that automatically removes the irrelevant variables from the model, while at the same time respecting the unique (trending) characteristics of the variables in the estimated model.

AB - This research develops new methods to forecast an economic variable based on a (very) large collection of potentially relevant variables. For example, it tries to predict the unemployment in the Netherlands based on the popularity of google search queries such as “werkloosheidsuitkering” and “vacatures”. Traditional economic models consider the effects of only a few variables at the same time. Nowadays, there is access to much larger datasets that potentially contain new information for us to explore. A simple idea would be to simply throw all the data into the same model, but that provides a lot of room for mistakes. Especially in economics, it is known that economic variables such as unemployment display strongly trending behaviour over time. If unemployment was very high in January, it will likely be high in February as well. This kind of behaviour requires careful treatment in statistical models. Therefore, different techniques from the statistics literature were combined to create an estimation method that automatically removes the irrelevant variables from the model, while at the same time respecting the unique (trending) characteristics of the variables in the estimated model.

KW - big data

KW - high-dimensional statistics

KW - time series

KW - forecasting

U2 - 10.26481/dis.20210114ew

DO - 10.26481/dis.20210114ew

M3 - Doctoral Thesis

SN - 9789463807456

PB - Datawyse / Universitaire Pers Maastricht

CY - Maastricht

ER -