使用Google趋势跟踪实时经济活动.docx
《使用Google趋势跟踪实时经济活动.docx》由会员分享,可在线阅读,更多相关《使用Google趋势跟踪实时经济活动.docx(59页珍藏版)》请在淘文阁 - 分享文档赚钱的网站上搜索。
1、Table of contentsTracking activity in real time with Google Trends61. Introduction and Summary62. The COVID-19 crisis called for the use of high-frequency indicators73. Exploiting the full potential of Google Trends94. A neural panel model of GDP growth114.1. From quarterly GDP growth to a weekly tr
2、acker: a bridge model of GDP growth124.2. A non-linear algorithm124.3. To pool or not to pool: a panel nowcasting model for 46 countries145. How well can the OECD Weekly Tracker nowcast the economy?156. Model insights: a dive into the black box206.1. Shapley values: explaining machine learning with
3、game theory206.2. A dive into the model inner workings216.3. From Shapley values to sectoral insights237. The OECD Weekly Tracker237.1. The COVID-19 crisis: a week-by-week analysis247.2. Latest insights from the Weekly Tracker: a stalling recovery below 2019 levels307.3. Consumption volume remains s
4、ubdued while its composition has shifted31References35Annex A. Data pre-processing and data issues40Additional details44Annex B. Additional results47TablesTable 1. Standard indicators were outpaced by the crisis8Forecast performance18FiguresQueries in Google Trends: beyond keyword searches11Figure 1
5、. Nowcasting GDP growth with Google trends (M-1 forecast)15Trackers predictions for Q2 202017Figure 2. The OECD Weekly Tracker and Google Mobility19Most important variables and their contributions to predictions22Figure 3. Partial dependence plots22The neural network can be thought of as an alternat
6、ive to using dynamic factors or principal components as it reduces the dimensionality to a number of intermediate components in the middle layer before making a prediction. The multi-layer structure helps avoid overfitting. As opposed to PCA, it allows for capturing non-linear relationships. Variabl
7、es are pre-processed using normalisation. The main caveat of neural network is their black-box nature, which is addressed using machine learning interpretability techniques in section 6.24. While a vast research has focused on the inclusion of the GT indicator as an explanatory variable in conventio
8、nal autoregressive models, papers have used factor models of multiple GT categories (Vosen and Schmidt, 2011(31); Balakrishnan and Dixit, 2013(32). Other papers used linear shrinkage methods such as Ridge (Ferrara and Simoni, 201M) or Spike-and-Slab (Scott and Varian, 2014(33). Fewer papers have use
9、d non-linear methods: Burdeau and Kinzler (2017(34) experimented with Support Vector Machines (SVMs) and boosting and reported better results from non-linear approaches.25. Neural networks have had attracted little attention from macroeconomists compared to tree-based methods such as Random Forests,
10、 mostly because of the small size of macroeconomic data. By providing variables comparable across countries for a large number of countries and at a high frequency, Google Trends creates opportunities for using a wider array of algorithms and econometric methods, be it for prediction or policy analy
11、sis.Box 1, Training the neural networkAdditional details on the training on the neural network algorithm.Architecture and technical details. The neural network algorithm used in this paper is a standard multi-layer perceptron implemented with most of the default parameters in Python statistical soft
12、ware scikit-learn. It includes two hidden layers of respectively 100 and 20 neurons. Each neuron uses a rehJ activation function. The activation function takes a weighted sum of input signals (the variables values) and yields the linear combination of inputs provided it is higher than a given thresh
13、old. The weights and threshold are optimised using stochastic gradient descent.Standard Scaler. It has become an industry standard to scale the variable values prior to fitting the algorithm. Early experiments proved that Quantile Scalers performed badly especially when it comes to predicting extrem
14、e values in times of crises. Standard Scalers do not treat extreme values as outliers and thus allow better performance around downturns.Ensemble. Neural networks are notoriously sensitive to the initial random parameters. The choice of random parameters proved to have a strong effect on the results
15、. In order to curb the effect of that randomness, the tracker uses an ensemble of five neural networks initialised with random parameters, whose predictions are averaged over.Hyper-parameter optimisation. The use of gridsearch for hyperparameter optimisation was purposely avoided. Even when perfomed
16、 on a training set prior to the forecast simulations, gridsearch can lead to overfitting the validation set1: users may experiment with many parameter grids and simulation settings leading to biased simulation results. In order to prevent that issue, parameter optimisation was performed through a si
17、mple trial-and-error process aiming at finding a good level of fit with reasonable performance rather than at maximising goodness-of-fit. An additional guarantee against overfitting the validation set is provided by the generality of the model over a large number of countries, which reduces the like
18、liness of bias in performance measurement caused by ad hoc hyper-parameter selection.25.3. To pool or not to pool: a panel nowcasting model for 46 countriesThe panel nature of the data raises the question of whether to pool countries together or run country-wise models. Country-specific models seem
19、more intuitive as various levels of internet penetration, habits, culture, demography and institutions could explain possibly large differences in country-specific elasticities, but would involve many more variables (248) than observations (61). Conversely, pooling countries together increases the s
20、ample size and thus estimation accuracy The alternative between the two options can thus be thought of as bias-variance trade-off: introducing some bias by using average elasticities rather than country-specific ones allows to substantially reduce the variance of the estimator and increases overall
21、predictive accuracy.26. The panel nature of the data raises the question of whether to pool countries together or run country-wise models. Country-specific models seem more intuitive as various levels of internet penetration, habits, culture, demography and institutions could explain possibly large
22、differences in country-specific elasticities, but would involve many more variables (248) than observations (61). Conversely, pooling countries together increases the sample size and thus estimation accuracy The alternative between the two options can thus be thought of as bias-variance trade-off: i
23、ntroducing some bias by using average elasticities rather than country-specific ones allows to substantially reduce the variance of the estimator and increases overall predictive accuracy.27. This paper uses a neural panel model, which exploits a large sample of observations from 46 countries while
24、capturing cross-country heterogeneity. Neural networks are able to handle heterogeneity in the data as long as country dummies are included. A neural network whose architecture incudes an intermediate layer with enough neurons (in our case, 100) can flexibly model each possible interaction between G
- 配套讲稿:
如PPT文件的首页显示word图标,表示该PPT已包含配套word讲稿。双击word图标可打开word文档。
- 特殊限制:
部分文档作品中含有的国旗、国徽等图片,仅作为作品整体效果示例展示,禁止商用。设计者仅对作品中独创性部分享有著作权。
- 关 键 词:
- 使用 Google 趋势 跟踪 实时 经济 活动
限制150内