What is it about?

In this paper, we propose a robust methodology to assess the value of microblogging data to forecast stock market variables: returns, volatility and trading volume of diverse indices and portfolios. The methodology uses sentiment and attention indicators extracted from microblogs (a large Twitter dataset is adopted) and survey indices (AAII and II, USMC and Sentix), diverse forms to daily aggregate these indicators, usage of a Kalman Filter to merge microblog and survey sources, a realistic rolling windows evaluation, several Machine Learning methods and the Diebold-Mariano test to validate if the sentiment and attention based predictions are valuable when compared with an autoregressive baseline.

Featured Image

Why is it important?

We found that Twitter sentiment and posting volume were relevant for the forecasting of returns of S&P 500 index, portfolios of lower market capitalization and some industries. Additionally, KF sentiment was informative for the forecasting of returns. Moreover, Twitter and KF sentiment indicators were useful for the prediction of some survey sentiment indicators. These results confirm the usefulness of microblogging data for financial expert systems, allowing to predict stock market behavior and providing a valuable alternative for existing survey measures with advantages (e.g., fast and cheap creation, daily frequency).

Perspectives

This is a robust empirical study that applies state-of-the art sentiment analysis lexicon and machine learning methods to predict a large range of stock market variables: returns, volatility, trading volume and survey sentiment indices (AAII and II). Using Twitter data, interesting results were achieved for the forecasting of returns for some stocks and to predict some survey indices.

Dr Paulo Cortez
University of MInho

Read the Original

This page is a summary of: The impact of microblogging data for stock market prediction: using Twitter to predict returns, volatility, trading volume and survey sentiment indices, Expert Systems with Applications, December 2016, Elsevier,
DOI: 10.1016/j.eswa.2016.12.036.
You can read the full text:

Read

Resources

Contributors

The following have contributed to this page