Abstract
Researchers in finance and adjacent fields have increasingly been working with textual data, a common challenge being analysing the content of a text. Traditionally, this task has been approached through labour- and computation-intensive work with lists of words. In this article we compare word list analysis with an easy-to-implement and computationally efficient alternative called semantic fingerprinting. Using the prediction of stock return correlations as an illustration, we show semantic fingerprinting to produce superior results. We argue that semantic fingerprinting significantly reduces the barrier to entry for research involving textual content analysis, and we provide guidance on implementing this technique.
Original language | American English |
---|---|
Pages (from-to) | 2719-2735 |
Number of pages | 17 |
Journal | Applied Economics |
Volume | 49 |
Issue number | 28 |
DOIs | |
State | Published - 15 Jun 2017 |
Keywords
- Textual analysis
- industries
- semantic fingerprint
- stock returns
All Science Journal Classification (ASJC) codes
- Economics and Econometrics