TY - JOUR
T1 - Hybrid Content Analysis
T2 - Toward a Strategy for the Theory-driven, Computer-assisted Classification of Large Text Corpora
AU - Baden, Christian
AU - Kligler-Vilenchik, Neta
AU - Yarchi, Moran
N1 - Publisher Copyright: © 2020 Taylor & Francis Group, LLC.
PY - 2020/7/2
Y1 - 2020/7/2
N2 - Given the scale of digital communication, researchers face a painful trade-off between powerful, scalable computational strategies, and the theoretical sensitivity offered by small-scale manual analyses. Especially in the study of natural discourse on digital media, the interactive, ever-evolving stream of conversations across multiple platforms regularly defies efforts to obtain well-defined samples of manageable size, while their linguistic variability imposes major limitations upon the accuracy of automated tools. In this paper, we draw upon recent advances in computational text analysis to develop a hybrid approach to the deductive analysis of large-scale digital discourse, which combines the algorithmic extraction of coherent, recurrent patterns with a manual coding of identified patterns. The approach scales up to treat millions of texts at minimal added human effort, while affording researchers close control over the process of theory-guided classification. We demonstrate the power of Hybrid Content Analysis by studying polarization in a quarter of a million contributions from cross-platform interactive social media discourse about a controversial incident.
AB - Given the scale of digital communication, researchers face a painful trade-off between powerful, scalable computational strategies, and the theoretical sensitivity offered by small-scale manual analyses. Especially in the study of natural discourse on digital media, the interactive, ever-evolving stream of conversations across multiple platforms regularly defies efforts to obtain well-defined samples of manageable size, while their linguistic variability imposes major limitations upon the accuracy of automated tools. In this paper, we draw upon recent advances in computational text analysis to develop a hybrid approach to the deductive analysis of large-scale digital discourse, which combines the algorithmic extraction of coherent, recurrent patterns with a manual coding of identified patterns. The approach scales up to treat millions of texts at minimal added human effort, while affording researchers close control over the process of theory-guided classification. We demonstrate the power of Hybrid Content Analysis by studying polarization in a quarter of a million contributions from cross-platform interactive social media discourse about a controversial incident.
UR - http://www.scopus.com/inward/record.url?scp=85089575581&partnerID=8YFLogxK
U2 - https://doi.org/10.1080/19312458.2020.1803247
DO - https://doi.org/10.1080/19312458.2020.1803247
M3 - مقالة
SN - 1931-2458
VL - 14
SP - 165
EP - 183
JO - Communication Methods and Measures
JF - Communication Methods and Measures
IS - 3
ER -