Document details

The Computational Content Analyst: Using Machine Learning to Classify Media Messages

New York; London: Routledge (2025), ix, 133 pp.

Contains index

ISBN 978-1-00-351423-7; 978-1-03-284630-9 (pbk)

"Most digital content, whether it be thousands of news articles or millions of social media posts, is too large for the naked eye alone. Often, the advent of immense datasets requires a more productive approach to labeling media beyond a team of researchers. This book offers practical guidance and Python code to traverse the vast expanses of data--significantly enhancing productivity without compromising scholarly integrity. We'll survey a wide array of computer-based classification approaches, focusing on easy-to-understand methodological explanations and best practices to ensure that your data is being labeled accurately and precisely. By reading this book, you should leave with an understanding of how to select the best computational content analysis methodology to your needs for the data and problem you have. This guide gives researchers the tools they need to amplify their analytical reach through the integration of content analysis with computational classification approaches, including machine learning and the latest advancements in generative artificial intelligence (AI) and large language models (LLMs). It is particularly useful for academic researchers looking to classify media data and advanced scholars in mass communications research, media studies, digital communication, political communication, and journalism." (Publisher description)
1 Unveiling Content Analysis in the Contemporary Media Ecosystem, 1
2 Designing a Computational Content Analysis: An Illustration from “Civic Engagement, Social Capital, and Ideological Extremity”, 16
3 Basic Information Retrieval for Content Analysis, 32
4 Supervised Machine Learning with BERT for Content Analysis, 46
5 Text Classification of News Media Content Categories Using Deep Learning, 67
6 Leveraging Generative AI for Content Analysis, 87
7 Topic Modeling as a Lens for Discovery, 101
8 Extending Deep Learning to Image Content Analysis, 114
Appendix A: Codebook and Conceptual Definitions, 127
Appendix B: Deletion Themes, 128