Data Science in Social Research
-
Uwe Engel
and Lena Dahlhaus
About this book
Society is changing – it is becoming more diverse and digital. Social media today plays a central role in human communication. With a changing society, the way social scientists analyze it is also evolving. In recent years, it has become significantly more difficult to encourage people to participate in surveys. Furthermore, digitization opens up data options that go beyond the survey method. Information published online represents valuable digital behavioral traces and provides social researchers with another important source of data alongside their scientific surveys, which in recent years have increasingly been conducted online.
Computational methods have played a central role in social research from the very beginning. This applies today more than ever to data analysis, but now also to data collection. The increasing attention paid to machine learning methods in the statistical analysis of social science data represents a further remarkable development in the analysis of social science data.
This textbook addresses these developments and familiarizes readers with both elementary and more advanced methods of data analysis. Fundamentals and techniques of data management, programming with R, statistical data analysis, descriptive and causal inference, as well as predictive modeling are covered in depth. All methods are exemplified using real data either from survey research, the social media platform Bluesky or a large digital newspaper archive. Thematically, these data are focused on current sociological topics, particularly those related to human happiness, energy transition and climate policy, AI, political attitudes and the rise in right-wing voting.
Data science encompasses more than the algorithms required for data analysis and statistical learning. No less relevant are the rules by which social research and data analysis are conducted, data quality is ensured, and the results are validated. This textbook aims to provide this overview.
- A broad spectrum of methods described in a concrete and understandable way
- Many fully worked-out examples that use real data from empirical social research practice
- Data selection that addresses socially relevant topics
- A constant, close connection between three components: how to program a data analysis in R, how to interpret the results, and how they are statistically justified
Author / Editor information
Uwe Engel is a Professor emeritus of Sociology at the University of Bremen (Germany). He founded the Social Science Methods Centre of Bremen University and is a founding member of the European Association of Methodology (EAM) and the Bremen International Graduate School of Social Sciences (BIGSSS).
Lena Dahlhaus is a lecturer at the Institute of Social Sciences (IFSOL) at the Carl von Ossietzky University of Oldenburg, where she teaches statistics, social research methods and urban sociology to students of sociology and political science.
Topics
-
Download PDFPublicly Available
Frontmatter
I -
Requires Authentication UnlicensedLicensed
Preface
V -
Download PDFPublicly Available
Contents
VII -
Requires Authentication UnlicensedLicensed
Chapter 1 Data Science in Social Research
1 - Part I: Description, Explanation, Prediction
-
Requires Authentication UnlicensedLicensed
Chapter 2 Descriptive Inference
19 -
Requires Authentication UnlicensedLicensed
Chapter 3 Causal Inference
50 - Part II: Using R
-
Requires Authentication UnlicensedLicensed
Chapter 4 Data Management in R
87 -
Requires Authentication UnlicensedLicensed
Chapter 5 R Graphics
144 - Part III: Statistical Methods for Prediction, Classification, Text Analytics
-
Requires Authentication UnlicensedLicensed
Chapter 6 Linear Models
173 -
Requires Authentication UnlicensedLicensed
Chapter 7 Goodness of Fit and Predictive Power of a Linear Model
194 -
Requires Authentication UnlicensedLicensed
Chapter 8 Regression Diagnostics and Corrective Measures
229 -
Requires Authentication UnlicensedLicensed
Chapter 9 Linear Mixed Models
268 -
Requires Authentication UnlicensedLicensed
Chapter 10 Nonlinear Relations
286 -
Requires Authentication UnlicensedLicensed
Chapter 11 Classification
324 -
Requires Authentication UnlicensedLicensed
Chapter 12 Statistical Learning
359 -
Requires Authentication UnlicensedLicensed
Chapter 13 Natural Language Processing
395 -
Requires Authentication UnlicensedLicensed
References
-
Requires Authentication UnlicensedLicensed
R Libraries Used
-
Requires Authentication UnlicensedLicensed
About the Authors
-
Requires Authentication UnlicensedLicensed
Subject Index
-
Manufacturer information:
Walter de Gruyter GmbH
Genthiner Straße 13
10785 Berlin
productsafety@degruyterbrill.com