Yet Another Statistics-for-Linguistics Book
2022/10/28 DRAFT
James Myers
National Chung Cheng University, Graduate Institute of Linguistics
Special features of this book:
- Teaches you how to use the free and powerful R software,
popular among linguists because they're cheap
- Also teaches you how to use Excel, because, let's face it, R can get kind of intimidating
- Lots of terms are translated into Chinese, in case your Chinese is better than your English
- A healthy mix of messy real data sets and nice and clean fake data sets, involving language
teaching, psycholinguistics, phonetics, corpus analysis, typology, child language, and whatever else
I had handy
- Mixed-effects modeling demystified (and guidance in navigating the continuing debates)
- Some but not much ggplot2 or tidyverse, but most people end up copy/pasting examples from the web anyway
- Weird-but-cool stuff like resampling, generalized additive modeling, and Bayesian statistics
- Advice on how to look up stuff yourself, because you're an adult, right?
- Lots of cleverly hidden typos to challenge your critical thinking skills
- Basic biographical info on statisticians (mostly dead white men)
- An update schedule that's about what you'd expect for something done for free in my spare time
- An irritating "humorous" writing style similar to that used in this list
Chapter 1: Why do linguists need statistics? [2022/2/6]
Chapter 2: Data analysis software: Excel and R [2022/2/10]
Chapter 3: Quantifying some familiar ideas: Averages and variation [2022/2/25]
Chapter 4: Probability and hypotheses [2022/3/1]
Chapter 5: Correlation and modeling [2022/3/22]
Chapter 6: Comparing two continuous variables: t tests and beyond [2022/3/22]
Chapter 7: Comparing category sizes: Chi-squared and related tests [2022/4/2]
Chapter 8: Comparing more than two continuous variables:
Introduction to ANOVA [2022/4/8]
Chapter 9: More ANOVA: Repeated measures [2022/5/14]
Chapter XXX: Data exploration: Cluster analysis and related methods [not written yet]
Chapter 10: Modeling continuous variables: Multiple regression [2022/5/14]
Chapter 11: Modeling categorical variables: Logistic regression [2022/5/10]
Chapter 12: Beyond ANOVA, regression, and chi-squared:
Mixed-effects modeling [2022/5/13]
Chapter 13: The future of statistics: Bayesian modeling [2022/5/14]
Statistics resources (including data files)