You are here

The Sage website, including online ordering services, may be unavailable due to system maintenance on 18th January between 2:00 am and 8:30 pm GMT. If you need assistance please contact our Customer Service team. Thank you for your patience and we apologise for the inconvenience.

Disable VAT on Taiwan

Unfortunately, as of 1 January 2020 SAGE Ltd is no longer able to support sales of electronically supplied services to Taiwan customers that are not Taiwan VAT registered. We apologise for any inconvenience. For more information or to place a print-only order, please contact uk.customerservices@sagepub.co.uk.

Discovering Statistics Using R
Share
Share

Discovering Statistics Using R

Additional resources:


March 2012 | 992 pages | SAGE Publications Ltd

Keeping the uniquely humorous and self-deprecating style that has made students across the world fall in love with Andy Field's books, Discovering Statistics Using R takes students on a journey of statistical discovery using R, a free, flexible and dynamically changing software tool for data analysis that is becoming increasingly popular across the social and behavioural sciences throughout the world.

The journey begins by explaining basic statistical and research concepts before a guided tour of the R software environment. Next you discover the importance of exploring and graphing data, before moving onto statistical tests that are the foundations of the rest of the book (for example correlation and regression). You will then stride confidently into intermediate level analyses such as ANOVA, before ending your journey with advanced techniques such as MANOVA and multilevel models. Although there is enough theory to help you gain the necessary conceptual understanding of what you're doing, the emphasis is on applying what you learn to playful and real-world examples that should make the experience more fun than you might expect.

Like its sister textbooks, Discovering Statistics Using R is written in an irreverent style and follows the same ground-breaking structure and pedagogical approach. The core material is augmented by a cast of characters to help the reader on their way, together with hundreds of examples, self-assessment tests to consolidate knowledge, and additional website material for those wanting to learn more.

Given this book's accessibility, fun spirit, and use of bizarre real-world research it should be essential for anyone wanting to learn about statistics using the freely-available R software.

 
Why Is My Evil Lecturer Forcing Me to Learn Statistics?
What will this chapter tell me?

 
What the hell am I doing here? I don't belong here

 
Initial observation: finding something that needs explaining

 
Generating theories and testing them

 
Data collection 1: what to measure

 
Data collection 2: how to measure

 
Analysing data

 
What have I discovered about statistics?

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Everything You Ever Wanted to Know About Statistics (Well, Sort of)
What will this chapter tell me?

 
Building statistical models

 
Populations and samples

 
Simple statistical models

 
Going beyond the data

 
Using statistical models to test research questions

 
What have I discovered about statistics?

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
The R Environment
What will this chapter tell me?

 
Before you start

 
Getting started

 
Using R

 
Getting data into R

 
Entering data with R Commander

 
Using other software to enter and edit data

 
Saving Data

 
Manipulating Data

 
What have I discovered about statistics?

 
R Packages Used in This Chapter

 
R Functions Used in This Chapter

 
Key terms that I've discovered

 
Smart Alex's Tasks

 
Further reading

 
 
Exploring Data with Graphs
What will this chapter tell me?

 
The art of presenting data

 
Packages used in this chapter

 
Introducing ggplot2

 
Graphing relationships: the scatterplot

 
Histograms: a good way to spot obvious problems

 
Boxplots (box-whisker diagrams)

 
Density plots

 
Graphing means

 
Themes and options

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Exploring Assumptions
What will this chapter tell me?

 
What are assumptions?

 
Assumptions of parametric data

 
Packages used in this chapter

 
The assumption of normality

 
Testing whether a distribution is normal

 
Testing for homogeneity of variance

 
Correcting problems in the data

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
 
Correlation
What will this chapter tell me?

 
Looking at relationships

 
How do we measure relationships?

 
Data entry for correlation analysis

 
Bivariate correlation

 
Partial correlation

 
Comparing correlations

 
Calculating the effect size

 
How to report correlation coefficents

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
 
Regression
What will this chapter tell me?

 
An Introduction to regression

 
Packages used in this chapter

 
General procedure for regression in R

 
Interpreting a simple regression

 
Multiple regression: the basics

 
How accurate is my regression model?

 
How to do multiple regression using R Commander and R

 
Testing the accuracy of your regression model

 
Robust regression: bootstrapping

 
How to report multiple regression

 
Categorical predictors and multiple regression

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Logistic Regression
What will this chapter tell me?

 
Background to logistic regression

 
What are the principles behind logistic regression?

 
Assumptions and things that can go wrong

 
Packages used in this chapter

 
Binary logistic regression: an example that will make you feel eel

 
How to report logistic regression

 
Testing assumptions: another example

 
Predicting several categories: multinomial logistic regression

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Comparing Two Means
What will this chapter tell me?

 
Packages used in this chapter

 
Looking at differences

 
The t-test

 
The independent t-test

 
The dependent t-test

 
Between groups or repeated measures?

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Comparing Several Means: ANOVA (GLM 1)
What will this chapter tell me?

 
The theory behind ANOVA

 
Assumptions of ANOVA

 
Planned contrasts

 
Post hoc procedures

 
One-way ANOVA using R

 
Calculating the effect size

 
Reporting results from one-way independent ANOVA

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Analysis of Covariance, ANCOVA (GLM 2)
What will this chapter tell me?

 
What is ANCOVA?

 
Assumptions and issues in ANCOVA

 
ANCOVA using R

 
Robust ANCOVA

 
Calculating the effect size

 
Reporting results

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Factorial ANOVA (GLM 3)
What will this chapter tell me?

 
Theory of factorial ANOVA (independant design)

 
Factorial ANOVA as regression

 
Two-Way ANOVA: Behind the scenes

 
Factorial ANOVA using R

 
Interpreting interaction graphs

 
Robust factorial ANOVA

 
Calculating effect sizes

 
Reporting the results of two-way ANOVA

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Repeated-Measures Designs (GLM 4)
What will this chapter tell me?

 
Introduction to repeated-measures designs

 
Theory of one-way repeated-measures ANOVA

 
One-way repeated measures designs using R

 
Effect sizes for repeated measures designs

 
Reporting one-way repeated measures designs

 
Factorisal repeated measures designs

 
Effect Sizes for factorial repeated measures designs

 
Reporting the results from factorial repeated measures designs

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Mixed Designs (GLM 5)
What will this chapter tell me?

 
Mixed designs

 
What do men and women look for in a partner?

 
Entering and exploring your data

 
Mixed ANOVA

 
Mixed designs as a GLM

 
Calculating effect sizes

 
Reporting the results of mixed ANOVA

 
Robust analysis for mixed designs

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Non-Parametric Tests
What will this chapter tell me?

 
When to use non-parametric tests

 
Packages used in this chapter

 
Comparing two independent conditions: the Wilcoxon rank-sum test

 
Comparing two related conditions: the Wilcoxon signed-rank test

 
Differences between several independent groups: the Kruskal-Wallis test

 
Differences between several related groups: Friedman's ANOVA

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Multivariate Analysis of Variance (MANOVA)
What will this chapter tell me?

 
When to use MANOVA

 
Introduction: similarities and differences to ANOVA

 
Theory of MANOVA

 
Practical issues when conducting MANOVA

 
MANOVA using R

 
Robust MANOVA

 
Reporting results from MANOVA

 
Following up MANOVA with discriminant analysis

 
Reporting results from discriminant analysis

 
Some final remarks

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Exploratory Factor Analysis
What will this chapter tell me?

 
When to use factor analysis

 
Factors

 
Research example

 
Running the analysis with R Commander

 
Running the analysis with R

 
Factor scores

 
How to report factor analysis

 
Reliability analysis

 
Reporting reliability analysis

 
What have I discovered about statistics?

 
R Packages Used in This Chapter

 
R Functions Used in This Chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Categorical Data
What will this chapter tell me?

 
Packages used in this chapter

 
Analysing categorical data

 
Theory of Analysing Categorical Data

 
Assumptions of the chi-square test

 
Doing the chi-square test using R

 
Several categorical variables: loglinear analysis

 
Assumptions in loglinear analysis

 
Loglinear analysis using R

 
Following up loglinear analysis

 
Effect sizes in loglinear analysis

 
Reporting the results of loglinear analysis

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Multilevel Linear Models
What will this chapter tell me?

 
Hierarchical data

 
Theory of multilevel linear models

 
The multilevel model

 
Some practical issues

 
Multilevel modelling on R

 
Growth models

 
How to report a multilevel model

 
What have I discovered about statistics?

 
R packages used in this chapter

 
R functions used in this chapter

 
Key terms that I've discovered

 
Smart Alex's tasks

 
Further reading

 
Interesting real research

 
 
Epilogue: Life After Discovering Statistics
 
Troubleshooting R
 
Glossary
Appendix

 
Table of the standard normal distribution

 
Critical Values of the t-Distribution

 
Critical Values of the F-Distribution

 
Critical Values of the chi-square Distribution

 
 
References

Supplements

Click for online resources

Companion Website to accompany Discovering Statistics Using R

I used the book for a class on socialization research. For most of my students it was the first time that they used R for their statistical analyses. "Discovering Statistics Using R" washed away any fears of formula and generated appetite for discovering all the great possibilities that R offers to students of social phenomena.

Dr Tina Freyburg
Department of Political and Social Sci, European University Institute
June 8, 2012

I will recommend this book to my students who need R or want to use R rather than SPSS - it's as phantastic as the SPSS version in my eyes!

Miss Anna Schewe
Faculty of Psychology & Sport Science, Bielefeld University
June 6, 2012

Very good book!

Mr Martin Krippl
Please select your department, Please select your institution
June 1, 2012

Since I saw the previous edition of this book (SPSS-based) I was hoping and then waiting for the R-based version. And finally it's here having all the great qualities of the previous version.
What I noticed previously is that the kind of narrative presented in these books is very positively received by the students making them read beyond the required chapters only. It's definitely the first book that we will recommend our students to refer in case of R-based data-analysis assignments.

Mr Stanislaw Zabramski
Communications , Uppsala University
May 30, 2012

I think this text is going to make R very much more accessible than before for students. I will publicise it in the MSc student sessions of my course and to MSc dissertation students, and ensure that my colleagues who run the R courses for postgrads are aware of it. I'm indicating the book as supplemental for my own course as this course does not involve any hands-on stats.

I won't be recommending this book to undergraduates at present as they are taught stats entirely via SPSS, but this might change when methodology teaching is reviewed at some time in the future; the existence of this student-friendly text certainly makes this much more of a possibility than previously.

Ms Elizabeth Austin
Dept of Psychology, Edinburgh University
May 11, 2012

I am still reading the book, but I can see I will most definetely adapt it as an essential course. I have never laugh so much reading a stats course. As one of my students said of the SPSS version, "there cannot possibly be anything wrong with a book that quotes Radiohead". I don't feel I am yet ready for providing a full assessment, but there are many things that I like about the book. There is simply no better alternative in the market for introducing fearful social science students into the intimidating environment of R.

Dr JJ Medina
School of Law, Manchester University
May 2, 2012

"Discovering statistics using R" is in my opinion as much a success as it's pendant for SPSS. This book is as well a wonderful instructions for beginners in R (introduciton to workspace, loading data, cran, handling of packages etc.) as a guide for advanced learners. I have read other R-introductions, which could not explain R's essentials as simple as this book does.
Great is the application to the R commander, which is for R-beginners (with SPSS experience) a real help.
On the downside I was disappointed that the additional software RStudio was not recommended, as it improves (in my opinion) the handling of R immensely.
The explanation of statistical procedures are as in it's SPSS pendant very illustrative and it really invites to explore statistics, as a student of my course wrote on a feedback form (this time with R).
A definite bonus is the application of robust methods (which is not given in it's SPSS pendandt), which is a real advantage for more advanced students and researchers.
Eventhough I have the book for 2-3 weeks, I am confident to find more upsides than downsides in this book while using it in my lectures and for my research.

Mr Roscoe Araujo
Psychologische Methodenlehre und experimentelle Psychologie, University of Cologne
May 2, 2012

In short, the book is outstanding: Based on an unique overall funny undertone, the book serves very well as an introduction to R and to the world of statistics in general.

Mr Thomas Schübel
Faculty of Politics and Management, University of Konstanz
May 2, 2012

Whilst the book is well written, readers unfamiliar with R will struggle quite a bit. I'd recommend using it in conjunction with introductory R books, but not on its own.

Dr Sebastian Teicher
Psychology, Nottingham Trent University
May 2, 2012

This is an excellent book and the integration of R into the 'Discovering Statistics' series is a long-awaited move. I have already recommended this book to a number of colleagues and will continue to recommend it to MA/PhD students in the future.

The only downside of this edition is that the book isn't as visually appealing as the SPSS version (there's less colour), but in no way does this detract from the overall quality of the content.

Overall, this book is up-to-date, accessible, and comprehensive. An ideal introductory statistics textbook that also serves as an accessible introduction to a powerful and rapidly-growing software package.

Mr Sam Kirkham
School of English, Sheffield University
April 30, 2012

Sample Materials & Chapters

Chapter One