Using statistical software rather than a spreadsheet
(The following information is based on (Dunn 2025).)
Many people use spreadsheets (such as Microsoft Excel) rather than statistical software. But using spreadsheets requires extreme care; many extremely expensive and dangerous errors have been made through using spreadsheets (AlTarawneh and Thorne 2017), including problems when reporting the 2020 COVID-19 pandemic.
These problems emerge for different reasons:
- Spreadsheets often change the entered data (for example, reformatting entries as dates if the spreadsheet thinks the data should be a date) in the interest of being ‘user-friendly’, even when not appropriate. This has had dire consequences (Ziemann, Eren, and El-Osta 2016).
- Many spreadsheets have errors in formulas (R. R. Panko and Sprague Jr 1998), but these are incredible difficult to locate and hence fix (Galletta et al. (1996), R. Panko (2016)).
- Spreadsheets do not leave a record of how the data have been analysed; for example, formulas can be very difficult to understand and parse. Keeping a record of the analysis, new variables that have been created, and other operations with the data is called reproducible research. Reproducibility ensures, among other advantages, that the results can be checked by others.
- Excel has bugs (Keeling and Pavur (2004), Mélard (2014)) even in very basic operations (Berger (2007), Hargreaves and McWilliams (2010)). After trying to fix these bugs, sometimes they are made even worse (McCullough and Wilson 2002).
Spreadsheets can be used for research and analysis… but you must be very careful! Many of the problems are due to human error, and some emerge because Excel is being used for purposes it is not really designed for (i.e. scientific analysis).
jamovi
We strongly recommend installing the most recent stable, desktop (not online) version of jamovi.
SPSS
All recent versions of SPSS work in similar ways. SPSS is installed in computer labs and available using USC Anywhere.