Data analysis bootcamp is for all of you who have to work with data and work with data sets. This is a set of practical, hands-on training where you will learn how to work with shell, git, bash scripts, R, Rstudio, dplyr and ggplot2. After working through this set of tutorials, you will gain confidence to work with your own data sets. If you feel stuck, please come back to revisit this page and ask questions.
We will cover the following topics:
- Basic shell commands
- How to use git for version control
- How to use OpenRefine for data cleaning and data preprocessing
- Working with Rstudio for data analysis using tidydata, dplyr, and ggplot2 for data analysis and graphing
- How to use lavaan for structural equation modelling
- How to write your research using markdown/latex/overleaf/authorea
- How to conduct meta analyses and systematic reviews
Basic Shell commands
Shell is a command line interface where you can issue instructions to your computer in plain text and you can get yoru work done. Depending on the kind of operating system you use, you will need to either install or learn to use a shell already bundled with your OS. In order to work with this tutorial (this will be our first tutorial), you will need to install `bash` on your computer.
- For windows OS, use powershell or if you install git for windows you can use the gitbash that gets installed on your machine
- For Linux and Mac OS, you get bash
You can learn more about shell and practice if you follow the data/software carpentry exercise here:
How will we navigate directories and files in shell?
You will see a `$` sign in the shell. This is known as a command prompt. Run the following prompts and see what it returns. I have provided you the notes relevant to each of the codes that you will run