5 Data Science
Figure Figure 5.1 is the data science pipeline depicted in R4DS (2e) Whole game (as of 2024-01-23). In this chapter, we will demonstrate scripts that show implementations of this pipeline.
The named steps in this pipeline are
- Import: Bring data into R/RStudio
- Tidy: Restructure the data for analysis
- Transform: Prepare data for downstream analysis
- Visualize: Create graphics
- Model: Build a statistical model
- Communicate: Dessimate your results