Tidy

Author

Jarad Niemi

To tidy data means to convert it from whatever its current format is to a format where it can be analyzed. This analysis may be in the form of summary statistics, visualizations, modeling, and predictions. To perform these analyses, the data generally needs to be in a format where each row is a single observation and each column is a variable. In this part, we discuss many of the operations performed while tidying the data including reading files, joining data sets, and transforming data.