This week, we will learn some techniques for dealing with messy data. Most of the time, the raw data you get is in the wrong format and variables are not properly coded, so you will need to “clean” the data before starting any analysis. Other times the data will be split into two or more data sets, and you’ll need to “join” them together into a single data frame. The readings below discuss some clever techniques for how to address these situations.


EMSE 4197 (CRN 78916): Exploratory Data Analysis - Spring 2020
George Washington University | School of Engineering & Applied Science
Dr. John Paul Helveston | jph@gwu.edu | Wednesdays | 12:45–3:15 PM | District House B205 | |
This work is licensed under a Creative Commons ShareAlike 4.0 International License.
See the licensing page for more details about copyright information.
Content 2020 John Paul Helveston