Department: Engineering Management and Systems Engineering @ GWU

Credits: 3


This course provides students with a foundation in exploring data using the R programming language. Students will learn how to source, manage, transform, and explore a wide variety of data types. Students will also master the fundamental concepts for visualizing and communicating information contained in raw data, including the human psychology of visual information processing. All analyses will be conducted to support reproducibility from raw data to results using RMarkdown. Teaching will involve interactive lectures with plenty of class time spent working on examples and coding. Students will be assessed through quizzes and exams. Throughout the semester, students will work on a research project of their own design to demonstrate mastery of the course’s topics. At the end of the semester, students will submit a final, reproducible report of their project and will give a 5-minute presentation of their findings.

Learning Objectives:

Having successfully completed this course, students will be able to:

  • Import, manipulate, visualize, and export data in R.
  • Conduct a systematic exploratory data analysis (EDA) of different types of data.
  • Apply fundamental principles of visualizing information for exploratory analysis and communication.
  • Wrangle data from its original format into a fit-for-purpose format.
  • Get data off the web and expose data, code, results on the web.
  • Generate fully reproducible reports that contain code, equations, visualizations, and narrative text.


Students should have taken Programming for Analytics or have experience with at least one programming language. If you’re not sure whether you have the necessary prerequisite skills, you can try and get up to speed by completing Assignment 0 before classes start. Once classes start, it may be difficult to keep up without this background, and it may be more beneficial to wait and take this course next year after taking Programming for Analytics in the coming Fall.

EMSE 4197 (CRN 78916): Exploratory Data Analysis - Spring 2020
George Washington University | School of Engineering & Applied Science
Dr. John Paul Helveston | | Wednesdays | 12:45–3:15 PM | District House B205 | |
This work is licensed under a Creative Commons ShareAlike 4.0 International License.
See the licensing page for more details about copyright information.
Content 2020 John Paul Helveston