Cleaning Data Helps Clean the Air

Kelley Donalds
Abstract: In this project, students use a real-world, complicated database and experience firsthand the consequences of inadequate data modeling. The database used in this project was the result of a multimillion dollar data collection effort undertaken by the U.S. Environmental Protection Agency in order to set limits on emissions of air pollutants from electric power plants. First, students explore the database to identify design limitations from the perspective of a data analyst. Second, students create a new database design which overcomes identified problems. In this case study, students will develop the skill to infer usage implications by studying the design of an existing database. We believe that this skill is valuable but is different from the skill of designing a database from scratch.

Keywords: database design, Systems Analysis and Design

