The Michigan Alzheimer’s Disease Center’s Data Core will be leading a workshop on “Scrubbing and Cleaning Sensitive Data” at the 2020 Michigan Institute for Data Science (MIDAS) annual symposium this November 10 at 2:45pm.
Registration is required here.
Our team will be presenting the following project:
Before analysis, data must be retrieved, scrubbed of identifiable information, cleaned (e.g., addressed missing data, reshaped appropriately), and delivered. Using biomedical and transportation datasets as examples of how this generalizable process works, this workshop will walk attendees through a real-world pipeline used to process and deliver datasets. Documentation and code will be made available through GitLab to allow for coding along with the demonstration. As a result of this workshop, attendees will leave with a practical template for implementing their own a data science pipeline.