This course will focus on preparing me for collecting and cleaning data for downstream analysis and sharing.
One of the major components of a data scientist’s job is to collect and clean data. Whether at a small organization or a major enterprise, the first step in using data is getting, cleaning and understanding the data. In this course I will focus on R packages and a few outside tools that can be used to collect data from a variety of sources, from Excel files to databases like MySQL. Also a variety of formats including JSON, XML, and flat files (.csv, .txt) will be covered.
Here you can find my code during my course.