Organizations need to find ways to extract and analyze all sources of data in order to make informed decisions. This course provides an overview of the practice of data science with an emphasis on data cleaning, the data life cycle and data processing. Real-world and fragmented data will be examined. You will learn how to use statistical methods and R programming language to analyze data sets. Experiential learning activities combined with hands-on learning practice sessions will provide you with practical skills in computer programming and data cleaning.
Upon successful completion of this course, you will be able to:
- Define and describe the concepts of data science
- Explain the role of a data scientist in the real-world environment
- Clean, transform and analyze data
- Clean data and improve data quality for reporting and analytics
- Differentiate between supervised and unsupervised approaches to statistical learning
- Demonstrate the use of linear regression
- Apply statistical methods such as cross-validation and bootstrap
- Apply model selection and regularization to data sets
- Program using R programming language
Additional RequirementsTo be successful in this program, it is recommended that you have an Undergraduate degree or College diploma, a recent statistics course and basic experience or understanding of a programming language.
Applies Towards the Following Certificates
- Data Science Certificate : Required Courses
*Course details are subject to change.