Credits: 4
Tags: Data science, Python
This course is an introduction to data science for engineers. The data science workflow: acquisition and cleansing, exploration and modeling, prediction and decision making, visualization and presentation. Tools for data science including numerical optimization, the Discrete Fourier Transform, Principal Component Analysis, and probability with a focus on statistical inference and correlation methods. Techniques for different steps in the workflow including outlier detection, filtering, regression, classification, and techniques for avoiding overfitting. Methods for combining domain-agnostic data analysis tools with the types of domain-specific knowledge that are common in engineering. Ethical considerations. Optional topics include classification via neural networks, outlier detection, and Markov chains. Programming projects are in Python.
Prerequisites: MATH 1920 and either CS 1110 or CS 1112. Corequisite: MATH 2940
Key Topics: Numerican Optimization, Fourier Transforms, Principal Component Analysis, Image Processing, Probability, Markov Chains
Semester(s): Fall, Spring
Difficulty: 4/5
Rating: 3/5
Assignments: Weekly problem sets, and biweekly labs and pre-lab assignments.
Exams: Two prelims and one final exam.
Semester(s): Spring
Difficulty: 2.5/5
Rating: 4.5/5
Assignments: Problem sets (involve math and Python programming for simulating)
Exams: 10 minute quiz at the beginning of almost every lecture, one take-home final