STAT 5870 - Big Data Analysis Using Python


This course has three main goals: (i) students learn basic statistical programming in Python; (ii) students learn to use Python as a tool to effectively store, manipulate, and gain insight from data; (iii) students learn parallel and distributed computing with Python. The usefulness of Python for data science stems mainly from the large and active ecosystem of third-party packages. Therefore, students will also learn how to use popular packages: NumPy, Pandas, Scipy, Matplotlib, and Scikit-Learn.

Note: Open to upper-level undergraduate and graduate students.

Prerequisites/Corequisites: Prerequisite: STAT 5850 or CS 5821, with a grade of "B" or better or instructor approval and a suitable laptop

Credits: 3 hours



Print-Friendly Page (opens a new window)