Curriculum

Degree requirements for the MS in Data Science can be found at http://gsas.nyu.edu/bulletin/data-science/ms-data-science.html.

The curriculum for the MS in Data Science (MSDS) degree is 36 credits. One of the key features of the MS in Data Science curriculum is a capstone project that makes the theoretical knowledge you gain in the program operational in realistic settings. During the project, you will go through the entire process of solving a real-world problem: from collecting and processing real-world data, to designing the best method to solve the problem, and finally, to implementing a solution. The problems and datasets you’ll engage with will come from real-world settings identical to what you might encounter in industry, academia, or government.

The MSDS also offers students the opportunity to pursue a track. See below for more information.

Required Course Information (For students starting program in Fall 2017 or later)

Course descriptions can be found in NYU’s Albert Course Search. Recent course pages are linked below.

  1. DS-GA 1001 Introduction to Data Science
  2. DS-GA 1002 Probability and Statistics for Data Science
  3. DS-GA 1003 Machine Learning
  4. DS-GA 1004 Big Data
  5. DS-GA 1006 Capstone Project and Presentation
  6. One Data Science Elective (choose 1 from list below).
    1. DS-GA 1005 Inference and Representation
    2. DS-GA 1008 Deep Learning
    3. DS-GA 1011 Natural Language Processing with Representation Learning
    4. DS-GA 1012 Natural Language Understanding and Computational Semantics
    5. DS-GA 1013 Optimization-based Data Analysis
    6. Optimization and Computational Linear Algebra

Pre-approved Elective Information

For information on pre-approved electives for the MSDS degree, visit https://cds.nyu.edu/academics/ms-in-data-science/ms-courses/.

Tracks (For students starting program in Fall 2017 and later)

Data Science Track

In the Data Science track, students take six required courses and six elective courses from a diverse list of courses.

Data Science Big Data Track

The Data Science Big Data track focuses on methods and techniques required to acquire, manage, analyze and visualize large volumes of data. Student will acquire deep understanding of algorithms and their complexity and gain hands-on experience on how to build end-to-end solutions to computational problems.

Ordinarily, students pursuing this track take 3 of the following courses –

  • DS-GA 1012: Natural Language Understanding and Computational Semantics
  • CS-GY 6313: Information Visualization
  • CS-GY 6323 Large-Scale Visual Analytics
  • One of the following:
    • CS-GY 6083 Principles of Database Systems (Engineering School) or CSCI-GA 2433 Database Systems (Courant Computer Science)
    • CS-GY 6093: Advanced Database Systems (Engineering School) or CSCI-GA 2434 Advanced Database Systems (Courant Computer Science)

Data Science Mathematics and Data Track

The Data Science Mathematics and Data track provides the mathematical background to understand and analyze modern data-analysis methods in areas such as deep learning, compressed sensing, high-dimensional statistics and graph signal processing. In addition, the track will provide exposure to fundamental research problems inspired by newly-developed data-science techniques.

Ordinarily, students pursuing this track take 2 of the following courses –

  • DS-GA 1013: Optimization-Based Data Analysis
  • DS-GA 1005: Inference and Representation
  • CSCI-GA 2945/MATH-GA 2012 Convex and Nonsmooth Optimization

Data Science Natural Language Processing Track

The Data Science Natural Language Processing Track will give students the skills to build machine learning models that can understand, manipulate, or produce data expressed in natural language text.

Ordinarily, students pursuing this track take 2 of the 3 core courses below in any order. However, students are encouraged to complete all 3 core courses.

  • DS-GA 1011 Natural Language Processing with Representation Learning
  • DS-GA 1012 Natural Language Understanding and Computational Semantics
  • CSCI-GA 3033 Statistical NLP

Recommended courses:

  • DS-GA 1005 Inference and Representation
  • DS-GA 1008 Deep Learning
  • DS-GA 3001 Text as Data
  • CSCI-GA 2590 Natural Language Processing
  • Advanced linguistics courses with consent of instructor. Contact Sam Bowman (bowman@nyu.edu) for advice.

Data Science Physics Track

The Data Science Physics Track provides the same solid foundation in data science and further develops modeling and inference skills in the context of compelling, data-intensive physics research topics. This track is ideal for applicants who have some physics background, are interested in transitioning into a career in data science, and wish to leverage those skills for a competitive advantage.

Ordinarily, students pursuing this track take the following courses –

  • DS-GA 1005 Inference and Representation
  • Physics Research: Select 2 of the following –
    • PHYS-GA 2091 Experimental Physics Research
    • PHYS-GA 2093 Theoretical Physics Research
    • PHYS-GA 2095 Research Reading
  • Physics Electives: Select 2 of the following –
    • PHYS-GA 2000 Computational Physics
    • PHYS-GA 2002 Statistical Physics
    • PHYS-GA 2022 Biophysics
    • PHYS-GA 2059 Special Topics
    • PHYS-GA 2061 Special Topics
    • PHYS-GA 2053 Special Topics in Astrophysics
    • PHYS-GA 2054 Special Topics in Astrophysics
    • PHYS-GA 2017 Phase Transitions and Critical Phenomena

Data Science Biology Track (This track will be available for incoming Fall 2018 students.)

Large datasets are revolutionizing our understanding of basic biology as well as of human health and disease. The Data Science Biology track is for students who want to further develop their computational skills and apply them to the biomedical sciences.

The capstone project for this track will be biology-based and completed with a biology or biotech mentor. Ordinarily, students pursuing this track take the following courses –

  • One CDS elective (DS-GA course)
  • 4 Biology electives

4 Semester Course Plan

This is just a guide to help plan out courses for the degree. Course plan may vary based on when courses are offered.

Year 1 – Fall

Course Title Credits
TOTAL CREDITS 9
DS-GA-1001  Intro to Data Science 3
DS-GA-1002 Statistical and Mathematical Methods for Data Science 3
General Elective or Track Course 3

Year 1 – Spring

Course Title Credits
TOTAL CREDITS 9
DS-GA-1003 Machine Learning and Computational Statistics 3
DS-GA-1004 Big Data 3
General Elective or Track Course 3

Year 2 – Fall

Course Title Credits
TOTAL CREDITS 9
Data Science Elective (see list in required courses section) 3
DS-GA-1006 Capstone Project in Data Science 3
General Elective or Track Course 3

Year 2 – Spring

Course Title Credits
TOTAL CREDITS 9
General Elective or Track Course 3
General Elective or Track Course 3
General Elective or Track Course 3