Data Science Courses and Modules Overview

data science n.w
1 / 9
Embed
Share

Explore data science courses at Bryn Mawr College including a 300-level computer science elective, a 200-level data visualization course, and topics such as statistical methods, machine learning, topological data analysis, and network analysis. Engage in project-oriented learning and work with real-life data sets. Dive into machine learning modules focused on the process of doing machine learning effectively, using tools like iPython notebook and Scikit-learn.

  • Data Science
  • Courses
  • Bryn Mawr College
  • Machine Learning
  • Data Visualization

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Data Science Dianna Xu Bryn Mawr College 1

  2. The Course 300-level Computer Science elective CS majors and minors Pre-reqs: CS1, CS2 (Data Structures), Discrete Math and Linear Algebra Unstructured data and explorative data analysis Iterative processes that require programming skills and and knowledge of algorithms 2

  3. 200-level Data Visualization Course Taught Spring 2014 at Haverford Stats basics, linear regression Clustering Baby network analysis PageRank Visualization with D3 50% overlap of students 3

  4. Topics Statistical methods (2-3 weeks) basics, regression analysis, Bayesian methods Machine learning background (2-3 weeks) multivariate regression and logistic regression Dimensionality reduction (3 weeks) PCA, SVD, Kernel PCA, other non-linear methods Topological data analysis (2-3 weeks) manifold learning, intro to TDA Network analysis (2-3 weeks) collaborative filtering, community detection 4

  5. Project-Oriented Students will team up for a semester-long project on data analysis for local "data clients" Faculty members who have "real life" data sets and research questions Library, registrar and institutional research 5

  6. Data Sets Digital Du Chemin repertory of polyphonic songs from 16th-century France Dark Reactions chemical experiments with associated reactants and results Maine athletes Anil's bio data? 6

  7. Machine Learning Modules Focused on the process of doing (good) machine learning i.e. (step 1) Pose a problem in the language of machine learning (step 2) Gather data (step 3) Choose a potential method for solving the problem (step 4) Setup an experiment to properly evaluate your method (step 5) Evaluate experiment and possibly go to step 2 or 3 Possible toolsets: iPython notebook Scikit-learn 7

  8. iPython Notebook Modules http://occam.olin.edu/sites/default/files/DataScienceMaterials/machine_learning_le cture_1/Machine%20Learning%20Lecture%201.html http://occam.olin.edu/sites/default/files/DataScienceMaterials/machine_le arning_lecture_2/Machine%20Learning%20Lecture%202.html 8

  9. Open-source, python-based package for machine learning. Principal strength is a consistent API and enforcement of "good" machine learning practices 9

Related


More Related Content