Matt Taddy
Matt Taddy

Teaching

BUS41201 Data Mining

Spring 2011 Syllabus

Final: project, jester sample, jester test, jester jokes

Midterm: grades, exam solutions, TraderJoes.R, TraderJoes.csv, TJcenters.csv, boozebin.csv, boozemulti.csv

Lectures: 01Data, 02Regression, 03Models, 04Classification, 05Clustering, 06Networks, 07Factors, 08Trees, 09Text

Code: 01Data, 02Regression, 03Models (fdr), 04Classification (roc), 05Clustering, 06Networks, 07Factors, 08Trees

Data: Prostate Biopsy, CA housing, TV pilots (demographics, show details), gasoline, FXmonthly (Solutions, SP500 monthly returns), rollcall votes (legislators), Web Browsing (basket), Firenze, California Search (California Nodes), IMDB network (movie names, example analysis), Light Beer (Solutions, R-code), lastfm, protein, AHS Mortgages (description, clean script), email spam (description, solutions, code), semiconductor, orange juice (description), credit risk (description), Ben and Jerry's, Odwalla (HomeScan Data Dictionary)

Computing: R project site, R video tutorials, NY Times article about R


Regression Review Material
Session 1: class1.pdf, class1.R, examples1.pdf
Session 2: class2.pdf, class2.R, examples2.pdf
Session 3: class3.pdf, class3.R, examples3.pdf
[ exercises.pdf ]
Data: pickup.csv, rent.csv, MBA-hgt.csv, mktmodel.csv, newspaper.csv, crime.csv, OJ.csv, Election2008byState.csv, Income-ObamaVoteShare.csv, amazon.csv, WSPTrafficStops.csv, winequality.csv


Booth Homepage | Booth Portal | UC Homepage Copyright © 2008 University of Chicago Booth School of Business