|
|
BUS41201 Data Mining
Spring 2011 Syllabus
Final:
project,
jester sample,
jester test,
jester jokes
Midterm:
grades,
exam solutions,
TraderJoes.R,
TraderJoes.csv,
TJcenters.csv,
boozebin.csv,
boozemulti.csv
Lectures:
01Data,
02Regression,
03Models,
04Classification,
05Clustering,
06Networks,
07Factors,
08Trees,
09Text
Code: 01Data, 02Regression,
03Models (fdr),
04Classification (roc),
05Clustering,
06Networks,
07Factors,
08Trees
Data:
Prostate Biopsy,
CA housing,
TV pilots
(demographics,
show details),
gasoline,
FXmonthly
(Solutions, SP500 monthly returns),
rollcall votes
(legislators),
Web Browsing
(basket),
Firenze,
California Search
(California Nodes),
IMDB network
(movie names,
example analysis),
Light Beer
(Solutions,
R-code),
lastfm,
protein,
AHS Mortgages
(description,
clean script),
email spam
(description, solutions, code),
semiconductor,
orange juice
(description),
credit risk
(description),
Ben and Jerry's,
Odwalla
(HomeScan Data Dictionary)
Computing:
R project site,
R video tutorials,
NY Times article about R
Regression Review Material
Session 1:
class1.pdf,
class1.R,
examples1.pdf
Session 2:
class2.pdf,
class2.R,
examples2.pdf
Session 3:
class3.pdf,
class3.R,
examples3.pdf
[ exercises.pdf ]
Data:
pickup.csv,
rent.csv,
MBA-hgt.csv,
mktmodel.csv,
newspaper.csv,
crime.csv,
OJ.csv,
Election2008byState.csv,
Income-ObamaVoteShare.csv,
amazon.csv,
WSPTrafficStops.csv,
winequality.csv
|