| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

DataMiningCamp

This version was saved 13 years, 4 months ago View current version     Page history
Saved by hoffman.tricia@gmail.com
on November 8, 2010 at 6:12:27 pm
 

Home Page For the Beginning Data Mining/Machine Learning Class

 

Install R 

To get started using R you can download it from   http://cran.r-project.org/ 

 

Complete instructions for downloading R and installing it on eclipse are here:  To run R on Eclipse

 

 

Reading Material 

To understand the mathematics behind linear regression check out these slides:  MultivariateLinearRegression.pdf

 

To understand why you need to normalize the data before the regression: RidgeBiasDiscussion.txt

 

 PredictionErrorvsModelComplexity This chart is from the Elements of Statistical Learning text book

 

Ridge Regression Section 3.4.1 on page 61 of the :

"The Elements of Statistical Learning - Data Mining, Inference, and Prediction"  by Trevor Hastie, Robert Tibshirani, and Jerome Friedman.      http://www-stat.stanford.edu/~tibs/ElemStatLearn

 

 

Regression(stats315a).pdf

PracticalRegression_R.pdf

 

 

 

Examples to Run 

Here are some examples you might find interesting:

RegressionExample.R     problem statement  this is a ridiculously simple example

sonar_regress_crossval.R   example using cross-validation

RidgeRegSonarData.R        example of Ridge Regression

 

 

Data to Download

Data Required for these programs

sonar_test.csv

sonar_train.csv

Data for Camp Example

winequality-red.csv

 

Comments (0)

You don't have permission to comment on this page.