Sprint Challenge: Linear Models
In this sprint challenge, you will demonstrate your understanding of linear regression, ridge regression, and logistic regression. You'll apply what you've learned to solve real-world prediction problems.
Challenge Overview
This sprint challenge consists of several key tasks:
- Import training data
- (stretch goal): Create scatter plot
- Split data into feature matrix and target vector
- Split data into training and validation sets
- Establish baseline mean absolute error
- Build and train linear regression model
- Build and train ridge regression model
- Calculate mean absolute error for training and validation sets
- Calculate for validation set
- Make predictions based on test set
- (stretch goal): Get below 18,000 for test set MAE
- (stretch goal): Plot model coefficients
Assessment Criteria
Your submission will be evaluated based on the following:
- Technical Proficiency (50%): Correct implementation of linear models, appropriate feature engineering, proper model evaluation.
- Data Analysis (20%): Quality of exploratory data analysis, insights derived from the data.
- Model Interpretation (20%): Accurate interpretation of model coefficients, clear explanation of model predictions.
- Code Quality (10%): Well-documented, readable code, efficient implementation.
Sprint Challenge Repository
Click the link below to access the sprint challenge repository. Follow the instructions in the README file to complete the challenge.