Professional Documents
Culture Documents
220514_Report
220514_Report
1. Methodology
1.1 Data Preprocessing Steps:
1. Reading data: The code reads training and test data from CSV files using
Pandas.
1
1.5 Normalization, Standardization, or Transformation:
No normalization or standardization is applied explicitly. However, one-hot
encoding is performed for categorical variables using OneHotEncoder.
• Model Deployment: Finally, the best model selected from grid search is
used to make predictions on the test data, and the predictions are saved
to a CSV file for submission.
2. Experiment Details
2
3
4
5
3. Results
Public F1 score is 0.24841.
Private F1 score is 0.24647.
Public Leaderboard Rank: 69.
Private Leaderboard Rank: 65.
4. References
Pandas
Intro to Machine Learning
Model Selection and Boosting - Scikit-learn
Classification using SVC
Category Encoders Examples
Pipelines
Seaborn
Matplotlib
openai
5. Github repository
CS253 ML Assignment