Professional Documents
Culture Documents
Car Insurance Fraud Proposal
Car Insurance Fraud Proposal
Project Proposal
[4] Jesus M Perez, Javier Muguerza, Olatz Arbelaitz, Ibai Gurrutxaga, and
We plan to divide our project into three main parts: the first part Jose I Martın. Consolidated tree classifier learning in a car insurance
will be about exploratory analytics – to visualize the data and fraud detection domain with class imbalance. In International
Conference on Pattern Recognition and Image Analysis, pages 381–389.
better understand features; the second will consist in using simple Springer, 2005.
machine learning models (Logistic Regression, Regularization,
[5] Yibo Wang and Wei Xu. Leveraging deep learning with lda-based text
Feature Selection, Classification Trees) to understand which analytics to detect automobile insurance fraud. Decision Support
features might be more influential for predicting and their Systems, 105:87–95, 2018.
relationship with the target variable. Eventually, we will develop [6] Meryem Yankol-Schalck. The value of cross-data set analysis for
our final model, exploiting Random Forest or Gradient Boosted automobile insurance fraud detection. Research in International
Business and Finance, 63:101769, 2022.
Trees, to make the predictions and evaluate the best fit.
Moreover, as we are dealing with a dataset with skewed class
distribution – there are far more observations labelled as not