Professional Documents
Culture Documents
One Class Classification Approaches For Detecting Automobile Insurance Fraud
One Class Classification Approaches For Detecting Automobile Insurance Fraud
Submitted by:
Atul Kumar Agrawal
Registration no. : 1602040031
I. INTRODUCTION:
Automobile insurance fraud has been a major issue to the insurance companies and has caused
several crores of losses due to the fraudulent and false claims. It is a serious crime in most parts of
the world and the scammners may be sentenced to atleast 1 year of jail and upto 20 years. The
farudsters involve fake patients, fake doctors, fake lawyers together. Various machine learning and
deep learning techniques have been developed to detect these kind of fraud and research is been
done to get the best suitable method that can detect new patterns of fraudsters over time. As the
number of frauds is low in comparison to the legitimate transactions,we use one class clasification
for the minority class to get better results so as to minimise the classifiation of non-fraudulent data
as fraud i.e. minimising the false positive alarm rates.
II. MOTIVATION :
The Insurance Fraud Bureau in the UK estimated there were more than 20,000 fake collisions and
false insurance claims across the UK from 1999 to 2006. One tactic fraudsters use is to drive to a
busy junction or roundabout and brake sharply causing a motorist to drive into the back of them.
They claim the other motorist was at fault because they were driving too fast or too close behind
them, and make a false and inflated claim to the motorist's insurer for injury and damage, which can
pay the fraudsters up to 30 Lakhs .In the Insurance Fraud Bureau's first year or operation, the usage
of data mining initiatives exposed insurance fraud networks and led to 74 arrests and a five-to-one
return on investment. The Insurance Research Council estimated that in 1996, 21 to 36 percent of
auto-insurance claims contained elements of suspected fraud.There is a wide variety of schemes
used to defraud automobile insurance providers.
According to data released by Beijing bureau of China, 10% insurance claims of the total claims
were fraud.The Coalition Against Insurance Fraud estimates that in 2006 a total of about $80 billion
was lost in the United States due to insurance fraud.According to estimates by the Insurance
Information Institute, insurance fraud accounts for about 10 percent of the property/casualty
insurance industry's incurred losses and loss adjustment expenses. Indiaforensic Center of Studies
estimates that Insurance frauds in India costs about $6.25 billion annually.
1. Soft auto-insurance fraud: Examples of soft auto-insurance fraud include filing more than one
claim for a single injury, filing claims for injuries not related to an automobile accident,
misreporting wage losses due to injuries, and reporting higher costs for car repairs than those that
were actually paid.
2. Hard auto-insurance fraud: It includes activities such as staging,automobile collision, filing
claims when the claimant was not actually involved in the accident, submitting claims for medical
treatments that were not received, or inventing injuries and false stolen reports.
Various deep learning, Machine learning and data mining techniques have been implemented in the
case of automobile insurance fraud detection. These are:
Various attributes that are used frequently for automobile insurance fraud detection are:
occurance date and report date, claim open date and claim report date, count of customer
communication, policy effective date and claim occurance date, claim on same vehicle check.
Proposal:
Flow Chart:
V. Conclusion:
Thus, the complete survey was done to get the various techniques used in automobile insurance
fraud detection. This shows that there is still scope as minority class classifiers can be oversampled
to further increase the accuracy and will be helpful in preventing losses of the insurance companies
to a great extent.
The proposed system shall be implemented on different types of algorithms which are used to
compare and test the accuracies to get the best possible method for higher accuracy and low false
positive alarm rates.
VII. References:
2. N.V. Chawla,K.W. Bowyer., L.O. Hall, and W.P. Kegelmeyer,“SMOTE: Synthetic Minority
oversampling Technique”, Journal of Artificial Intelligence Research, vol. 16(1), pp. 321-357, 2002.
3. M. Vasu, and V. Ravi, “ A hybrid undersampling approach for mining unbalanced datasets:
Application to Banking and insurance”, International Journal of Data Mining Modeling and
Management, Vol. 3(1), pp. 75-105, 2011.
4. M.A.H. Farquad, V. Ravi and S. Bapi Raju,“Analytical CRM in banking and finance using SVM:
a modified active learning-based rule extraction approach”, International Journal of Electronic
Customer Relationship Management, vol. 6(1), pp 48-73, 2011.
7. D. C. Li, C. W. Liu, S. C. Hu, “A learning method for the class imbalance problem with medical
datasets”, Computers in Biology and Medicine, Vol. 40(5), pp. 509-518, 2010.
8. L. Peng, H. Zhang, B. Yang, andY. Chen, “A new approach for imbalanced data classification
based on data gravitation”, Information Sciences, Vol. 288, pp. 347-373, 2014.
9. C. F. Tsai, andY. H. Lu, “ Customer churn prediction by hybrid neural networks”, Expert Systems
with Applications, Vol. 36 (10), pp. 12547-12553, 2009.
10. Makki, Sara, Zainab Assaghir, Yehia Taher, Rafiqul Haque, Mohand-Saïd Hacid, and Hassan
Zeineddine. "An Experimental Study With Imbalanced Classification Approaches for Credit Card
Fraud Detection." IEEE Access 7 (2019): 93010-93022.
11. Kowshalya, G., and M. Nandhini. "Predicting Fraudulent Claims in Automobile Insurance." In
2018 Second International Conference on Inventive Communication and Computational
Technologies (ICICCT), pp. 1338-1343. IEEE, 2018.
12. Yan, Chun, and Yaqi Li. "The Identification Algorithm and Model Construction of Automobile
Insurance Fraud Based on Data Mining." In 2015 Fifth International Conference on
Instrumentation and Measurement, Computer, Communication and Control (IMCCC), pp. 1922-
1928. IEEE, 2015.
13. Xu, Wei, Shengnan Wang, Dailing Zhang, and Bo Yang. "Random rough subspace based neural
network ensemble for insurance fraud detection." In 2011 Fourth International Joint Conference on
Computational Sciences and Optimization, pp. 1276-1280. IEEE, 2011.
14. K. Nian, H. Zhang, A. Tayal, T. Coleman, and Y. Li, “Auto insurance fraud detection using
unsupervised spectral ranking for anomaly,” The Journal of Finance and Data Science, vol. 2, no. 1,
pp. 58–75, 2016.
15. C. Phua, D. Alahakoon, and V. Lee, “Minority report in fraud detection: classification of
skewed data,” Acm sigkdd explorations newsletter, vol. 6, no. 1, pp. 50–59, 2004.