Professional Documents
Culture Documents
Report PDF
Report PDF
35 Pages 3.4MB
Summary
ABSTRACT
INTRODUCTION
Car dents and scratches are common occurrences that can make the
appearance of a vehicle and diminish its resale value. Whether it is a minor parking
lot mishap or a more severe collision, these imperfections can be frustrating and
40
costly for car owners to repair. Fortunately, advances in technology have led to the
development of innovative solutions for detecting and analyzing car dents and
scratches accurately and efficiently. Car dents are depressions or deformities in the
vehicle's outer body caused by impacts with objects, shopping carts, hail, or other
vehicles. On the other hand, scratches are shallow cuts or abrasions on the car's
paintwork, often resulting from contact with rough surfaces, tree branches, or
vandalism shown in Figure 1.
Dents and scratches can make routine maintenance tasks, such as washing
and waxing, more challenging. The irregular surface may trap dirt and grime,
making it harder to clean the affected areas properly. Additionally, waxing the car
may be less effective due to the damaged paint layer, which can lead to further
deterioration of the vehicle's appearance. When car owners decide to claim
insurance for dent or scratch repairs, they may face the drawback of higher
premiums in the future. Additionally, insurance policies often come with
36
deductibles, meaning that the car owner must pay a certain amount out of pocket
before the insurance coverage kicks in. For minor damages, it may not be worth
making a claim due to the deductible costs. Regular maintenance and careful
61
driving can help minimize the risk of dents and scratches, ensuring that the car
remains in top-notch condition for an extended period.
In the past, detecting car dents and scratches was primarily done through
manual inspections by automotive technicians. This process involved visually
examining the entire vehicle surface under specific lighting conditions to identify
imperfections. However, this method was time-consuming, subjective, and prone to
human error. Automated dent and scratch detection systems offer the promise of
31
addressing these limitations. By leveraging the power of computer vision, these
systems can process vast amounts of data rapidly and identify imperfections with
greater accuracy and consistency.
The need of artificial intelligence (AI) and computer vision to improve car
maintaining and appearance is one of the industry's newest developments. The
creation of an automated system that can recognize dents and scratches on vehicles
is the project's main objective. In order to evaluate photographs or videos of
automobiles and pinpoint flaws with high precision, computer vision techniques
like image processing and machine learning are used. Real-time picture and video
processing is part of the project's scope, giving users access to immediate
feedback. For speedy assessments in situations like vehicle inspections, insurance
claims, and pre-owned vehicle evaluations, this real-time capacity is crucial. The
study may go beyond simple detection by developing algorithms to determine how
severe dents and scrapes are. Users would have knowledge of the damage's scope,
which would help them decide on the most important repairs to make first and how
50
much they should cost. The AI system should be able to continuously adapt to and
learn from fresh data. To improve detection accuracy over time, this entails
integrating methods for model updates and improvements.
The system may be designed as part of the scope so that it can be built up to
manage many vehicles, making it appropriate for usage in dealerships, auto body
shops, and automobile rental agencies. The project's potential advantages and uses
range widely, from automated detection and severity evaluation to real-time
processing and integration with mobile apps. The incorporation of AI-driven
solutions for car maintenance and aesthetics holds the promise of increased
efficiency, decreased costs, and improved customer experiences as the automotive
industry continues to develop.
LITERATURE SURVEY
Phyu and Kuntpong (2020) examined several deep learning methods and
49
used transfer learning in pre-trained VGG models, to identify the damaged section
59
of an automobile and determine its position and severity. With the help of transfer
11
learning and L2 regularisation, they were able to achieve accuracy of 95.22% of
VGG19 and 94.56% of VGG16 in damaged detection, 76.48% of VGG19 and
74.39% of VGG16 in damage localisation, and 58.48% of VGG19 and 54.8% of
VGG16 in damage severity. They concluded that VGG19 performed better than
VGG16. Their models addressed overfitting issues.
Najmeddine et al., (2019) looked at insurance firms and they found that
12
claim leakages cost them a significant amount of money. Inception-Resnet
pre-trained model as features extractor, followed by fully connected neural
networks to identify and categorise the damages, was their suggested approach.
The damage site was marked using the R-CNN mask. They concluded that the
12
suggested pre-trained model is more effective in localising damage, with an
accuracy of 86.8% as opposed to 75.4%. They discovered that Inception-ResnetV2
model outperforms VGG16 with 80% precision compared to 69% for damage
severity classification. As a result, they discovered a solution demonstrating a
crucial resource for the insurance sector to combat problems with claims leakage
using the Inception-ResnetV2 model.
Mahboub & Majid (2022) have proposed a model using two-route CNN
model that is able to extract features from the image. For each route, they
employed the new mReLU and inception blocks, which oversee extracting
high-level visual information. In order to reduce the model's effectiveness at
segmenting pieces, they employed a pre-trained ResNet-50 model. To address the
overfitting, a 3 3 mReLU block and an inception block were proposed. They
discovered that using a pretrained CNN model to each route produced superior
segmentation results compared to other models including FFNN, YOLOv3, HTC,
and VGG. They used the recommended methodology to a personal database of an
automobile collection. Their conclusion was 93% accurate. The front doors and
trunk lid have the highest levels of accuracy, while the rear bumper has the lowest
levels. The recommended two-route model outperformed all other models in
comparison, demonstrating its great efficacy in achieving the required goals.
Yonathan Wijaya Gustian et al., (2023) has developed a model using YOLO
Deep Learning Algorithm. To detect the defects, they used convolution Neural
Network which helps in feature extraction. Precision analyzes how well the model
foresees the proper things, whereas recall assesses how many of the recognized
42
items are correctly identified given the total number of objects. In this case, the
average accuracy is 61.55%. Recall rates are, on average, 46.05%. The term
"mAP" refers to the average accuracy value across all recognized object classes.
62
The mean mAP of 0.441 in this instance demonstrates that the model performs
adequately in spotting defects in car pictures in general. The performance of this
model can still be improved by tweaking the parameters and techniques utilized
16
during the model training and testing process. A larger and more diverse dataset
can help improve the efficiency of damage identification on vehicles when using
deep learning algorithms like YOLO.
Automotive Industry: The dent and scratch detecting system has been a
game-changing innovation in the automobile sector, transforming several facets of
vehicle maintenance, quality assurance, and customer happiness. With the help of
this system, auto repair companies have significantly increased their productivity
and accuracy. It automates the detection of exterior car damage throughout
estimation and repair operations, saving time and money while ensuring a complete
analysis of all problems. In addition to streamlining procedures, this improves
customer satisfaction by giving more accurate repair estimates and lowering the
possibility of error.
The technology also serves as a strong fraud detection tool. When legitimate
claims are accurately identified and suspect ones are flagged, insurance fraud and
related expenses are ultimately decreased. In essence, incorporating computer
vision systems into insurance workflows is a game-changer since it brings about
efficiency, accuracy, fairness, and fraud protection, all of which are ultimately
advantageous to both insurers and policyholders in the claims-handling process.
A. Image acquisition
B. Pre-processing
Before using images for purposes like training machine learning models, an
image dataset must be cleaned, improved, and organized. To increase the
effectiveness and performance of the model, this may involve scaling, normalizing
the pixel values, eliminating noise, and enhancing the data.
D. Training
• Initializing the YOLO model with the updated settings and, if available,
pre-trained weights.
58
• Employing a loss function that includes classification loss and
localization loss.
• Utilising labelled pictures and comments to train the model using the
training set.
29
• To avoid overfitting, keep an eye on the loss and other metrics on the
validation set.
E. Evaluation
30
Metrics like mean Average Precision (mAP), precision, recall, and F1-score
are frequently used to evaluate YOLO. A general indicator of detection
performance across different confidence threshold levels is provided by mAP.
2
Model Size mAPval Speed CPU Speed A100 Params FLOPs
(pixels) 50-95 ONNX (ms) TensorRT (ms) (M) (B)
YOLOv8n 640 37.3 80.4 0.99 3.2 8.7
YOLOv8s 640 44.9 128.4 1.20 11.2 28.6
YOLOv8m 640 50.2 234.7 1.83 25.9 78.9
YOLOv8l 640 52.9 375.2 2.39 43.7 165.2
YOLOv8x 640 53.9 479.1 3.53 68.2 257.8
Table 1: Trade-off between different YOLO Model
60
In our model, we used performance metrics like precision, recall, accuracy,
F1 score, map50 and map50-95 to evaluate the model. The brief explanation of
these metrics are given below. The predictive effectiveness of an algorithm on a
45
dataset is represented by a confusion matrix. A confusion matrix for a binary class
dataset (which, hypothetically, consists of "positive" and "negative" classes) must
include the following four elements:
55
True Positives (TP): The quantity of samples that were accurately classified
as "positive."
10
False Positives (FP) are samples that were incorrectly classified as
"positive."
Precision = TP / (TP+FP)
Recall evaluates how well you remember all the advantages. For instance,
80% of the potential positive situations can be found in our most accurate K
predictions.
Recall = TP / (TP+FN)
8
The F1 score, a machine learning evaluation metric, rates a model's
precision. It incorporates a model's recall and precision ratings. The accuracy
statistic tells us how often a model predicts properly over the entire dataset. Only
when the data set is category-balanced, or when each category has an equal number
of samples, can this measure be relied upon.
48
F1 Score = (2 x Precision x Recall) / ( Precision + Recall)
28
Recall indicates the percentage of real positive cases that the algorithm
correctly detects, while precision represents the precision of the model's positive
predictions. Precision and recall are frequently trade-offs; for instance, raising the
66
amount of identified objects (greater recall) may lead to an increase in false
1
positives (lower precision). The precision-recall curve, which plots precision
versus recall for different confidence thresholds, is incorporated into the AP
measure to account for this trade-off. The Precision-Accuracy (PA) curves show
64 39
how Precision and Recall interact at various confidence levels. The Mean Average
Precision (mAP), which is obtained as the mean of the AP values for every
category, is generated from the region under the prediction and recall curves.
These metrics are useful tools for evaluating the effectiveness of concepts
and systems, allowing decisions based on data and ongoing development across a
range of applications. These measurements direct efforts at hyperparameter
tweaking, model selection, and optimization, resulting in the creation of reliable
and efficient algorithms. Additionally, they make it easier to compare various
models or approaches, help debug issues, and provide information for
decision-making procedures, ultimately improving technology and the use of data
across a variety of sectors.
F. Recognition
G. Web Interface
Our concept prioritizes accessibility for end users, even those with limited
technical expertise, by offering a user-friendly interface that abstracts complex
technological components. To achieve this, we've chosen to develop a web
interface using ReactJS and Flask. ReactJS, known for its interactive and intuitive
front-end capabilities, provides an engaging and responsive user experience.
Meanwhile, Flask, a lightweight Python web framework, enables us to build a
robust back-end for handling data processing and communication seamlessly. This
combination ensures that our users can easily interact with our technology, making
it accessible, efficient, and user-centric, ultimately enhancing their overall
experience and utility.
Figure 5. UI design
H. Deployment
Due to its adaptability and wide range of libraries, like OpenCV and
TensorFlow, Python is a well-known language. TensorFlow, Keras, PyTorch, and
many other machine learning libraries and frameworks are available in Python, and
they provide tools for activities including preprocessing data, training models, and
evaluation. Python speeds up the development cycle and promotes code
collaboration because of its clear and legible syntax, which is accessible to both
novice and seasoned programmers. The huge and vibrant Python community
makes a multitude of instructions, record-keeping, and open-source initiatives
available, facilitating knowledge-sharing and problem-solving.
Car dent and Scratch are identified by training our model using YOLOv8.
The organization that created YOLOv5, Ultralytics, launched YOLOv8 in January
17
2023. YOLOv8 supports a variety of vision tasks, such as object detection,
1
segmentation, posture estimation, tracking, and classification. YOLOv8 had five
scaled variations: YOLOv8n (nano), YOLOv8s (small), YOLOv8m (medium),
YOLOv8l (large), and YOLOv8x (extra enormous). YOLOv8 utilizes an
architecture that is exactly the same as that of YOLOv5, as seen in Figure 6, with
1
significant changes to the CSPLayer, now known as the C2f module.
YOLOv8 can be run through the command line interface (CLI) or installed
as a PIP package. Additionally, it features a number of integrations for deployment,
training, and labeling. A crucial feature of YOLOv8 is its extensibility. Its
extensibility is arguably one of YOLOv8's most appealing features. It acts as a
foundation to preserve compatibility with earlier YOLO versions. This enables
customers to take advantage of the most recent YOLO improvements while
maintaining the capabilities of their existing YOLO-based applications. customers
may quickly switch between different YOLO versions and evaluate their
performance. Since YOLOv8's backward compatibility guarantees a seamless
upgrade to the most recent YOLO technology, it is a great option for both people
and businesses looking for cutting-edge object detection abilities without upsetting
their current workflows. Because of this, YOLOv8 is the best option for people
who want to benefit from the most recent YOLO technology while keeping their
old YOLO devices' functionality.
Dent and scratch detection is an challenging task and it helps the inspection
process by automation. We have developed a model using YOLOv8.
For a certain YOLOv8 model, recall and precision confidence curves can be
used to determine where there is a trade-off between the two. Using this
knowledge, one can select the ideal confidence level for a particular application.
For instance, a lower confidence level can be selected if it is crucial to find all
dents and scratches, even if doing so results in finding some false positives. A
higher confidence threshold might be used if it is crucial to reduce the amount of
false positives, even if doing so implies missing some actual positives.
35
Figure 10. Recall-Confidence Curve
Figure 11. Precision-Confidence Curve
CONCLUSIONS
we have developed a model that detects dent and scratch present in the exterior
region of our vehicles. Yolov8 has eliminated grid sensitivity, improved training
32
strategies, compute, and balance losses. Biggest advantage of using YOLO v8 is its
speed. It can process 155 frames per second. The process of recognizing and
finding automotive dents and scratches has been improved to an unparalleled
degree of precision by utilizing the YOLOv8 model's excellent object
identification skills. As a result, there will be fewer possibilities for human
mistakes, shorter inspection times, and ultimately higher customer satisfaction.
TOP SOURCES
The sources with the highest number of matches within the submission. Overlapping sources will not be
displayed.
arxiv.org
1 2%
Internet
thesai.org
3 1%
Internet
Sources overview
Similarity Report
researchgate.net
12 <1%
Internet
Sources overview
Similarity Report
hindawi.com
22 <1%
Internet
mdpi.com
30 <1%
Internet
Sources overview
Similarity Report
Chuanzhu Sun, Bin Li, Chaoxing Fu. "Study of Resin Zipper Defect Dete...
42 <1%
Crossref
Sources overview
Similarity Report
biomedical-engineering-online.biomedcentral.com
48 <1%
Internet
Alvin Sarraga Alon, Enrique D. Festijo, Drandreb Earl O. Juanico. "Tree ...
51 <1%
Crossref
Sources overview
Similarity Report
Phyu Mar Kyu, Kuntpong Woraratpanya. "Car Damage Detection and Cl...
59 <1%
Crossref
Syed Asif Ahmad Qadri, Nen-Fu Huang, Taiba Majid Wani, Showkat Ah...
60 <1%
Crossref
THEI on 2023-05-08
61 <1%
Submitted works
bannariamman on 2023-10-07
65 <1%
Submitted works
Sources overview