Welcome to Scribd!

Project Synopsis On Breast Cancer Detection Using Data Mining

Uploaded by

0% found this document useful (0 votes)

291 views3 pages

This project aims to improve cancer detection from the Wisconsin Diagnostic Breast Cancer dataset using data mining techniques. Various machine learning algorithms will be trained on features from the dataset to classify cancers. The goal is to increase the accuracy of cancer prediction compared to previous studies. Data cleaning, integration, selection, transformation, mining, and evaluation steps will be followed. Classification and prediction methods like decision trees and neural networks will be applied for modeling. The system requirements include a Core i3 processor, 2GB RAM, 200GB hard disk, and Windows 7 with Python/Jupyter Notebook/Anaconda for implementation.

Original Description:

this reportshow the details of the project in research of breast cancer

Original Title

Project Synopsis on breast cancer detection using data mining

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

291 views3 pages

Project Synopsis On Breast Cancer Detection Using Data Mining

Uploaded by

rishabh kumar

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 3

Search inside document

Project Synopsis

Cancer Detection Using Data Mining

Group Members
Team Leader Name –Rishabh Kumar
Contact-8756916748 Team Member 1-RohitKumar
E-mail-rishabhkumar7847@gmail.com Team Member 2-NavneetPrasad
Team Member 3- Manjeet Singh

Introduction
It is a research project on studies of disease related to cancer. Breast cancer is
one of the most common cancers among women worldwide and cancer related
deaths making it a significant public health problem in today’s society. Breast
cancer is one of the most common cancer along with lung and bronchus cancer,
prostate cancer, colon cancer, and pancreatic cancer among others. it is a topic of
research with great value.
Risk factors for breast cancer
Age, drinking alcohol, Having dense breast tissue, Gender genes, Giving birth at an
older age, never being pregnant these are included in risk factors for breast
cancer

The Dataset

The machine learning algorithms were trained to detect breast cancer using the
Wisconsin Diagnostic Breast Cancer (WDBC) dataset. The dataset consists of
features which were computed from a digitized image of a fine needle aspirate
(FNA) of a breast mass.

Objective

In this project we are trying to improve the past result gained by different expert
on WBDC data. We are focusing on increase in accuracy of the cancer data set so
that the model can predict more accurately.
Methodology

Data Mining techniques are used in this project. Data Mining also known as
Knowledge Discovery in Databases or KDD for short, refers to the wide route of
finding knowledge in data, and emphasizes the "high-level" function of particular
data mining methods. The combined aim of the KDD process is to dig up
knowledge from the large repositories. Data mining methods to pull out the
information according to the condition of actions, using a database along with any
vital preprocessing, sampling, and transformations of that database.

The list of steps involved in the knowledge discovery process:

 Data Cleaning - The noise and inconsistent data is removed.

 Data Integration - many data sources are combined.

 Data Selection - Data relevant to the analysis task are retrieved from the
database.

 Data Transformation - Data is transformed or consolidated into forms

appropriate for mining by performing summary or aggregation operations.
Data Mining - smart methods are applied in order to pull out data patterns.

 Pattern Evaluation - Data patterns are evaluated.

 Knowledge Presentation - Knowledge is represented.

Classification and Prediction

Classification is the route of finding a model that describes the data classes. The
reason is to be able to use this model to guess the class of objects whose class
label is unidentified. This obtained model is based on the analysis of sets of
training data. The obtained model can be presented in the following forms:

 Decision Trees

 Mathematical Formulae

 Neural Networks
Classification - It predicts the class of objects whose class label is unidentified. Its
purpose is to find an obtained model that describes and distinguishes data classes
or concepts. The Derived Model is based on the study set of training data i.e. the
data object whose class label is well known.

Prediction - It is used to predict missing or engaged numerical data values rather

than class labels. Regression study is usually used for prediction. Prediction can
also be used for recognition of distribution trends based on available data.

Requirement of Software and Hardware Specification

Hardware Components:

1. Processor- Core i3
2. Hard Disk – 200GB
3. Memory – 2GB
4. Monitor

Software Components:

1. Windows 7 or Higher
2. Python with Jupyter Notebook
3. Anaconda IDE

Abey Resume Template
Document1 page
Abey Resume Template
929-Vijay Kumar
No ratings yet
Aqi To Print
Document63 pages
Aqi To Print
Sujan Bohara
No ratings yet
Exam Registration System
Document12 pages
Exam Registration System
Deepak John
61% (18)
Hostel Management System
Document25 pages
Hostel Management System
Rakesh Yadav
No ratings yet
Crime Prediction in Nigeria's Higer Institutions
Document13 pages
Crime Prediction in Nigeria's Higer Institutions
Benjamin Bala
No ratings yet
Air Quality Prediction
Document21 pages
Air Quality Prediction
Sathay Adventure
No ratings yet
Seminar Report Machine Learning
Document20 pages
Seminar Report Machine Learning
Havish SR
No ratings yet
Online Blood Bank Management
Document10 pages
Online Blood Bank Management
Divya
100% (1)
Colour Detection
Document6 pages
Colour Detection
AR LAP
No ratings yet
Statistics For BCA
Document6 pages
Statistics For BCA
sebastian cyriac
100% (1)
Final Breast Cancer
Document23 pages
Final Breast Cancer
Prashanth Kumar
100% (1)
Heart Attack Predictions Using Machine Learning
Document8 pages
Heart Attack Predictions Using Machine Learning
IJRASETPublications
No ratings yet
Convolutional Neural Networks
Document17 pages
Convolutional Neural Networks
Manish Man Shrestha
No ratings yet
Lung Disease Prediction - Edited
Document35 pages
Lung Disease Prediction - Edited
Mohammad Farhan
No ratings yet
Final Year Project Thesis
Document25 pages
Final Year Project Thesis
AKANKSHA DHOTE
No ratings yet
Breast Cancer Classification
Document18 pages
Breast Cancer Classification
Satwik Sridhar Reddy
No ratings yet
Machine Learning Approach For Diabetes Prediction
Document7 pages
Machine Learning Approach For Diabetes Prediction
IJRASETPublications
No ratings yet
Data Mining Lab Manual
Document2 pages
Data Mining Lab Manual
akbar2694
No ratings yet
Heart Disease Prediction Final Report
Document31 pages
Heart Disease Prediction Final Report
MUSIC BY LOST
100% (1)
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
Document5 pages
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
ATS
No ratings yet
Key Data Mining Tasks: 1. Descriptive Analytics
Document10 pages
Key Data Mining Tasks: 1. Descriptive Analytics
sandeep
No ratings yet
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
Document4 pages
Analysis of Heart Diseases Using Machine Learning: September 6, 2018
Ashish Ranjan
No ratings yet
Introduction To Data Analytics and Visualization Question Paper
Document2 pages
Introduction To Data Analytics and Visualization Question Paper
Abhay Gupta
100% (1)
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
Document5 pages
Laptop Price Prediction Using Machine Learning: International Journal of Computer Science and Mobile Computing
Aaneill Crest
100% (1)
Alzheimer's Disease PHD PPT 2019
Document14 pages
Alzheimer's Disease PHD PPT 2019
RammurtiRawat
100% (1)
NLP Mini Project Report
Document27 pages
NLP Mini Project Report
Rajshree Borkar
No ratings yet
Message Spam Classification Using Machine Learning Report
Document28 pages
Message Spam Classification Using Machine Learning Report
Dhanusri Ramesh
No ratings yet
Pattern Recognition
Document19 pages
Pattern Recognition
Priyansh Kumar
No ratings yet
Heart Disease Prediction Synopsis
Document36 pages
Heart Disease Prediction Synopsis
MUSIC BY LOST
No ratings yet
Ss Project With Python
Document9 pages
Ss Project With Python
Bhola Kamble
No ratings yet
Detection of Tomato Leaf Disease Locations Using Deep Learning
Document9 pages
Detection of Tomato Leaf Disease Locations Using Deep Learning
birbro
No ratings yet
Project Report Hate
Document24 pages
Project Report Hate
Machine Learning
100% (1)
Project Synopsis
Document4 pages
Project Synopsis
Mohammad Javed
0% (1)
Machine Learning - Customer Segment Project. Approved by UDACITY
Document19 pages
Machine Learning - Customer Segment Project. Approved by UDACITY
Carlos Pimentel
100% (1)
Advanced Cluster Analysis: Clustering High-Dimensional Data
Document49 pages
Advanced Cluster Analysis: Clustering High-Dimensional Data
Priyanka Bhardwaj
No ratings yet
Smart Parking System Using Yolov3 Deep Learning Model: Major Project Report
Document26 pages
Smart Parking System Using Yolov3 Deep Learning Model: Major Project Report
Ishant Sadhawani
No ratings yet
IT6006 Data Analytics
Document12 pages
IT6006 Data Analytics
balakiruba
No ratings yet
Internship Report File
Document35 pages
Internship Report File
Manoj Kumar
No ratings yet
Stock Market Prediction
Document20 pages
Stock Market Prediction
Roopa Ram Bose
No ratings yet
BI Lab Manual
Document9 pages
BI Lab Manual
iamhafsa
0% (1)
Cat and Dog Classification Using CNN Fin
Document34 pages
Cat and Dog Classification Using CNN Fin
Kunal Garg
No ratings yet
Synopsis - Image Steganography
Document6 pages
Synopsis - Image Steganography
Akash Singh
100% (3)
Intelligent Surveillance System Using Deep Learning 1
Document9 pages
Intelligent Surveillance System Using Deep Learning 1
Varun Mittala
No ratings yet
18csl66-Ss and Os Lab Manual
Document55 pages
18csl66-Ss and Os Lab Manual
Akansha Singh
No ratings yet
Car Price Prediction Using Machine Learning
Document15 pages
Car Price Prediction Using Machine Learning
Shania Jone
100% (1)
Customer Segmentation Report
Document31 pages
Customer Segmentation Report
OXEN Enterprises
No ratings yet
Lab Assignment 1 Title: Data Wrangling I: Problem Statement
Document12 pages
Lab Assignment 1 Title: Data Wrangling I: Problem Statement
Mr. Legendperson
No ratings yet
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
Document256 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
Vishal Navale
100% (1)
Heart Disease Detection Using Machine Learning
Document12 pages
Heart Disease Detection Using Machine Learning
Abhishek Deshmukh
No ratings yet
Heart
Document28 pages
Heart
Aanchal Gupta
No ratings yet
Prediction of Heart Disease Using Machine Learning Algorithms
Document10 pages
Prediction of Heart Disease Using Machine Learning Algorithms
IJRASETPublications
100% (1)
The Problem of Overfitting: Overfitting With Linear Regression
Document32 pages
The Problem of Overfitting: Overfitting With Linear Regression
Pallab Puri
No ratings yet
DFD
Document14 pages
DFD
api-3739854
100% (8)
Machine Learning Based Heart Disease Prediction System
Document11 pages
Machine Learning Based Heart Disease Prediction System
IJRASETPublications
No ratings yet
Question Bank - Unit I
Document2 pages
Question Bank - Unit I
Rama
No ratings yet
Heart Disease Prediction Using Machine Learning
Document7 pages
Heart Disease Prediction Using Machine Learning
IJRASETPublications
No ratings yet
RGPV Notes
Document90 pages
RGPV Notes
Islamic India
100% (1)
Title: Smart Heath Prediction Using Machine Learning
Document20 pages
Title: Smart Heath Prediction Using Machine Learning
Rohit Raj
No ratings yet
Prediction of Heart Disease Using Logistic Regression Algorithm
Document6 pages
Prediction of Heart Disease Using Logistic Regression Algorithm
IJRASETPublications
No ratings yet
AKTU Syllabus CS 3rd Yr
Document1 page
AKTU Syllabus CS 3rd Yr
PPDC NAGAUR
No ratings yet
Forest Fires Data Set Analysis Using Machine Learning: Name: 1.pawan Jakke (111815018) 2.utkarsh Dubey (111815047)
Document8 pages
Forest Fires Data Set Analysis Using Machine Learning: Name: 1.pawan Jakke (111815018) 2.utkarsh Dubey (111815047)
Aman Dubey
No ratings yet
Supervised Learning (Classification and Regression)
Document14 pages
Supervised Learning (Classification and Regression)
shreya sarkar
No ratings yet
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Simon R. Chapple
No ratings yet
SARP Redcap
Document23 pages
SARP Redcap
Bhavana Alapati
No ratings yet
The Role of Artificial Intelligence in The Development of Accounting Systems: A Review
Document2 pages
The Role of Artificial Intelligence in The Development of Accounting Systems: A Review
Ria Gayle
No ratings yet
MSBI Interview Questions
Document7 pages
MSBI Interview Questions
cap.rohit550
No ratings yet
Ieee 2011 Free Projects With Source Code
Document8 pages
Ieee 2011 Free Projects With Source Code
vikasys
No ratings yet
BI Developer
Document8 pages
BI Developer
Rahul Chinta
No ratings yet
Graph Database-Based Network Security
Document12 pages
Graph Database-Based Network Security
smanjuravi
No ratings yet
01 - Introduction To PME v8 v1.0
Document48 pages
01 - Introduction To PME v8 v1.0
jorge_carrera
No ratings yet
Zeiss Piwebshopfloorqualitydatamanagementindustry4 170929035627
Document44 pages
Zeiss Piwebshopfloorqualitydatamanagementindustry4 170929035627
Satriyo Utomo
No ratings yet
Aspen Petroleum Supply Chain: Installation Guide
Document90 pages
Aspen Petroleum Supply Chain: Installation Guide
Francisco Ariza
No ratings yet
How To Access Online Service System
Document8 pages
How To Access Online Service System
muraliala
0% (1)
Employee Management System: A Project Submitted in Partial Fulfilment of The Requirements For The Degree of
Document21 pages
Employee Management System: A Project Submitted in Partial Fulfilment of The Requirements For The Degree of
Prints Bindings
No ratings yet
Symantec DLP 14.5 Upgrade Guide Win
Document71 pages
Symantec DLP 14.5 Upgrade Guide Win
Uditha Priyanga Ranaweera
No ratings yet
Computer Synopsis Class12 30-Jul-2021
Document21 pages
Computer Synopsis Class12 30-Jul-2021
Aaqib
No ratings yet
Core Tables HR
Document19 pages
Core Tables HR
shivram2511
No ratings yet
Report - Online Learning System
Document42 pages
Report - Online Learning System
Bhargav Jani
No ratings yet
MongoDB Developer Training Program Datasheet
Document13 pages
MongoDB Developer Training Program Datasheet
ottmar mendoza juarez
No ratings yet
Active Database
Document30 pages
Active Database
Namrata91
No ratings yet
7 Network Programmability Concepts
Document48 pages
7 Network Programmability Concepts
Kv142 Kv
No ratings yet
How To Hack Facebook
Document2 pages
How To Hack Facebook
Coder Coder
No ratings yet
Replication
Document27 pages
Replication
anon-862077
100% (1)
Proficy HMI/SCADA - iFIX: U SQL
Document66 pages
Proficy HMI/SCADA - iFIX: U SQL
David Da Silva Borges
No ratings yet
AGOPP - Research Process Strategy Model: The Five Stages of AGOPP - Strategies, Suggestions and Examples
Document20 pages
AGOPP - Research Process Strategy Model: The Five Stages of AGOPP - Strategies, Suggestions and Examples
Michael Brocato
No ratings yet
Actus Adwatch - Automatic Content Detection & TV Ads Tracking
Document4 pages
Actus Adwatch - Automatic Content Detection & TV Ads Tracking
Bijjar Baloch
No ratings yet
Installation of SAP Systems
Document25 pages
Installation of SAP Systems
oleg.g1
No ratings yet
Upgrade To: Reduce Costs and Accelerate Innovation
Document1 page
Upgrade To: Reduce Costs and Accelerate Innovation
bujjii777
No ratings yet
Adding Points To Cimplicity
Document1 page
Adding Points To Cimplicity
Mohamed Amine
100% (1)
HL7 Recommendation: Using XML As A Supplementary Messaging Syntax For HL7 Version 2.3.1
Document25 pages
HL7 Recommendation: Using XML As A Supplementary Messaging Syntax For HL7 Version 2.3.1
David Uribe Katz
No ratings yet