Professional Documents
Culture Documents
Analysis and Identification of Cancerous Factors
Analysis and Identification of Cancerous Factors
Analysis and Identification of Cancerous Factors
Abstract - Data mining techniques have been generally utilized as a part of medical decision support systems for forecast
and finding of different diseases with great accuracy. Prediction of cancer at an early stage is a crucial task. In study
conducted by researchers was proven that patient affected by cancer consumes food which contains cancer causing
substances. It includes processed meat, processed sugar, pastry, poor diet, poor intake of fish and vegetables, and so on can
also stimulate the appearance of this dangerous disease like cancer. The proposed study centered on the application of data
mining techniques using rule based algorithm for predicting cancer at an early stage. The aim of the thesis paper is to give an
alert to the user which will save the time and cost of the treatment.
Data mining techniques are used to develop a system Tanvi Sharma, Anand Sharma (2016) focused on the
to predict cancer at an early stage. Rule based data different data mining classification techniques by
using WEKA tool and Rapid miner on the public K-means algorithm to separate data relevant to the
health care dataset to analyze. Based on highest skin cancer. Finally they implemented a prediction
accuracy, the best technique for particular data set is system of skin cancer using Lotus Notes.Shweta
chosen. They analyzed performance of data mining Kharya (2012) focused on different types of current
classification technique for health care system. research using data mining techniques to improve the
breast cancer prognosis and diagnosis. Ada et. al.
Neelam Singh and Santosh Kumar Singh Bhadauria (2013) proposed a method of segmentation which
(2016) have introduced cancer prediction system contains chest position, size and hidden portion of the
using data mining. They proposed an approach for the lung area. They used feature extraction, classification
extraction of significant pattern from data warehouse etc. technique of data mining to detect lung cancer.
for efficient prediction of cancer. By using java they Ronak Sumbaly et. al. (2014) proposed data model
implemented the proposed method which can using decision tree of data mining technique to
efficiently and successfully predict the risk level of predict breast cancer at an early stage. They also
cancer. B. Muthazhagan et. al. (2016) explored the discussed different data mining approaches for
recent research on early prediction of lung cancer prediction of breast cancer.
using data mining and image processing. They
observed various data mining techniques such as III. PROPOSED WORK
classification, clustering, prediction etc. These
systems provide most accurate values of prediction of R. Agarwal [1] has introduced association rule
cancer. learning of data mining technique. Association rules
analysis is a technique to uncover how items are
Kumar Anita (2015) has expressed cancer prediction associated to each other. The algorithm is
using four data mining techniques. They used four implemented using rule set as given:
classification algorithms such as Naïve Bayes,
Logical Model Tree, Random forecast, Classification IF A & B THEN C
and Regression Tree. The result observed that
Random forecast classification method performs Where A and B is the conjunction of conditions and
better than the others. Peter Adebayo Idowu et. al. C is the Prediction class. There is no limit on the
(2015) used data mining techniques to predict breast number of conjunction of conditions in the rules, but
cancer. To understand the risk factors of breast cancer there is a constraint on the number of predicted
they studied number of case studies. They compared dimension. Association rules are constructed by
the data with two different methods like Naïve Bayes identifying data for frequent if/then patterns and
and J48 decision tree. The result showed that J48 identification of the most important relationship by
decision tree is best model to predict the risk of breast using the criteria support and confidence.
cancer.
Cancer risk factors and its domain:
Tasnuba Jesmin et.al (2013) collected 150 people
data and preprocessed, and then clustered the relevant (Table No.1: Risk Factors)
and non relevant data for brain cancer using K-means
algorithm. They developed a tool for brain cancer
detection using data mining technique which saves
time reduce the cost.
Attribute and Score Values: Step 3: Compute the Risk Prediction by using the
rule set (Table No. 2)
(Table No.2: Score Values)
Step 4: Compute association rule using IF ‘A1
…An’ THEN R rule. Here ‘A1…An’ are
conjunctions of conditions that may be satisfied or
unsatisfied set of predicted dimensions R. The rule
based classification technique accelerates to four
statuses TP, TN, FP, and FN as defined below:
TP: - indicates patient has cancer and it is correctly
predicted.
FP: - indicates patient has cancer and it is incorrectly
predicted.
TN: -indicates patient does not have cancer and it is
correctly predicted.
FN: -indicates patient does not have cancer and it is
incorrectly predicted.
Cover=
CONCLUSION
prediction of cancer for particular organ of the body [11] Shweta Kharya “Using Data Mining Techniques for
Diagnosis and Prognosis of Cancer Disease” International
using classification techniques.
Journal of Computer Science, Engineering and Information
Technology (IJCSEIT), Vol.2, No.2, April 2012.
REFERENCES [12] http://naturalon.com/10-of-the-most-cancer-causing-foods/
[13] Peter Adebayo Idowu, Kehinde Oladipo Williams, Jeremiah
[1] R. Agrawal, T. Imielinski, and A. Swami. Mining association Ademola Balogun and Adeniran Ishola Oluwaranti “Breast
rules between sets of items in large databases. In the Proc. of Cancer Risk Prediction Using Data Mining Classification
the ACM SIGMOD Int’l Conf. on Management of Data Techniques”, Transactions on Networks and
(ACM SIGMOD ‘93), Washington, USA, May 1993. Communications, Volume 3 No 2, April (2015); pp: 1-11.
[2] https://en.wikipedia.org/wiki/Data_mining. [14] Tasnuba Jesmin, Kawsar Ahmed, Md. Zamilur Rahman, Md.
[3] Dr. P. Indra Muthu Meena, Dr. Vani Perumal “Performance Badrul Alam Miah “Brain Cancer Risk Prediction Tool Using
of C4.5 and Naïve Bayes Algorithm to Predict Stomach Data Mining” International Journal of Computer Applications
Cancer - An analysis” International Journal of Advanced (0975 – 8887) Volume 61– No.12, January 2013.
Research in Computer and Communication Engineering ISO [15] Er. Tapas Ranjan Baitharu, Dr.Subhendu Kumar Pani “A
3297:2007 Certified Vol. 5, Issue 11, November 2016. Comparative Study of Data Mining Classification Techniques
[4] B. Muthazhagan, T. Ravi “an early diagnosis of lung cancer using Lung Cancer Data” International Journal of Computer
disease using data mining and medical image processing Trends and Technology (IJCTT) – volume 22 Number 2–
methods: A survey” Middle-East Journal of Scientific April 2015.
Research 24(10): 3263-3267, 2016. [16] Tanupriya Choudhury, Prof.Dr. Vivek Kumar, Dr. Darshika
[5] Neelam Singh and Santosh Kumar Singh Bhadauria “Early Nigam “Intelligent Classification & Clustering Of Lung &
Detection of Cancer Using Data Mining” International Oral Cancer through Decision Tree & Genetic Algorithm”
Journal of Applied Mathematical Sciences ISSN 0973-0176 International Journal of Advanced Research in Computer
Volume 9, Number 1 (2016), pp. 47-52. Science and Software Engineering , Volume 5, Issue 12,
[6] Dr. Vani Perumal, Shibu Samuel, Dr. P. Indra Muthu Meena December 2015 ISSN: 2277 128X.
“Application of Training Dataset using Naïve Bayes [17] Jaimini Majali, Rishikesh Niranjan, Vinamra Phatak, Omkar
Classifier for Prediction of Stomach Cancer in Female Tadakhe “Data Mining Techniques For Diagnosis And
Population” International Journal of Scientific Engineering Prognosis Of Cancer” International Journal of Advanced
and Technology Research, ISSN 2319-8885 Vol.05,Issue.45 Research in Computer and Communication Engineering Vol.
November-2016. 4, Issue 3, March 2015.
[7] Dr. T. Christopher, J. Jamera banu “Study of Classification [18] Ada, Rajneet Kaur “A Study of Detection of Lung Cancer
Algorithm for Lung Cancer Prediction” IJISET - Using Data Mining Classification Techniques” , International
International Journal of Innovative Science, Engineering & Journal of Advanced Research in Computer Science and
Technology, Vol. 3 Issue 2, February 2016. ISSN 2348 – Software Engineering, Volume 3, Issue 3, March 2013 ISSN:
7968. 2277 128X.
[8] Kawsar Ahmed, Tasnuba Jesmin, Md. Zamilur Rahman [19] Ronak Sumbaly, N. Vishnusri, S. Jeyalatha “Diagnosis of
“Early Prevention and Detection of Skin Cancer Risk using Breast Cancer using Decision Tree Data Mining Technique”
Data Mining” International Journal of Computer Applications International Journal of Computer Applications (0975 –
(0975 – 8887) Volume 62– No.4, January 2013. 8887) Volume 98– No.10, July 2014.
[9] V.Krishnaiah “Diagnosis of Lung Cancer Prediction System [20] Kumar Anita “A Study on Cancer Perpetuation Using the
Using Data Mining Classification Techniques” International Classification Algorithms” International Journal of Advance
Journal of Computer Science and Information Technologies, Research in Computer and Communication, 2015.
Vol. 4 (1) 2013, 39 – 45 www.ijcsit.Com ISSN: 0975-9646. [21] Williams, Kehinde, et al. "Breast cancer risk prediction using
[10] Cancer Prevention and Control Retrieved data mining classification techniques." Transactions on
fromhttp://www.cdc.gov/cancer/dcpc/resources / features/ Networks and Communications3.2, 2015.
worldcancerday/ Retrieved on: 15 November 2013.