Professional Documents
Culture Documents
1.abstract:: Lung Cancer Desies Identification Using Ann
1.abstract:: Lung Cancer Desies Identification Using Ann
1.Abstract:
2.INTRODUCTION
2.Introduction:
MATLAB :
Matlab is a multi-paradigm numerical computing environment and
proprietary programming language developed by MathWorks. MATLAB allows
matrix manipulations, plotting of functions and data, implementation of
algorithms. Matlab can be written in C, C++, java languages.
ANN(Artifical Neural Network):
ANN MODEL:
Histogram Equivalization:
Histogram Equivalization is a computer image processing technique used to
improve contrast in image. It can be done in 3 steps:
1. Compute the histogram of the image
2. Calculate the normalized sum of histogram
3. Transfer input image to output image
Segmentation:
Segmentation , it will divides image into pixels.
Filtering:
Is a technique of modifying a image.
Dilation :
Dilation is a basic operation in mathematical morphology.First it has been
expanded to gray scale image and to complete lattics.
Entropy:
Entropy is said as corresponding states of intencysity level in which individual
pixels can adopt.
3.OBJECTIVES
3.PROJECT OBJECTIVE
Present LCP research results in the treatment & prevention of Lung Cancer.
4.REQUIREMENT SPECIFICATION
4.1.SYSTEM SPECIFICATION
A System Requirements Specification (SRS) (also known as a Software Requirements
Specification) is a document or set of documentation that describes the features and
behaviour of a system or software application. It includes a variety of elements (see below)
that attempts to define the intended functionality required by the customer to satisfy their
different users.
4.1.1.HARDWARE REQUIREMENTS:
O Processor:Intel®Pentium®CPU4200@1.10Ghz
O RAM : 1GB
O HDD : 50GB and above
4.1.2.SOFTWARE REQUIREMENTS:
Few of the key concepts that are utilized in the project are discussed below.
After incorporate extraction, a classifier is depended upon to pick the correct class of
the characters. Subsequently, there are different classifiers which depend upon
machine learning methods, for example, support vector machines (SVM), artificial
neural network (ANN), significant conviction network (DBN), convolution neural
networks (CNN), secured Markov models (Well), and artificial neural network
(ANN) have been utilized as a bit of the character acknowledgment process.
2. Costfunction:Whenbuildingalinearmodelit’ssaidthatwearetrying to
minimize the error an algorithm does making predictions, and wegot
that by choosing a function to help us measure the error also called cost
function.
It’ssometimesdifficulttoseehowthismathematicalexplanation
translatesintoapracticalsetting,soit’shelpfultolookatanexample. The
canonical example when explaining gradient descent is linear reg
Lines that fit our data better (where better is defined by our error
function) will result in lower error values. If we minimize this function,
we will get the best line for our data. Since our error function consists of
two parameters (m and b) we can visualize it as a two-dimensional
surface. This is what it looks like for our dataset:
We can then update the initial approximation to reach to the minimal and
thereby obtain the values of thecoefficients
4. Logisticregression:
Input data is fed into the first layer, activating each input neuron to some extend.
Based on the weights and an activation function the network determines which
K.L.E’s Smt C.I.MUNAVALLI POLYTECHNIC HUBBALLI Page 16
LUNG CANCER DESIES IDENTIFICATION USING ANN
neurons from the next layer to activate and how strong the activation is going to
be. This so called feedforward process is continued until the output neurons are
activated. The architecture of a neural network has a huge influence on which
data it can work with and its performance.
Back propagation is a sort of oversaw learning for multi-layer nets, generally called
the summed up delta run the show. Screw up data at the yield layer is back induced
to before ones, empowering moving toward weights to these layers to be revived. It
is frequently used as planning calculation in current neural system applications. The
back propagation algorithm An early time of Neural Network was made by Warren
McCulloch and Walter Pitts in 1943 which was a computational model in setting of
Science and calculation. This model made orchestrated research which was pivoted
around the utilization of Neural Networks in Fake care. Made neural framework is
on an astoundingly crucial level a work of wide number of interconnected cells. The
procedure of cells are with the honest to goodness focus on that each cell gets an
information and drives a yield for happening cells. Each cell has a pre-described
The outline underneath is a square chart that depicts the structure and work stream
of a made simulated Neural Network. The neurons are interconnected with each
other in a serial way. The system include different covered layers depending on the
assurance of examination of commitments with the dataset.
Before depicting the convolution neural system, I begin with a delineation of the
fundamental, or support forward, neural system. A more thorough treatment of
this material is given in. Consider a managed learning circumstance where we
are given a course of action of checked data {(x(i), y(i))}. Here, x(i) and y(i)
mean the features and name, independently of the ith planning delineation. At an
irregular state by then, neural systems give a strategy for addressing a brain
boggling, nonlinear limit hW(x) of our data variable x. The farthest point hW(x)
is parameterized by a weights organize W that we can tune to fit our information.
The capacity to consequently extricate area particular highlights from the basic
information gives a feasible contrasting option to the technique for hand-building
that has been the more conventional approach in machine learning and PC
vision. As I show in this proposition, the utilization of these educated highlights
in conjunction with the illustrative energy of a convolution neural network
empowers us to design a high-performing and capable model with in every way
that really matters no hand-tuning. Thusly, this work speaks to a takeoff from a
portion of the more regular techniques that have been connected to the issues of
text detection and recognition.
5.ANALYSIS PHASE
The Boston Scientific SCOPE Team researched bronchoscopy tools and procedures and
worked with physicians to develop areas of opportunity for Boston Scientific’s pulmonary
development team. Boston Scientific asked the team to investigate the problem space of
pulmonary lesion identification and diagnosis and develop a solution space.
Training the system: In this module, we train the system by using the
labeled dataset. We restrict the hand gesture symbols to 5 different
styles. We can assume each style to be a class and thus we have 5
classes.Thenameofeveryclass(thestylename)servesusasthelabelto all the
images corresponding to that class, which serve as data. Later the
features are extracted from every image in each of the classes. We then
train the dataset using a multi-class classifier such as SVM, Naive Bays
oraConvolutionNeuralNetworktoobtainaclassificationmodel.This
classification model is then used fortesting
Testing the system: In this module, we supply the test image to the
trainedmodel.Thetrainedmodelreturnstheprobabilityofthetestimage
belonging to each of the trained classes. We then design a hypothesis to
decide the class of the test image. This determines the gesture that is
present in the image. This activity is done on a real time system.
The main benefit of screening is a lower chance of dying from lung cancer, which accounts for
many deaths in current and former smokers. Still, it’s important to be aware that, as with any
type of screening, not everyone who gets screened will benefit. Screening with LDCT will not
find all lung cancers, and not all of the cancers that are found will be found early.
Even if a cancer is found by screening, you may still die from lung cancer. Also, LDCT often
finds things that turn out not to be cancer, but have to be checked out with more tests to know
what they are. You might need more CT scans, or invasive tests such as a lung biopsy, in which
a piece of lung tissue is removed with a needle or during surgery.
If you are at a higher risk, your doctor can explain your risk and how the ACS lung cancer
screening guideline applies to you. Your doctor can also talk with you about what happens
during screening and the best places to get the yearly screening test. Lung cancer screening is
covered by Medicare and by many private health insurance plans. Your health care team can
help you find out if your insurance will provide coverage.
Screening should only be done at facilities that have the right type of CT scanner and that have
experience in LDCT scans for lung cancer screening. The facility should also have a team of
specialists that can give patients the appropriate care and follow-up if there are abnormal results
on the scans. You might not have the right kind of facility nearby, so you may need to travel
some distance to be screened.
If you are at higher risk and should be screened, you should get a LDCT every year until you
reach the age of 74, as long as you are still in good health.
We use every single image in the dataset for training. We propose to improve
the quality of the images in the dataset using the image processing and
computer vision techniques such as morphological operations, metric
correction etc. We later associate every image with a class label. This
ensures the dataset to contain the images along with the labels. Now we
send this to a learning algorithm and obtain a trained model.
The test image undergoes the same process of quality improvement and then
it is sent to the trained model to obtain the probability of the trained model
belonging to a particular digit. We then use it to decide the class of the
gesture via a designed hypothesis. This gives us the information about the
gesture present in theimage.
DESIGN DOCUMENT
7.PROGRAM CODE
7.1 Coding:
Computer programming (often shortened to programming) is a process that leads from original
formulations of computing problem to executable programs. It involves activities such as
analysis, understanding and generically solving such problems resulting in an algorithm,
verification of requirements of the algorithms including its correctness and its resource
consumption, implementation(commonly referred to as coding) of the algorithm in a target
programming language, testing, debugging, and maintaining the source code, implementation of
the build system and the management of derived artefacts such as machine code of computer
programs. The algorithm is often only represented in human-pars able from reasoned about
using logic.
Computer programmers are those who write computer software. These jobs usually involve :
Coding
K.L.E’s Smt C.I.MUNAVALLI POLYTECHNIC HUBBALLI Page 31
LUNG CANCER DESIES IDENTIFICATION USING ANN
Debugging
Documentation
Integration
Maintenance
Requirements Analysis
Software Architecture
Software Testing.
7.2 Code:
7.2.1 Cancer.m
function varargout = cancer(varargin)
% CANCER M-file for cancer.fig
% CANCER, by itself, creates a new CANCER or raises the existing
% singleton*.
%
% H = CANCER returns the handle to a new CANCER or the handle to
% the existing singleton*.
%
% CANCER('CALLBACK',hObject,eventData,handles,...) calls the local
% function named CALLBACK in CANCER.M with the given input arguments.
%
% CANCER('Property','Value',...) creates a new CANCER or raises the
% existing singleton*. Starting from the left, property value pairs are
% applied to the GUI before cancer_OpeningFcn gets called. An
% unrecognized property name or invalid value makes property application
% stop. All inputs are passed to cancer_OpeningFcn via varargin.
%
% *See GUI Options on GUIDE's Tools menu. Choose "GUI allows only one
% instance to run (singleton)".
%
% See also: GUIDE, GUIDATA, GUIHANDLES
if nargout
[varargout{1:nargout}] = gui_mainfcn(gui_State, varargin{:});
else
gui_mainfcn(gui_State, varargin{:});
end
% End initialization code - DO NOT EDIT
% --- Outputs from this function are returned to the command line.
function varargout = cancer_OutputFcn(hObject, eventdata, handles)
% varargout cell array for returning output args (see VARARGOUT);
% hObject handle to figure
% eventdata reserved - to be defined in a future version of MATLAB
% handles structure with handles and user data (see GUIDATA)
axes(handles.axes2);
imshow(he);
title 'Histogram Equalization'
% --- Executes on button press in pushbutton3.
function pushbutton3_Callback(hObject, eventdata, handles)
% hObject handle to pushbutton3 (see GCBO)
% eventdata reserved - to be defined in a future version of MATLAB
% handles structure with handles and user data (see GUIDATA)
global I
%% Histogram Equalization
t=rgb2gray(I);
he=histeq(t);
%% Segmentation by thresholding
threshold = graythresh(he);
bw = im2bw(he,threshold);
axes(handles.axes3);
imshow(bw)
title 'Segmentation by Thresholding'
hx = hy';
Iy = imfilter(double(bw), hy, 'replicate');
axes(handles.axes4);
imshow(Iy,[]),
title('Filtered Image')
global I
%% Histogram Equalization
t=rgb2gray(I);
he=histeq(t);
%% Segmentation by thresholding
threshold = graythresh(he);
bw = im2bw(he,threshold);
%% Filter
hy = fspecial('sobel');
hx = hy';
Iy = imfilter(double(bw), hy, 'replicate');
se = strel('line',11,90);
bw2 = imdilate(Iy,se);
BW5 = imfill(bw2,'holes');
axes(handles.axes6);
imshow(BW5)
global I
% figure,imshow(I);
% title 'Input Lung Image'
%% Histogram Equalization
t=rgb2gray(I);
he=histeq(t);
% figure,imshow(he);
% title 'Histogram Equalization'
%% Segmentation by thresholding
threshold = graythresh(he);
bw = im2bw(he,threshold);
% figure,imshow(bw)
% title 'Segmentation by Thresholding'
%% Filter
hy = fspecial('sobel');
hx = hy';
Iy = imfilter(double(bw), hy, 'replicate');
se = strel('line',11,90);
bw2 = imdilate(Iy,se);
BW5 = imfill(bw2,'holes');
%% Features Extraction
%% Contrast
disp ('Contrast Value');
cmap = contrast(BW5)
%% Entropy
disp ('Entropy Value');
E = entropy(BW5)
set(handles.edit2,'string',num2str(E));
%% GLCM Values
glcm = graycomatrix(BW5,'Offset',[2 0])
%% mean
c1=mean(C);
%% variance
c2=var(double(C));
%% contrast
d=C;
c9=[c6 c7 c8];
%% network
net = network
net.numInputs = 6
net.numLayers = 1
P = size(double(c1));
Cidx = strcmp('Cancer',c9);
T = size(double(c2));
net = newff(P,T,25);
[net,tr] = train(net,P,T);
testInputs = P(:,tr.testInd);
testTargets = T(:,tr.testInd);
out = round(sim(net,testInputs));
diff = [testTargets - 2*out];
detections = length(find(diff==-1))
false_positives = length(find(diff==1))
true_positives = length(find(diff==0))
false_alarms = length(find(diff==-2))
Nt = size(testInputs,2);
fprintf('Total testing samples: %d\n', Nt);
cm = [detections false_positives; false_alarms true_positives]
cm_p = (cm ./ Nt) .* 100 ;
%%
sim_out = round(sim(net,testInputs));
if ((max(c24.Contrast))>2)
set(handles.edit1,'string','Lung Cancer Affected Image');
else
set(handles.edit1,'string','Normal Image');
end
set(hObject,'BackgroundColor','white');
end
8.TESTING
8.1 TESTING
The goal of utilizing numerous testing methodologies in the development process is
to make sure the software can successfully operate in multiple environments and across
different platforms. These can typically be broken down between functional and non-
functional testing. Functional testing involves testing the application against the
business requirements. It incorporates all test types designed to guarantee each part of a
piece of software behaves as expected by using uses cases provided by the design team
or business analyst. These testing methods are usually conducted in order and include:
Unit testing
Integration testing
System testing
Acceptance testing
Non-functional testing methods incorporate all test types focused on the operational
aspects of a piece of software. These include:
Performance testing
Security testing
Usability testing
Compatibility testing
Unit testing is the first level of testing and is often performed by the developers
themselves. It is the process of ensuring individual components of a piece of software
at the code level are functional and work as they were designed to. Developers in a
test-driven environment will typically write and run the tests prior to the software or
feature being passed over to the test team. Unit testing can be conducted manually, but
automating the process will speed up delivery cycles and expand test coverage. Unit
testing will also make debugging easier because finding issues earlier means they take
less time to fix than if they were discovered later in the testing process. TestLeft is a
tool that allows advanced testers and developers to shift left with the fastest test
automation tool embedded in any IDE.
After each unit is thoroughly tested, it is integrated with other units to create modules
or components that are designed to perform specific tasks or activities. These are then
tested as group through integration testing to ensure whole segments of an application
behave as expected (i.e, the interactions between units are seamless). These tests are
often framed by user scenarios, such as logging into an application or opening files.
Integrated tests can be conducted by either developers or independent testers and are
usually comprised of a combination of automated functional and manual tests.
separate testing team than the development team before the product is pushed into
production.
amounts of time. Ideally, this will help you understand what will happen when the load
is suddenly and drastically increased.
1. Integrity
2. Confidentiality
3. Authentication
4. Authorization
5. Availability
6. Non-repudiation
8.1.8 CompatibilityTesting
Compatibility testing is used to gauge how an application or piece of software will
work in different environments. It is used to check that your product is compatible with
multiple operating systems, platforms, browsers, or resolution configurations. The goal
is to ensure that your software’s functionality is consistently supported across any
environment you expect your end users to be using.
TestComplete is our robust automated GUI testing tool that excels in compatibility
and integration testing. It helps QA teams create and run tests across desktop, mobile,
and web applications – enabling testing professionals to speed up delivery cycles and
improve software quality. Testcomplete comes with built-in support for various test
environments, integrations to performance testing tools, as well as support for
developer friendly SCMs, allowing you to seamlessness integrate it into your
development process. Using TestComplete will enable you to build a robust testing
framework that utilizes the broad spectrum of available software testing
methodologies.
INPUT :
10.1Future Scope:
This research mainly deals with Computer Aided Diagnosis System (CAD) for early detection
of lung cancer nodules from the Chest Computer Tomography (CT) image using FPCM,
MFPCM and SVM and MFPCM and ELM approaches. These techniques are very much useful
for the detection of 135 lung cancer. The proposed CAD system with the ELM approach
provides the best result among the three proposed techniques. In order to improve the
performance of the proposed approaches, some future enhancements would be necessary in the
present research work. The main aim of the future enhancements would be to increase the
sensitivity and specificity of the system. The future option for this research is to use Enhanced
Extreme Learning Machine in the CAD system. The use of Enhanced ELM (Guang-Bin Huang
et al 2008) would provide even better results. Moreover the training time and the accuracy of
the system would also be better by using Enhanced ELM. The second future enhancement of
this research would be to use the new clustering algorithm which can provide better results than
the proposed clustering algorithms. The other future enhancements would be to incorporate
latest technologies like nanotechnology (Medley et al 2008), genetic algorithm (Serhat 2007)
etc., into the CAD system for the better performance of the system.
10.2 DISADVANTAGES:
The patient must be completely immobile
Dental amalgam, metallic structure,and prostheses greatly degrade the images
10.3 Conclusion:
Lung cancer is the leading cause of death over the world and the only chance of
cure for patients affected from this kind of cancer is surgical resection.
This is mainly due to the fact that several factors are involved in lung cancer
develoment and progression and to date the diagnostic methods available for an
early and efficient detection are not sufficient.
Although lung cancer research data have accumulated dramatically during the
past several years, there is no database specifically focusing on lung cancer
molecular biology available yet.
In this short review we have summarized some of the factors contributing to lung
cancer, but it is just an overview of the most important proteins involved in lung
cancer disease, which are often mutated or present an unusual pattern of
expression compared to the healthy tissue.
All together these data are important for understanding the nature behind this type
of cancer and also to understand that basic research on proteins, miRNAs and all
the other epigenetic modifications could be used to develop more powerful
diagnostic tools, as well as prognostic or predictive markers or even for the
development of new targeting therapies for lung cancer.
References:
1. Jemal A, Siegel R, Ward E, Murray T, Xu J, Thun MJ. Cancer statistics, 2007. CA Cancer J.
Clin. 2007;57(1):43–66. [PubMed] [Google Scholar]
2. Travis WD, Travis LB, Devesa SS. Lung cancer. Cancer. 1995;75(1 Suppl):191–202. [PubMed] [Google
Scholar]
3. Schreiber G, McCrory DC. Performance characteristics of different modalities for diagnosis of suspected
lung cancer: summary of published evidence. Chest. 2003;123(1 Suppl):115S–128S. [PubMed] [Google
Scholar]
4. Henschke C I, Yankelevitz D F. CT screening for lung cancer: update 2007. Oncologist. 2008;13(1):65–
78. [PubMed] [Google Scholar]
5. Spira A, Beane J, Shah V, Liu G, Schembri F, Yang X, Palma J, Brody JS. Effects of cigarette smoke on
the human airway epithelial cell transcriptome. Proc. Natl. Acad. Sci. USA. 2004;101(27):10143–
10148. [PMC free article] [PubMed] [Google Scholar]
6. Sandler A, Gray R, Perry MC, Brahmer J, Schiller JH, Dowlati A, Lilenbaum R, Johnson DH. Paclitaxel-
carboplatin alone or with bevacizumab for non-small-cell lung cancer. N. Engl. J. Med. 2006;355(24):2542–
2550. [PubMed] [Google Scholar]
7. Sone S, Li F, Yang ZG, Honda T, Maruyama Y, Takashima S, Hasegawa M, Kawakami S, Kubo K,
Haniuda M, Yamanda T. Results of three-year mass screening programme for lung cancer using mobile low-
dose spiral computed tomography scanner. Br. J. Cancer. 2001;84(1):25–32. [PMC free
article] [PubMed] [Google Scholar]
8. Fossella FV, DeVore R, Kerr RN, Crawford J, Natale RR, Dunphy F, Kalman L, Miller V, Lee JS, Moore
M, Gandara D, Karp D, Vokes E, Kris M, Kim Y, Gamza F, Hammershaimb L. Randomized phase III trial
of docetaxel versus vinorelbine or ifosfamide in patients with advanced non-small-cell lung cancer
previously treated with platinum-containing chemotherapy regimens. The TAX 320 Non-Small Cell Lung
Cancer Study Group. J. Clin. Oncol. 2000;18(12):2354–2362. [PubMed] [Google Scholar]
9. Wang L, Xiong Y, Sun Y, Fang Z, Li L, Ji H, Shi T. HLungDB: an integrated database of human lung
cancer research. Nucleic Acids Res. 2010;38(Database issue):D665–9. [PMC free
article] [PubMed] [Google Scholar]
10. Fong KM, Sekido Y, Gazdar AF, Minna JD. Lung cancer. 9: Molecular biology of lung cancer: clinical
implications. Thorax. 2003;58(10):892–900. [PMC free article] [PubMed] [Google Scholar]
11. Herbst RS, Heymach JV, Lippman SM. Lung cancer. N. Engl. J. Med. 2008;359(13):1367–
1380. [PubMed] [Google Scholar]
12. Aviel-Ronen S, Blackhall FH, Shepherd FA, Tsao MS. Kras mutations in non-small-cell lung
carcinoma: a review. Clin. Lung Cancer. 2006;8(1):30–38. [PubMed] [Google Scholar]
13. Reissmann PT, Koga H, Figlin RA, Holmes EC, Slamon DJ. Amplification and overexpression of the
cyclin D1 and epidermal growth factor receptor genes in non-small-cell lung cancer. Lung Cancer Study
Group. J. Cancer Res. Clin. Oncol. 1999;125(2):61–70. [PubMed] [Google Scholar]
14. Salgia R, Skarin AT. Molecular abnormalities in lung cancer. J. Clin. Oncol. 1998;16(3):1207–
1217. [PubMed] [Google Scholar]
15. Shigematsu H, Takahashi T, Nomura M, Majmudar K, Suzuki M, Lee H, Wistuba II, Fong KM,
Toyooka S, Shimizu N, Fujisawa T, Minna JD, Gazdar AF. Somatic mutations of the HER2 kinase domain
in lung adenocarcinomas. Cancer Res. 2005;65(5):1642–1646. [PubMed] [Google Scholar]
16. Buttitta F, Barassi F, Fresu G, Felicioni L, Chella A, Paolizzi D, Lattanzio G, Salvatore S, Camplese PP,
Rosini S, Iarussi T, Mucilli F, Sacco R, Mezzetti A, Marchetti A. Mutational analysis of the HER2 gene in
lung tumors from Caucasian patients: mutations are mainly present in adenocarcinomas with
bronchioloalveolar features. Int. J. Cancer. 2006;119:2586–2591. [PubMed] [Google Scholar]
17. Naoki K, Chen TH, Richards WG, Sugarbaker DJ, Meyerson M. Missense mutations of the BRAF gene
in human lung adenocarcinoma. Cancer Res. 2002;62(23):7001–7003. [PubMed] [Google Scholar]
18. Brose MS, Volpe P, Feldman M, Kumar M, Rishi I, Gerrero R, Einhorn E, Herlyn M, Minna J,
Nicholson A, Roth JA, Albelda SM, Davies H, Cox C, Brignell G, Stephens P, Futreal PA, Wooster R,
Stratton MR, Weber BL. BRAF and RAS mutations in human lung cancer and melanoma. Cancer
Res. 2002;62(23):6997–7000. [PubMed] [Google Scholar]
19. Lynch TJ, Bell DW, Sordella R, Gurubhagavatula S, Okimoto RA, Brannigan BW, Harris PL, Haserlat
SM, Supko JG, Haluska FG, Louis DN, Christiani DC, Settleman J, Haber DA. Activating mutations in the
epidermal growth factor receptor underlying responsiveness of non-small-cell lung cancer to gefitinib. N.
Engl. J. Med. 2004;350:2129–2139. [PubMed] [Google Scholar]
20. Paez JG, Jänne PA, Lee JC, Tracy S, Greulich H, Gabriel S, Herman P, Kaye FJ, Lindeman N, Boggon
TJ, Naoki K, Sasaki H, Fujii Y, Eck MJ, Sellers WR, Johnson BE, Meyerson M. EGFRmutations in lung
cancer: correlation with clinical response to gefitinib therapy. Science. 2004;304:1497–
1500. [PubMed] [Google Scholar]