Download as pdf or txt
Download as pdf or txt
You are on page 1of 5

See

discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/291495907

An Effective Two Way Classification of Breast


Cancer Images

Article in International Journal of Applied Engineering Research · December 2015

CITATIONS READS

0 41

1 author:

Balakrishnan S
Sri Krishna College of Engineering and Technology
32 PUBLICATIONS 9 CITATIONS

SEE PROFILE

Some of the authors of this publication are also working on these related projects:

Conference Papers View project

NOT based project View project

All content following this page was uploaded by Balakrishnan S on 23 January 2016.

The user has requested enhancement of the downloaded file.


International Journal of Applied Engineering Research ISSN 0973-4562 Volume 10, Number 21 (2015) pp 42472-42475
© Research India Publications. http://www.ripublication.com

An Effective Two Way Classification of Breast Cancer Images


P. Palanikumar

Assistant Professor, Rajiv Gandhi College of Engg, Chennai, palani. lect@gmail. com.

S. Geofrin Shirly

Assistant Professor, Department of Computer Science and Engineering, Vels University, Chennai geofrin55@yahoo. co. in

S.Balakrishnan

Associate Professor, Department of Computer Science and Engineering,


Sri Venkateswara College of Engg & Tech, Chittoor, AP.

Abstract recent years, the incidence rate of breast cancer has


Breast cancer is a highly heterogeneous disease and very considerably increased [1]. Simultaneously, breast cancer
common among western women. Mammogram is an survival rate has also improved over the past few years with
examination of a woman’s breasts using X-rays to check for the development of more effective diagnostic techniques and
cancer. Mammography is one of the first diagnostic tests to improvements in treatment methodologies.
prescreen breast cancer. Early detection of breast cancer has BREAST cancer screening has primarily taken an anatomic
been known to improve recovery rates to a great extent. In approach through X-ray mammography, which requires the
most medical centers, experienced radiologists are given the breast tumor to have developed to a stage where it is
responsibility of analyzing mammograms. But, there is always significantly denser than healthy tissue. As a consequence,
a possibility of human error. Errors can frequently occur as a mammography misses 5%-15% of nonpalpable breast lesions
result of fatigue of the observer, resulting in interobserver and that are not sufficiently denser than healthy tissue [2], [3]. In
intraobserver variations. The sensitivity of mammographic addition, increased density is not always tied to the presence
screening also varies with image quality. To offset different of cancer: dense lesions of tissue that are further investigated
kinds of variability and to standardize diagnostic procedures, via biopsy are often found to be benign [2]. Instead of relying
efforts are being made to develop automated techniques for on density changes, cancer can also be detected by using early
diagnosis and grading of breast cancer images. This paper molecular signatures.
presents a two way classification algorithm for the The American Cancer Society had predicted that about 230,
classification of breast cancer images into benign (tumour 480 new cases of invasive breast cancer and about 57, 650
growing, but not dangerous) and malignant (cannot be new cases of noninvasive breast cancer would be diagnosed in
controlled, it causes death) classes. Because of the sparse the United States in 2011 and around 39, 520 women would
distribution of abnormal mammograms, the two-way die from breast cancer [11]. The most popular diagnostic
classification data mining algorithms are used. First technique called mammography uses low-dose X-rays, high-
classification algorithm is k-means algorithm which is used to contrast and high-resolution detectors, and an X-ray system
partition a given dataset into a user specified number of designed specifically to image the breasts. Mammography has
clusters. Second classification algorithm is Support Vector found its application in both screening and diagnosis of breast
Machine (SVM) is used to find the best classification function cancer. There are two types of mammography systems: screen
to distinguish between members of the two classes in the film mammography (SFM), wherein the end recording device
training data. is a film screen; and full-field digital mammography (FFDM),
which uses digital detectors as the recording media. The
Keywords: Mammogram, Breast cancer, k-means, Support digital images provided by FFDM offer many advantages over
Vector Machine. its film counterpart in terms of ease of image processing and
enhancement [4]-[7].
Digital Mammogram is one of the important methods to
Introduction identify the Breast Cancer at an early stage at some extend.
CANCER refers to the uncontrolled multiplication of a group The advantages of digital mammography include the lack of
of cells in a particular location of the body. Cancer is a serious ionizing radiation, its non-invasiveness, the relatively compact
disease in which growths of cells, also called cancers, form in instrumentation, and its cost-effectiveness. While
the body and kill normal body cells. A group of rapidly mammography has been proven to be the most effective and
dividing cells may form a lump or mass of extra tissue. These reliable method for the early detection of breast cancer, as
masses are usually referred to as tumors. Cancer cells are indicated by Siddiqui et al. [29], the large number of
termed malignant tumors. Breast cancer is any form of mammograms, generated by population screening, must be
malignant tumor that develops from breast cells. Important interpreted and diagnosed by a relatively small number of
signs to look for in the case of breast cancer are clusters of radiologists. In addition, when observing a mammographic
microcalcifications, masses, and architectural distortions. In image, abnormalities are often embedded in and camouflaged

42472
International Journal of Applied Engineering Research ISSN 0973-4562 Volume 10, Number 21 (2015) pp 42472-42475
© Research India Publications. http://www.ripublication.com

by varying densities of breast tissue structures, resulting in Then the algorithm iterates between two steps till
high rates of missed breast cancer cases as mentioned by convergence:
Wroblewska et al. [30]. Step 1: Data Assignment
Each data point is assigned to its closest centroid, with ties
broken arbitrarily. This results in a partitioning of the data.
Related Work Step 2: Relocation of “means”
Abou-Chadi et al. [23] used a neural network approach for Each cluster representative is relocated to the center (mean) of
detecting candidate circumscribed lesions in digitized all data points assigned to it. If the data points come with a
mammograms. The neural network trained using back probability measure (weights), then the relocation is to the
propagation algorithms. The procedure depends mainly on the expectations (weighted mean) of the data partitions.
major difference between the histogram of the normal tissue The default measure of closeness is the Euclidean distance, in
and that of the cancerous tissue. which case one can readily show that the non-negative cost
Brake et al. [22] studied single and multiscale detection of function,
masses in digital mammograms. Scale is an important issue in
the automated detection of masses in mammograms, due to
the range of possible sizes masses can have. In this work, it
was examined if detection of masses can be done at a single
scale, or whether it is more appropriate to use the output of the
detection method at different scales in a multiscale scheme.
Chan et al. [24] investigated a computer-based method for the B. Support Vector Machines
detection of microcalcification in digital mammograms. The In today’s machine learning applications, support vector
method is based on a difference image technique in which a machines (SVM) [10] are considered a must try—it offers one
signal suppressed image is subtracted from a signal enhanced of the most robust and accurate methods among all well-
image to remove structured background in the mammogram. known algorithms. It has a sound theoretical foundation,
Global and local thresholding techniques are then used to requires only a dozen examples for training, and is insensitive
extract potential microcalcification signals. to the number of dimensions. In addition, efficient methods
Karssemeijer [25] developed a statistical method for detection for training SVM are also being developed at a fast pace.
of microcalcifications in digital mammograms. The method is Support Vector Machines uses data from two classes to
based on the use of statistical models and the general determine a maximum margin hyperplane between the two
framework of Bayesian image analysis. classes. The hyperplane is determined in such a way that the
Nakayama et al. [26] used a filter bank for the detection of distance from this hyperplane to the nearest data points on
nodular and linear patterns. The filter bank is designed so that each side, called support vectors, is maximal. Support vector
the subimages generated the elements of a Hessian matrix at machine (SVM) classifiers can be extended to nonlinearly
each resolution level. By calculating the small and large separable data with the help of kernel function application on
eigenvalues, a new filter bank has the following three the data to make them linearly separable [8]. In this paper, we
properties. (a) Nodular patterns of various sizes can be used the linear kernel, polynomial kernel of orders 1, 2, and 3,
enhanced. (b) Both nodular and linear patterns of various sizes and the radial basis function kernel. An approach with wavelet
can be enhanced. (c) The original image can be reconstructed SVM was discussed in [9], which uses similar kernel
with these patterns removed. The filter bank is applied to techniques.
enhance microcalcifications in mammograms.
Yu et al. [27] proposed a CAD system for the automatic
detection of clustered microcalcifications through two steps. Proposed System Architecture
The first one is to segment potential microcalcification pixels Mammogram result is taken as an input and given to
by using wavelet and gray level statistical features and to preprocessing phase for filtering the data. Pre-processing is an
connect them into potential individual microcalcification important issue in low-level image processing. Using filtering
objects. The second step is to check these potential objects by it is possible to filter out the noise present in image. A high
using 31 statistical features. Neural network classifiers were pass filter passes the frequent changes in the gray level and a
used. Results are satisfactory but not highly guaranteed low pass filter reduces the frequent changes in the gray level
because the training set was used in the testing set. of an image. That is; the low pass filter smoothes and often
removes the sharp edges. A special type of low pass filter is
the Median filter. The Median filter takes an area of image (3
Algorithms Used x 3, 5 x 5, 7 x 7, etc), observes all pixel values in that area and
A. K-means algorithm puts it into the array called element array. Then, the element
The algorithm operates on a set of d-dimensional vectors, D = array is sorted and the median value of the element array is
{xi | i = 1,..., N }, where xi denotes the ith data point. The found out. We have achieved this by sorting the element array
algorithm is initialized by picking k points the initial k cluster in the ascending order using bubble sort and returning the
representatives or “centroids”. Techniques for selecting these middle elements of the sorted array as the median value. The
initial seeds include sampling at random from the dataset, output image array is the set of all the median values of the
setting them as the solution of clustering a small subset of the element arrays obtained for all the pixels [28]. Median filter
data or perturbing the global mean of the data k times. goes into a series of loops which cover the entire image array.

42473
International Journal of Applied Engineering Research ISSN 0973-4562 Volume 10, Number 21 (2015) pp 42472-42475
© Research India Publications. http://www.ripublication.com

After end of preprocessing phase, the processed data is fed References


into first classification algorithm (i. e. k-means algorithm).
With the help of k-means algorithm, processed data can be [1] K. Polat and S. Genes, “Breast cancer diagnosis
converted into specified clustered data. Then clustered data is using least square support vector machine, ” Digit.
given as input in SVM algorithm and produces best classified Signal Process., vol. 17, no. 4, pp. 694-701, Jul.
data. 2007.
[2] R. Manoharan et al., “Raman spectroscopy and
fluorescence photon migration for breast cancer
diagnosis and imaging, ” Photochem. Photobio., vol.
67, no. 1, pp. 15-22, Jan. 1998.
[3] R. T. Osteen, J. L. Connolly, M. E. Costanza, J. R.
Harris, and D. F. Hayes, “Cancer of the breast, ” in
Cancer Manual, 9th ed. New York: Am. Cancer Soc.,
1996, pp. 320-339.
[4] E. D. Pisano, C. Gatsonis, R. E. Hendrick, M. J.
Yaffe, J. K. Baum, S. Acharyya, and J. B. Cormack,
“Diagnostic performance of digital versus film
mammography for breast-cancer screening, ” New
England J. Med., vol. 353, no. 17, pp. 1773-1783,
Oct. 2005.
[5] E. D. Pisano, R. E. Hendrick, M. J. Yaffe, J. K.
Baum, S. Acharyya, and J. B. Cormack, “Diagnostic
accuracy of digital versus film mammography:
Exploratory analysis of selected population
subgroups in DMIST, ” Radiology, vol. 246, no. 2,
pp. 376-383, Feb. 2008.
[6] A. Mencattini, M. Salmeri, R. Lojacono, M. Frigerio,
and F. Caselli, “Mammographic images enhancement
and denoising for breast cancer detection using
dyadic wavelet processing, ” IEEE Trans. Instrum.
Meas., vol. 57, no. 7, pp. 1422-1430, Jul. 2008.
[7] R. Schulz-Wendtland, M. Fuchsjigerb, T. Wackerc,
and K. P. Hermannd, “Digital mammography: An
update, ” Eur. J. Radiol., vol. 72, no. 2, pp. 258-265,
Nov. 2009.
[8] K. R. Muller, S. Mika, G. Ratsch, K. Tsuda, and B.
Scholkopf, “An introduction to kernel based learning
algorithms, ” IEEE Trans. Neural Netw., vol. 12, no.
2, pp. 181-201, Mar. 2001.
Fig. 1. System architecture [9] M. Shen, L. Lin, J. Chen, and C. Q. Chang, “A
prediction approach for multichannel EEG signals
modeling using local wavelet SVM, ” IEEE Trans.
Conclusion Instrum. Meas., vol. 59, no. 5, pp. 1485-1492, May
A novel framework for two-way classification methodology in 2010.
mammographic image analysis has been proposed. From a [10] Vapnik V (1995) The nature of statistical learning
study of the available literature, we find that the application of theory. Springer, New York.
two way classification to the problem of mammographic [11] D. Tax and R. Duin, “Uniform object generation for
image analysis is rare. We strongly believe that the optimizing oneclassclassiers, ” J. Mach. Learn. Res.,
performance of the proposed system can be further enhanced vol. 2, pp. 155-173, Mar. 2001.
by framing new functionals that are more adaptable to [12] U. R. Acharya, E. Y. K. Ng, Y. Hong, Y. Jie, and G.
mammograms. J. L. Kaw, “Automatic identification of breast cancer
This article presents a very general overview of two way using mammogram, ” J. Med. Syst., vol. 32, no. 6,
classification architecture. It demonstrates how an abstract pp. 499-507, 2008.
structure allows us to discover effective classification of [13] T. C. S. S. Andre and R. M. Rangayyan,
breast cancer images. “Classification of tumors and masses in
This algorithm has been implemented in future for further mammograms using neural networks with shape and
development because of its simplicity and it also encourages texture features, ” in Proc. 25th Ann. Int. Conf. IEEE
results that will motivate real-time breast cancer diagnosis EMBS, vol. 3. Sep. 2003, pp. 2261-2264.
system. [14] F. Dehghan, H. Abrishami-Moghaddam, and M. Giti,
“Automatic detection of clustered microcalcifications

42474
International Journal of Applied Engineering Research ISSN 0973-4562 Volume 10, Number 21 (2015) pp 42472-42475
© Research India Publications. http://www.ripublication.com

in digital mammograms: Study on applying adaboost Nodular Patterns and Linear Patterns in Medical
with SVM-based component classifiers, ” in Proc. Images” Systems and Computers in Japan, Vol. 36,
30th Annu. Int. Conf. IEEE EMBS, Aug. 2008, pp. No. 13, 2005.
4789-4792. [27] Songyang Yu and Ling Guan, "A CAD system for
[15] R. M. Rangayyan, N. M. El-Faramawy, J. E. L. the automatic detection of clustered
Desautels, and O. A. Alim, “Measures of acutance microcalcifications in digitized mammogram 3films,
and shape for classification of breast tumours, ” " IEEE Trans. Med. Imag., vol. 19, pp. 115-126,
IEEE Trans. Med. Imag., vol. 16, no. 6, pp. 799-810, February 2000.
Dec. 1997. [28] R. C. Gonzalez, R. E. Woods, “Digital Image
[16] C. E. Priebe, R. A. Lorey, D. J. Marchette, J. L. processing”, Pretice Hall. 2007.
Solka, and D. W. Rogers, “Nonparametric spatio- [29] M. Siddiqui, M. Anand, P. Mehrotra, R. Sarangi, N.
temporal change point analysis for early detection in Mathur, Biomonitoring of organochlorines in women
mammography, ” in Proc. 2nd Int. Workshop Digit. with benign and malignant breast disease,
Mammography, Jul. 1994, pp. 111-120. Environmental Research 98 (2) (2005) 250-257.
[17] L. Wei, Y. Yang, R. M. Nishikawa, and Y. Jiang, “A [30] A. Wroblewska, P. Boninski, A. Przelaskowski, M.
study on several machine-learning methods for Kazubek, Segmentation and feature extraction for
classification of malignant and benign clustered reliable classification of microcalcifications in digital
microcalcifications, ” IEEE Trans. Med. Imag., vol. mammograms, Opto-Electronics Review 11 (3)
24, no. 3, pp. 371-380, Mar. 2005. (2003) 227-235.
[18] B. Verma and J. Zakos, “A computer-aided diagnosis
system for digital mammograms based on fuzzy-
neural and feature extraction techniques, ” IEEE
Trans. Inf. Technol. Biomed., vol. 5, no. 1, pp. 46-
54, Mar. 2001.
[19] S. K. Kinoshita, P. M. A. Marques, A. F. F. Slates,
H. R. C. Marana, R. J. Ferrari, and R. L. Villela,
“Detection and characterization of mammographic
masses by artificial neural network, ” in Proc. 4th Int.
Workshop Digit. Mammography, Jun. 1998, pp. 489-
490.
[20] Y. Chitre, A. P. Dhawan, and M. Moskowitz,
“Artificial neural network based classification of
mammographic microcalcifications using image
structure features, ” Int. J. Pattern Recognit. Artif.
Intell., vol. 7, no. 12, pp. 1377-1402, 1993.
[21] K. Ganesan, R. Acharya, K. C. Chua, L. C. Min, and
K. T. Abraham, “Decision support system for breast
cancer detection using mammograms, ” Proc. Inst.
Mech. Eng., Part H, J. Eng. Med., vol. 227, no. 7, pp.
721-732, Jul. 2013.
[22] Guido M. Brake and Nico Karssemeijer “Single and
Multiscale Detection of Masses in Digital
Mammograms” IEEE transactions on medical
imaging, vol. 18, no. 7, July 1999.
[23] NohaYoussry, Fatma E. Z. Abou-Chadi, Alaa M. El-
Sayad, "A neural network approach for mass
detection in digitized mammograms, " ACBME,
2002.
[24] H. P. Chan, K. Doi, C. J. Vyborny, K. L. Lam, and
R. A. Schmidt, “Computer-aided detection of
microcalcifications in mammograms methodology
and preliminary clinical study, ” Investigative
Radiol., vol. 23, pp. 664-671, 1988.
[25] N. Karssemeijer, “Recognition of clustered
microcalcifications using a random field mode,
biomedical image processing and biomedical
visualization, ” Proc. SPIE, vol. 1905, pp. 776-786,
1993.
[26] Ryohei Nakayama and Yoshikazu Uchiyama
“Development of New Filter Bank for Detection of

42475

View publication stats

You might also like