Professional Documents
Culture Documents
Textbook Information and Decision Sciences Proceedings of The 6Th International Conference On Ficta Suresh Chandra Satapathy Ebook All Chapter PDF
Textbook Information and Decision Sciences Proceedings of The 6Th International Conference On Ficta Suresh Chandra Satapathy Ebook All Chapter PDF
https://textbookfull.com/product/proceedings-of-the-
international-congress-on-information-and-communication-
technology-icict-2015-volume-1-1st-edition-suresh-chandra-
satapathy/
https://textbookfull.com/product/proceedings-of-international-
conference-on-ict-for-sustainable-development-
ict4sd-2015-volume-1-1st-edition-suresh-chandra-satapathy/
https://textbookfull.com/product/proceedings-of-international-
conference-on-ict-for-sustainable-development-
ict4sd-2015-volume-2-1st-edition-suresh-chandra-satapathy/
Proceedings of the Second International Conference on
Computer and Communication Technologies IC3T 2015
Volume 2 1st Edition Suresh Chandra Satapathy
https://textbookfull.com/product/proceedings-of-the-second-
international-conference-on-computer-and-communication-
technologies-ic3t-2015-volume-2-1st-edition-suresh-chandra-
satapathy/
https://textbookfull.com/product/proceedings-of-the-second-
international-conference-on-computer-and-communication-
technologies-ic3t-2015-volume-3-1st-edition-suresh-chandra-
satapathy/
https://textbookfull.com/product/proceedings-of-the-
international-conference-on-data-engineering-and-communication-
technology-icdect-2016-volume-1-1st-edition-suresh-chandra-
satapathy/
https://textbookfull.com/product/smart-computing-and-informatics-
proceedings-of-the-first-international-conference-on-
sci-2016-volume-2-1st-edition-suresh-chandra-satapathy/
Information
and Decision
Sciences
Proceedings of the 6th International
Conference on FICTA
Advances in Intelligent Systems and Computing
Volume 701
Series editor
Janusz Kacprzyk, Polish Academy of Sciences, Warsaw, Poland
e-mail: kacprzyk@ibspan.waw.pl
The series “Advances in Intelligent Systems and Computing” contains publications on theory, applications,
and design methods of Intelligent Systems and Intelligent Computing. Virtually all disciplines such as
engineering, natural sciences, computer and information science, ICT, economics, business, e-commerce,
environment, healthcare, life science are covered. The list of topics spans all the areas of modern intelligent
systems and computing such as: computational intelligence, soft computing including neural networks,
fuzzy systems, evolutionary computing and the fusion of these paradigms, social intelligence, ambient
intelligence, computational neuroscience, artificial life, virtual worlds and society, cognitive science and
systems, Perception and Vision, DNA and immune based systems, self-organizing and adaptive systems,
e-Learning and teaching, human-centered and human-centric computing, recommender systems, intelligent
control, robotics and mechatronics including human-machine teaming, knowledge-based paradigms,
learning paradigms, machine ethics, intelligent data analysis, knowledge management, intelligent agents,
intelligent decision making and support, intelligent network security, trust management, interactive
entertainment, Web intelligence and multimedia.
The publications within “Advances in Intelligent Systems and Computing” are primarily proceedings of
important conferences, symposia and congresses. They cover significant recent developments in the field, both
of a foundational and applicable character. An important characteristic feature of the series is the short
publication time and world-wide distribution. This permits a rapid and broad dissemination of research results.
Advisory Board
Chairman
Nikhil R. Pal, Indian Statistical Institute, Kolkata, India
e-mail: nikhil@isical.ac.in
Members
Rafael Bello Perez, Universidad Central “Marta Abreu” de Las Villas, Santa Clara, Cuba
e-mail: rbellop@uclv.edu.cu
Emilio S. Corchado, University of Salamanca, Salamanca, Spain
e-mail: escorchado@usal.es
Hani Hagras, University of Essex, Colchester, UK
e-mail: hani@essex.ac.uk
László T. Kóczy, Széchenyi István University, Győr, Hungary
e-mail: koczy@sze.hu
Vladik Kreinovich, University of Texas at El Paso, El Paso, USA
e-mail: vladik@utep.edu
Chin-Teng Lin, National Chiao Tung University, Hsinchu, Taiwan
e-mail: ctlin@mail.nctu.edu.tw
Jie Lu, University of Technology, Sydney, Australia
e-mail: Jie.Lu@uts.edu.au
Patricia Melin, Tijuana Institute of Technology, Tijuana, Mexico
e-mail: epmelin@hafsamx.org
Nadia Nedjah, State University of Rio de Janeiro, Rio de Janeiro, Brazil
e-mail: nadia@eng.uerj.br
Ngoc Thanh Nguyen, Wroclaw University of Technology, Wroclaw, Poland
e-mail: Ngoc-Thanh.Nguyen@pwr.edu.pl
Jun Wang, The Chinese University of Hong Kong, Shatin, Hong Kong
e-mail: jwang@mae.cuhk.edu.hk
123
Editors
Suresh Chandra Satapathy Vikrant Bhateja
Department of Computer Science Department of Electronics
and Engineering and Communication Engineering
PVP Siddhartha Institute of Technology SRMGPC
Vijayawada, Andhra Pradesh Lucknow, Uttar Pradesh
India India
This Springer imprint is published by the registered company Springer Nature Singapore Pte Ltd.
part of Springer Nature
The registered company address is: 152 Beach Road, #21-01/04 Gateway East, Singapore 189721,
Singapore
Preface
v
vi Preface
publication under the proceedings. This volume consists of 59 papers from diverse
areas of information and decision sciences.
The conference featured many distinguished keynote addresses by eminent
speakers like Dr. Siba K. Udgata (University of Hyderabad, Telangana, India) on
Intelligent and Soft Sensor for Environment Monitoring; Dr. Goutam Sanyal (NIT
Durgapur, WB, India) on Vision-based Biometric Features; Dr. Kamiya Khatter
(Sr. Editorial Assistant, Springer Nature, India) on Author Services and Tools.
These keynote lectures embraced a huge toll of an audience of students, faculties,
budding researchers, as well as delegates.
We thank the General Chairs: Prof. Samaresh Mishra, Prof. Veena Goswami,
KIIT University, Bhubaneswar, India, and Prof. Suresh Chandra Satapathy,
PVPSIT, Vijayawada, India, for providing valuable guidelines and inspiration to
overcome various difficulties in the process of organizing this conference.
We extend our heartfelt thanks to the Honorary Chairs of this conference:
Dr. B. K. Panigrahi, IIT Delhi, and Dr. Swagatam Das, ISI, Kolkata, for being with
us from the beginning to the end of this conference and without their support this
conference could never have been successful.
We would also like to thank School of Computer Applications and Computer
Engineering, KIIT University, Bhubaneswar, who jointly came forward to support
us to organize sixth edition of this conference series. We are amazed to note the
enthusiasm of all faculty, staff, and students of KIIT to organize the conference in
such a professional way. Involvements of faculty coordinators and student volunteer
are praiseworthy in every respect. We are confident that in the future too we would
like to organize many more international-level conferences in this beautiful campus.
We would also like to thank our sponsors for providing all the support and financial
assistance.
We take this opportunity to thank authors of all submitted papers for their hard
work, adherence to the deadlines, and patience with the review process. The quality
of a refereed volume depends mainly on the expertise and dedication of the
reviewers. We are indebted to the program committee members and external
reviewers who not only produced excellent reviews but also did these in short time
frames. We would also like to thank the participants of this conference, who have
participated in the conference above all hardships. Finally, we would like to thank all
the volunteers who spent tireless efforts in meeting the deadlines and arranging every
detail to make sure that the conference can run smoothly. All the efforts are worth and
would please us all, if the readers of this proceedings and participants of this con-
ference found the papers and conference inspiring and enjoyable. Our sincere thanks
to all press print and electronic media for their excellent coverage of this conference.
We take this opportunity to thank all Keynote Speakers, Track and Special
Session Chairs for their excellent support to make FICTA-2017 a grand success.
Chief Patron
Achyuta Samanta, KISS and KIIT University
Patron
H. Mohanty, KIIT University
Advisory Committee
Sasmita Samanta, KIIT University
Ganga Bishnu Mund, KIIT University
Samaresh Mishra, KIIT University
Honorary Chairs
Swagatam Das, ISI, Kolkata
B. K. Panigrahi, IIT Delhi
General Chairs
Veena Goswami, KIIT University
Suresh Chandra Satapathy, PVPSIT, Vijayawada
Convener
Sachi Nandan Mohanty, KIIT University
Satya Ranjan Dash, KIIT University
Organizing Chairs
Sidharth Swarup Routaray, KIIT University
Manas Mukul, KIIT University
vii
viii Organization
Publication Chair
Vikrant Bhateja, SRMGPC, Lucknow
Steering Committee
Suresh Chandra Satapathy, PVPSIT, Vijayawada
Vikrant Bhateja, SRMGPC, Lucknow
Siba K. Udgata, UoH, Hyderabad
Manas Kumar Sanyal, University of Kalyani
Nilanjan Dey, TICT, Kolkata
B. N. Biswal, BEC, Bhubaneswar
Editorial Board
Suresh Chandra Satapathy, PVPSIT, Vijayawada, India
Vikrant Bhateja, SRMGPC, Lucknow (UP), India
Dr. J. R. Mohanty, KIIT University
Prasant Kumar Pattnaik, KIIT University, Bhubaneswar, India
Joao Manuel R. S. Tavares, Universidade do Porto (FEUP), Porto, Portugal
Carlos Artemio Coello Coello, CINVESTAV-IPN, Mexico City, Mexico
Registration Chairs
Ajaya Jena, KIIT University
Utpal Dey, KIIT University
P. S. Pattanayak, KIIT University
Prachi Viyajeeta, KIIT University
Publicity Chairs
J. K. Mandal, University of Kalyani, Kolkata
Himanshu Das, KIIT University
R. K. Barik, KIIT University
Organization ix
Workshop Chairs
Manoj Mishra, KIIT University
Manas Kumar Rath, KIIT University
Track Chairs
Machine Learning Applications: Steven L. Fernandez, The University of Alabama,
Birmingham, USA
Image Processing and Pattern Recognition: V. N. Manjunath Aradhya, SJCE,
Mysore, India
Signals, Communication, and Microelectronics: A. K. Pandey, MIET, Meerut (UP),
India
Data Engineering: M. Ramakrishna Murty, ANITS, Visakhapatnam, India
xiii
xiv Contents
Joao Manuel R. S. Tavares earned his Ph.D. in Electrical and Computer Engi-
neering in 2001 and his postdoctoral degree in Mechanical Engineering in 2015. He
is a Senior Researcher and Project Coordinator at the Instituto de Ciência e Ino-
vação em Engenharia Mecânica e Engenharia Industrial (INEGI), Portugal, and an
Associate Professor at the Faculdade de Engenharia da Universidade do Porto
(FEUP), Portugal. He is the co-editor of more than 35 books, co-author of more
than 550 articles in international and national journals and conferences, and holder
of 3 international and 2 national patents. He is the co-founder and co-editor of the
book series “Lecture Notes in Computational Vision and Biomechanics,” founder
and editor-in-chief of the journal “Computer Methods in Biomechanics and
Biomedical Engineering: Imaging and Visualization,” and co-founder and co-chair
of the international conference series: CompIMAGE, ECCOMAS VipIMAGE,
ICCEBS, and BioDental.
xix
xx About the Editors
1 Introduction
The web is growing constantly with a huge amount of text mainly through blogs,
twitter, reviews, and social media. Most of this text is written by various authors in
different contexts. The availability of text challenges the researchers and informa-
tion analysts to develop automated tools for information analysis. Authorship
analysis is one such area attracted by several researchers to extract information from
the anonymous text. Authorship analysis is a procedure of finding the authorship of
a text by inspecting its characteristics. Authorship analysis is classified into two
categories such as authorship identification and author profiling [1].
Authorship identification finds the authorship of a document. It is categorized
into two classes namely, authorship attribution and authorship verification.
Authorship attribution predicts the author of a given anonymous document by
analyzing the documents of multiple authors [2]. Authorship verification finds
whether the given document is written by a particular author or not by analyzing the
documents of a single author [3]. Author profiling is a type of text classification
task, which predicts profiling characteristics such as gender, age, occupation, native
language, location, education, and personality traits of the authors by analyzing
their writing styles [4].
Authorship attribution is used in several applications such as forensic analysis,
security, and literary research. In forensic analysis, the suicide notes and property
wills are analyzed whether the note or will is written by a correct person or not by
analyzing the writing styles of suspected authors. The terrorist organizations send
threatening mails; authorship attribution techniques are used to identify which
terrorist organization send the mail or to confirm the mail whether it came from
correct source or not. In literary research, some researchers try to claim the inno-
vations of others without proper acknowledgement. The authorship attribution is
used to identify author of a document by analyzing the writing styles of the various
authors.
The authorship attribution approaches are divided into two categories such as
profile-based approaches and instance-based approaches [1]. In this work, an
instance-based approach is discussed. In the profile-based approach, the available
training texts per author were concatenated to get a single text file. This single big
file is used to extract the different properties of the author’s style. In the
instance-based approaches, the known authorship text sample is considered as an
instance and every text sample as a unit.
This paper is planned as follows. Section 2 explains the related work done by the
researchers in authorship attribution area. The existing model used by several
researchers to represent a document is described in Sect. 3. In Sect. 4, the proposed
model is explained with a term weight measure and document weight measure. The
A New Approach for Authorship Attribution 3
experimental results are analyzed in Sect. 5. Section 6 concludes this work and
suggests the future work in authorship attribution.
2 Existing Work
3 Existing Approach
Figure 1 shows the model of the BOW approach. In this approach, first collect the
corpus. Then, preprocessing techniques are applied to the collected dataset. Extract
Training Corpus
Preprocessing Techniques
Extracting Features
Bag of Words
(F1 ,F2 ,…………………….,F n )
Classification Model
the most frequent terms or features that are important to discriminate the writing
styles of the authors from the modified dataset. Consider these terms or features as
BOW. Every document in the dataset is represented by this BOW. Each value in the
document vector is the weights of the BOW. Finally, the document vectors are used
to generate classification model.
TRAINING CORPUS
<D1,D2,………………..Dm>
PREPROCESSING
TECHNIQUES
UNKNOWN
DOCUMENT ( Du )
PREPROCESSING
TECHNIQUES
AUTHOR OF A
CLASSIFICATION MODEL UNKNOWN
PREDICTS DOCUMENT
term weights in each author group of documents using term weight measures.
Document weights are determined for each author group by aggregating the weights
of the terms in a document using document weight measure. Generate document
vectors with document weights to build a classification model. Finally, the classi-
fication model is used to predict the author of an unknown document.
In this model, {A1,A2, …Aq} is a set of author groups, {D1, D2, …Dm} is a list of
documents in the corpus, and {T1, T2, …Tn} is a list of vocabulary terms. TWAxy is
the weight of the term Ty weight in the author group Ax, and DWApq is the
document Dq weight in the author group Ap. The profiles of an anonymous doc-
ument are predicted using classification model. In this approach, identification of
suitable weight measures for calculating term weight and document weight is
important. Sections. 4.1 and 4.2 discuss about the weight measures used in this
approach.
In text classification, the text document is represented by the vector space model
and predefined classes are assigned to documents through machine learning algo-
rithms. In vector space model, the documents are represented as numerical feature
vector, which consists of the weights of the terms that are extracted from the
document. The basic problem in text classification is identification of appropriate
term weight measure to calculate the weight of the terms that directly affects the
classification accuracy.
In this work, the Discriminative Feature Selection Term Weight (DFSTW)
measure [10] was identified from text categorization domain to select the best
features to categorize the text. This measure is tested on our corpus to predict the
characteristics of the authors. DFS measure assigns more weight to the terms that
are having high average term frequency in class cj and the terms with high
occurrence rate in most of the documents of cj. DFS measure assigns less weight to
the terms that occurred in most of the documents in both cj and cj . The DFSTW
measure is represented in Eq. (1).
where A = {A1, A2, ….Aq} is the set of author groups, {D1, D2,…., Dm}is a
collection of documents in the corpus, and V = {t1,t2, ….,tn} is a collection of
vocabulary terms for analysis. aij is the number of documents of class cj which
contains the term ti, bij is the number of documents of class cj which does not
contain term ti, cij is the number of documents which contains term ti and does not
A New Approach for Authorship Attribution 7
belong to class cj, and dij is the number of documents which does not contain term ti
and does not belong to class cj.
The proposed document weight measure as in Eq. (2) is used to calculate the weight
of a document or corpus of each author group. This measure used the combination
of term weights that are specific to document and specific to author group. The
Term Frequency (TF) is used to compute term weights specific to a document, and
term weight measures are used to determine the term weights specific to author
group. The document weight computation considers the correlation between the
terms in that document.
where Wdkj is the weight of document dk in the author group Aj and Wtij is the
weight of a term ti in the corpus of author group Aj.
5 Experimental Results
In this work, the experimentation is carried out with most frequent terms with BOW
model and proposed ADW model. The proposed model achieved an accuracy of
97.89% for author prediction when Naïve Bayes Multinomial classifier is used.
The BOW approach obtained an accuracy of 82.16% for author prediction when Naïve
Bayes Multinomial classifier is used. In both models, the most frequent 8000 terms
obtained highest accuracy. In BOW approach, the terms are independently partici-
pated in the classification process but in proposed ADW model the terms are col-
laboratively in the form of document weight participated in the classification process.
This is the main reason for obtaining good accuracies in the proposed model.
In our future work, it is planned to consider the domain characteristics and cate-
gorical features while computing a document weight. It is also planned to usage of
semantic and syntactic structure of the language while assigning weights to the
document.
A New Approach for Authorship Attribution 9
References
1 Introduction
Online assessments are gaining increased acceptance in higher education as the pre-
ferred appraisal scheme, gradually replacing the manual paper-and-pen assessment
• You pay a royalty fee of 20% of the gross profits you derive from
the use of Project Gutenberg™ works calculated using the
method you already use to calculate your applicable taxes. The
fee is owed to the owner of the Project Gutenberg™ trademark,
but he has agreed to donate royalties under this paragraph to
the Project Gutenberg Literary Archive Foundation. Royalty
payments must be paid within 60 days following each date on
which you prepare (or are legally required to prepare) your
periodic tax returns. Royalty payments should be clearly marked
as such and sent to the Project Gutenberg Literary Archive
Foundation at the address specified in Section 4, “Information
about donations to the Project Gutenberg Literary Archive
Foundation.”
• You comply with all other terms of this agreement for free
distribution of Project Gutenberg™ works.
1.F.
1.F.4. Except for the limited right of replacement or refund set forth in
paragraph 1.F.3, this work is provided to you ‘AS-IS’, WITH NO
OTHER WARRANTIES OF ANY KIND, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR ANY PURPOSE.
Please check the Project Gutenberg web pages for current donation
methods and addresses. Donations are accepted in a number of
other ways including checks, online payments and credit card
donations. To donate, please visit: www.gutenberg.org/donate.
Most people start at our website which has the main PG search
facility: www.gutenberg.org.