Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/358261374

Survey on the Design and Development of Indian Language Chatbots

Article · February 2022


DOI: 10.1109/ICCISc52257.2021.9484891

CITATIONS READS
0 110

5 authors, including:

Dhanya L K
Cochin University of Science and Technology
4 PUBLICATIONS   0 CITATIONS   

SEE PROFILE

All content following this page was uploaded by Dhanya L K on 01 February 2022.

The user has requested enhancement of the downloaded file.


Survey on the Design and Development of
Indian Language Chatbots
2021 International Conference on Communication, Control and Information Sciences (ICCISc) | 978-1-6654-0295-8/21/$31.00 ©2021 IEEE | DOI: 10.1109/ICCISc52257.2021.9484891

Athira Susan George1 , Gowri Muralikrishnan2 , Leya Rachel Ninan3 , Pranav S.Varrier4 , Dhanya L.K.5
1,2,3,4,5
Dept. of Computer Science and Engineering, Mar Baselios College of Engineering and Technology,
Kerala, India
Email: athirageorge288@gmail.com1 , gowrimuralikrishnan@gmail.com2 , leyarachelninan@gmail.com3 ,
pranavsvarrier@gmail.com4 , dhanya.lk@mbcet.ac.in 5

Abstract—A chatbot is an intelligent system which can hold a


conversation in real-time with a human being using natural
language. This paper describes a multitude of chatbots that
communicate in Indian Languages. Extensive research has been
done on chatbots in English before, but there is little study
on Chatbots in Indian languages. This survey was conducted
as a part of our project on an Intelligent Chatbot using Deep
Reinforcement Learning. The paper is based on the firm belief
that chatbots in Indian languages can be used to bridge the com-
munication gap amongst Indian users. For the chatbot to work
resourcefully, it needs to be provided vast logical resources. The
peculiarities of natural language require special care, especially
in the processing of synonyms, word forms, context, etc. This
survey paper deeply studies the methodologies and behaviours Fig. 1. Source: Indian Languages – Defining India’s Internet, a report by
of various Indian chatbots, and a comparison is drawn based on KPMG in India and Google, 2017
their accuracy.
Index Terms—Artificial Intelligence, Chatbot, NLP, NLTK,
Virtual Assistant II. S URVEY ON C HATBOTS IN I NDIAN L ANGUAGES
A. Bengali
I. I NTRODUCTION
1) Golpo: Golpo is a retrieval-based closed domain chatbot
India is on the verge of phenomenal progress with digital that is data-driven, which can converse in Bengali. The bot
interventions transforming even conventional government ser- is primarily based on a pattern matching algorithm that will
vices. Multilingual, knowledge-rich, intelligent chatbots are a improve its accuracy through learning from interactions.
decisive next step in this direction. Natural Language Pro- A manually created Bangla corpus is made to converse in
cessing (NLP) can broaden the internet access to a wider basic Bangla. It uses a chat interface based on its knowledge
part of India’s population. High computing GPU machines, base to respond to user-queries efficiently, in real-time. A
the widespread availability of high-speed internet, and the language-independent NLP library with a learning mechanism
increased use of mobile devices have aided in the development is the backbone of this chatbot. Initially, the input is matched
of NLP technology. with the entries in the database, then a confidence score is
English speakers constitute only ten percent of Indians; the calculated for the matched sentences, and finally, the one
rest ninety percent speak languages such as Hindi, Marathi, with the highest confidence score is selected as the response
Gujarati, Bengali, Kannada, Telugu, Tamil, to name just a few to the input. Best Match Adapter is employed to give out a
of the major languages spoken in India. response based on the best known match to a given statement.
NLP, an AI segment, is critical in comprehending and Then, the Jaccard Similarity function uses a Jaccard Index
handling the human-language. Knowing words, phrases, def- to compare the known and input statements. This index is
initions, syntactic structures, and how to link those concepts kept at 50 per cent. The Chatbot achieves better accuracy
in a comprehensible manner are all part of understanding a when compared to Neural Conversational Model (NCM), a
language. This needs in-depth data concerning the languages generative-based model, and outperforms the state-of-the-art
and, therefore the ability to interpret them. Part of speech model Cleverbot [14].
tagging, lemmatization, phrase extraction, text categorization,
entity extraction, and topic extraction are some of the valuable 2) Doly: Doly was created as a Bengali Chatbot that gives
NLP functionalities. a reply to the user query. It was created to be implemented
Here, a survey is presented based on twelve Indian Lan- in the educational sector. Machine Learning (ML) algorithms
guage chatbots. Comparisons are carried out, and conclusions and Bengali Natural Language Processing (BNLP) are the
are drawn. mainstays of this AI-powered chatbot. The chatbot uses a

978-1-6654-0295-8/21/$31.00 ©2021 IEEE

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
ML algorithm like a search algorithm to check the corpus semantic-based natural language paradigm. Intents generation
for similar findings. Then, the appropriate response is then has a minimum accuracy of 64%, while answer generation
produced from the list using Naı̈ve Bayesian. The chatbot’s has a minimum accuracy of 82.1% and a maximum accuracy
dialogue corpus is derived from a basic Bengali Language gain of 96.7%. [3]. .
corpus, and it is saved in .yml format. For training the
prepared data-set, the whole data-set is pre-processed. The 2) Umair: UMAIR is an Urdu customer service
various pre-processing methods used are anaphora, removing representative chatbot. Its architecture includes a novel
stop words and lemmatization. The bot employs Hobbs’ engine, scripting language, and WOW (Word Order Wizard)
Anaphora Resolution algorithm. The two novel lemmatization string similarity algorithm, which are combined to combat
methods based on data structures and computer programming the unique challenges proposed by Urdu. The controller is at
hash tables are Dictionary Based Search by Removing Affix the core of the chatbot, and it aids in directing and managing
(DBSRA) and Trie. The accuracy of the system is 88%. The the conversation. This controller ensures that the conversation
creators of Doly wants to enable this chatbot as an effective goal is achieved. The conversation manager (CM) ensures
education tool for all Native-Bengali speaking students [1]. that the user stays true to the intention of the topic by joining
hands with the Path Manager (PM). The PM loads a path
3) Bengali Informative Chatbot (BIC): BIC delivers that uses decision trees within the architecture, and it leads
relevant data by using NLP. Trie and Dictionary Based Search the discussion towards the desired leaf node where the goal
by Removing Affix (DBSRA), as well as Edit Distance, are lies. The scripting language allows response to be conveyed
two algorithms for discovering the lemmatization of Bengali in the form of maps, images, and forms. Another addition
words. To make the tasks simple, a Bengali Language Toolkit is the AllowYesNo rule in the scripting language, which
(BLTK) and Bangla Regular Expression (BRE) are utilised makes Umair seem more helpful and intelligent to the user by
by the bot. The Bengali Anaphora resolution methodology answering the queries with a “yes” or “no” response. The bot
using the Hobbs’ algorithm is used, along with the Term combines WOW algorithm and PM overcomes the obstacles
Frequency-Inverse Document Frequency (TF-IDF) and Cosine posed by the Urdu language. Here, an utterance cleanser is
Similarity, to get the right expression of information from used to remove the special characters from the user input,
the documents. In addition, the creators compiled a Bengali hence normalizing the user’s utterance. The accuracy of the
root word corpus, synonym word corpus, stopwords corpus, bot was predicted to be around 83% [17].
and 700 question-answer pairs. The Accuracy of the system:
97.22% [2]. 3) Neural Encoder-Decoder Based Urdu Chatbot: The
Neural Encoder-Decoder based chatbot is an Urdu-language
4) Disha: Bangla Healthcare Chatbot: Disha works in the chatbot that can respond to user queries regarding the general
field of healthcare. It is a closed-domain machine learning- history and current affairs of Pakistan. It uses a seq2seq model
based Bangla chatbot, aimed at stimulating conversation with with an attention driven bi-directional encoder mechanism.
human users as a virtual medical assistant. Disha, the text- The attention mechanism enables the chatbot to give more
based chatbot, converses with the user with the support of its importance to the relevant parts of the user query while
knowledge base. It aids in diagnosing potential diseases based generating the response. The encoder converts the query of
on the symptoms, keeps a log of the patient’s health status, p words, x1, x2, ..., xp, into a series of vectors, c1, c2,
and warns the patient against health hazards. Initially, the user ..., cp. The decoder then uses this vector to predict the
enters basic information such as name, age, and blood group response of q words, y1, y2, ..., yq. The dataset for the
into the chatbot. The chatbot asks many interactive questions model was built by collecting data from 300 students of the
to collect the user’s information. A customized Bangla dataset National University of Computer and Emerging Sciences who
for disease classification was built. The name of the disease entered questions into the database via a web portal designed
along with its speciality is mapped to the type of doctor. Once especially for this crowd-sourcing facility. The model achieved
the disease is classified, the relevant suggestions regarding the a BLEU(Bilingual Evaluation Understudy Score) score of
type of doctor, the user should turn to is displayed. While all 58, which is the highest score obtained by any Urdu-based
of the classifiers performed well and had high accuracy, SVM conversational agents.
was selected as the core classifier due to its highly efficient
performance. Furthermore, it had the highest accuracy of 98.39 C. Odia
per cent. [15]. 1) Chatbot that implements Odia POS Tagging in IoT
devices: One of the most challenging and advanced research
B. Urdu takes place in a subsection of NLP, called Part of speech
1) Nubot: Here, The bot deals with the generation of tagging (POST). This chatbot works in IoT-enabled-devices
accurate replies regarding unstructured data in Roman Urdu. with the functionality of Part of Speech Tagging.
The corpus is used in the RASA NLU module to classify The importance of text processing in understanding the
intent. A knowledge graph is used in conjunction with the point of view of IoT-based smart devices, which are linked
RASA Framework to keep track of the dialogue history for a through the internet and sensors, is highlighted here. POST’s

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
function in text processing is to assist the human users in 2) AskDISHA: AskDISHA is an AI chatbot jointly devel-
simulating IoT-enabled devices. The chatbot’s functionality oped by IRCTC and CoRover, which uses technologies like
is to receive a text from a human being using any natural artificial intelligence (AI), machine learning (ML), and natural
language and process it; after that, it is correctly understood, it language processing (NLP) to answer passenger queries. It
takes action and hits the target as directed, but all devices must supports user queries in English and Hindi and provides
be linked through a local area network or a wide area network. responses in both text and audio formats. AskDISHA was
For the Chatbot programming to interact with IoT devices, introduced as a solution to the difficulties faced by citizens
both a training and a test corpus are to be tagged. Once it in obtaining information pertaining to railway reservations.
is trained, the increase in number of tagged corpus increases Often, passengers experience long waiting periods, even for
the accuracy of the system while testing. If a sentence has tiny queries. This chatbot can produce immediate responses
ten terms and two of them aren’t in the database, the system to user queries on ticket reservation, refund status, cancella-
cannot tag them. To solve this problem, a large data set can tions, reservations, PNR search, and other travel packages and
be used to determine the accuracy of POS Tagger. Advanced offerings of IRCTC. When you start typing your query, the
dataset methods can be used to improve the system’s accuracy. AskDISHA agent will begin mining relevant questions from
[4]. the IRCTC Tourism’s administration, to provide user friendly
and flexible responses. AskDISHA has been observed to be
D. Marathi capable of answering passenger queries with an accuracy of
1) Query-Based Chatbot System in Marathi Language: 90%. In 2019, it was the recipient of the Asia Leadership
This system is designed according to the query-based system Award for Innovation Using Technology for being the first
in the Marathi language. The user asks a question to the sys- chatbot for any government organization in India. [18]
tem, and the system automatically replays all answers through
the chatbot. All the query related information is added to this F. Malayalam
system. All the data necessary for answering the question 1) An Interactive AI chatbot in Malayalam: This is a
is available on the portal. It is a communiqué simulating retrieval based Malayalam chatbot with user preference elici-
computer program. Python is used to create the chatbot. The tation using NLP. The system introduces a chatbot that has
chat is formed using a pattern that is known to the user and embedded knowledge to identify the sentences and decide
could be easily understood. A chat dialogue box shows up to itself as a response to a question. Here, the response principle
create a conversation. Text recognition classification is used is to match the input sentence from the user. Kerala’s people
for large size of data or files. It is simple to extract text data can go through the different Social Security Schemes of the
from such large files using OCR technology. Text detection, Kerala Government using this chatbot. The implementation
localization and tracking modules are correlated and constitute works based on NLP and AI. NLTK (Natural Language
the most vital and challenging part of the extraction process. Processing Toolkit) for Python is used to analyse text. The
By using Tesseract OCR with the python system, we extract data set consists of about 837 Kerala govt policies and 495
the text from images, pdf or other multiple files [5]. tokens which were extracted from the dataset. They use
2) AMBER Chatbot and Detection of Paraphrases: AM- word tokenization, stop word removal, Enchant and Jaccard.
BER stands for A Marathi Based Eliza Chatterbot. AMBER The word tokenize method of NLTK helps in splitting the
uses Sentence Generation. It uses the Paraphrasing detection paragraph into individual words. Stop word removal is used
Library. It uses several frameworks for determining semantic for better tokenization. Enchant is a Python module that checks
similarity DKPro Similarity. The alignment amongst idioms a word’s spelling and provides suggestions for correcting it.
and phrasal chunks is calculated by using a Word Alignment The Jaccard similarity is a measure of how close finite sample
Algorithm. AMBER executes various Natural Language Pro- sets are. The accuracy of the chatbot was found out to be 60%,
cessing tasks such as Transliteration, Tokenization, Part of apart from the limitations of the Jaccard Algorithm [8].
Speech. RDRPOST Tagger is used for obtaining POS Tags. 2) Malayalam Chatbot based on Chatterbot Python Li-
It uses phonetic and stripping algorithms to find suffixes [6]. brary: The system is a retrieval based domain-specific chatbot
that converses in the Malayalam language. The chatbot is
E. Hindi developed based on an NLP library with a Python package
1) Query-Based Chatbot in Hindi: This chatbot is designed based on Machine Learning(ML) mechanisms. The chatbot
according to a query-based system in Hindi language (Indian matches the queries with the entries in the database and then
Language). The user asks the question to the system; the calculates a confidence value for sentences that matched. It
system automatically replays all answers through the chatbot. then finally selects the appropriate response based on the high-
OCR concept is used to fetch data from pdf or images in est confidence value among them. The responses are retrieved
text format. All the query related information is added to this from the knowledge base based on the context. The input is
system. Although notable work is not carried out for the Hindi given to a web interface, and then it is processed. Sentences
language, chatbot but many researchers and organizations that match the input text are selected. The known response to
have started building chatbot systems in this field [7]. the selected matches is returned. Then, the confidence value
is calculated for each response. The appropriate response with

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
the highest confidence value is returned and then displayed. roll number from the required input after the user enters it.
Chatterbot includes training tools to make a simple training Intents, entities and agents are used to accomplish these tasks.
process, including the loading of dialogues into the chatterbot The Agents are NLP modules that decipher the meaning
database. Several training classes are inbuilt in the chatterbot of the user’s queries and carry out the required action. The
package. Here, the data is trained from the dialogue corpus agents translate the user’s request into machine-readable
using the Corpus Trainer Class of Chatterbot. Malayalam data. The agents have Intents that provide the service that
Tourism corpus was manually created and then stored in the customer expects from them. The useful user input data
YAML format. Then, set a trainer for the Malayalam dialogue are derived using entities, which act as a keyword within
corpus. the requests. The request is then forwarded to an external
The chatterbot has a built-in storage adapter that connects system that maintains the student’s information. The request
different databases and a chatterbot adapter. The Best Match to the external system is rendered by a mechanism known
logic adapters are used to find the closest match. Here, a low as Fulfillments. They aid in passing the request from the bot
confidence adapter is used, and a threshold value of 0.5 is set. to the external source, obtaining a response, and passing it
The similarity between the input text and the known responses back to the user using ”Webhook”. The matched intent is
is calculated by The Best Match adapter. Accurate answers are passed into a web service using a webhook to get results
generated when compared using the Jaccard index, as it shows from it. Fulfillment returns the result to the user, along with
the response only when the correct word match is found. The the required database information. It was discovered that
bot interacts with users in a better way when using Levenshtein the application could load with a CPU time less than 13
for comparison. It finds a response even for unknown instances milliseconds. Furthermore, the performance time was reported
as it makes a character level comparison. Even though the bot to be less than 40 milliseconds. [11].
gives faulty responses, the conversation keeps going on [9].
G. Tamil
1) Poongkuzhali - An Intelligent Tamil Chatbot: H. Multi-Lingual Chatbots
Poongkuzhali is a chatbot that simulates human conversation
in Tamil through Artificial Intelligence. The system’s job is 1) Aham: Aham, provides information on COVID-19 in
to produce a suitable answer based on the input query. This 7 Indian languages. It uses Dhee.ai as its platform. Aham
input query can be based on a pre-existing conversation topic will provide the user with factual information curated by the
in Tamil. It is possible to enter data in transliterated form. Ministry of Health and Family Welfare and the World Health
The bot understands the user’s input (query or statement) in its Organisation. [19]
most basic sense. The user can submit the question in any way;
the chatbot will decode and understand it using a collection of
2) COVID-19 Self Help Chatbot: The bot was created by
decomposition rules, regardless of its grammatical format. The
iBihar.org. It aims to provide factual information on Covid-19
answer is then created by applying a collection of reassembly
based on the guidelines of WHO and Indian Government.
rules from the knowledge base. After that, the answer is re-
This chatbot can converse in four regional languages: English,
framed to fit the user’s query. There are currently two domains
Hindi, Maithili and Urdu. This web based chatbot is one
open, with the ability to add more in the future. For adding
of the few multilingual chatbots that engage in spreading
domains and enhancing the knowledge base, there is a separate
information of the pandemic to its users. [20]
interface.
The system will initiate a dialogue when the user is at a
loss. History data is maintained, even though each user-input 3) Niki: It is a one-of-a-kind, completely automated AI-
query is processed separately, in order to manage references powered chatbot developed by Niki.ai. It includes a unique
to previous questions. In addition, Poongkuzhali will respond plug-and-play technology that can be integrated into an
in the same voice (active or passive) as the user’s query. The Operating System, messaging platforms, Apps, and Websites.
chatbot has a time complexity of the order of n, where n is The bot gives personalised recommendations for services and
defined as the number of terms in the input [10]. products with no manual interventions. Besides Hindi, the
2) Tamil Chatbot using Intent Recognition: This chatbot bot’s development in multiple Indian languages like Marathi,
applies Natural Language Processing (NLP) along with the Gujarati, Bangla, Tamil, Telugu, Kannada, besides English, is
Dialog Flow API to respond to queries in Tamil. Intent in its making. [21]
detection, conversation management routines, slot fulfillment,
and dialogue design are all essential modules in this Chabot. 4) Recharge Bot by Payjo: Payjo created India’s first
Flask micro framework and Pusher Channels are also used messenger bot that lets people recharge their phones, sets
here. reminders for recharge, and shows suitable phone number
The chatbot creates a communication bridge between plans. The Bot supports smart balance reminders for prepaid
parents and the educational institution. It asks users to fill in phones. The chatbot currently supports Hindi, Tamil, Telugu,
the necessary student information. The chatbot identifies the and Kannada. [22]

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
III. P LATFORMS THAT SUPPORT M ULTILINGUAL 135 languages. It is highly flexible; the user can shift the
C HATBOTS accuracy of NLP to get a match, the AI-engine will flow to
A. Vernacular.ai match this change. It employs a Drag and Drop feature, which
makes it very popular. [26]
Initially, Vernacular.ai was introduced as a chatbot that
aims to solve India’s linguistic barrier problem for business IV. CHALLENGES
owners. Now it has grown itself into an AI-based platform The following are the challenges encountered while creating
that delivers two products: VASR and VIVA. At first, the a chatbot in Indian Languages.
artificial intelligence based chatbot uses the natural language • Even with significant advancements in the field, translat-
processing (NLP) technology in Indian languages. Now, the ing idioms and slang remains an obstacle.
platform supports 10 Languages and 160+ Dialects. It is also • Indian Languages carry ambiguity around their words.
available 24x7. It helps cut the expenses of companies by The NLP algorithms must figure out such ambiguities
40-60%. Lately, Voice & conversational intelligence are also and complexities.
integrated into chatbots through an independent platform that
• The data-sets currently available for most Indian lan-
is constantly learning. [23]
guages are small when compared to western languages.
B. Dhee.ai • The features of NLP, such as Intent Recognition, Entity
DheeYantra is building a Multi-lingual AI/NLP platform, Identification, or Grammatical Phrasing, are not available
dhee.ai. While traditional chatbots understand only English, for all Indic languages.
dhee.ai understands multiple languages. Its patent-pending • There is a lack of resources regarding the literature and
NLP algorithms for Indian Languages enable businesses to grammar of Indian Languages.
engage and communicate with native language speakers in • Some chat platforms have erratic features.
their language. It supports real-time semantic understanding, • The sentence structuring of English is subject-verb-object
continuous learning, a dashboard for Real-time Analytics, word order whereas most Indic Languages uses subject-
language filters. Dhee.ai takes care of responding to queries object-verb.
on Facebook messenger too. A supporting mobile App can
be used to delegate chat to human agents to handle unknown V. INFERENCE
questions from Dhee.ai. It supports Indian English, Indic lan- There are two approaches to build a chatbot in Indian
guages like Hindi, Tamil, Marathi, Gujarati, Bangla, Telugu, Language: Native NLU Development and Translation based
Kannada, Malayalam, and foreign languages such as Aus- approach. In the Native NLU based approach, different NLU
tralian English, US English, UK English, Spanish. Support for components that allow chatbots to work end-to-end are con-
languages like German, Portuguese, French, Arabic, Swahili structed and trained using training data for the specific Indic
is in progress. [24] language. In Translation based approach, the user-input to the
chatbot in Indic languages are translated to English using a
C. Haptik translation API and then NLU components built-in English
Haptik is an artificial intelligence platform that develops are used. Both approaches are still inconsistent with regards
conversational assistants for companies to improve user in- to creating an Indian language chatbot. Various works are still
teractions and boost their productivity. Apart from AI bots in process in these two above stated approaches.
that converse in English, Haptik provides a platform (Linguist Hindi remains the Indian Language where most NLP ex-
Pro) that allows organizations to build multilingual chatbots periments are taking place, followed by Bengali. Other Indian
that support conversations in vernacular languages. Named Languages have less work done on them. Unsupervised and
Entity Recognition (NER) is a key component of the algorithm supervised language processing is advancing a lot and holds
that runs behind Haptik’s bots. NER is a sub-process of a great scope for NLP works in low resources languages.
information extraction that attempts to identify and classify
VI. C ONCLUSION
entities in the text into predefined categories such as the name,
location, organization, and so on. A NER system for a chatbot In this paper, the focus is on the design, development, and
is heuristic-based and uses several NLP techniques to extract performance of various Indian language chatbots. The rise in
the desired entities from the chat interface. There are various chatbots has made the life of its provider and the user simpler,
entities that need to be identified, and entities of different types as both the parties are mutually benefiting from it. However,
need to be separated as different entities use different detection the bots are still in their initial phase of learning. The accuracy
logic. Haptik’s existing NER system developed for the purpose will increase as more data is provided to them, improving
of English chat was upgraded to support five languages: Hindi, the system efficiency. Different methodologies and tools can
Marathi, Gujarati, Tamil, and Bengali. [25] be used to develop chatbots depending on the end-use and
expected results. The paper discusses multiple existing ap-
D. Flow.ai proaches and platforms that are used to create chatbots in Indic
Flow.ai is an AI-driven platform that can be used to create languages. Organizations have reduced costs, increased sales,
multilingual chatbots. Currently, it provides support in over improved customer satisfaction, and had immense growth

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
due to the implementation of chatbots. This massive rise in [14] Tasnim Dewan Orin, Dr. Mahbub Alam Majumdar, “Implementation of
multilingual chatbots should be looked upon positively, and a Bangla chatbot”, April 2017, BRAC University.
[15] M. M. Rahman, R. Amin, M. N. Khan Liton and N. Hossain, ”Disha:
many Indian organizations should invest in this technology. An Implementation of Machine Learning Based Bangla Healthcare Chat-
Even though most Indic languages have low resources, in the bot,” 2019 22nd International Conference on Computer and Information
future, there will be an increased work in the field of Indic Technology (ICCIT), Dhaka, Bangladesh, 2019, pp. 1-6.
[16] M. Alam, ”Neural Encoder-Decoder based Urdu Conversational Agent,”
language NLP despite the challenges. 2018 9th IEEE Annual Ubiquitous Computing, Electronics Mobile
Communication Conference (UEMCON), New York, NY, USA, 2018.
R EFERENCES [17] Kaleem, M. et al. “Development of UMAIR the Urdu Conversational
[1] Kowsher, Md Sharmin Tithi, Farhana Alam, M Huda, Mohammad Agent for Customer Service.” (2014).
Moheuddin, Mir Md Rosul, Md. (2019). Doly: Bengali Chatbot for [18] S. Dey and D. Shukla, ”Analytical study on use of AI tech-
Bengali Education. 1-6. 10.1109/ICASERT.2019.8934592. niques in tourism sector for smarter customer experience manage-
[2] Kowsher, Md Alam, M Uddin, Md Islam, Md Pias, Nuruz- ment,” 2020 International Conference on Computer Science, Engineer-
zaman Saifullah, Abu. (2019). Bengali Informative Chatbot. 1-7. ing and Applications (ICCSEA), 2020, pp. 1-5, doi: 10.1109/ICC-
10.1109/IC4ME247184.2019.9036585. SEA49143.2020.9132925.
[3] NUBOT: Embedded Knowledge Graph With RASA Framework for [19] “Meet aham! Your new virtual assistant providing information on
Generating Semantic Intents Responses in Roman Urdu J Shabbir, MU COVID-19 in 7 Indian languages,” Outlookindia.com, 21-Mar-2020.
Arshad, W Shahzad - arXiv preprint arXiv:2102.10410, 2021 [Online]. Available: https://www.outlookindia.com/newsscroll/meet-
[4] R. Sahoo, B. R. Das and B. Kishore Mishra, ”Analysis and Imple- aham-your-new-virtual-assistant-providing-information-on-covid19-in-
mentation of Odia Part of Speech Tagger in recent IoT based devices 7-indian-languages/1785923. [Accessed: 02-Apr-2021].
through Chatbot: A review,” 2020 International Conference on Computer [20] H. T. Correspondent, “Covid-19 app, chatbot launched
Science, Engineering and Applications (ICCSEA), Gunupur, India, 2020, for people in Bihar,” 02-Apr-2020. [Online]. Available:
pp. 1-4, doi: 10.1109/ICCSEA49143.2020.9132940. https://tech.hindustantimes.com/tech/news/covid-19-app-chatbot-
[5] Navalakha, Darshan et al. “Review of Chatbot System in Marathi launched-for-people-in-bihar-story-291GpnvSHza27YcqqCuw5K.html.
Language.” (2019). [Accessed: 02-Apr-2021].
[6] Shah Manthan, Jigneshkumar, Arnav Mediratta, Akshay Kundale, Shub- [21] “AI-based chatbot Niki to add seven more Indian languages,” Sap.io,
ham Nangare, “AMBER Chatbot and Detection of Paraphrases for 14-Oct-2019. [Online]. Available: https://sap.io/ai-based-chatbot-niki-to-
Devnagari”, Vishwakarma Journal of engineering research, (4 Dec add-seven-more-indian-languages/. [Accessed: 02-Apr-2021].
2017). [22] [4] M. Rao, “Why chatbots suit India better than apps,” Techinasia.com,
[7] Varad Bhagwat, Mrunal Nagarkar, Pooja Paramane, Shrikant Jindam4, 06-Jun-2016. [Online]. Available: https://www.techinasia.com/chatbots-
Prof. N.G. Kharate, ”Review of Chatbot System in Hindi Language”, for-the-next-billion. [Accessed: 02-Apr-2021].
2019 [23] S. Nair, “Multilingual chatbots making conversational AI
[8] Amalchandran, J Shibu V, ”An Interactive Malayalam Chatbot Using vernacular,” Haptik.ai, 24-Apr-2019. [Online]. Available:
AI”, Volume 8, Issue 9, 2020 https://www.haptik.ai/blog/building-multilingual-chatbots/. [Accessed:
[9] S. Sandhini, R. Binu, R.R Rajeev, M.M Reshma, ”A Proposal of Chatbot 02-Apr-2021].
for Malayalam”, 2018 [24] D. Misal, “7 chatbot building platforms from India that are revolutionis-
[10] T. Kalaiyarasi, Ranjani Parthasarathi, T.V. Geetha, ”POONGKUZHALI- ing businesses,” Analyticsindiamag.com, 21-Oct-2018. [Online]. Avail-
An Intelligent Tamil Chatterbot”, 2003 able: https://analyticsindiamag.com/7-chatbot-building-platforms-from-
[11] U. Shanmugam, S. Mani, S. Sivakumar and P. Rajeswari, ”Human- india-that-are-revolutionising-businesses/. [Accessed: 03-Apr-2021].
Computer text conversation through NLP in Tamil using Intent Recog- [25] H. T. Correspondent, “Reliance Jio Haptik-powered coronavirus
nition,” 2019 International Conference on Vision Towards Emerging chatbot arrives on WhatsApp,” 22-Mar-2020. [Online].
Trends in Communication and Networking (ViTECoN), Vellore, India, Available: https://tech.hindustantimes.com/tech/news/reliance-jio-
2019. haptik-powered-coronavirus-chatbot-arrives-on-whatsapp-story-
[12] Ayush Kumar, Pooja Rajesh Chellani, ”A Survey on Chatbots, A Brief KvS0RDxGNQRYungitkycSO.html. [Accessed: 03-Apr-2021].
Analysis of Chatbots” International Journal of Engineering Research [26] “Multi-language design,” Flow.ai. [Online]. Available:
Technology (IJERT),2017 https://flow.ai/docs/learn/multi-language-design. [Accessed: 04-May-
[13] Sameera A. Abdul-Kader, Dr. John Woods, ”Survey on Chatbot Design
2021].
Techniques in Speech Conversation Systems” (IJACSA) International
Journal of Advanced Computer Science and Applications, Vol. 6, No.
7, 2015

Authorized licensed use limited to: COCHIN UNIVERSITY OF SCIENCE AND TECHNOLOGY. Downloaded on February 01,2022 at 04:58:34 UTC from IEEE Xplore. Restrictions apply.
View publication stats

You might also like