Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Summary on “Chitty-Chitty-Chat Bot”: Deep

Learning for Conversational AI

Shaniul Islam Shanto


Department of Computer Science and Engineering
University of Dhaka
Dhaka, Bangladesh
shaniulshanto.2331.csedu@gmail.com

Abstract—This paper [1] shows different aspects of non-task amount of data available all over the internet, a retrieval-based
oriented conversation AI. It describes various methods of deep conversational system is on high demand. In this method,
learning for data collecting , giving responses, handling context, system extracts matching metrics, then responds to queries
adding utterances in response, handling multiple answers for a
single question of a conversation AI. with candidate responses. Wang et al. [2013] [2] proposed to
Index Terms—Conversation AI, Deep learning, context, re- construct a two-step retrieval method to deliver appropriate
sponse, utterance responses from the massive data repository. Best retrieval
systems extract underlying abstract features of data with
I. I NTRODUCTION DNN. Generation-based conversational AI, another method to
With the increasing demands of interactions between hu- respond queries, generates responses from a model with the
mans and electronic brains, many branches of Artificial in- application of deep-learning. We can effortlessly say that both
teligence are highly calling for contributions from conversa- systems have pros and cons considering varities of queries and
tional studies. the ceasless development of web internet i.e. responses. Although, Retrieval-based system can create natural
web 2.0 keeps facilitating many data-driven mathods. Task and fluent sentences, so mainstream ’real-world’ applications
oriented and non-task-oriented: two types of conversation AI using this method largely, it fails to create ’new’ appropriate
are designed to serve people for many purposes. Task oriented candidates for a given utterance. generation-based systems can
such as reminder or scheduling and booking train or bus solve the issues of creating new candidate responses but in
tickets are mainly established in vertical domains. on the a very limited dataset. Furthermore, it may over-generate or
other hand, non-task-oriented is established for the purpose under-generate unnatural and not so fluent sentences. Finding
of random or social chatbots in open domains. These aim to candidate responses from both systems and merging them in
helps humans for their emotional companionship or as a source order to output the best solutions is the most feasible solution.
of entertainments. III. C ONTEXT BASED C ONVERSATION AI
Here in this paper, we discuss about non-task oriented con-
versation AI named as Chatbot.This survey paper is partially A proposal for A neural responding machine was raised
based on our continuous efforts on building conversational for single-turn conversations. [Shang et al.,2015] [3] The
models with deep learning approaches for chatbots. The paper model was then extended to handle multi-turn utterances in
summarizes the problem formulation and data collection for coversation. As studies indicate that context-aware conversa-
chatbots, and give an overview of state-of-the-art methods for tional AI are inevitable, various types of context modeling are
open domain chit- chats from several aspects. constructed as they all utilize utterances occurred before the
query utterance. Li et al. [2015] [4] suggested that words and
II. DATA C OLLECTION AND RESPONSE sentences ought to be prototyped on different hierarchies for
Conversational AI system functions based on ’query- representation learning. Context learning is then extended to
response’ method. This method may also deal with contexts hierarchical representations with the action of distinguishing
given by the queries hanging on previous utterances. Col- word-level and utterance-level representations.
lecting a huge amount of data from web resources such as IV. P ROACTIVE C ONVERSATION AI
social media or forums in the form of human-to-human con-
versation for interfacing between human-driven conversational There can be multiple responses for a single query. Fre-
systems and data-driven conversational systems. As massive quency of generic and meaningless responses gets higher in
data-driven methodology. So it becomes necessary to optimize
Identify applicable funding agency here. If none, delete this. for the likelihood of candidates. Conversational AI should also
be proactive as it is not an ideal assumption that only human [5] Xiang Li, Lili Mou, Rui Yan, and Ming Zhang. Stalematebreaker:
will initiate the conversation and only task of computer is A proactive content-introducing approach to automatic human-computer
conversation. ArXiv, abs/1604.04358, 2016.
to respond. So Conversational AI should also be proactive [6] Ryan Lowe, Michael Noseworthy, Iulian Vlad Serban, Nicolas Angelard-
and introduce new contents to start off conversation. Li et al. Gontier, Yoshua Bengio, and Joelle Pineau. Towards an automatic Turing
[2016] [5] proposed a proactive conversational system named test: Learning to evaluate dialogue responses. In Proceedings of the
55th Annual Meeting of the Association for Computational Linguistics
StalemateBreaker. The system detects when there occurs a (Volume 1: Long Papers), pages 1116–1126, Vancouver, Canada, July
stalemate, and determines what “intriguing” contents to in- 2017. Association for Computational Linguistics.
troduce by re-ranking candidate responses. Diversity can be [7] Lili Mou, Rui Yan, Ge Li, Lu Zhang, and Zhi Jin. Backward and
forward language modeling for constrained sentence generation. arXiv:
achieved by MMR or DPP. Mou et al. [2015] [6] proposed Computation and Language, 2015.
to introduce new contents, new method such as backward- [8] Chongyang Tao, Lili Mou, Dongyan Zhao, and Rui Yan. Ruber: An
and-forward algorithm is needed which starts the sentence unsupervised method for automatic evaluation of open-domain dialog
systems. Proceedings of the AAAI Conference on Artificial Intelligence,
generation from the designated content words. 32, 01 2017.

V. E VALUATION OF C ONVERSATION AI
There are some metrices like over, ground truth etc for
measuring generated based AI. Automatic evaluation metrics
can be divided into non learnable and learnable approaches.
Lowe et al. [2017] [7] proposed a neural network-based
learning metric for conversation evaluations. A neural network
based automatic evaluation metric predicts scores of ’query-
response’ method as well as the ground-truth response. Later,
Both referenced and unreferenced metrics blended evaluation
routine to render open-domain conversational systems was
proposed and this model doesn’t need any human scoring [Tao
et al., 2018b] [8].

VI. C ONCLUSION
The paper summarizes the problem formulation and data
collection for chatbots, and give an overview of state-of-the-
art methods for open domain chit- chats from several aspects.
It describes various method of data collecting , showing
response, handling context, adding utterance in response , han-
dling multiple answer for a single question of a conversation
AI. The different method of evaluation and evaluation matrices
are also discussed here. This is a well informed paper. I think
the paper is very useful for anyone who wants to work on
conversation AI.

R EFERENCES
[1] Rui Yan. ”chitty-chitty-chat bot”: Deep learning for conversational ai.
In Proceedings of the Twenty-Seventh International Joint Conference on
Artificial Intelligence, IJCAI-18, pages 5520–5526. International Joint
Conferences on Artificial Intelligence Organization, 7 2018.
[2] Hao Wang, Zhengdong Lu, Hang Li, and Enhong Chen. A dataset
for research on short-text conversations. In Proceedings of the 2013
Conference on Empirical Methods in Natural Language Processing,
pages 935–945, Seattle, Washington, USA, October 2013. Association
for Computational Linguistics.
[3] Lifeng Shang, Zhengdong Lu, and Hang Li. Neural responding machine
for short-text conversation. In Proceedings of the 53rd Annual Meeting of
the Association for Computational Linguistics and the 7th International
Joint Conference on Natural Language Processing (Volume 1: Long
Papers), pages 1577–1586, Beijing, China, July 2015. Association for
Computational Linguistics.
[4] Jiwei Li, Thang Luong, and Dan Jurafsky. A hierarchical neural
autoencoder for paragraphs and documents. In Proceedings of the 53rd
Annual Meeting of the Association for Computational Linguistics and
the 7th International Joint Conference on Natural Language Processing
(Volume 1: Long Papers), pages 1106–1115, Beijing, China, July 2015.
Association for Computational Linguistics.

You might also like