Professional Documents
Culture Documents
Detect Sarcastic
Detect Sarcastic
ANAND R (192071005)
BHARATH R (192071011)
• Most of the people express their ideas, views and opinions over
products in online networks such as Amazon, Flipkart etc.
• Provided unique ID for every customer, with those ID’s they can
submit their reviews about the purchased products. Based on those
reviews sarcastic information are gathered and optimized.
Objectives
1 Effective Text Data Saurav Pradha ,et.al(2019) • Preprocessing technique- Stemming, Lemmatization
Preprocessing IEEE and Spelling Correction
Technique for • Stemming technique performed best in terms of
Sentiment Analysis in computational speed.
Social Media Data
2 Data Preprocessing for ShreyasWankhede,et.al(2018) • Use N-gram method and Hidden Markov Model for
Efficient Sentimental International Conference on Spell-Checking and Correction of tweets
Analysis Inventive Communication and • Emoji Sentiment Ranking method used to evaluate
Computational Technologies, pg sentiment mapping of emojis by using sentiment
723-727 polarity
3. Feature Extraction and Sai Chandra Rachiraju & • Used IMDB review datesets
Classification of Movie Madamala Revanth (2020) • Word2vec and SVM gives better accuracy 83.4
Reviews using International Conference on
Advanced Machine Intelligent Computing and
Learning Models Control Systems pg 814-817
8
Literature review (Cont..)
S.No Title Author/Journal/Year/ Remarks
Volume /Page No
4 Deep LSTM-RNN with Sayed Saniya • Logistic regression used to detect sarcasm
Word Embedding for Salim,et.al(2020) • 15000 tweet dataset used
Sarcasm Detection on International Conference for • Long Short Term Memory (LSTM) and Word
Twitter Emerging Technology Embeddings can make the sarcasm detection
efficiently-
5 Sentiment Analysis D.Deepa and A.Tamilarasi • Detect the polarity of words from twitter using feature
using Feature (2019) IEEE, Third extraction and dictionary-based methods
Extraction and International Conference on I- • CountVectorizer gives 81% accuracy
Dictionary-Based SMAC (IoT in Social, Mobile,
Approaches Analytics and Cloud)
6 Sentiment Classification Yongxia • Text documents are obtained through LDA method
of Online Reviews JING,et.al(2019),IEEE, • k-Nearest Neighbors (KNN) algorithm is used for
Based on LDA and International Symposium on executing the sentiment classification of online reviews.
Semantic Analysis of Computational Intelligence
Sentimental Words and Design (ISCID)
9
Problem Description
• Their approach didn’t consider dialogues and cannot extract out the
words having multiple meaning and will determine their sense to
deal with low sarcasm detection.
Disadvantages
• The central issue in sarcasm detection is the data set availability and
their analysis.
• These studies are critical to the business holders as they can take
business choices as indicated by examination after effects of user's
assessments about their items.
• Sarcasm detection can be applied to product reviews but can also
be applied to stock markets, news articles, or political debates.
• In past few years in the field of SA remarkable work has been done
but sarcasm detection in textual sentences is not up to the mark.
Hardware Requirements
• Processor : i3,i5,i7
• RAM : 2GB
• Hard disk : 500 GB
• Compact Disk : 650 Mb
• Keyboard : Standard keyboard
• Mouse : Logitech mouse
• Monitor : 15 inch color monitor
Software Requirements
Review ID
Customer
Amazon Review Data Set User Review Data Set
Preprocessing Preprocessing
Classification Classification
Matching
Satisfied Customer
Dataflow Diagram
Level 1
Storage
Level 2
Storage
Level 3
Storage
Level 4
Positive Reviews
Sarcastic
Prediction
Negative
Reviews
Modules
• Fake Reviews Analysis •Real time dataset annotation by
Modeler creating e-commerce website.
• E-Commerce Website
– Review ID Generation •Preprocessing
System
•Feature Extraction
– Review ID Issue
• Dataset Annotation •Classification
– Training Phase
• Amazon Review Dataset •Prediction
– Testing Phase
•Recommendation
•Performance Analysis
Module Description
• No one can be self-sufficient. And this brings out the need for
demand and supply of goods and services.
• Transactions have been going on all over the world for centuries,
locally, and across locations.
• However, also bear in mind that with the whole world going
online, data privacy laws have become increasingly stringent.
Review ID Generation System
• To post a review for the purchased product, each customer has been
provided with an unique ID while purchasing.
• With those ID the customer can post their reviews regarding the
purchased products.
Review ID Issue
• So that in future new customers can gain actual and exact reviews for
the products that are in the E-Commerce sites such as Amazon,
Flipkart, Snapdeal etc.
Data Set Annotation
Training Phase
Testing Phase
It is done by the data and reviews that are attained from the testing
phase.
Preprocessing
As a result more prominently trained and tested data are preprocessed
which extracts a sequence of sarcastic reviews that are provided by the
customers those who purchased products.
References
• R. Xia, F. Xu, C. Zong, Q. Li, Y. Qi and T. Li. ”Dual Sentiment Analysis:
Considering Two Sides of One Review”, IEEE Transactions on Knowledge
and Data Engineering, 2015.
• R. Xia, T. Wang, X. Hu, S. Li and C. Zong. ”Dual Training and Dual
Prediction for Polarity Classification”, Proceedings of the Annual Meeting
of the Association for Computational Linguistics (ACL), 2013.
• S. Ahuja and G. Dubey. ”Clustering Sentiment Analysis on Twitter Data”,
Second International Conference on Telecommunications and Networks,
IEEE, 2017
• Rathan M., V. Hulipalled, Murugeshwari P. and Sushmitha M. ”Every Post
Matters: A Survey on Applications of Sentiment Analysis in Social Media”,
International Conference on Smart Technology for Smart Nation, IEEE,
2017.
• D. Ikeda, H. Takamura, L. Ratinov and M. Okumura. ”Learning to Shift the
Polarity of Words for Sentiment Classification”, Proceedings of the Third
International Joint Conference on Natural Language Processing: Volume-I,
2008.