Welcome to Scribd!

Comparing Selective Masking Methods For Depression Detection in Social Media

Uploaded by

0% found this document useful (0 votes)

15 views2 pages

This document summarizes a study that compared seven selective masking methods for depression detection from social media text using BERT models. The study evaluated whether reconstructing masked words during pre-training or fine-tuning was more effective. It also examined model performance under different class imbalance ratios. Key findings showed that selective masking generally outperformed random masking, and the most accurate and robust models reconstructed masks during pre-training and used specific selective masking methods. This was the first comprehensive comparison of masking methods for depression classification with implications for the field.

Original Description:

Original Title

Comparing Selective Masking Methods for Depression Detection in Social Media

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

0% found this document useful (0 votes)

15 views2 pages

Comparing Selective Masking Methods For Depression Detection in Social Media

Uploaded by

Kaleeswari

Copyright:

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 2

Search inside document

Comparing Selective Masking Methods

for Depression Detection in Social

Media
Abstract
Identifying those at risk for depression is a crucial issue in which social media
provides an excellent platform for examining the linguistic patterns of depressed
individuals. A significant challenge in a depression classification problem is ensuring
that the prediction model is not overly dependent on keywords, such that it fails to
predict when keywords are unavailable. One promising approach is masking, i.e., by
masking important words selectively and asking the model to predict the masked
words, the model is forced to learn the context rather than the keywords. This study
evaluates seven masking techniques, such as random masking, log-odds ratio, and
the use of attention scores. In addition, whether to predict the masked words during
pretraining or fine-tuning phase was also examined. Last, six class imbalance ratios
were compared to determine the robustness of the masked selection methods. Key
findings demonstrated that selective masking generally outperforms random
masking in terms of classification accuracy. In addition, the most accurate and robust
models were identified. Our research also indicated that reconstructing the masked
words during the pre-training phase is more advantageous than during the fine-
tuning phase. Further discussion and implications were made. This is the first study to
comprehensively compare masking selection methods, which has broad implications
for the field of depression classification and the general NLP.

Dataset
 Reddit Self-reported Depression Diagnosis (RSDD) dataset and Time-RSDD
dataset (https://georgetown-ir-lab.github.io/emnlp17-depression/)

The datasets should be loaded into the OP_datasets folder

Training Approaches
 BERT further pre-train + fine-tune FURTHER-01-MLM.py and FURTHER-02-
classi.py (adapted from https://github.com/GU-DataLab/stance-detection-
KE-MLM and https://github.com/thunlp/SelectiveMasking)
 BERT fine-tune with reconstruction objective MASKER.py (adapted
from https://github.com/alinlab/MASKER)
 Standard BERT fine-tune BASE-classi.py

Selective Masking Methods

1. Random masking random
2. Depression Lexicon deplex (lexicon.txt
from https://github.com/gamallo/depression_classification/tree/master/
lexicons)
3. Log-odds-ratio logodds (from https://github.com/kornosk/log-odds-ratio)
4. TF-IDF tfidf (adapted from https://github.com/alinlab/MASKER)
5. Sum attention sumatt (adapted from https://github.com/alinlab/MASKER)
6. Top attention prop
7. Neural Network NN (adapted
from https://github.com/thunlp/SelectiveMasking)

get_datasets contains python script and .ipynb files for extracting, preprocesing and
creating the dataset objects for training
keyword contains .ipynb files for obtaining the keywords and the resulting keywords
in .txt format
src contain the source code for creating a masked dataset and training & evaluation
loop

John Deere - 240DLC - English
Document234 pages
John Deere - 240DLC - English
jeeva
50% (2)
1-Mapping Problems To Machine Learning Tasks
Document19 pages
1-Mapping Problems To Machine Learning Tasks
Himanshu Nimje
No ratings yet
Awp-Im-Wfp Procedures 2.0 Information Management Procedure
Document20 pages
Awp-Im-Wfp Procedures 2.0 Information Management Procedure
m_925
No ratings yet
Masters Thesis Revised
Document4 pages
Masters Thesis Revised
ben munjaru
No ratings yet
Class Imbalance Handling Techniques Used in Depression Prediction and Detection
Document17 pages
Class Imbalance Handling Techniques Used in Depression Prediction and Detection
Lewis Torres
No ratings yet
Relatório Machine Learning
Document24 pages
Relatório Machine Learning
David Drumond
No ratings yet
Classifying of Contraception Prevalence Based On Meteorological Variables
Document13 pages
Classifying of Contraception Prevalence Based On Meteorological Variables
nati sil
No ratings yet
Research Paper On Machine Learning
Document7 pages
Research Paper On Machine Learning
jbuulqvkg
100% (1)
A Method For Integration of Preferences To A Multi-Objective Evolutionary Algorithm Using Ordinal Multi-Criteria Classification
Document14 pages
A Method For Integration of Preferences To A Multi-Objective Evolutionary Algorithm Using Ordinal Multi-Criteria Classification
abcbatata
No ratings yet
Research Paper Using Discriminant Analysis
Document6 pages
Research Paper Using Discriminant Analysis
onuxadaod
100% (1)
Subject Psychology: Paper No. 2 Quantitative Methods Module No. 32: Multivariate Techniques: PCA and MDS
Document8 pages
Subject Psychology: Paper No. 2 Quantitative Methods Module No. 32: Multivariate Techniques: PCA and MDS
NIHUGBH KOLKATA
No ratings yet
Machine Learning: Fundamentals and Applications
From Everand
Machine Learning: Fundamentals and Applications
Fouad Sabry
No ratings yet
2718 An Introduction To Machine Learning Methods For Survey Researchers
Document10 pages
2718 An Introduction To Machine Learning Methods For Survey Researchers
Registro Información Copia Back
No ratings yet
An Overview of Software For Conducting Dimensionality Assessmentent in Multidiomensional Models
Document11 pages
An Overview of Software For Conducting Dimensionality Assessmentent in Multidiomensional Models
Paola Porras
No ratings yet
Journal of Statistical Software: The R Package CDM For Cognitive Diagnosis Models
Document24 pages
Journal of Statistical Software: The R Package CDM For Cognitive Diagnosis Models
chumairoh
No ratings yet
Boosted SVM
Document27 pages
Boosted SVM
hirenbhalala708
No ratings yet
Machine Learning (ML) Techniques
Document14 pages
Machine Learning (ML) Techniques
Mukund Tiwari
No ratings yet
Journal of Statistical Software: Generating Optimal Designs For Discrete Choice Experiments in R: The Idefix Package
Document41 pages
Journal of Statistical Software: Generating Optimal Designs For Discrete Choice Experiments in R: The Idefix Package
mejabundar2001
No ratings yet
Review Paper On Opinion Extraction of Drug Reviews Using PAMM
Document3 pages
Review Paper On Opinion Extraction of Drug Reviews Using PAMM
Editor IJRITCC
No ratings yet
Machinelearning
Document3 pages
Machinelearning
ayesha awan
No ratings yet
Report - Adv in CS 1st
Document5 pages
Report - Adv in CS 1st
Fahad Hassan
No ratings yet
Arbitrating Among Competing Classiers Using Learned Referees
Document24 pages
Arbitrating Among Competing Classiers Using Learned Referees
argamon
No ratings yet
Conference Paper LATENT DIRICHLET ALLOCATION (LDA)
Document9 pages
Conference Paper LATENT DIRICHLET ALLOCATION (LDA)
mahi m
No ratings yet
Targeted Learning in Data Science (PDFDrive) PDF
Document655 pages
Targeted Learning in Data Science (PDFDrive) PDF
Pedro Rico
No ratings yet
Supervised Approach To Extract Sentiments From Unstructured Text
Document5 pages
Supervised Approach To Extract Sentiments From Unstructured Text
International Journal of Engineering Inventions (IJEI)
No ratings yet
Matheuristics For The Capacitated P-Median Problem: International Transactions in Operational Research June 2014
Document18 pages
Matheuristics For The Capacitated P-Median Problem: International Transactions in Operational Research June 2014
Carlos Dela Cruz
No ratings yet
Hate and Toxic Speech Detection in The Context of Covid 19 Pandemic Using Xai Ongoing Applied Research
Document5 pages
Hate and Toxic Speech Detection in The Context of Covid 19 Pandemic Using Xai Ongoing Applied Research
Asma Daoud
No ratings yet
Ensemble Learning: Wisdom of The Crowd
Document12 pages
Ensemble Learning: Wisdom of The Crowd
Ravi Verma
100% (1)
RobnikSikonja2018 Informatica
Document11 pages
RobnikSikonja2018 Informatica
Abdulmalik Alayande
No ratings yet
Wang 2017
Document12 pages
Wang 2017
AI tailieu
No ratings yet
Author Profiling Using Semantic and Syntactic Features
Document12 pages
Author Profiling Using Semantic and Syntactic Features
Karthik Krishnamurthi
No ratings yet
Adaptive Distributionally Robust Optimization
Document16 pages
Adaptive Distributionally Robust Optimization
mmcordova
No ratings yet
Project: Advisor Dr. Sanaa El Touny (Spring 2024) Group 3
Document7 pages
Project: Advisor Dr. Sanaa El Touny (Spring 2024) Group 3
ahmedrefat74
No ratings yet
Project Report
Document10 pages
Project Report
amith.sajja121
No ratings yet
Healthcare 11 00285
Document27 pages
Healthcare 11 00285
Dilip Chakravarthy
No ratings yet
Research Paper
Document10 pages
Research Paper
ranabeena804
No ratings yet
Blank
Document1 page
Blank
ankur9924965700
No ratings yet
Zhang Et Al. (2018)
Document10 pages
Zhang Et Al. (2018)
Adithya Vedhamani
No ratings yet
Bachelor Thesis Methodology Chapter
Document5 pages
Bachelor Thesis Methodology Chapter
aflngnorgfzbks
100% (2)
Entropy 25 00033 v3
Document26 pages
Entropy 25 00033 v3
Elvis Santillan
No ratings yet
The Advantages of The Matthews Correlation Coefficient (MCC) Over F1 Score and Accuracy in Binary Classification Evaluation
Document13 pages
The Advantages of The Matthews Correlation Coefficient (MCC) Over F1 Score and Accuracy in Binary Classification Evaluation
mohamad ewo zees
No ratings yet
Dissertation Wurde Abgelehnt
Document7 pages
Dissertation Wurde Abgelehnt
SomeoneToWriteMyPaperForMeUK
100% (1)
A Comparative Study On Decision Making Methods Wit PDF
Document12 pages
A Comparative Study On Decision Making Methods Wit PDF
Mishal Khan
No ratings yet
Optimizacion Lobos
Document16 pages
Optimizacion Lobos
MATEO SANTOS GARCIA
No ratings yet
Case
Document2 pages
Case
Anonymous buSPtQ8F0u
No ratings yet
Thesis CMC
Document4 pages
Thesis CMC
lisaandersonshreveport
100% (2)
Unit V - Big Data Programming
Document22 pages
Unit V - Big Data Programming
jasmine
No ratings yet
Research Paper Machine Learning
Document4 pages
Research Paper Machine Learning
afeawfxlb
100% (1)
Predictive Analytics Exam-June 2019: Exam PA Home Page
Document9 pages
Predictive Analytics Exam-June 2019: Exam PA Home Page
justtestit
No ratings yet
Genetic Statistical
Document45 pages
Genetic Statistical
mgcombes
No ratings yet
Advanced Machine Learning Mastering Level Learning With Python
Document81 pages
Advanced Machine Learning Mastering Level Learning With Python
amine El Abbassi
No ratings yet
Adaptive Learning Search A New Tool To Help Compre
Document24 pages
Adaptive Learning Search A New Tool To Help Compre
blugao
No ratings yet
06-10613-Article Text-14141-1-2-20201228
Document7 pages
06-10613-Article Text-14141-1-2-20201228
valmar
No ratings yet
Fuzzy Decision Tree Model For Prediction
Document18 pages
Fuzzy Decision Tree Model For Prediction
Mohit Sharma
No ratings yet
Random Forest Research Paper
Document5 pages
Random Forest Research Paper
gw321jrv
100% (1)
Project Book
Document56 pages
Project Book
arunachalam .r
No ratings yet
Ensemble Learning
Document21 pages
Ensemble Learning
dfnngfff
No ratings yet
Department of Computer Science and Engineering Spring 2012
Document18 pages
Department of Computer Science and Engineering Spring 2012
Lens New
No ratings yet
A Novel TOPSIS Method Under Fermatean Fuzzy Hypersoft Set Based On Correlation Coefficients For Selection of Hip Prothesis Materials
Document33 pages
A Novel TOPSIS Method Under Fermatean Fuzzy Hypersoft Set Based On Correlation Coefficients For Selection of Hip Prothesis Materials
Science Direct
No ratings yet
Paper-1 Significance of One-Class Classification in Outlier Detection
Document11 pages
Paper-1 Significance of One-Class Classification in Outlier Detection
Rachel Wheeler
No ratings yet
Data Science Vijay1
Document88 pages
Data Science Vijay1
chathuryaphotos07
No ratings yet
Mixture Models and Applications
From Everand
Mixture Models and Applications
Nizar Bouguila
No ratings yet
Wine Review
Document2 pages
Wine Review
Kaleeswari
No ratings yet
OOPS Lab 4 Farshana
Document5 pages
OOPS Lab 4 Farshana
Kaleeswari
No ratings yet
Iare WT Notes
Document56 pages
Iare WT Notes
Kaleeswari
No ratings yet
Presentation Proforma-Gayathri J
Document3 pages
Presentation Proforma-Gayathri J
Kaleeswari
No ratings yet
PHP Proram
Document54 pages
PHP Proram
Kaleeswari
No ratings yet
SQL Questions
Document8 pages
SQL Questions
Kaleeswari
No ratings yet
Wa0008.
Document4 pages
Wa0008.
Kaleeswari
No ratings yet
3
Document69 pages
3
Kaleeswari
No ratings yet
Python If, If... Else, If... Elif... Else and Nested If Statement
Document9 pages
Python If, If... Else, If... Elif... Else and Nested If Statement
Kaleeswari
No ratings yet
Comprehension (New)
Document44 pages
Comprehension (New)
Kaleeswari
No ratings yet
Write A Program To Develop A Control Application To Demonstrate The Control Structure in C#
Document93 pages
Write A Program To Develop A Control Application To Demonstrate The Control Structure in C#
Kaleeswari
No ratings yet
Modified Criteria V
Document8 pages
Modified Criteria V
Kaleeswari
No ratings yet
ML III Internal
Document1 page
ML III Internal
Kaleeswari
No ratings yet
Bos Template Ug (12-10-21)
Document103 pages
Bos Template Ug (12-10-21)
Kaleeswari
No ratings yet
Bank
Document6 pages
Bank
Kaleeswari
No ratings yet
AUTOCriterion II
Document9 pages
AUTOCriterion II
Kaleeswari
No ratings yet
HTML Forms
Document9 pages
HTML Forms
Kaleeswari
No ratings yet
Connection
Document2 pages
Connection
Kaleeswari
No ratings yet
Ajp-16 11 2022
Document65 pages
Ajp-16 11 2022
Kaleeswari
No ratings yet
Java JFrame
Document3 pages
Java JFrame
Kaleeswari
No ratings yet
Soft Skills
Document2 pages
Soft Skills
Kaleeswari
No ratings yet
19CS51C - JAVA Programming: Co4 - Gui Application Assignment
Document14 pages
19CS51C - JAVA Programming: Co4 - Gui Application Assignment
Kaleeswari
No ratings yet
Unit 2
Document6 pages
Unit 2
Kaleeswari
No ratings yet
XML Coding
Document6 pages
XML Coding
Kaleeswari
No ratings yet
Digital Marketing Question Paper
Document1 page
Digital Marketing Question Paper
Kaleeswari
20% (5)
Iot Lab Code
Document5 pages
Iot Lab Code
Kaleeswari
No ratings yet
Critical Skills
Document5 pages
Critical Skills
Josiah Mwashita
No ratings yet
Electrical Design Criteria - Electrical Knowhow
Document6 pages
Electrical Design Criteria - Electrical Knowhow
dedeerland
No ratings yet
Multi Instrument Automation Server Interfaces
Document31 pages
Multi Instrument Automation Server Interfaces
Amalia Dyah
No ratings yet
Melissa Esenam Klu CV
Document1 page
Melissa Esenam Klu CV
melissa.esenam
No ratings yet
SafeNet MobilePASS User Guide, Version 8.2.0-B
Document51 pages
SafeNet MobilePASS User Guide, Version 8.2.0-B
Amhir 1925
No ratings yet
Guitar Amplifier: Owner's Manual Manuel D'utilisation Bedienungsanleitung Manual de Usuario
Document18 pages
Guitar Amplifier: Owner's Manual Manuel D'utilisation Bedienungsanleitung Manual de Usuario
GD
No ratings yet
Solar Powered Green House Scheme
Document2 pages
Solar Powered Green House Scheme
zainab444
No ratings yet
3G Training
Document359 pages
3G Training
Vikash Yadav
100% (3)
Vista Lesson 4 - Files Management
Document48 pages
Vista Lesson 4 - Files Management
Heman Lee
No ratings yet
New! Are You Already Coding The HTML For Your Web Design ? Select HTML Output From
Document3 pages
New! Are You Already Coding The HTML For Your Web Design ? Select HTML Output From
Niksa Markovic
No ratings yet
Microsoft Word - BAE BL 008-02
Document28 pages
Microsoft Word - BAE BL 008-02
Riccardo De Rubeis
No ratings yet
An Implementation of Maximum Power Point Tracking Algorithms For Photovoltaic Systems Using Matlab and Arduino Based RTOS System
Document5 pages
An Implementation of Maximum Power Point Tracking Algorithms For Photovoltaic Systems Using Matlab and Arduino Based RTOS System
IJSTE
No ratings yet
Silicon Chip 2021 09 September
Document116 pages
Silicon Chip 2021 09 September
libra972004
No ratings yet
Instagram User Analytics
Document6 pages
Instagram User Analytics
Rahul Shirude
No ratings yet
Comsats University Lahore: Report Writing Skills
Document20 pages
Comsats University Lahore: Report Writing Skills
Sara Kamran
No ratings yet
TI - 20201204 - Meteo Station and Sensor Commissioning Guide - V10 - EN
Document14 pages
TI - 20201204 - Meteo Station and Sensor Commissioning Guide - V10 - EN
Subham Hazra
No ratings yet
Counter-Strike 1.6 Servers - Page 49 PDF
Document1 page
Counter-Strike 1.6 Servers - Page 49 PDF
Milot Avdyli
No ratings yet
The Impact of Information Technology On Collection Development and Management in
Document2 pages
The Impact of Information Technology On Collection Development and Management in
Victor Imarhiagbe
No ratings yet
UG pRO
Document23 pages
UG pRO
DEEPAK GOYAL
No ratings yet
3 - Tier Architecture
Document36 pages
3 - Tier Architecture
Sanjeev Sarma
No ratings yet
Iec 947
Document1 page
Iec 947
Khaled Sayed
No ratings yet
Lung Disease Report Final
Document51 pages
Lung Disease Report Final
19IMT03 BHARANIDHARAN.M
No ratings yet
Reporte de Threat Modeling Proyecto
Document19 pages
Reporte de Threat Modeling Proyecto
Carlos Garcia Jacome Darker
No ratings yet
Am Front End Implementation Guide: Setting Up The System
Document3 pages
Am Front End Implementation Guide: Setting Up The System
Gnana Sambandam
No ratings yet
Rosenberger Site Solutions Installer Catalog
Document32 pages
Rosenberger Site Solutions Installer Catalog
you are awesome
No ratings yet
HC Cone Crusher: Main Features
Document3 pages
HC Cone Crusher: Main Features
Dian Wang
No ratings yet
IN 1052 UpgradingFromVersions1040And1041 en
Document108 pages
IN 1052 UpgradingFromVersions1040And1041 en
Rahul Khatri
No ratings yet