Professional Documents
Culture Documents
Comparing Selective Masking Methods For Depression Detection in Social Media
Comparing Selective Masking Methods For Depression Detection in Social Media
Dataset
Reddit Self-reported Depression Diagnosis (RSDD) dataset and Time-RSDD
dataset (https://georgetown-ir-lab.github.io/emnlp17-depression/)
Training Approaches
BERT further pre-train + fine-tune FURTHER-01-MLM.py and FURTHER-02-
classi.py (adapted from https://github.com/GU-DataLab/stance-detection-
KE-MLM and https://github.com/thunlp/SelectiveMasking)
BERT fine-tune with reconstruction objective MASKER.py (adapted
from https://github.com/alinlab/MASKER)
Standard BERT fine-tune BASE-classi.py
get_datasets contains python script and .ipynb files for extracting, preprocesing and
creating the dataset objects for training
keyword contains .ipynb files for obtaining the keywords and the resulting keywords
in .txt format
src contain the source code for creating a masked dataset and training & evaluation
loop