Professional Documents
Culture Documents
006 NLP-pipelineSLides
006 NLP-pipelineSLides
006 NLP-pipelineSLides
Complete
Natural
Language
Processing
NLP
Masterclass
Pipeline
This section would allow you to
conceptualise the overall process from
using raw text to the point where it can
be fed to a machine learning model.
For instance; take Amazon Text
Reviews to a point where we can
create a machine learning model can
tell us if it is a positive or negative
review.
Test Evaluate
Model Model
Text Complete
Natural
Language
Pre-
Processing
Masterclass
processing
Most of the time data in the real world,
not limited to NLP but for data science
in general - data can be messy. In our
case, text data can be highly
unstructured. While learning NLP, most
times the datasets require
preprocessing - but not to the extent to
if you were actually creating your own
dataset from scratch. Nevertheless, I
will ensure that you get good practice
with cleaning some very dirty data.
processing
Processing
Masterclass
Tweets
Extract hashtags
Clean URLS
Mentions
Emojis
Smileys
Remove digits
Punctuations
Stop words - dont add much
meaning - a, an, in, this, it, at, the
Normalization Processing
Masterclass
Embeddings Processing
Masterclass
Learning Processing
Masterclass
Evaluate Processing
Masterclass