Professional Documents
Culture Documents
12.lec 11 Transfer Learning 1
12.lec 11 Transfer Learning 1
Farzaneh Abdollahi
Winter 2024
Motivation
Motivation
Definition [3]
▶ Domain: D = {χ, P(X )}:
▶ a feature space χ
▶ a marginal probability distribution P(X )
X = {x|xi ∈ χ, i = 1, ..., N} is a dataset that contains N instances.
▶ Different domains are defined based on different feature spaces or
different marginal probability distributions between these domains.
▶ For FDI, different working conditions (WCs), locations and machines
can be regarded as different domains.
▶ Task: T = {Y, f (.)} when giving a specific domain D:
▶ a label space Y, Y = {y |yi ∈ Y, i = 1, ..., N} is a label set for the
corresponding instances in D.
▶ a mapping function is a non-linear and implicit function that can
bridge the relationship between the input instance and the predicted
decision, which is expectedly learned from the given datasets
▶ Different fault classes and types can be regarded as different tasks.
Transfer Learning
▶ Given a source domain D S = {χS , P S (X S )} with the source task
T S = {Y S , f S (.)} and a target domain D T = {χT , P T (X T )} with
the target task T T = {Y T , f T (.)}
▶ Objective is to learn a better mapping function f T (.) for the target
task T T with the transferable knowledge gained from the source
domain D S and task T S .
▶ In the transfer the domain and/or the task between the source and
the target scenarios could be different (D S ̸= D T and/or T S ̸= T T ).
▶ Instance-based:
▶ The labeled instances(data) in the target domain are too limited to
train a satisfied diagnosis model.
▶ On the otherhand some labeled instances in the source domain are
significantly different from the ones in the target domain
▶ ∴ directly merging the source data into the target data might
deteriorate the performance of the target deep model and due to
negative transfer during the model training
▶ In instance-based DTL approaches, the objective is to single out the
instances in the source domain that are positive for target model
training and to augment the target data by adapting the instance
weighting strategies.
Instance-based DTL
Instance-based DTL
▶ Model-based:
▶ The tasks between the source and the target domains share some
common knowledge in the model level,
▶ The transferable knowledge is well embedded into a pretrained source
deep model whose structure and parameters are general and helpful
for learning a powerful target model.
▶ Assuming availability of some labeled instances in the target domain,
the goal is to exploit which part of the deep learning model pretrained
in the source domain can help improving the model learning process
for the target domain.
▶ Based on the way of training of the target deep model, model-based
DTL can be divided into two subcategories:
▶ Sequential training
▶ Joint training
Model-Based DTL
▶ Sequential training: typically contain two stages:
▶ In the first stage, i.e., the pretraining on auxiliary domains, a
well-trained source model is learned from the source date which is
richer and larger
▶ In the second stage, i.e., the fine-tuning on the target domain, the
target deep model is obtained by freezing some components of the
well-trained source model and fine-tuning the rest components with
the target domain data, or by reusing all the parameters of the
well-trained source model to initialize the target deep model and
retraining the whole target model with the target domain data.
▶ The higher-level layers are prone to learn the task-specific
representations and the lower-level layers are able to capture general
representations in a deep learning model.
▶ ∴ In a classical fine-tuning strategy lower-level layers learned from
auxiliary domains are freezed and retrain the higher-level layer with
limited target domain data
Farzaneh Abdollahi Intelligent Fault Diagnosis Lecture 10 12/27
Outline Motivation and Definition Different Type of Transfer Learning Applications of DTL for IFD
Model-Based DTL
Feature-based
Ns NT
1 X 1 X
MMD(hs , hT ) = ∥ ϕ(his ) − ϕ(hiT )∥2H
Ns NT
i=1 i=1
GAN
[6]
[3]
Farzaneh Abdollahi Intelligent Fault Diagnosis Lecture 10 23/27
Outline Motivation and Definition Different Type of Transfer Learning Applications of DTL for IFD
1. The learned deep models are not robust and generalizable in the face
of changeable WCs and diversified data.
▶ It is difficult to deal with the uncertainty caused by the varying
environment during machines working.
▶ Working conditions (WCs) of machines are various during long-term
operation, and the health status is also declining with the degradation
of crucial components
2. The deep models should be updated by upgrading the manufacturing
products
▶ It is hard to collect and annotate the training data from scratch for
the application of new products
▶ Reusing the labeled historical data collected and accumulated from
the old products is relatively easy.
References I
References II