Computer Vision and AI

Computer Vision
Lab 10: Introduction to Transformers & Transfer Learning
Based on the Lecture

Prepared by
Amjad Dife
2023 / 2024
Computer Vision 2023 / 2024 Assiut University | Faculty of computers and information 1
Motivation
1 2 3
Motivation
4 5
Motivation
6 7
Motivation
Outline
9 10
Outline
11 12
Outline
❑ Transformers
➢ Introduction to Transformers as a neural network
➢ The need for Word2Vec Network
▪ The main usage
▪ Measuring Similarities
▪ Different Operations on Vectors
▪ Drawbacks of Word2Vec, and the Solution.
➢ The main Block in Transformers
▪ Self Attention Block.
▪ Multi-headed Attention.
❑ Transfer Learning
Outline
❑ Transformers
▪ The main usage
Outline
❑ Transformers
▪ The main usage
Introduction to Transformers
▪ Transformers is a Neural Network, it is mainly developed for NLP tasks, and then used for
Computer Vision.
▪ It requires a huge data to be trained.
▪ The beginning was: "Attention is All you need", by Google.
Outline
❑ Transformers
▪ The main usage
Word2Vec
▪ The main idea is the embedding (512 number for example) of the similar words nearest to
each other in the vector space.
▪ How to measure the similarity between

two word's vectors? using the dot product
(Cosine Similarity = Cosine Distance)
Outline
❑ Transformers
▪ The main usage
Word2Vec
▪ Note that: The highest the dot product the

nearest the vectors. (the more related
words)
Word2Vec
▪ Different operations on the

vectors:
➢ France – Paris + Italy =
Roma
Outline
❑ Transformers
▪ The main usage
Word2Vec
▪ The Drawback of this (Word2Vec)

network is that:
▪ the vector of each word is fixed it
doesn't change based on the
Context.
▪ Note that: in Natural Languages, the
same words can be used in different
Context and have different meanings
based on the context (the
surrounding words)
Outline
❑ Transformers
▪ The main usage
Self Attention
▪ The main block in The Transformers is the "Self-Attention Block "

▪ The Self-attention block has three main steps:
➢ Find alignment Scores: by calculating the Dot product (matrix multiplication).
➢ Get the Weights: by Normalizing the Scores (from step 1) with Softmax
➢ Reweighting the original embedding: using the weights (from step 2).
Self Attention | The Big Picture
Self Attention | Step 1
1. Find alignment Scores
alignment map
2. Get the Weights: by Normalizing the Scores (from step 1) with Softmax
3. Reweighting the original embedding: using the weights (from step 2).
Outline
❑ Transformers
▪ The main usage
Multi-headed Attention
Outline
❑ Transformers
▪ The main usage
Transfer Learning
Transfer Learning
Transfer Learning
Thank You
The image Generated by a

neural Network ☺

Computer Vision and AI

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Computer Vision and AI

Uploaded by

Copyright:

Available Formats

Computer Vision

Lab 10: Introduction to Transformers & Transfer Learning

Based on the Lecture

▪ How to measure the similarity between

▪ Note that: The highest the dot product the

▪ Different operations on the

▪ The Drawback of this (Word2Vec)

▪ The main block in The Transformers is the "Self-Attention Block "

1. Find alignment Scores

The image Generated by a

You might also like