Professional Documents
Culture Documents
VA Lecture 28
VA Lecture 28
Versha - 2021UCD2156
What do you mean by Video Segmentation?
It is the process of dividing a video into meaningful regions. These regions can be
based on various characteristics like:
● Object boundaries
● Motion
● Color
● Texture
● Other visual features
Types of Video Segmentation
a. Automatic (Unsupervised)
a. (Instance-Agnostic) VSS
b. Semi-automatic (Semi-supervised)
b. Video Instance Segmentation
c. Interactive
c. Video Panoptic Segmentation
d. Language-guided
Video Object Segmentation
It focuses on tracking objects within
a video and is used in applications
such as surveillance and
autonomous vehicles.
Methodology:
● Object initialization - identifying
the object in the first frame of
the video
● Object tracking - tracking its
movement throughout the rest
of the video
Approaches
1. Unsupervised VOS
- aims to segment objects in a video without using any labeled data.
- e.g. Focus on Foreground Network (F2Net).
1. Semi-supervised VOS
- use a small amount of labeled data to guide the segmentation process
and unsupervised methods to refine the segmentation results.
- useful in cases where obtaining labeled data is difficult or expensive.
- additionally, the unsupervised methods used in semi-supervised video
object segmentation can help to improve the robustness and
generalization of the segmentation results.
- e.g. Sparse Spatiotemporal Transformers (SST).
3. Interactive VOS
- User can specify the initial location of an object in the first frame of the
video or draw a bounding box around the object.
4. Language-guided VOS
- uses natural language input to guide the segmentation and tracking of
objects within a video.
Video Semantic Segmentation
Methodology
● Feature extraction using CNN
● Features are used to classify
each pixel using FCN
Approaches
1. (Instance-Agnostic) Video Semantic Segmentation
- It is a method to identify and segment objects in a video sequence without
considering the individual instances of the objects.
- It is in contrast to instance-aware semantic segmentation, which tracks and
segments individual instances of objects within a video, making it less
computationally demanding.
1. Video Instance Segmentation
- It identifies and segments individual instances of objects within a video sequence.
- It is in contrast to the instance-agnostic semantic segmentation, which only
identifies and segments objects within a video without considering individual
instances.
3. Video Panoptic Segmentation
- identifies and segments both objects and their parts in a video sequence
in a single step. This approach combines the strengths of both instance-
agnostic semantic segmentation and video instance segmentation.
Challenges and Limitations of Video Segmentation
4. Edge
● Identify sharp changes in intensity
between pixels (edges).
● Significant variations in edge patterns
between frames could suggest a shot
change.
Strengths and Limitations