Professional Documents
Culture Documents
Multimedia Databases: Yonsei University 2 Semester, 2009 Sanghyun Park
Multimedia Databases: Yonsei University 2 Semester, 2009 Sanghyun Park
Multimedia Databases
Yonsei University
2nd Semester, 2009
Sanghyun Park
Contents
Introduction to MMDBMS
Motivation
Content-based retrieval
Generic MMDBMS structure
Image Retrieval
Information Sciences, 178(22), pp.4301~4313,
November, 2008, Elsevier
Conclusion
Motivation (1/2)
Multimedia is a much more powerful communication tool
than traditional data in our daily life
Image showcase, graphic design, TV commercial, speech,
movie, hand phone multimedia message, etc
PC 1.5
laptop 1.4
mouse 1.2
monitor 1.1
Mapping
keywords Search
results
… … … …
PC 1.5
Collect keywords
… … … …
… … … …
laptop 1.4
mouse 1.2 and confidences … …
… …
monitor 1.1 …
…
…
…
… …
… …
CD 0.6
tree 0.4 …
…
…
… … …
keyboard 0.1 … … …
…
…
…
…… …
keyword-based
image retrieval
Bench
Search
results
Initial retrieval results
…
bench 4.0 leather 2.5 chair 4.0 grass 2.0 ship 3.0 candy 2.1 table 3.0
bag 3.0 bench 3.2 bench 2.1 bench 1.8 bench 1.2 animal 1.8 bench 0.5
tree 1.2 leaf 0.5 black 3.2 sun 0.5 dove 2.4 bench 0.8 laptop 2.7
Initial retrieval results
…
bench 4.0 leather 2.5 chair 4.0 grass 2.0 ship 3.0 candy 2.1 table 3.0
bag 3.0 bench 3.2 bench 2.1 bench 1.8 bench 1.2 animal 1.8 bench 0.5
tree 1.2 leaf 0.5 black 3.2 sun 0.5 dove 2.4 bench 0.8 laptop 2.7
positive positive
• Rearrangement of images
– Rearrangement order should be based on visual feature
– What kind of visual feature plays a critical role in
distinguishing positive and negative images
• Discrimination power
– Ex) a query keyword ‘forest’
• The user is likely to focus on the color than the shape or pattern
when submitting user’s feedback
Visual feature 1
…
positive positive
Visual feature 2
…
positive positive
Visual feature 3
…
positive positive
• Discrimination Power of VFj (jth visual feature)
Np: # of positive images
Po j Ne j
DPj Nn: # of negative images
N p Nn Poj: # of positive images among the top Npth images
Nej: # of negative images among the bottom N nth images
• Weight of VFj
DPj
wj n
DP
k 1
k
VF1 VF2 SUM
Image Iavg
0.3 ×0.2 0.8 ×1.0 0.86
positive
0.3 ×0.2 0.5 ×1.0 0.56
positive positive
Confidence
modification
…
bench 4.5 grass 2.0 chair 4.0 leather 2.5 ship 3.0 table 3.0 candy 2.1
bag 3.0 bench 2.5 bench 2.3 bench 3.2 bench 1.2 animal 1.8
tree 1.2 sun 0.5 black 3.2 leaf 0.5 dove 2.4 laptop 2.7 bench 0.3
• Training set
– 360 images (about 4% of the total number of image)
• Test set
– 8,921 images
• Parameter decision
– ThresholdSize: # of additional images
• Growth rate of recall
RE ( ExtendedFeedback) RE ( NaiveFeedback )
Growth rate of recall 100
RE ( NaiveFeedback )
• Growth rate of precision
PR( ExtendedFeedback ) PR( NaiveFeedback )
Growth rate of precision 100
PR ( NaiveFeedback )
Conclusion
Multimedia is a powerful tool for communication