Professional Documents
Culture Documents
Online Handwritten Script Recognition
Online Handwritten Script Recognition
Online Handwritten Script Recognition
RECOGNITION
Presentation By:
Priya Ahuja
CSE 6C
10-CSU-110
CONTENTS
Online Recognition
Why Handwriting Recognition?
Why is Handwriting Recognition difficult?
Properties of Scripts
Features of Handwritten Script
Steps in Handwritten Script Recognition
Future Scope
References
Online Recognition
On-line handwriting recognition involves the automatic conversion of
text as it is written on a special digitizer or PDA, where a sensor picks
up the pen-tip movements as well as pen-up/pen-down switching. That
kind of data is known as digital ink and can be regarded as a dynamic
representation of handwriting. The obtained signal is converted into
letter codes which are usable within computer and text-processing
applications.
The elements of an on-line handwriting recognition interface typically
include:
1) a pen or stylus for the user to write with.
2) a touch sensitive surface, which may be integrated with, or adjacent
to, an output display.
3) a software application which interprets the movements of the stylus
across the writing surface, translating the resulting strokes into digital
text.
Devices that accept on-line handwritten data: From the top left, Pocket
PC, CrossPad, Ink Link, Cell Phone, Smart Board, Tablet with display,
Anoto pen, Wacom Tablet, Tablet PC
Properties of scripts
Arabic : Arabic is written from right to left within a line and the lines
are written from top to bottom. A typical Arabic character contains a
relatively long main stroke which is drawn from right to left, along with
one to three dots. The character set contains three long vowels.
Short markings(diacritics) may be added to the main character to
indicate short vowels. Due to these diacritical marks and the dots in
the script, the length of the strokes vary considerably.
Cyrillic: Cyrillic script looks very similar to the cursive Roman script. The
most distinctive features of Cyrillic script, compared to Roman script are:
1) individual characters, connected together in a word, form one long
stroke,
2) the absence of delayed strokes .Delayed strokes cause movement of the
pen in the direction opposite to the regular writing direction.
Han: Characters of Han script are composed of multiple short strokes. The
strokes are usually drawn from top to bottom and left to right within a character.
The direction of writing of words in a line is either left to right or top to bottom.
Hebrew: Words in a line of Hebrew script are written from right to left and,
hence, the script is temporally similar to Arabic. The most distinguishing factor
of Hebrew from Arabic is that the strokes are more uniform in length in the
former.
Roman: Roman script has the same writing direction as Cyrillic, Devnagari,
and Han scripts. In addition, the length of the strokes tends to fall between that
of Devnagari and Cyrillic scripts.
where Xstart(.) denotes the x coordinate of the pen-down position of the stroke,
n is the number of strokes in the pattern,
and r is set to 3 to reduce errors due to abrupt changes in direction between
successive strokes.
The value of HID falls in the range [r n , n r].
Where H(r,denotes the number of votes in the (r,th bin in the twodimensional Hough transform space. The Hough transform can be
computed efficiently for dynamic data by considering only the sample
points. The numerator is the sum of the bins corresponding to line
orientations between -10o and 10o and the denominator is the sum of
all the bins in the Hough transform space. The value of Shirorekha
Strength is a real number which falls in the range [0.0, 1.0].
Stroke Density: This is the number of strokes per unit length (x-axis) of the
pattern. Note that the Han script is written using short strokes, while
Roman and Cyrillic are written using longer strokes.
where n is the number of strokes in the pattern. The value of Stroke
Density is a real number and can vary within the range (0.0,R 1), where R1
is a positive real number.
Aspect Ratio: This is the ratio of the width to the height of a pattern. The
value of Aspect Ratio is a real number and can vary within the range (0.0,
R2), where R2 is a positive real number.
Reverse Distance: This is the distance by which the pen moves in the
direction opposite to the normal writing direction. The normal writing
direction is different for different scripts. The value of Reverse Distance is a
nonnegative integer and its observed values were in the range [0,1200].
where Y pen-down(.)and Y pen-up(.) are the y-coordinates of the pen-down and pen-up
positions, respectively .For an n-stroke pattern, the Average Vertical Stroke
Direction is computed as the average of the VD values of its component strokes.
The value of Average Vertical Stroke Direction falls in the range [-1.0,1.0].
_
Y(s) is the average of the y-coordinates of the stroke points and n is the
number of strokes in the pattern. The value of VID is an integer and falls
in the range (1 -n, n -1).
Variance of Stroke Length: This is the variance in sample lengths of
individual strokes within a pattern. The value is of Variance of Stroke
Length is a nonnegative integer.
Feature Extraction
Basic Idea: Describe shape of pen trajectory locally
Typical Features:
Slope angle of local trajectory(represented as sin and cos :
continuous variation)
Binary pen-up vs. pen-down feature
Hat feature for describing delayed strokes(strokes that spatially
correspond to removed delayed strokes are marked)
Feature Dynamics: In all applications of HMMs dynamic
features greatly enhance performance.
Discrete time derivative of features
Here: Differences between successive slope angles
CLASSIFICATION
The last big step is classification. In this
step various models are used to map
the extracted features to different
classes and thus identifying the
characters or words the features
represent.
Markov property
A Markov System
Has N states, called s1, s2 .. sN
s2
s1
N=3
t=0
s3
A Markov System
Has N states, called s , s .. s
1
s2
s1
N=3
t=0
qt=q0=s3
s3
Current State
A Markov System
Has N states, called s , s .. s
Current State
s2
s1
N=3
t=1
qt=q1=s2
s3
P(qt+1=s1|qt=s2) = 1/2
P(qt+1=s2|qt=s2) = 1/2
P(qt+1=s3|qt=s2) = 0
P(qt+1=s1|qt=s1) = 0
P(qt+1=s2|qt=s1) = 0
s2
P(qt+1=s3|qt=s1) = 1
s1
qt=q1=s2
s3
N=3
t=1
P(qt+1=s1|qt=s3) = 1/3
P(qt+1=s2|qt=s3) = 2/3
P(qt+1=s3|qt=s3) = 0
P(qt+1=s1|qt=s2) = 1/2
P(qt+1=s2|qt=s2) = 1/2
Has N states, called s1, s2 .. sN
P(qt+1=s3|qt=s2) = 0
P(qt+1=s1|qt=s1) = 0
s2
P(qt+1=s2|qt=s1) = 0
P(qt+1=s3|qt=s1) = 1
Note: qt {s1, s2 .. sN }
2/3
1/2
s1
N=3
t=1
qt=q1=s2
1/3
s3
P(qt+1=s1|qt=s3) = 1/3
P(qt+1=s2|qt=s3) = 2/3
P(qt+1=s3|qt=s3) = 0
Often notated with arcs
between states
P(qt+1=s1|qt=s2) = 1/2
P(qt+1=s2|qt=s2) = 1/2
Markov Property
P(qt+1=s3|qt=s2) = 0
P(qt+1=s1|qt=s1) = 0
s2
P(qt+1=s2|qt=s1) = 0
P(qt+1=s3|qt=s1) = 1
In other words:
2/3
1/2
s1
N=3
t=1
qt=q1=s2
1/3
s3
P(qt+1 = sj |qt = si ) =
P(qt+1 = sj |qt = si ,any earlier history)
Notation:
P(qt+1=s1|qt=s3) = 1/3
P(qt+1=s2|qt=s3) = 2/3
P(qt+1=s3|qt=s3) = 0
aij P (qt 1 si | q s j )
i P (q1 si )
Basic Calculations-1
Example: What is the probability that the
weather for eight consecutive days is
sun-sun-sun-rain-rain-sun-cloudy-sun?
Solution:
O = sun sun sun rain rain sun cloudy sun
3
3 3 1 1 3
2
3
a12 a1N
a21
a22 a2N
:
aN1
:
:
aN2 aNN
P(qt+1=Sj | qt=Si)=aij
b1(1)
b1(2)
b1(M)
b2(1)
b2(2)
b2(M)
P(Ot=k | qt=Si)=bi(k)
The coins represent the states; these are hidden because you do not
know which coin was tossed each time
The outcome of each toss represents an observation
A likely sequence of coins may be inferred from the observations, but
this state sequence will not be unique
1, 2 or 3 coins?
Which of these models is best?
Since
Future Scope
Over the past three decades, many different methods
have been explored by a large number of scientists to
recognize characters. A variety of approaches have
been proposed and tested by researchers in different
parts of the world to improve the experience of
usability.
References
ieeexplore.ieee.org/iel5/34/28182/01261096.pdf
F. Coulmas, Writing Systems: An Introduction to Their Linguistic Analysis:
Cambridge University Press, 2003.
R. Plamondon and S. N. Srihari, "On-line and off-line handwriting
recognition: A comprehensive survey," IEEE Transactions on Pattern
Analysis and Machine Intelligence,vol. 22, pp. 63-84, 2000.
A. L. Spitz, "Determination of the script and language content of
document images," IEEE Transactions on Pattern Analysis and Machine
Intelligence, vol. 19, pp. 235-245,1997
G. X. Tan, C. Viard-Gaudin, and A. Kot, "Automatic Writer Identification
Framework for Online HandwrittenDocuments Using Character
Prototypes," Pattern Recogn.,2009.
THANK YOU !