Professional Documents
Culture Documents
A Review of Skew Detection Techniques For Document
A Review of Skew Detection Techniques For Document
A Review of Skew Detection Techniques For Document
317
Authorized licensed use limited to: TU Wien Bibliothek. Downloaded on September 03,2023 at 19:11:56 UTC from IEEE Xplore. Restrictions apply.
the number of intersected line that sinusoidal curves have at In general, the Hough Transform method has a high
various ρ and θ values the accumulator array is forming. For accuracy in detecting the angle of skewed. On other hand,
each point of ρ, a peak gives the angles where straight lines the complex document images which contain figure, title,
can be fitted out of the original pixels. The skew angle of columns and foot will make the process of finding the
the documents can be calculated by averaging the value of θ highest values of the accumulator into a difficult task and
for highest accumulator peaks which represented in fig. 3 by will not calculate the correct skew angle for the documents.
the lines. Fig. 4 shows the points in Hough Transform space Besides, the presence of noise will slow this method.
with the highest values of accumulator. Furthermore, it consumes a large memory.
318
Authorized licensed use limited to: TU Wien Bibliothek. Downloaded on September 03,2023 at 19:11:56 UTC from IEEE Xplore. Restrictions apply.
TABLE I. A COMPREHENSIVE COMPARATIVE ANALYSIS OF SKEW DETECTION TECHNIQUES.
Method Advantages Main Drawbacks Time Iterative Content of the document / Mixed document
Nature
This is very
straightforward solution
to find the skew If the image contains diagrams or graphs, this
PP Sensitive to noise. Very slow Iterative
angle. Easy to code and will reduce the accuracy of detecting the angle.
it is simple to
understand.
High accuracy in The presence of noise
The complex document images which contain
detecting the angle of will slow this method.
Faster than figure, title, columns and foot will make the
HT skewed. Also it requires a large Non-iterative
PP method process of finding the highest values of the
memory.
accumulator difficult.
Robust , reliable, and This method faces Can be used to detect the skew angle from the
NN can be used to any range problem in degraded and Fast Non-iterative document contains different types of font size,
of skew angles historical documents. scripts and layouts.
319
Authorized licensed use limited to: TU Wien Bibliothek. Downloaded on September 03,2023 at 19:11:56 UTC from IEEE Xplore. Restrictions apply.
TABLE III. THE CORRECT SKEWED ANGLE AND ALSO
SKEWED ANGLE FOR PP, HT AND NN METHODS
Methods
Figure 6. (a) And (b) are sample images used in the experiment.
From the experiments, we conclude that the projection
To compare the performance between the existing profile performed better than other methods and achieves
methods, we compare on two aspects which are the speed the highest accuracy. However, PP does not really
and accuracy. successful when it is given input images that include figures
or the skewed angle is more than 15°. The Hough transform
method unable to give high accuracy hence the sample
A. Speed images contain figure, title, columns and foot, which will
We compare the speed between the methods using make the process of finding the highest values of the
MATLAB clock function. Table II reported the accumulator is a difficult task. The nearest neighbor method
computing time for each method. performed poorly for the reason that it is designed for a
special script.
320
Authorized licensed use limited to: TU Wien Bibliothek. Downloaded on September 03,2023 at 19:11:56 UTC from IEEE Xplore. Restrictions apply.
REFERENCES
321
Authorized licensed use limited to: TU Wien Bibliothek. Downloaded on September 03,2023 at 19:11:56 UTC from IEEE Xplore. Restrictions apply.