Professional Documents
Culture Documents
Model For A Gradual Objects Recognition
Model For A Gradual Objects Recognition
Abstract: The objects appearance in the images is modified because of several external factors (i.e.:
illumination, rotate, pose) and the application of the standard comparison algorithms does not answer
suitably. For landing this problem, we present here an architecture based on a gradual recognition:
The object is represented on a scale of categories, and the task of the algorithms of recognition is to
concentrate on the determination of the category the most detailed according to information extracted from
the image. It is not necessary any more to pass by several intermediate phases before starting the process of
recognition. On this principle, we propose a model for the internal representation of a system of vision which
tries to generalize the recognition of objects by taking into account the categorization.
2
The hybridization here consists of the use of two
modes of data processing. The treatment steered by This diagram clarifies the organization of the models
the data is determined by the information-stimulus of objects according to three levels of categorization.
coming from the outside (bottom / up), while the We introduced besides, the knowledge than can
treatment steered by concepts is driven by the contained in categories: rules of correspondence,
information already stored in memory, what implies deductions procedures, etc. We insist here on the
a preliminary experience and already acquired visual aspect, the characteristics can be introduced
concepts (top / down) [26]. by parameters illustrated here in the left party. Every
characteristic points towards a set of parameters
3.2 Description of the approach according to its position (SUB_CATEGORIE,
BASE_CATEGORIE or SUR_CATEGORIE)
3.2.1 The model
Considering the objects particularity of our 3.2.2 The general architecture
application domain (aerial images), it is necessary to Our architecture is based on the idea which consider
enrich the internal representation by other aspects as the recognition as a gradual process ending in an
the color and texture, and to endow the system with identification or in a categorization. It possesses
an inferences mechanism for landing the gaps of the three big modules (Fig. 3a): a library containing the
prototypes model. In this work we consider that the tools of images treatment, a module concerning the
categorization is an effective means to organize a segmentation multicriterion (Fig. 3b) and a module
vision system. The internal representation uses for the categorization tasks (Fig. 3c).
abstraction levels to supply a gradual recognition. The recognition process proposed in this architecture
For this, we suppose that the superior level is an follows a simple procedure:
organizing and generative level including knowledge 1- Start with the segmentation multicriterion for the
on the validity of a category. We organize the most complete possible extraction of the indicators
characteristics hierarchically in three levels: (characteristics of texture, shape and color)
- The first level will concern information such as the 2- Selection of categories according to the discovery
color, the texture, the shape, etc. of the indications.
- The second level will be concentrated on the object 3- Elaboration of the segmentation by combination
parts. The recognition of a part can select all the of criteria and reduction of the categories number by
prototypes containing this part. contribution of precision.
- The third level will be interested in constraints 4- Definitive Decision: identification or
intra-objects and inter-objects. A constraint can categorization
suggest a set of prototypes validating it. The essential character of this architecture is to
To illustrate our idea, we are going to be interested organize the recognition by allowing the process to
to the first level of processing and to take into be started since the low level of the images treatment
account the color, the texture and the flat forms until the highest level. We tempt then, to resolve two
(2D). The modeling takes into account three problems: the extraction of objects via the
abstraction levels: meta-category, basic-category and segmentation and the recognition of the extracted
sub-category according to the following scheme: objects.
An aerial image, in our case, presents several
objects, it would be very difficult to extract them
SUR_CATEGORIE directly by the segmentation module. Indeed, an
KNOWLEDGE BASE House Zones,… object could be the result of several zones supplied
R1: House is never black by the segmentation multicriterion. For that purpose,
several approaches putting in front of the automatic
Classes of textures BASE_CATEGORIE modeling by the learning were advanced. We cite
Particular texture City
some from them: 1) Modelling by the color: object
is represented by a histogram of colors [27]. 2)
Classes of colors Approaches of EigenImages numerous researchers
Particular color SUB_CATEGORIE [28] uses the Karhunen-Loeve transformation ( or
Classes of shapes Rabat
Principal Components Analysis: PCA ) for the
calculation of appropriate images. 3) Probability
Particular shape
Density: Objects are represented by a probability
Fig. 1
3
density of objects characteristics[29]. The learning Image Entry
and the objects localization are formulated as
problems of parameters estimation. TOOLS FOR
Segmentation Categorization
The automatic objects modeling by distributions of IMAGE
PROCESSING Module tasks
probability is used efficiently in many applications.
It is adopted here for illustration.
We treat at first the information concerning the
General Arichitecture
color, the texture and the shape. We combine then
Fig. 3a
these three characteristics to model objects by 3
distributions of probability.
Pretreatment, grading, corrections,…
This diagram (Fig. 3c) suggests a functioning in two
modes: learning and Recognition. Multicriteria Segmentation
To illustrate these processes, we use the framework color Texture
of the Beyesian theory. The allocation of an element
x in one of the models {Z1, Z2, …, Zk} is based on
the following formula (formula of Bayes): Combination of criteria of segmentation
p(Zi|x)= p(x|Zi). p(Zi)/ p(x)
p(Zi|x): the a posteriori probability. Segmentation multicriterion
p(Zi): represent the probability that an element taken Fig. 2b
at random belongs to the model Segmentation by
textures Texture Base
p(x): is the law modeling the global histogram of the of
prototypes
observed image. We often take a uniform law.
p(x|Zi): the a priori probability colour
4.1 Segmentation:
4
We chose 3 criteria of segmentation for this z1 = t1-c1-f1,
example: borders, color and texture: z2 = t2-c2-f2,
z3 = t3-c3-f2
- Segmentation by the Detection of the borders: z4 = t3-c3-f3
The detection of the borders is based on the
techniques of derivation. A point on the border
Between two regions possess an important grey level Fig. 6
jump. The module of the pressure gradient in this
point must be maximal according to the direction 4.2 Categorization Task
In this phase, the categorization role is efficient, the
If g (x, y) represents the grey level at the pixel (x, y),
system has to supply objects prototypes
the gradient is (wg/wx, wg/wy) and the direction is
corresponding to zones. At this level, the only
(wg/wy)/(wg/wx). In practice, we use derivation filters characteristics allowing to select the list of the
like Sobel, Perwitt, Kirch, etc. prototypes are the texture, the color and the shape.
To improve the detection of outlines we adopted the Let us take for example the zone z3 (after its
algorithm of Canny-Deriche [19]. extraction of the origin image).
- Segmentation by Colors:
We used an effective algorithm proposed and
described by D. Comaniciu in [30] based on "Shift Forest Texture
5 Conclusion
S1 S2 S3
The introduction of the phase of categorization in a
S1 : 3 different textures: t1, t2 and t3 vision system thus allows to facilitate the task in the
S2 : 3 different colours: c1, c2 and c3 recognition of objects. This domain covers different
S3 : 2 outlines separating 3 different zones: f1, f2 and f3 applications such as the cartography, the agriculture,
Fig. 5 the oceanography etc., and the automation of the
treatment of images constitutes the inescapable tool
In this segmentation, we notice that zones of the practitioners.
determined by outlines are not exactly the same that In our part, we proposed architecture including the
those exits of the other segmentations (textures and levels. The architecture based on he recommended
colors). On the other hand these last ones confused model allowed us to organize the tasks of the
zones f3 with t3 and c3 respectively. The segmentation and the selection of prototypes in this
Combination of the criteria in the process of phase of the categorization. The competition
segmentation allows to correct this defect and to between several prototypes allows to deliver quickly
determine four zones:
5
those who are considered relevant in the phase of the [13] R. Hu and M. M. Fahmy, “Texture Segmentation
recognition. based on a Hierarchical Markov Random Field Models”,
The part of the parallelism is important, what Signal Processing, vol. 26, pp. 285-305, 1992.
reduces considerably the time of answer. We shall [14] Breuel T. M. « Geometrique Aspects of Visual
Object Recognition » PhD Thesis, MIT. (1992)
complete this work by the introduction Multi-agents
[15] Wallis, G. and Rolls, E. (1997). A model of
system to benefit the cooperation and the invariant object recognition in the visual system. Prog.
competitiveness. Neurobiol. 51, 167--194.
References : [16] Biederman, I. Recognition-by-components: A theory
[1] M. Sadgal, A. El Fazziki , “Un Modèle de of human image understanding. Psych. Rev. 94, 115-147.
catégorisation dans les Systèmes de Vision” 1987
TELECOM’2001 & 2èmes JFMMA, Casablanca, 17-18- [17] D. Marr. “Vision”. W.H. Freeman, 1982.
19 Octobre 2001. [18] Tarr, M., Williams, P., Hayward, W., and Gauthier,
[2] Sadgal M. & El Fazziki A. “Une Architecture Mult- I. Three-dimensional object recognition is
Agents Basée Sur la Catégorisation dans le Traitement viewpoint-dependent. Nat. eurosci. 1, 275—277, 1998
d’images”, International Conference on Image an Signal [19] J. F. Canny. “A computational approach to edge
Processing, 608-617, Agadir, May 3-5 2001 detection”. IEEE Trans. Pattern Analysis and Machine
[3] El Kharki O. Ait Belaid M. Ait Ouahman A. Sadgal Intelligence, 8(6):769--768, November 1986.
M. Bijaber N. “ L’application de la télédétection et des [20] R. Deriche. Optimal edge detection using recursive
systèmes d’information géographique à l’élaboration de la filters. In Proc. International Conference on Computer
carte d’occupation du sol dans la province d’Oujda au Vision (ICCV' 87), London, United Kingdom, 8--12 June
Maroc”, Journal Maghrebien de physique, volume 2 N° 1 1987.
2002 [21] B. Wrobel and O. Monga. Segmentation d' images
[4] O. El Kharki, M. Ait Belaid, A. Ait Ouahman, M. naturelles : coopération entre un détecteur contour et un
Sadgal, N. Bijaber, "Classification of multispectral image détecteur région. In Actes du Onzième colloque GRETSI,
using ISODATA algorithm" AMSE Journal, paper N° Nice, France, Juin 1987.
01315 (1B), 2002. [22] M. Berthod, Z. Kato, S. Yu et J. Zerubia. Bayesian
[5] O. El Kharki, M. Sadgal, A. Ait Ouahman, A. El image classification using Markov random fields. Image
Himdy, M. Ait Belaid “ Automatic Cartography From and Vision Computing, 14(4) :285–293, 1996.
Aerial Images (Site of Salé,aMorocco)” International [23] B. Widrow and M. A. Lehr; 30 years of adaptive
Archive of Photogrammetry And Remote Sensing, Vol. neural networks: Perceptron, madaline and back-
XXXI0II, Part B2, pages 179 – 183, 2000. propagation, In Processin of the Institute of Electrical and
[6] O. El Kharki, N. Moumkine, M. Sadgal, A. Ait Electronic Engineers, volume 78, pages 1415-1442, 1990.
Ouahman “Study and realisation of an image coder / [24] Rosch E. « Principles of categorization » in Rosch
decoder” MS2000, 11-14 Avril 2000, Cure, Egypt. E & Lioyd B editors, Cognition and categorization pp27-
[7] O. El Kharki, M. Sadgal, A. Ait Ouahman “Quelques 48. Erlbaum, Hillsdale NJ. (1978).
techniques Numériques de prétraitement [25] Erickson, M.A., & Kruschke, J.K. Rules and
d’images” Optique ’2000, Meknés, 27-28 Avril 2000. exemplars in category learning. Journal of Experimental
[8] O. El kharki, M. Sadgal, A. Ait Ouahman « La Psychology: General, 127, 107-140. 1998
Cartographie automatique à partir d’images aériennes [26] Ulric Neisser , “Cognitive Psychology”, 1967
(site de Rabat –Salé, Maroc) » première rencontre sur les [27] Swain M. J. & Ballard D. H. (1991), "Color
systèmes électroniques et informatiques SEI’99, organisée indexing", Int. J. Comput. Vision, 7:11-32.
par la FST de Fés-Saiss (Maroc) le 13-14 octobre 1999. [28] P. Hall, D. Marshall & R. Martin. Incremental
[9] P. Garnesson, G. Giraudon & P. Montesino eigenanalysis for classification. In Proc. of British Vision
« Messie : un système multi-spécialiste en vision Machine Conference (BMVC' 98), pages 286-295, 1998.
Application en imagerie aérienne », Rapport de [29] Hornegger, J. and Niemann, H. “Statistical learning,
recherche, INRIA, avril 1999. localization and identification of objects”. In ICCV’95
[10] Weidner U. and Förstner W. « Towards Fifth InternationalConference on Computer Vision, pp.
Automatic building Extraction from High Resolution 914–919. 1995.
Digital Elevation Models » ISPRS Journal, 50, pp 38-49 [30] D. Comaniciu and P. Meer. Mean shift analysis and
[11] Walter V. « Automatic classification of Remote applications. In 7th International Conference on
Sensing Data for GIS Database Revision » In IAPRS Vol. Computer Vision, pages 1197–1203, Kerkyra, Greece,
32, Part 4, pp 545-552. (1998). September 1999.
[12] Airault S. & Jamet O, « Détection et restitution [31] S. Derrode and W. Pieczynski, SAR image
automatique du réseau routier sur images aériennes » segmentation using using generalized Pairwise Markov
Traitement du Signal, Vol. 12 Numéro 2 1995. Chains, SPIE' s International Symposium on Remote
Sensing, September 22-27, Crete, Grece, 2002.