Professional Documents
Culture Documents
Visual Control of A Robotic Hand
Visual Control of A Robotic Hand
net/publication/4045874
CITATIONS READS
24 279
4 authors, including:
Irene Macaluso
Trinity College Dublin
100 PUBLICATIONS 936 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Self-Consciousness and Theory of Mind for a Robot Developing Trust Relationships View project
All content following this page was uploaded by Antonio Chella on 21 May 2014.
Abstract -- The paper deals with the design and control the robotic hand. This paper is structured as follows:
implementation of a visual control of a robotic system composed section II describes the whole system and the
of a dexterous hand and stereo cameras. The aim of the proposed experimentation setup; section III deals with the fingertip
system is to reproduce the movements of a human hand in order localization and the tracking on image sequence; section IV
to learn complex manipulation tasks. A novelty algorithm for a concerns the control of robotic hand by inverse kinematics
robust and fast fingertips localization and tracking is presented.
Moreover, a simulator is integrated in the system to give useful
computation; section V reports the details about various
feedbacks to the users during operations, and provide robust experimentations done; finally, some conclusions and future
testing framework for real experiments (see video). work are drawn in section VI.
The control of robotic systems has reached a high level Our proposal deals with the design and implementation
of precision and accuracy, but often the high complexity and of a visual control of a robotic system composed by a
task specificity are limiting factors for large scale uses. dexterous anthropomorphic hand (the DIST-Hand [1]) and
Today, robots are requested to be both “intelligent” and calibrated stereo cameras. The aim of the present work is to
“easy to use”, allowing a natural and useful interaction reproduce real-time movements of a human hand in order to
with human operators and users. In this paper, we deal with learn complex manipulation tasks.
the (remote) control of an anthropomorphic robotic hand: The human hand is placed in front of the stereo couple. A
our approach is to provide a Human-Computer-Interface black panel is used as background and illumination is
[10][17][15] based on natural hand gesture imitation of the controlled. Fingertips and wrist are extracted at each frame
operator showing his hand [8]. Different methods have been and their 3D coordinates are computed by a standard
proposed to capture human hand motion. Rehg and Kanade triangulation algorithm. Given a hand model, inverse
[11][12] introduced the use of a highly articulated 3D hand kinematics is used to compute the joint angles which are
model for the tracking of a human hand [13][14]. For translated to suitable commands sent to the DIST-Hand
tracking, the axes of the truncated cylinders that are used to controller.
model phalanges, are projected onto the image, and local The system (see figure 1) is composed by some common
edges are found. Fingertip positions are measured through a modules that work in parallel on the two flows of images
similar procedure. A nonlinear least squares method is used sourcing from the stereo rig:
to minimize the error between the measured joint and tip
locations and the locations predicted by the model. The 1) image pre-processing operation (noise filtering,
system runs in real-time, however, dealing with occlusions tresholding, edge detecting);
and handling background clutter remains a problem. Heap 2) the fingertip tracking following the first estimation;
and Hogg [6] used a deformable 3D hand shape model. The 3) fingertip localization on image based on previous
hand is modeled as a surface mesh which is constructed via position.
PCA from training examples. Real-time tracking is achieved
by finding the closest possibly deformed model matching The coordinates of the fingertips are the input of the other
the image. In [16], Cipolla and Mendoca presented a stereo software modules responsible of:
hand tracking system using a 2D model deformable by
affine transformations. Wu and Huang [19] proposed a two- 1) 3D coordinate computation;
step algorithm to estimate the hand pose, first estimating the 2) inverse kinematics;
global pose and subsequently finding the configuration of 3) actuator commands computation.
the joints. Their algorithm relies on the assumption that all
fingertips are visible. III. FINGERTIP LOCALIZATION AND TRACKING
Our method uses fingertips as features, considering each
finger except the thumb as planar manipulator, and starting This section deals with the localization of the fingertips
from this hypothesis we compute inverse kinematics to in long image sequences. In order to achieve real-time
performances state-space estimation techniques are used to where (r(k), c(k)) represent the fingertip pixel position at
reduce the search space from the full camera view to a kth frame, and (vr(k) , vc(k)) be its velocity in r and c
smaller search window centered on the prediction of the directions.
future position of the fingertips. In the following, a novel
algorithm is proposed to detect the fingertip position inside
the search window.
I*
Left Image Image Fingertip xreal 3D Coordinate
Preprocessing xpred Extraction Computation
Right Image Module Module Module
a b
Kalman Filter Inverse
Module Kinematics
Module
Qhuman
c
Initial estimates
Human-To-
Robot Mapping Fig.2 (a) Initial hand configuration used to calculate the first estimation of
Module the fingertip positions; (b) three points (yellow squares) with high
correlation response individuate a group and provide a fingertip candidate
Qrobot computed as centroid of such a group (red arrow); (c) circular templates
used to search fingertips.
To the communication
units Under the assumption that a fingertip moves with
constant translation over the long sequence and a
sufficiently small sampling interval (T), the plant equation
for the recursive tracking algorithm can be written as
Fig.1 The system architecture: two parallel set of procedures perform
separately the estimation of the fingertips on right and left images; the
coordinates extracted are the input of the procedures responsible for the X(k+1)=Ax(k)+w(k)
control of the robotic hand.
Where
The visual servoing procedure starts with a preliminary
setup phase, based on various image processing operations. 1 0 T 0
The procedure individuates the hand/arm on image, 0 1 0 T
compares it with robotic one and calculates exact size ratios. A =
0 0 1 0
Relevant features are tracked on image sequences, and 0 0 0 1
reconstructed in a three-dimensional space, using standard
computer vision algorithms [3]. These features are used as a and the plant noise w(k) is assumed to be zero mean with
guide to perform 3D reconstruction of the simulated covariance matrix Q(k).
arm/hand, and real system positioning. The reconstructed For each fingertip, the measurement used for the
movements are reproduced both by the simulator and the corresponding recursive tracking filter at time tk is the
robotic system in real time. image plane coordinates of the corresponding point in the
kth image. Thus the measurement is related to the state
A. Fingertip Tracking vector as:
B. Fingertip Extraction