Professional Documents
Culture Documents
聽音辨位的研究及其應用
聽音辨位的研究及其應用
100032
TTSA
2011
2011
ii
48
1.18
iii
(Abstract)
A Study on Voice Source Localization and its Application
Our research is enlightened by the magical beating-drum skill Voice Source
Localization in the movie House of Flying Daggers. To explore the theory of
Voice Source Localization, we use a pair of microphones syncing the time
difference which is produced when the microphones are receiving sound waves
during record. From the preceding procedure, we confirm the feasibility of Voice
Source Localization. Utilizing the theory, we make use of the common computer
sound cards and cheap sonic wave sensory elements (Piezo Buzzer) and apply
them to the point of impact immediate display system. The practice and testing
results prove that our method can directly judge the time difference of right and left
channels in sounds data and then immediately display the position of the point of
impact. On the 48x48cm target, the average error is 1.18cm, which is a pretty good
result.
iv
BB
3.5mm Jack
2mm 3mm
1
1-1
1-1
1-2
1-2
x2 y2 x2
y2
1
a2 b2 a2 c2 a2
a 1/2
2
v t t t L
t R t 0 t 0
t 0 c
mL mR / 2
a
c
2a
1 v (t L t R )
cos 1 v t
cos
m m
2c
R
mL mR
L
cos 1 cos 1
[ 1]
t 0 0 90 t 0 90 180 t 0 90
( x, y)
x2
y2
x2
y2
1 [ 2]
a 2 c 2 a 2 (1 / 2 v t ) 2 ( mL mR / 2) 2 (1 / 2 v t ) 2
t 0 x 0 t 0 x 0
t 0 y
[ 1]
mL mR
3
(1)
Goldwave
192,000Hz96,000Hz44,100Hz22,050Hz11,025Hz
192,000HzGoldWave
0.107% 192,000Hz
2-1
2-2
2-3 GoldWave
760Hz
758.21Hz
0.235%
350Hz
349.34Hz
0.187%
5517Hz
5517.24Hz
0.004%
512Hz
512.82Hz
0.160%
66080Hz
66115.70Hz
0.054%
1000Hz
1000.00Hz
0.000%
192,000Hz
(2)
4 GoldWave
3 45 GoldWave
6
4 GoldWave
0.2588 0.2540 0.0018
0.9m 27 347.2 m / sec
[ 1]
v t
mL mR
cos 1
0.9
x2
y2
1
(1 / 2 347.2 0.0018) 2 (0.9 / 2) 2 (1 / 2 347.2 0.0018) 2
x2
y2
1
0.31248 2 0.45 2 0.31248 2
t 0
5
Mic In BB
1 2
[ 2]
[5][6][7][8]
6-1
6-2
6-3
6-4
GoldWave
7
102
801
8-1
8-5
8-2
8-3
8-4
8-6
8-7 102
8-8 801
Mic In
4cm 48cm48cm
10
(1)
5 10
GoldWave 192,000Hz
BB 3
3
GoldWave
t
v d / t d
(2)
11
GoldWave
10-5
BB
10
(3)
BB
BB
12
[1][2]
0.5%
BB
11 Visual Basic 6
BB
11 BB
BB
Sound (i)
13
2Bytes 215 32768 1Byte 2 7 128
12
12
14
2.5%
5%10%15%20%25%
30%
25%
13
2.5%
5%
10%
15%
20%
25%
30%
5476.66 1104.99
972.09
663.29
660.67
656.34
626.39
2103.14
553.73
553.73
587.05
590.83
587.05
590.83
1107.29
646.66
704.93
651.00
603.97
611.59
603.97
682.75
618.10
575.19
604.23
613.70
604.23
587.74
599.67
710.08
759.51
665.70
640.90
604.53
595.67
847.26
717.59
683.39
685.80
663.85
661.60
659.37
694.86
641.09
612.82
631.69
631.69
631.69
631.69
1865.35
816.48
733.72
732.62
693.84
625.80
598.02
1189.76
710.08
773.96
692.49
693.57
605.35
605.35
10
1543.72
855.38
579.63
571.68
575.99
604.06
647.58
(m/s) 1611.05
737.42
694.90
648.55
636.90
619.22
614.66
157.29
126.18
50.38
41.15
24.33
25.04
7.77%
6.46%
3.93%
4.07%
1452.70
15
(4)
25%
619.22 m/s
d (cm)
t (sec)
(m / sec)
( 20 , 20 )
33.41
0.000509
656.34
( 16 , -16 )
15.28
0.000206
587.05
( 12 , -4 )
16.17
0.000264
611.59
( -16 , 12 )
14.36
0.000238
604.23
( -24 , -8 )
14.91
0.000247
604.53
d (cm)
t (sec)
(m / sec)
( 16 , 24 )
32.51
0.000491
661.60
( 12 , -16 )
14.36
0.000227
631.69
( -4 , 12 )
16.17
0.000258
625.80
( -8 , -24 )
14.91
0.000246
605.35
10
( -16 , 16 )
15.28
0.000253
604.06
(1)
BB ( x, y) 1
2 t1 t 2
16
t 1 0 t 2 0
( x 6) 2
( y 28) 2
1
(1 / 2 v t1 ) 2 ( 24 12 / 2) 2 (1 / 2 v t1 ) 2
( y 6) 2
( x 28) 2
1
(1 / 2 v t 2 ) 2 ( 24 12 / 2) 2 (1 / 2 v t 2 ) 2
[ 3]
t 1 0 t 2 0
x 6
( y 6) 2
( x 28) 2
1
(1 / 2 v t 2 ) 2 ( 24 12 / 2) 2 (1 / 2 v t 2 ) 2
t 1 0 t 2 0
( x 6) 2
( y 28) 2
1
(1 / 2 v t1 ) 2 ( 24 12 / 2) 2 (1 / 2 v t1 ) 2
y 6
t 1 0 t 2 0
x6
y 6
(2)
[ 3]
t1 t 2 14
17
x 6 y 6 t1 0 t 2 0
1 Part I t1 0 t 2 0 2
Part II
t1 0 t 2 0 3 Part III
t1 0
t1 0 t 2 0 4 Part IV
x 6 t 2 0 y 6
14 t 1 t 2
[ 3]
1 y
[24,24] 0.1 481 t1
x
18
t1 0
x 6
(1 / 2 v t1 ) 2 [( y 28) 2 18 2 (1 / 2 v t1 ) 2 ]
18 2 (1 / 2 v t1 ) 2
t1 0
x 6
t1 0
x 6
(1 / 2 v t1 ) 2 [( y 28) 2 18 2 (1 / 2 v t1 ) 2 ]
18 2 (1 / 2 v t1 ) 2
2
x' [24,24]
0.1 481 t 2
y '
t 2 0
y' 6
(1 / 2 v t 2 ) 2 [( x 28) 2 18 2 (1 / 2 v t 2 ) 2 ]
18 2 (1 / 2 v t 2 ) 2
t 2 0
y' 6
t 2 0
y' 6
(1 / 2 v t 2 ) 2 [( x 28) 2 18 2 (1 / 2 v t 2 ) 2 ]
18 2 (1 / 2 v t 2 ) 2
19
Client / Server 15
Client
BB
Server Server Client
15
(1) Client
BB 3
1 Buffer1
2 Buffer2
20
3 Buffer3
1 BB
Server 1
3 1 2
16 Client
21
17 Client BB
(2) Server
Sever
Client
Client
18 Server
Client
22
19 Server
10
BB
1.18cm
48cm 48cm
1
t1
( 16 , -12 )
0.000274
( 12 , 0)
2
t2
cm
-0.000184
( 17.7 , -12.7 )
1.8
0.000274
0.000144
( 11.3 , 0.2 )
0.7
(8,8)
0.000274
0.000250
( 8.6 , 6.2 )
1.9
( 8 , -12 )
0.000187
-0.000090
( 8.4 , -10.2 )
1.8
( 4 , -8 )
0.000146
-0.000021
( 4.3 , -7.1 )
0.9
(0,0)
0.000099
0.000092
( -0.1 , -0.6 )
0.6
( -4 , 4 )
0.000034
0.000127
( -4.1 , 2.3 )
1.7
( -8 , 0 )
-0.000031
0.000081
( -7.8 , -0.3 )
0.4
( -8 , -8 )
-0.000026
-0.000026
( -7.8 , -7.8 )
0.3
10
( -16 , -16 )
-0.000130
-0.000128
( -17.2 , -17.1 )
1.6
23
20 ()()
0.322516 cm/sample
24
[5][6][7][8]
21-1
21-2
21-1
21-2
25
619.22 m/s
619.22 m/s
22-1 BB 22-2
P
S
2.5%5%10%15%20%25% 30%
22-1
22-2
48
1.18
26
MAYA44
V3
http://neural.cs.nthu.edu.tw/jang/books/audioSignalProcessing/
, , , 2004
, Visual Basic 6.0 , , 1999
http://web.ncyu.edu.tw/~andrew/
http://zh.wikipedia.org/zh-tw/
http://en.wikipedia.org/wiki/
, ,
32 , 1995
,
, 44 , 2004
27
28