Professional Documents
Culture Documents
An Efficient Method For Determining Threshold Voltage, Series Resistance and Effective Geometry MOS Transistors
An Efficient Method For Determining Threshold Voltage, Series Resistance and Effective Geometry MOS Transistors
Abstract- An accurate and robust method of extracting the (after Terada and Muta [5], and Chern et al. [ 6 ] ) and the
threshold voltage, the series resistance and the effective geometry CMP method (after Ciofi, Maccucci and Pellegrini [7]), which
of MOS transistors is presented. The method is based on efficient methods are also discussed by Ng and Brews [3]. McAndrew
nonlinear optimization using an iterative linear regression proce-
dure which usually converges in less than four rounds. Thereby and Layman [SI developed a nonlinear optimization procedure
extracted parameters are obtained from analytical expressions and claimed a significantly more accurate and robust method
for the solutions to a linear system of equations whereby time- after careful comparisons with the other methods mentioned
consuming numerical differentiationsare avoided. MOSFET pa- above.
rameters are explicitely identified as parameters of an underlying In this paper we will evaluate a new method for determining
widely used device model that is a good approximation for
operation in the linear region. The method is particularly suitable the threshold voltage, the series resistance and the effective
for process characterizationand can be used on as few as twelve geometries of MOS transistors [9] and emphasize its potential
data points (three data points from each of four different size for becoming a standard method for rapid and accurate process
transistors). By connecting external resistors in series with the control and technology characterization. The proposed method
transistors, we show that the extracted values of the parameters is simple and robust and uses nonlinear optimization by means
are independent of the series resistance.
of iterative least square fitting procedures to determine triplets
of parameters rather than pairs of parameters.
I. INTRODUCTION
(the maximum channel conductance) In three subsequent steps linear regression of Eo on L,, AEo
on L, - AL, and A E on Eo yields AL, R s and 0.
(3) The second method is the CMP method (after Ciofi, Mac-
cucci and Pellegrini [7]) which was the first method for process
characterization to determine VT consistent with /3 and 0, even
(the average threshold voltage at mid-channel, sometimes
referred to as vnt [ll]), and
if the V&/2-term in (1) was unnecessarily dropped in this
process. Although not explicitely stated in the paper and not
very CPU efficient in its implementation, the CMP method
(4) introduced a nonlinear fit to the data. Assuming VT known a
linear regression of 1/(VGs -VT) on l / I D s was used to obtain
where VDSis the external drain-source voltage (VDS= VAS + +
,B from the slope and H = 19 2PRs from the intercept. This
2 R , l ~ s )the
, model can be written as a rational function in procedure was repeated for different assumed values of VT
the observables until a minimum was found. The error function they used for
optimization can very roughly be approximated by
(5)
The sum of squares of all differences between the measure- gains (i.e., having equal channel widths but different channel
ments and the model, (6), can be written lengths).
N r Step 3) Effective Geometiy Extraction: Finally, the effec-
tive geometries are determined from the transistor gains of a
number of transistors of different geometries using the model
in (9). This equation, being a rational function of the same
where N ( > 3 ) is the number of data points. Equation (11) can type as (3,means that k’,AW, and A L can be determined
also be expressed as using the same iterative least square fitting method as used in
step 1. This method opens the opportunity of determining one
value of AL common for all channel widths while in previous
methods the value of A L is determined for each channel width.
Step 4 ) Iterative Improvements: Once the series resistance
For a rational function the unknown observables occur is known, the internal voltages are also known and the first two
nonlinearly in the normal equations which makes them difficult steps of the extraction procedure can be repeated using the
to solve for. Hamming [12] suggested an iterative procedure calculated internal voltages instead of the external voltages.
to get around these difficulties by assuming the value of From the new 0’ vs. P relationship a correction term to the
c’ in the denominator to be known. We use c’ = 0 if no series resistance can be obtained together with a new value of
good initial approximation is known for c’. The unknown 8. This procedure is then repeated until only negligible correc-
observables now appear linearly in the normal equations and tion terms are obtained for the series resistance. However, with
can be solved for. The value obtained for c’ can then be used in parameter extraction based on measurements well in the linear
the denominator in the next round of extraction. Usually the region (VGS - V - >> V D Sonly ) negligible improvements are
extracted values converge in less than four rounds, making expected for each parameter.
the procedure computationally efficient. It is also possible to It is also possible to increase the accuracy of the extraction
use a preestimate value for c’ in the denominator for direct procedure by accounting for the body effect of each device.
extraction of the unknown observables if one is willing to This can be done by repeating the single device parameter
accept a nonstandard error function for the least squares fitting. extraction step for a second substrate bias using the same
Alternatively, the sum of the squared relative differences, approximation as McAndrew and Layman for the threshold
2 voltage shift AV- due to the body effect, i.e.,
[IDS% - a ’ s v ; ~ ~ ]
&=E a=1 I&%
(13) AI+ = D v IDSRs (14)
where D v = aV,/aV& and V- is the threshold voltage at
can be minimized following a similar procedure which in-
the substrate bias VSB of interest.. Again, with measurements
cludes replacing [y - c’I2 in the denominator of (12) by
performed deep in the linear region (VGS - VT >> V D S )a
I&%[y - c’I2. Besides being more computationally efficient
small body effect (V& - VT >> AV-) is not expected to alter
than the CMP method, the proposed method has the important
the values of the extracted parameter significantly.
advantage of offering freedom of choice when it comes to
Similarly, it is possible to handle the influence of the
selecting the error function, while the CMP method restrains
substrate bias on the mobility and the mobility reduction factor
the user to a useful but nonstandard error function.
for each device. If, by repeating the single device parameter
To obtain accurate values of the extracted parameters it is
extraction step at a second substrate bias, we find that the
very important that all data points are in the correct region of
internal substrate bias IDSRScauses a shift AD and/or A@of
operation, in this case deep in the linear region. We therefore
the transistor gain and mobility reduction factor, respectively,
use a small drain-source voltage (VDS = 50 mV) to obtain a
the values of x and y for each measurement should be modified
constant transverse field along the channel, and a gate-source
according to
voltage such that VGS - V- >> VDSto guarantee that all data
points are well within the linear region. A procedure has been
developed by which data measured for gate voltages within
0.3 V of the threshold voltage are filtered out. and
Step 2) Series Resistance Extraction: In the first step the Ab’
y = VAS - AV- - -(VAS - VT) (16)
threshold voltage for each individual transistor was calcu- e
lated from the observable b. In the second step the mobility
reduction factor and the series resistance are obtained by and the parameters reextracted using the values extracted in
the previous round for P, 0, and V - to calculate x and y.
linear regression’ of l / ( b - c’ - vDS/2) = 0 2PRs on +
a’/(b - e’ - vDS/2) = 0.This step is identical to the CMP
method in that it effectively plots the equivalent mobility I v . MEASUREMENT
SYSTEM AND
reduction factor against the transistor gain for a number of EXPERIMENTAL
RESULTS
transistors having the same series resistance but different To evaluate the accuracy and robustness of the new method
‘Altematively, linear regression of l / a ‘ = O/p + 2Rs on ( b - c’ - several tests were performed using a semiconductor parameter
V~s/2)/a‘= 1/p as in the SJ-method [2]. analyzer (HP 4145A) for the measurements. A test chip
218 EEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 9, NO. 2, MAY 1996
TABLE I 0.35 I I /
P rllAnn1 0.10
e' [VI]
total error
0.05
relative emr 53
0 0.5 1.o
Gain [mA/V2]
Fig. 1. The effective mobility reduction factor plotted versus the gain with
the channel width as a parameter for n-channel devices. The series resistances
are proportional to the slopes of these plots.
KARLSSON AND JEPPSON: AN EFFICIENT METHOD FOR DETERMINING THRESHOLD VOLTAGE 219
4.5 L
4.0 .-g
lE 1.0 1; -
A
Y
. A
Aw A
3.5 m -
C
'5
0.8 -
f
3.0
E 2.5 2
U
U s 0.6 -
2.0 a
.-K
3l 1.5
1.o
0.5
0
0 5 10 15 20 25 30 0 50 100 150 200 250
Channel Width [pm] External Resistance [Ohms]
(a)
Fig. 4. Channel shortening and channel narrowing versus extemal series
resistance extracted for five different external resistors.
300
I4O m
7 250
E
cu'
> 200
Y
.-ac e [V-~I
(Wfi1.440.8) vT [VI 1"21 Rs [a1
150
-
(3
Model values 0.800 0.1300 0.05 300
8 Round 1 0.800 0.1305 0.049985 300.25
2 100 Round 2 0.800 0.1300 0.049999 300.00
.-
Q
0
&! 50
are shown in Fig. 4 which confirms that neither AL nor A W
was affected by the series resistance.
0 The accuracy of the extraction procedure has been investi-
0 5 10 15 20 25 30 gated and the results compared to those obtained after iterative
Channel Length [pm] improvement. During this experiment we used synthetic data
(b) generated from the MOSFET model to avoid problems with
Fig. 3. (a) The gain versus the channel width, and (b) the reciprocal gain measurement noise and model inaccuracies. Four transistors
versus the channel length for n-channel transistors. The insert plots are with different channel lengths and but equal channel widths
enlargements of the intercept with the z-axis. were simulated. Different values were assumed for the thresh-
old voltage of each channel length. The results obtained for
fitted to measurements, yielding one single value of A W and one of the transistor sizes are compiled in Table 11.
one single value of AL. This is consistent with an underlying The values extracted for the model parameters are within
model which assumes the same channel shortening A L for 0.5% of the model values already after the first round. In the
all channel widths and the same channel narrowing A W due second round, the value obtained from the first round for the
to oxide encroachment for all channel lengths. For the n- series resistance was used to calculate the internal voltages.
channel devices of the 1 pm CMOS process manufactured by New values of VT,p, and 13' were then extracted for each
AMS and delivered through Norchip, we obtained a channel device and linear regression of 0' on ,6 was used to extract
narrowing A W = 1.0 pm and a channel shortening AL = a correction term A R s to the series resistance and a new
0.4 pm. value of .'6 In the second round, the substrate bias dependence
Since the existence of series resistances has been shown to of the threshold voltage (and other parameters) could also be
lead to an underestimation of A L when ,B is determined from accounted for.
the peak-g, method [3], we wanted to verify that the values We have also investigated the sensitivity to noise in mea-
extracted for the channel shortening and channel narrowing surement data since this is an important characteristic of any
using our method are not influenced by the series resistance. extraction method. This aspect of parameter extraction has
The extraction procedure was therefore repeated using external been extensively studied by McAndrew and Layman who
resistors connected in series with the transistors. The results compared several well-known extraction methods. For our
220 E E E TRANSACTIONS ON SEMICONDUCTORMANUFACTURING, VOL. 9, NO. 2, MAY 1996
c
c
was used for each device, while one set of values common for
P
5 -0.8
all transistors in the group was used for IC', 8, AW, and AL.
0
._
S
s
V. DISCUSSION -0.4
extraction of parameter triplets, rather than repeated extrac- Gate-Source Voltage [VI
tion of parameter pairs (repeated until the best combination Fig. 5. Measured data (squares) and model playback with data simulated
of parameter values is found), provides an efficient means using extracted parameter values (solid lines) for eight p-channel devices of
for rapid device characterization. However, the method also different channel lengths but identical channel width.
provides certain advantages over the CMP method: first, it
does not ignore the V&/2 term when determining V, thereby ne1 shortening and one value A W for the channel narrowing.
significantly improving the accuracy of VT as discussed by This is in contrast to the conventional methods where A L is
McAndrew and Layman [8], second, it offers the user complete determined from 1/p vs. Lm-plots for one transistor width
freedom when it comes to choosing the error function for the at a time, which means that different values of A L can be
least square fitting procedure (instead of restraining him to obtained for each channel width. Similarly, when A W is
a nonstandard error function), third, it provides a means of determined from the ,D vs. Wm-plots, different values of A W
determining one single value of A L common to all channel can be obtained for each channel length. However, the channel
widths consistently with the transistor model (while still re- narrowing due to the combined effects of oxide encroachment
taining the option of determining AL for each channel width and over-etching of the gate is a parameter not discussed
for the purpose of checking the validity of the model). by CMP or by McAndrew and Layman [8]. Their method
Further, the method extends the direct extraction method was, however, recently most elegantly extended to include
proposed by Hamer [lo], which was limited to three data the channel narrowing 1131. By use of external resistors
points for extracting the three observables, so it becomes useful we have also shown that the values extracted for AW and
for any number of data points. The technique is based on linear AL are not influenced by the series resistance as is the case
regression which means that it is simple and efficient and when using the peak-g, method to determine the transistor
that the observables are obtained from analytical expressions gain ,D.
for the solutions of a system of linear equations. In addition, In addition, the iterative improvement scheme in our method
by use of an iterative linear regression procedure it turns the offers the possibility of acGounting for first-order variations
direct procedure into a nonlinear optimization method suitable of VT with IDsRs-a phenomenon which previously was
for any number of data points. Again, it is a simple and only taken into account by the optimization technique used
efficient technique converging in less than four rounds where by McAndrew and Layman. Thereby the same accuracy can
the observables are obtained without use of numerical (or be obtained at a much lower computational cost. Also, in our
analytical) differentiations. method D v is obtained from values of VT determined con-
Our method uses the same transistor geometry model as sistently with the underlying MOSFET model at two different
Hamer to simultanously determine one value A L for the chan- values of V&, while McAndrew and Layman obtained Dv
KARLSSON AND JEPPSON AN EFFICIENT METHOD FOR DETERMINING THRESHOLD VOLTAGE 221
from VT determined at two different values of VBSby using the that simply minimizes the differences between measurements
peak-g, method. Of course, the value of the iterative improve- and model by circuit designers.
ment can be discussed since the parameter values extracted
typically vary by less than 1% if the dependence of VT on VI. CONCLUSION
IDS& is neglected. On the other hand this is a phenomenon A new and simple method for accurate extraction of thresh-
that may become important for devices with heavily doped old voltages, series resistances, and effective transistor geome-
substrates and small feature sizes, as discussed by McAndrew tries of MOS transistors has been presented and validated for
and Layman. The iterative improvement scheme also offers accuracy and robustness. The new method explicitely inter-
the possibility of accounting for first-order variations of both prets MOS device parameters as parameters of an underlying
p and 6’ with I D & y - a phenomenon which McAndrew and MOSFET model. Parameters are extracted consistent with this
Layman discussed but did not account for. Again, note that model using a rapidly converging iterative linear regression
the extracted parameters depend on VBS and different values procedure to fit measured data to the nonlinear model. The
are obtained for each specified value of VBS. method does not require any numerical differentiations and is
Most of the model shortcomings are discussed in great computationally efficient.
detail by McAndrew and Layman and need not be repeated The accuracy and robustness of the method have also been
here. As in their work (and most other works), the mobility verified. By accuracy we mean that the method 1) should
degradation parameter is assumed to be independent of channel return the correct model parameters when parameter extraction
length. In our model this assumption is used to separate the is performed from synthetic data generated by the model,
influence of the series resistance from the influence of the and, 2 ) should result in small errors when measurements
mobility degradation. An interesting observation we made, are compared to model playbacks using extracted parameter
and that we have not seen discussed elsewhere, is that we values. By robustness we mean that the method 1) should
found the mobility degradation parameter to depend on the have both a small shift of mean parameter value with noise
channel width. This observation can be explained as being level and a small standard deviation at any given noise level,
the result of the gate-voltage dependence of the channel and, 2) should return parameter values that are insensitive to
width-when the gate voltage overdrive increases, the channel the series resistance (and the initial value of e’).
electrically widens into the oxide encroachment region [13]. If In our opinion, the method is well suited for automated
this phenomenon is not specifically included in the underlying measurement systems and that its simplicity and robustness,
MOSFET model, but the channel width (and hence ,f3) is together with its potential for rapid process monitoring, should
assumed to be independent of the gate voltage, it will instead make it attractive for industrial use.
result in a width dependent contribution to 6’ [9].
To retain accuracy and physical meaning of model param- REFERENCES
eters it is important that measurements be performed well
[l] D. T&acs, W. Muller, and U. Schwabe, “Electrical measurements of
in the linear region, i.e., VGS- VT >> VDS. In many other feature sizes in MOS Si2-gate VLSI technology,” IEEE Trans. Electron
works, a typical value of Vis - VT M 1OVDs is used to Devices, vol. ED-27, pp. 1368-1373, Aug. 1980.
[2] P. I. Suciu and R. L. Johnston, “Experimental derivation of the source
filter out measurements too close to the moderate inversion or and drain resistance of MOS transistors,” IEEE Trans. Electron Devices,
saturation regions. In our experience it is not necessary to be vol. ED-27, pp. 18461848, Sept. 1980.
that conservative. We have usually filtered out measurements K. K. Ng and J. R. Brews, “Measuring the effective channel length of
MOSFET’s,” IEEE Circ. and Devices, pp. 33-38, Nov. 1990.
within 0.3 V of V, only, using VDS = 50 mV. Actually, it P. R. Karlsson and K. 0. Jeppson, “Extraction of series-resistance-
is important to base the extraction on data points as close as independent MOS transistor model parameters,” IEEE Electron Device
possible to VT,particularly for determining accurate values Letters, vol. 13, pp. 581-583, Nov. 1992.
K. Terada and H. Muta, “A new method to determine effective MOSFET
of VT and when working with few data points, since this channel length,” Jpn. J. Appl. Phys., vol. 18, no. 5, pp. 953-959, 1979.
significantly reduces the sensitivity to noise. However, the gate J. G . J. Chem, P. Chang, R. F. Motta, and N. Godinho, “A new method
to determine MOSFET channel length,” IEEE Electron Device Lett., vol.
voltage overdrive VGS- VT must be large enough, say at least EDL-1, pp. 170-173, Sept. 1980.
a few hundreds of millivolts, to ensure the validity of (1) and C. Ciofi, M. Macucci, and B. Pellegrini, “A new measurement method
to minimize the influence of measurement errors, given the of MOS transistor parameters,” Solid-state Electron., vol. 33, no. 8, pp.
1065-1069, 1990.
absolute accuracy of the measurement equipment. C. C. McAndrew and P. A. Layman, “MOSFET effective channel
Finally, we would like to emphasize the importance of length, threshold voltage and series resistance determination by robust
extracting physical device parameters with respect to an un- optimization,” IEEE Trans. Electron Devices, vol. 39, pp. 2298-23 11,
Oct. 1992.
derlying device model. A “good” physical model is believed P. R. Karlsson and K. 0. Jeppson, “A direct method to extract effective
to yield “good” physical parameters. Measurements and op- geometries and series resistances of MOS transistors,” in Proc. IEEE
timization should be performed in such a way that physical Int. Con5 Microelectronic Test Structures, 1994, pp. 184-189.
M. F. Hamer, “First-order parameter extraction on enhancement silicon
properties of the extracted parameters are retained. This is true MOS transistors,” IEE Proc., vol. 133, pt. I, no. 2, pp. 49-54, 1986.
both for process characterization and for circuit simulation. E. Ricco, H. C. D e Graaf, and F. M. Klaassen, “MOS and bipolar
transistors,” in IEDM Course Notes on Adv. Dev. Characterization and
Ideally, parameters should have the same meaning for both Test Method., 1994.
process engineers and circuit designers. Today, even such R. W. Hamming, Numerical Methods for Scientists and Engineers.
a “simple” parameter as the threshold voltage has different New York Dover, 1973, pp. 497498.
R. A. Ashton, P. A. Layman, and C. C. McAndrew, “Modeling and
meanings when it is routinely derived by the peak-g, method characterization of MOSFET width dependencies,” in Proc. IEEE Int.
by process engineers, and looked upon as a fitting parameter Con$ Microelectronic Test Structures, 1993, pp. 133-137.
222 IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING, VOL. 9, NO. 2, MAY 1996
Peter R.Karlsson received the M.S. degree in elec- Kjell 0. Jeppson (S’68-M76-SM’83) received
trical engineenng in 1988 and the Ph.D. degree in the M.S. degree in electrical engineering in 1970
solid-state electronics in 1994, both from Chalmers and the Ph.D in solid-state electronics in 1977,
University of Technology, Goteborg, Sweden. both from Chalmers University of Technology,
Between 1988 and 1994, he was a member of the Goteborg, Sweden.
research and teaching staff, Department of Solid- He has been a lecturer (associate professor) in
State Electronics, Chalmers University. His main the Department of Solid-state Electronics, Chalmers
research interest involved transistor modeling and University, since 1978. He spent the academic year
parameter extraction. Since 1995, he has been with 1973-74 with Rockwell International, Anaheim,
Bofors Missiles, Goteborg, Sweden, where he works CA, and the fall semester 1985 at the Southampton
with microsensor and microelectronics systems. University Microelectronics Centre, UK. His main
research interest is focused on MOS devices and CMOS VLSI design. He has
published several papers on MNOS nonvolatile memories, MOS transistor
modeling and parameter extraction, CMOS gate delay, and hierarchical
DRC of VLSI circuits. He has also authored a textbook (in Swedish) on
semiconductor devices.