Machine Learning Framework For Predicting Reliability of Solder Joints

Machine learning framework for predicting
reliability of solder joints

Sung Yi
Mechanical and Materials Engineering Department, Portland State University, Portland, Oregon, USA, and
Robert Jones
Portland State University, Portland, Oregon, USA
Abstract
Purpose – This paper aims to present a machine learning framework for using big data analytics to predict the reliability of solder joints. The
purpose of this study is to accurately predict the reliability of solder joints by using big data analytics.
Design/methodology/approach – A machine learning framework for using big data analytics is proposed to predict the reliability of solder joints
accurately.
Findings – A machine learning framework for predicting the life of solder joints accurately has been developed in this study. To validate its accuracy
and efficiency, it is applied to predict the long-term reliability of lead-free Sn96.5Ag3.0Cu0.5 (SAC305) for three commonly used surface finishes
such OSP, ENIG and IAg. The obtained results show that the predicted failure based on the machine learning method is much more accurate than
the Weibull method. In addition, solder ball/bump joint failure modes are identified based on various solder joint failures reported in the literature.
Originality/value – The ability to predict thermal fatigue life accurately is extremely valuable to the industry because it saves time and cost for
product development and optimization.
Keywords Thermal fatigue, Machine learning, Solder joints failure prediction, Thermal cycling, Cracks, Drop
Paper type Research paper
1. Introduction pack, burn-in, etc. When the solder joint of an electronic

package fractures, eventually electrical operational failures
Solder joints have been used in the electronic industry as both
occur. Therefore, the mechanical property and strength of
structural and electrical interconnections between electronic
solder joints become critical to the reliability of electronic
devices and printed circuit boards (PCB) for many years. The
devices and systems.
roles of solders are to make an electrical connection as well as to
Solder Joint Reliability (SJR) is the probability that solder
protect solder joints from external mechanical shock or external
joints will provide proper electrical and mechanical connections
chemical environment. However, when solder joints are under
over a specific period of time (design life) under the design
thermal cyclic loading, mismatch in the coefficient of thermal
operating conditions without failures. Solder joint reliability
expansions (CTE) between PCB and solder balls creates
thermal strains and stresses on the joints, which may finally can be classified as component-level solder joint reliability and
result in cracking (Zhong and Yi (1999), Zhong et al. (2000); board-level solder joint reliability. Component-level solder
Zhong et al. (2002), Kim et al. (2009); Chu et al. (2014), Yao joint reliability addresses the reliability of solder joints within
et al. (2019)). Consequently, the mechanical interconnection is the single package prior to board mounting, while board-level
lost, leading to electrical failures (such as hard/intermittent solder joint reliability deals with the reliability of the solder
open and parametric failure), which in turn causes malfunction joints of an electronic package after it has been mounted on a
of the circuit or whole system. Such failure mechanism is called board or substrate, encompassing both the solder-to-package
thermal fatigue. Thermal fatigue is a major failure mechanism and solder-to-board interfaces. Assurance of board-level solder
of solder joints and plays an important role in determining the joint reliability of a package is essential for electronic systems.
reliability of electronic packages. Moreover, solder balls in ball Many previous works (Tapani Nurmi and Olavi Ristolainen
grid array (BGA) packages are subjected to external forces (2002), Syed (2004); Kang et al. (2004), Hegde et al. (2009);
during post surface mount technology processes such as socket Andersson et al. (2009), Collins et al. (2012); Liu et al. (2017),
insertion, testing, shipment, etc. Poor mechanical properties of Jiang et al. (2018)) have been carried out to understand the
solder balls often lead to missing balls during testing, mark and reliability of solder joints. Various factors including thermal
deformation, fundamental mechanical properties such as
elastic modulus, yield strength, shear strength, fatigue, creep,
The current issue and full text archive of this journal is available on microstructure, intermatallic compound formation,
Emerald Insight at: www.emeraldinsight.com/0954-0911.htm electromigration, void formation, and metallurgy and phase
transformation affect solder joint reliability. Failure of the
Soldering & Surface Mount Technology Received 2 April 2019

© Emerald Publishing Limited [ISSN 0954-0911] Revised 28 May 2019
[DOI 10.1108/SSMT-04-2019-0013] Accepted 28 June 2019
Machine learning framework Soldering & Surface Mount Technology
Sung Yi and Robert Jones
solder joint in electronic products can arise either when solder joint reliability by Weibull distributions can be found in
individual overstress events exceed the strength of the solder many previous works (Tapani Nurmi and Olavi Ristolainen
joint; or when the accumulation of damage due to cyclic (2002), Kang et al. (2004); Suhling et al. (2004), Arulvanan
loading exceeds its fatigue strength. Failure accelerators can et al. (2006); Yew et al. (2009), Herkommer et al. (2010);
include voids, solder mask defects, misalignment of the Collins et al. (2012), Arfaei et al. (2013); Libot et al. (2018)).
terminal, low terminal height, poor intermetallic formation, A typical example is shown in Figure 1. Collins et al. (2012)
and lack of wetting at the terminal and pad interfaces. studied the long-term reliability of 2512 ceramic chip resistors
Reliability and lifetime predictions of solder joints in with Sn96.5Ag3.0Cu0.5 (SAC305) solder joints under
electronic packages have been one of the most difficult accelerated temperature cycling (ATC) conditions. Three
problems in the electronics industry. Assurance of board-level different surface finishes such as organic solderability preservative
solder joint reliability of a package is essential for electronic (OSP), immersion silver (IAg) and electroless nickel immersion
systems. The ability to predict thermal fatigue life accurately is gold (ENIG) was considered. The test vehicles and coupons are
valuable to the industry because it saves time and cost for subjected to a temperature cycles profile with thermal cycling
product development and optimization. The package attributes between 0°C and 100°C, a 10°C/min ramp rate, and 30 min
and test conditions can be simulated using constitutive dwells.
equations for solders. The solder joint responses such as MIL-HDBK-217 (McLeish, 2010) has had a long history as a
partitioned creep strain, total creep strain, and creep strain tool for use in performing reliability predictions on electronic
energy density are then used to determine the parameters of components and systems. The reliability community in general
damage mechanism based life prediction models (Syed (2004), adopted this tool for their own use. As a result, MIL-HDBK-217
Hegde et al. (2009); Chen et al. (2017)). To predict thermal became and still is the most widely known and used reliability
fatigue life accurately, an appropriate constitutive model, prediction method. However, this traditional approach to
experimentally determined material properties, and finite predicting the long-term reliability of devices in field use assumes
element analysis are required all of these contribute that the components of the system have constant failure rates with
significantly to the accuracy of life prediction of solder joints.
“pi-factor” modifiers to account for various quality, operating, and
Phenomenal-based models are often used to establish the
environmental conditions. Therefore, it is inaccurate for predicting
constitutive relationship of materials. The mechanical behavior
actual field failure events and may result in poor designs.
of solders is viscoplastic because its deformation is rate-
IEEE Standard 1413.1 (IEEE Standard:1413.1-2002, 2003)
dependent. As most solder alloys operate at a high homologous
promotes reliability prediction method of using the load (stress)
temperature even at room temperature, creep and stress
and damage models and provides a summary of the common
relaxation are prominent deformation processes occurring in
methods for reliability prediction of electronic system. Weibull,
solder joints during service. In the past, many constitutive
lognormal or other statistical distributions can be used to describe
models were proposed for solders (Darveaux and Banerji,
hazard rates during both infant mortality and wear-out.
1992; Yi et al., 2000; Wang et al., 2001; Yi et al., 2002; Wiese
However, the failure rate could be different at different stage of
et al., 2002; Ma, 2009). However, though many experimental
life of product and may be a combination of many distributions
studies show the strong influence of microstructure change,
intermetallic compound (IMC) layer formation and void representing many different failure modes. As shown in Figure 1,
formation on solder mechanical properties, these phenomena the single Weibull distribution cannot predict the reliability and
have not been considered in most constitutive relationships. the mean life to failure of solder joints accurately.
Therefore, in general, the life prediction model models based The underestimation of the times to solder joint failures may
on those constitutive relationships are believed to be grossly result in fatal failures. For example, on December 28, 2014,
inaccurate and to over-estimate the thermal fatigue life.
Accelerated testing is a powerful tool to understand a physics Figure 1 Weibull distribution
of failure and improve reliability at different stages of the
product development. The standard JEDEC thermal cycle
conditions are well defined in the JEDEC Standard JESD22-
A104D test specification (JESD22-A104D (2005)). However,
the experimental determination of the fatigue life of the solder
joints in electronic packages is a very time consuming process.
First of all, to conduct accelerated temperature cycle tests,
daisy chained packages are assembled and are mounted on test
boards of various thicknesses. After package assembly on
board, the assembled boards are placed inside thermal cycle
chambers. Then thermal cycles that cover the range of
accelerated conditions typically used by the industry are
applied and in situ resistance is measured to detect solder joint
failures. Then the failure data is further analyzed by the Weibull
analysis software to determine the Weibull parameters and the
mean life. However, the Weibull analysis may underestimate or
overestimate the reliability and the mean life to failure
significantly. Such underestimation or overestimation of the
Indonesia Air Asia Flight 8501, an Airbus A320-216, crashed increasing of their thicknesses. During thermal cycles, the
into the Java Sea during bad weather, killing all 155 passengers fatigue failure occurs mainly in the Cu3Sn and Cu6Sn5
and seven crew on board. On December 1, 2015, Indonesia’s interface.
National Transportation Safety Committee stated that the Figure 3 shows the schematic diagram of the IMC formation.
cracking of a solder joint of both channel A and B resulted in After the reflow process, a thin layer of Cu6Sn5 is formed at the
loss of electrical continuity and two flight augmentation interface of Cu pads and solders and the thickness of Cu6Sn5
computer (FAC) fault, which caused the autopilot and auto- layer increases as the reflow time increases. The thickness of
thrust disengaged and contributed to the subsequent loss of Cu6Sn5 layer on the bottom pad without Ni barrier on the PCB
control and a malfunction in two of the plane’s rudder travel side is larger than the one formed near the Ni barrier on the
limiter units (Komite Nasional Keselamatan Transportasi, package side. After thermal cycles or long thermal storage time,
2014). On the other hand, the overestimation of the times to the Cu3Sn layer is formed at the interface of Cu6Sn5 and Cu
failure may lead to consequently underestimating the in the bottom IMC layer on the PCB side, while it is not formed
maintenance interval, which may unnecessarily increase the in the top IMC layer on the package side since Ni is a good
maintenance cost. barrier (Zhong and Yi (1999), Tseng and Duh (2013); Yoon
The goal of this study is to propose a general machine et al. (2015)). When the ENIG surface-treated substrate is
learning (ML) framework to predict the reliability of solder soldered with SAC solders, the formation of the IMC layer
joints accurately for electronic devices and systems. In this proceeds to the process. During the reflow, Au that has a high
study, it will be demonstrated how to achieve this goal using the solubility and a high diffusion rate in the molten solder is
ML method. instantaneously dissolved in the solder and the Ni surface is
exposed to the molten solder. Thereafter, Ni is melted as a
2. Primary factors influencing the reliability of molten solder and Ni in the molten solder is supersaturated to
form a (Ni, Cu)3Sn4 layer (Zhong and Yi (1999)). At this time,
solder joints
Ni3Sn4 or (Cu, Ni)6Sn5 layer may be formed depending on the
ML models for solder joint reliability assessment can be type of the solder. Then, in the Ni-P layer, Ni participates in the
constructed from three parts: training data, a set of attributes formation of (Ni, Cu)3Sn4, and the concentration of P
that influence the failures of solder joints, and a ML algorithm increases at the top surface of the Ni-P layer. Finally, a very thin
to map attributes to reliability. Figure 2 summarizes the Ni-Sn-P ternary IMC layer is formed between the (Ni,
fundamental factors influencing the reliability of solder joints. Cu)3Sn4 layer and the Ni3P layer. This Ni-Sn-P layer has voids
of several nanometers in size, but the mechanism of void
2.1 Formations of intermetallic compounds and voids formation has not yet been completely elucidated. Kirkendall
The Cu/Sn system is one of the simplest metallurgical options void is considered to be due to the difference in the diffusion
for solder joints. During the reflow, the IMC layer is formed rate between Sn and Ni, and that voids formed in the top layer
between the Cu-pad and the solder joint(Zhong and Yi (1999), of Ni-P during immersion Au process exist in IMC during
Gong et al. (2008), Kim and Yu (2010); Zhong et al. (2010), soldering. This void formation appears to be one of the main
Kim et al. (2009); Tseng and Duh (2013), Yoon et al. causes of brittle fracture of the solder joint. The voids of the Ni-
(2015); Park et al. (2018)). The longer the reflow time, the Sn-P layer after reflow are relatively larger in the ENIG than in
thicker the IMC layer. The IMC formation is believed to be one the ENEPIG, which is attributed to the fact that the ENIG
of the necessary conditions for a good solder joint. The surface brittle fracture reliability is lower than that of ENEPIG (Tseng
finish of materials to be jointed also plays an important role in and Duh (2013)). Table I shows various IMC layers formed
deciding the joint life and failure modes. Many previous studies
(Zhong and Yi, 1999; Gong et al., 2008; Kim et al., 2009; Figure 3 Schematic diagram of IMC formation on Cu pad
Zhong et al., 2010; Yi, 2014; Kim and Yu, 2010; Zhong et al.,
2010; Tseng and Duh, 2013; Yoon et al., 2015) show that the
tensile and shear strengths of IMC layers deteriorate with
Figure 2 Fundamental factors influencing the reliability of solder joints

Table I Intermetallic compound formation

Aging
Reflow temp. temp.(8C)/
IMC Kirkendall void Solder Pad Surface finish (°C)/Time(min) Time(hrs) Reference
Ni3Sn4, (Au, 37Pb/63Sn Cu Electrolytic Ni/ 210/1 150/1000 Zhong and Yi
Ni)Sn4 Au & ENIG (1999)
Cu6 Sn5 Cu3Sn/Cu Sn-3.5Ag Cu 260/1 As Reflowed Kim and Yu
interface (2010)
Cu6Sn5, Cu3Sn Cu6Sn5, Cu3Sn/ Sn-3.5Ag Cu 260/1 150/1000/ Kim and Yu
Cu interface 3000 (2010)
(Cu, Ni)6Sn5, in Ni3 P SAC305 Cu ENEPIG 240/1,3 Tseng and Duh
Ni3P (2013)
(Cu, Ni, Pd)6- in Ni3 P SAC305 Cu ENIG 240/5,10 Tseng and Duh
Sn5, Ni3P (2013)
Ni3Sn4, P-rich Sn-1Ag Cu ENIG 260/ 1,10,30,60 Yoon et al.
Ni (2015)
between Cu-pads and solder joints during the reflow and Figure 4 Crack propagation along the Cu3Sn/Cu6Sn5 interface
thermal storage.
2.2 Mechanical failure modes

Lee et al. (2005) observed three fracture modes have been
observed in the fatigue testing. Those modes include in solder,
mixed solder/IMC and in IMC fracture modes. Mattila and
Kivilahti (2006) also investigated the failure mechanisms in
solder joints after drop testing and also after consecutive
thermal and mechanical loading, using different surface
finishes in copper pads. The test assemblies were thermally
cycled (45°C/1125°C, 15 min. dwell time, 750 cycles) or
isothermally annealed (125°C, 500 h) before the standard drop
test. The average number of drops to failure increased when the
thermal cycling was performed before the drop test (1,500 G
deceleration, 0.5 ms half-sine pulse). They observed that four
different failure modes:
1 cracking of the reaction layers on either side of the
interconnections;
2 cracking of the bulk solder;
3 mixed mode of component-side intermetallic and bulk
solder cracking; and
4 void assisted cracking of the component-side Cu3Sn layer. and drop test (De Monlevade and Peng (2007), Zhang et al.
Similarly, Lai et al. (2006) reported four failure modes such as (2009); Libot et al. (2018)). Fracture surfaces or interfaces vary
bulk solder failure, pad peeling, mandatory IMC fracturing, and often depend on test methods. The solder ball/bump joint
and IMC fracturing with a certain portion of solders. They failure modes can be divided into six categories below:
observed that most of failures occurred typically at the 1 bulk solder fracture;
interconnection on the package side of along the intermetallic 2 within IMC region: Cu3Sn/Cu6Sn5 interface or (Ni,
phase layer and pad peeling took place on the board side only. Cu)3Sn4, (Cu, Ni)6Sn5;
De Monlevade and Peng (2007) performed JEDEC standard 3 bulk solder intermetallic layer: Solder/Cu6Sn5 interface;
board-level drop testing after thermal aging at three different 4 broken trace lines;
temperatures for different times to study preferred crack 5 void assisted cracking; and
propagation paths in Sn-3.8 per cent, Ag-0.7 per cent, Cu 6 pad cratering.
solder joints and reported that failures occurred typically along The schematic diagram for possible failure modes on a solder
the ternary Cu3Sn/Cu6Sn5 intermetallic layer as shown in ball joint is shown in Figure 5. Table II shows various solder
Figure 4. Libot et al. (2018) studied a crack through the solder joint failure modes reported in the literatures.
joint for the WLP900 assembly and reported that the crack is
located in the solder bulk on the component side.
3. Reliability
Solder ball or bump joints may fracture during various
reliability tests such as isothermal aging, thermal cycling, If t is a random variable denoting the time to failure, the
thermal shock, mechanical loading, bending test, impact test, reliability function at time t can be defined as:
Figure 5 Failure modes for solder joints where
P ðt > tÞ ¼ 1 P ðt < tÞ ¼ 1 F ðtÞ (6)
The conditional probability function h(t) defined by:
P ðt < t t 1 Dtjt > tÞ P ðt < t t 1 DtÞ

hðtÞ ¼ lim ¼ lim
Dt!0 Dt Dt!0 DtP ðt > tÞ
(7)
The conditional function is generally called the hazard rate.

Substituting equations (4) and (6) into equation (7) yields:
f ðtÞ f ðt Þ
hðtÞ ¼ ¼ (8)
RðtÞ ¼ P ðt > tÞ: (1) 1 F ðtÞ RðtÞ
Then the cumulative distribution function of failure F(t) The main reason for defining the h(t) function is that it is often
becomes: more convenient to work with than f(t). The hazard failure rate
h(t) measures the instantaneous failure, while the failure
F ðtÞ ¼ P ðt < tÞ ¼ 1 P ðt > tÞ ¼ 1 RðtÞ (2) density (PDF) is defined as the derivative of the failure
probability and measures the overall rate of failures. In general,
where F(t) is the fraction of all units in the population the hazard function is a combination of a decreasing hazard of
which fails by time t. If the time to failure t has a probability early failure, an increasing hazard of wear-out failure, and some
density function f(t), then F(t) can expressed as: constant hazard of random failure.
ðt The failure times of this population follow a distribution with
F ðtÞ ¼ f ðt0 Þdt0 (3) a probability density function of f(t), then the population
0
MTTF can be mathematically calculated by:
where the failure density function f(t) is defined by: ðt ðt
MTTF ¼ RðtÞdt ¼ tf ðtÞdt (9)
P ðt < t t 1 DtÞ ¼ f ðtÞDt (4) 0 0
Sometimes it is more convenient to deal with the

probability of failure between time t and t 1 D t given 4. Multi-Layer Feed-Forward neural network
that there were no failures up to time t. The
Artificial Neural Network (ANN) technique is the mimics of
probability of failure between time t and t 1 D t can be
human brain and is capable to elevated functions, such as
defined as:
adaptation or learning, fault tolerance, input-output mapping,
P ðt < t t 1 DtÞ non-liner function and lower level functions such as data pre-
P ðt < t t 1 Dtjt > tÞ ¼ (5)
P ðt > tÞ processing for different kinds of inputs. Neural network (NN)
systems are combination of many simple processing elements
Table II Solder joint failure modes

De
Mattila and Monlevade Zhang Putaala
Kivilahti Lai et al. and Peng Lee et al. Wu et al. et al. et al. Libot et al.
Place Failure mode (2006) (2006) (2007) (2005) (2018) (2009) (2014) (2018)
Package Bulk solder fracture x x x x x
side
IMCs interface x x x x x x x x
Solder/ IMCs x x x x x x
Broken trace lines
Void assisted crack x x
Pad cratering
Board side Bulk solder fracture x x
IMCs interface x x x
Solder/ IMCs x x x x
Broken trace lines x
Void assisted crack x
Pad cratering x x
which operate in parallel (also called parallel distributed 5.1 General model and dataset description
function) and have been inspired both by human biological The feed-forward neural network created for this study
nervous systems and mathematical theories of learning, employs the TensorFlow framework in Python. In this case,
information processing and control (Bishop,1995; Haykin, four attributes were used for two distinct categories, namely the
2009). number of cycles to failure and the SAC305 surface finish
A multi-layer feed-forward neural network structure, trained treatment. One-hot encoding was used to break-up the surface
by back-propagation, is used in this study. In general, an ANN finish treatment category into three binary attributes
structure consists of several layers and each layer consists of corresponding to ENIG, Immersion Silver (IAg) and OSP,
several neurons, which is also called processing elements (PE). respectively.
Figure 6 shows a typical structure of a feed-forward ANN The complete dataset was divided randomly using seventy-
model, in which the left column is input layer, the right most per cent of the total points for training points, and the
column is output layer, in between input layer and output layer remaining thirty per cent used for testing the trained model. No
is a hidden layer. Generally there could be more than one artificial discretion was imposed regarding the makeup of the
hidden layer. training and testing data. In some cases, the dataset parsing
The neuron is a processing element (PE) whose output is may affect the accuracy of the ML model in predicting test
calculated by multiplying its inputs by a weight vector, outputs in certain regimes of the model. Increasing the total
summing the results, and applying an activation function to the dataset size and using categorical discretion are ways to increase
sum. Each processing element has several inputs and one model fidelity. A pseudo-randomized dataset was also created
output. The relation between input, x, and output, y, of a single by modulating the cycles to failure coordinates within the
processing element can be expressed as: domain for each surface finish treatment category to plot
! the output of the network and test the prediction capability of
Xm
yi ¼ f wji xj 1 u i (10) the fully-trained model. The pseudo-random points are a
j¼1 means to examine potential prediction results for range
reasonability, as there are no corresponding failure rate
where wji are multiplying factors (or weights), u i is a constant coordinates for this data. For the complete dataset used for
(normally referred to as threshold or bias), m is the number of training and testing data points, the failure rate coordinate was
inputs, f is called activation function, which is normally selected used to calculate the error that the model output represents.
from a list of S-shaped functions such as sigmoid, hyperbolic Both the attributes and the output were pre- and post-
tangent or similar. Data flow in this type of network is from the processed using the min-max scaler. In general, it is desirable
input layer to the hidden layer and there on to the output layer. that the dataset contain at least one order of magnitude greater
points than the number of features used by the neural network.
5. Solder joint reliability assessment For this study, a total of 90 data points were used, which
enabled the ML model to effectively characterize and predict
ML models for solder joint reliability assessment can be the trends of the tested solder joint reliability.
constructed from three parts: training data, a set of attributes
that influence the failures of solder joints, and a ML algorithm 5.2 Machine learning model structure
to map attributes to reliability. The structure of the network used herein incorporates the input
To demonstrate the reliability assessment capability based on layer, three hidden layers, and an output layer. The input layer
ML, the long-term reliability of lead-free Sn96.5Ag3.0Cu0.5 is comprised of four nodes that take in the scaled attribute data
(SAC305) for three commonly used surface finishes such OSP, for each data point and passes it to the nodes in the first hidden
ENIG and IAg was evaluated. The failure data reported by layer. For this model, 50 nodes are used in both the first and
Collins et al. (2012) are employed. third hidden layers, and the second hidden layer uses 100
nodes. This model is fully-interconnected, meaning that each
node of a layer is passed the output from each of the nodes of
the previous layer. The output layer is also comprised of a single
node, which represents the model prediction of the failure rate
Figure 6 Schematic diagram of multi-layer feed-forward neural
for that data point. The structure of the model used in this
network
study is shown in Figure 7.
In both training and testing, the mean squared error (MSE)
is calculated between the output layer and the observed failure
rate for the input data as shown in equation (11). The
computed error is used during training by the optimization
algorithm to back-propagate corrections to the parameters of
the hidden and output nodes.
X
MSE ¼ î 2
¼1n Yi Y (11)
i
Each node of the hidden layers exhibits the same behavior

for the model in this study. Each is characterized by a bias
qffiffiffiffiffiffiffiffiffiffiffiffiffiffi
Figure 7 ML algorithm
w :¼ w a vcorrected
dw = corrected
sdw 1e (12)
qffiffiffiffiffiffiffiffiffiffiffiffiffiffi
b :¼ b a vcorrected
db = s corrected
db 1 e (13)
The momentum terms (vdw, vdb) and the RMSprop terms (sdw,
sdb) used in equations (12) and (13) are calculated via equations
(14) and (15) using their initialized values, the most current
calculated derivatives (dw, dv) obtained from gradient descent
in the current training mini-batch and pre-defined hyper
parameters ( b 1, b 2).
vdw ¼ b 1 vdw 1 ð1 b 1 Þdw (14a)
vdb ¼ b 1 vdb 1 ð1 b 1 Þdb (14b)
sdw ¼ b 2 sdw 1 ð1 b 2 Þdw2 (15a)
sdb ¼ b 2 sdb 1 ð1 b 2 Þdb2 (15b)
This implementation of the Adam algorithm uses bias

correction through exponential decay of ( b 1, b 2) according to
equations (16) and (17).
and weight, referred to as parameters, and an activation
vcorrected
dw ¼ vdw = 1 b t1 (16a)
function. The biases are all initialized as zero from the
outset, and the weights are initialized by Xavier

initialization designed to keep the scale of gradients vcorrected
db ¼ vdb = 1 b t1 (16b)
consistent in all layers. The activation function determines
the output of the node in response to the node inputs,
biases and weights. In this study, each node uses a rectified scorrected
dw ¼ sdw = 1 b t2 (17a)
linear unit (relu) which returns zero for any negative value,
or for any positive value returns that value. The relu
scorrected
db ¼ sdb = 1 b t2 (17b)
function can be written f(x)= max(0, x). The output layer
uses the same bias and weight properties and initialization The learning rate, a, is fixed in this algorithm for all layers at
scheme as the hidden layers. The activation function,
0.001, based on experimental tuning for this implementation.
however, uses direct matrix multiplication of the third
For the remaining hyperparameters, b 1 = 0.9, b 2 = 0.999, e =
hidden layer outputs and the output layer weights, then
108 were chosen as recommended in Kingma and Ba (2014).
adds in the output layer biases.
For 2,450 epochs the final training cost was calculated at
1.38 104, with a corresponding testing cost of 5.19 104.
5.3 ML Model training, testing and modification
For each training epoch, a testing epoch is accomplished using
The model in this study was trained using the training data
the respective dataset. The difference between training and
points randomly selected from the aggregated dataset. The
testing execution is merely that parameter optimization does
points were shuffled and passed into the network for
processing. Each pass through the entire training dataset is not occur after testing. A depiction of the error and its evolution
known as an epoch. At the completion of an epoch of throughout training is depicted below.
training, an optimized differential to each bias and weight is
computed using the vectorized MSE for each point. The 6. Machine learning model results
adaptive moment estimation algorithm (colloquially called
Adam) was used as a computationally efficient alternative to The ML model was trained through 2500 epochs. The
stochastic gradient decent for iterative changes to the resulting cost after each pass through the dataset was logged,
parameters with respect to the objective Kingma and Ba and the completed log was plotted for both the training and
(2014). Adam combines the advantages of gradient descent testing mini-batches, as shown in Figure 8. The final training
with momentum and RMSprop to compute the iterative cost was found to be 1.65 104. Because the parameter
parameter corrections. For a given iteration t, Adam modifications result from a training epoch, the testing cost is
computes weight and bias corrections, respectively, as normally higher. The final testing cost in this case was
shown in equations (12) and (13). determined to be 5.53 104.
Figure 8 ML cost versus epochs of training
For each additional epoch of training, it should be noted that

there is a stochastic distribution of the cost function that Figure 9 SAC 305 failure data with ML validation
increases in magnitude after approximately 103 epochs of
training. However, it is also apparent that while the training
smoothed cost function continues to decline, the testing
smoothed cost function declines at a more moderate rate.
Continuous optimization of the ML model is possible to some
extent with additional training, although the diminishing
optimization return for each added epoch should be weighed
against the computational cost of doing so. It was noted
through experimentation that beyond 5000 training epochs, the
stochastic variation of the final training and testing costs did not
exhibit consistently lower cost than the model achieved after
2500 epochs.
The model in this study was tested for predictive capability
after the completion of training. The results for the model
predictions are shown in Figure 9. The actual failure data is
plotted along with the model predictions for a visual
comparison of model fidelity. Additionally, pseudo-
randomized failure data were created for each surface finish
treatment within similar ranges of the original data sets. Those
points were fed into the model to “predict” the failure rate for a
given failure cycle.
The error between the experimental results and the predicted
failures is defined as:
evaluated and are given in Table III. As shown in Table III, the
jMeasured failure Predicted failurej present ML algorithm predicts the failures of solder joints
Error ¼ (18) much more accurately comparing with the Weibull
Measured failure
distribution.
The average errors for ENIG, IAg and OSP cases based on Substantial improvement in optimization, however, could be
both the present ML algorithm and the Weibull distribution are achieved by increasing the number and distribution of data
Table III Average errors for predicted failures relationships between the attributes are learned at the same
time, not individually. Prior to commencing training, the
Average errors (%)
dataset is shuffled randomly to prevent the model from
Surface treatment ML Weibull (Collins et al. (2012))
‘memorizing’ data and inferring relationships that may or may
IAg 5.2 22.0 not be consequential. As is shown in the previous figures, the
ENIG 5.1 11.7 model is able to determine the importance of surface finish
OSP 5.0 21.5 treatment category on its own and therefore applies weights and
biases to accommodate this learned behavior.
points in each relevant attribute category, to allow the model to

7. Conclusions
effectively refine its predictions consistent with the behavior of
more complete data sets. While the so-called “10” rule This method can be used to predict the life of solder joints
generally holds true for ML datasets, it is necessary to screen accurately. The ability to predict thermal fatigue life accurately
the data to ensure that rule is complied with in all categories. is valuable to the industry because it saves time and cost for
For the case of the data used in this study, there were 36 data product development and optimization. The proposed method
points available with respect to both ENIG and IAg surface is applied to predict the long-term reliability of lead-free
finish treatments, but only 19 available for the OSP surface Sn96.5Ag3.0Cu0.5 (SAC305) for three commonly used
finish treatment. Because the data was parsed into mini-batches surface finishes such OSP, ENIG, and IAg to validate its
randomly, it is possible that entire regions of data for a specific accuracy and efficiency. The error analysis shows that the
attribute be omitted from training. This is particularly true if predicted failures based on the ML method are much more
the data is sparse for an attribute in any given region. accurate than ones by the Weibull method.
In this study, data were weighted for each surface finish
treatment as shown in Figure 10. For the ENIG surface finish
treatment, the mean cycles to failure was 2010.0, with a References
standard deviation of 681.2. Similarly for IAg and OSP, the Andersson, C., Vandevelde, B., Noritake, C., Sun, P.,
means were 1605.8 and 2070.9 with standard deviations of Tegehall, P.E., Andersson, D.R., Wetter, G. and Liu, J.
729.4 and 709.5, respectively. It was expected that the fidelity (2009), “Thermal cycling of lead-free sn-3.8 ag-0.7 cu
of model predictions in sparse data regimes used for training 388pbga packages”, Soldering & Surface Mount Technology,
would suffer, for example, at the low or high failure rate Vol. 21 No. 2, pp. 28-38.
margins. It is interesting to note from the results depicted in Arfaei, B., Mahin-Shirazi, S., Joshi, S., Anselm, M., Borgesen,
Figure 9 that even in these sparse data regimes, the model is P., Cotts, E., Wilcox, J. and Coyle, R. (2013), “Reliability
able to predict in a manner consistent with expected behavior of and failure mechanism of solder joints in thermal cycling
both the aggregate data as well as the surface finish treatment tests”, Electronic Components and Technology Conference
specific curve. The mutual reliance of this model on overall (ECTC), 2013 IEEE 63rd, IEEE, pp. 976-985.
data behavior and attribute-specific relationships helps Arulvanan, P., Zhong, Z. and Shi, X. (2006), “Effects of
illustrate one powerful capability of ML predictive models. process conditions on reliability, microstructure evolution
The model is able to discern the importance of attributes and failure modes of snagcu solder joints”, Microelectronics
quite well. Using one-hot encoding, a three-parameter attribute Reliability, Vol. 46 Nos 2/4, pp. 432-439.
is broken distinctly into three separate attributes. The training Bishop, C.M. (1995), Neural Networks for Pattern Recognition,
mini-batch is passed through the model as a whole. The Clarendon Press, Oxford.
Chen, Y., Jin, Y. and Kang, R. (2017), “Coupling damage and
Figure 10 SAC305 reliability data histogram reliability modeling for creep and fatigue of solder joint”,
Microelectronics Reliability, Vol. 75, pp. 233-238.
Chu, Y.F.S., Yi, P. and Geng, (2014), “Thermal fatigue life
prediction of solder joints of plastic ball grid array packages”,
International Journal of Materials and Structural Integrity,
Vol. 8 Nos 1/2/3, pp. 32-20.
Collins, M.N., Punch, J. and Coyle, R. (2012), “Surface finish
effect on reliability of sac 305 soldered chip resistors”,
Soldering & Surface Mount Technology, Vol. 24 No. 4,
pp. 240-248.
Darveaux, R. and Banerji, K. (1992), “Constitutive relations
for tin-based solder joints”, IEEE Transactions on
Components, Hybrids, and Manufacturing Technology, Vol. 15
No. 6, pp. 1013-1024.
De Monlevade, E.F. and Peng, W. (2007), “Failure mechanisms
and crack propagation paths in thermally aged pb-free solder
interconnects”, Journal of Electronic Materials, Vol. 36 No. 7,
pp. 783-797, doi: 10.1007/s11664-006-0062-8, ISSN 1543-
186X, available at: https://doi.org/10.1007/s11664-006-0062-8
Gong, J., Liu, C., Conway, P.P. and Silberschmidt, V.V. Proceedings – Annual Reliability and Maintainability
(2008), “Evolution of cusn intermetallics between molten Symposium (RAMS), pp. 1-6, doi: 10.1109/
snagcu solder and cu substrate”, Acta Materialia, Vol. 56 RAMS.2010.5448044.
No. 16, pp. 4291-4297. Ma, H. (2009), “Constitutive models of creep for lead-free
Haykin, S.S. (2009), Neural Networks and Learning Machines/ solders”, Journal of Materials Science, Vol. 44 No. 14,
Simon Haykin, Prentice Hall, New York, NY. pp. 3841-3851.
Hegde, P., Whalley, D.C. and Silberschmidt, V.V. (2009), Mattila, T.T. and Kivilahti, J.K. (2006), “Reliability of lead-
“Creep damage study at powercycling of lead-free free interconnections under consecutive thermal and
surface mount device”, Computational Materials Science, mechanical loadings”, Journal of Electronic Materials, Vol. 35
Vol. 45 No. 3, pp. 638-645. No. 2, pp. 250-256, doi: 10.1007/BF02692443, ISSN 1543-
Herkommer, D., Punch, J. and Reid, M. (2010), “A 186X, available at: https://doi.org/10.1007/BF02692443
reliability model for sac solder covering isothermal Park, H.-P., Seo, G., Kim, S. and Kim, Y.-H. (2018), “Effects
mechanical cycling and thermal cycling conditions”, of solder volume and reflow conditions on self-alignment
Microelectronics Reliability, Vol. 50 No. 1, pp. 116-126. accuracy for fan-out package applications”, Journal of
IEEE Standard:1413.1-2002 (2003), IEEE guide for selecting Electronic Materials, Vol. 47 No. 1, pp. 133-141.
and using reliability predictions based on IEEE 1413, IEEE, Putaala, J., Salmela, O., Nousiainen, O., Kangasvieri, T.,
New York. Vähäkangas, J., Uusimäki, A. and Lappalainen, J. (2014),
JESD22-A104D (2005), Temperature Cycling. JEDEC “Lifetime prediction and design aspects of reliable lead-free
Standard. non-collapsible bga joints in ltcc packages for rf/microwave
Jiang, M., Fu, G., Wan, B., Xue, P., Qiu, Y. and Li, Y. (2018), telecommunication applications”, Soldering & Surface Mount
“Failure analysis of solder layer in power transistor”, Technology, Vol. 26 No. 3, pp. 117-128.
Soldering & Surface Mount Technology, Vol. 30 No. 1, Suhling, J.C., Gale, H.S., Wayne Johnson, R., Nokibul Islam,
pp. 49-56. M., Shete, T., Lall, P. and Thompson, J.R. (2004),
Kang, S.K., Lauro, P., Shih, D.-Y., Henderson, D.W., “Thermal cycling reliability of lead-free chip resistor solder
Gosselin, T., Bartelo, J., Steve R, C., Goldsmith, C., Puttlitz, joints”, Soldering & Surface Mount Technology, Vol. 16 No. 2,
K.J. and Hwang, T.-K. (2004), “Evaluation of thermal pp. 77-87.
fatigue life and failure mechanisms of sn-ag-cu solder joints Syed, A. (2004), “Accumulated creep strain and energy density
with reduced ag contents”, Electronic Components and based thermal fatigue life prediction models for snagcu
Technology Conference, 2004. Proceedings. 54th, Vol. 1, IEEE, solder joints”, Electronic Components and Technology
pp. 661-667. Conference, 2004. Proceedings. 54th, Vol. 1, IEEE,
Kim, S.H. and Yu, J. (2010), “Secondary imc formation pp. 737-746.
induced by kirkendall voiding in cu/sn–3.5 ag solder Tapani Nurmi, S. and Olavi Ristolainen, E. (2002),
joints”, Journal of Materials Research, Vol. 25 No. 9, “Reliability of tin-lead balled bgas soldered with lead-free
pp. 1854-1858. solder paste”, Soldering & Surface Mount Technology, Vol. 14
Kim, Y.M., Oh, C.Y., Roh, H.R. and Kim, Y.H. (2009), “A No. 2, pp. 35-39.
new cu-zn solder wetting layer for improved impact Tseng, C.-F. and Duh, J.-G. (2013), “Correlation between
reliability”, 59th Electronic Components and Technology microstructure evolution and mechanical strength in the sn–
Conference, pp. 1008-1013. 3.0 ag–0.5 cu/enepig solder joint”, Materials Science and
Kingma, D.P. and Ba, J. (2014), “Adam: a method for Engineering: A, Vol. 580, pp. 169-174.
stochastic optimization”, arXiv Preprint arXiv, Vol. 1412, Wang, G.Z., Cheng, Z.N., Becker, K. and Wilde, J. (2001),
p. 6980. “Applying anand model to represent the viscoplastic
Komite Nasional Keselamatan Transportasi (2014), “Aircraft deformation behavior of solder alloys”, Journal of Electronic
accident investigation report.” Packaging, Vol. 123 No. 3, pp. 247-253.
Lai, Y.S., Yang, P.-F. and Yeh, C.-L. (2006), “Experimental Wiese, S., Feustel, F. and Meusel, E. (2002),
studies of board-level reliability of chip-scale packages “Characterisation of constitutive behaviour of snag, snagcu
subjected to jedec drop test condition”, Microelectronics and snpb solder in flip chip joints”, Sensors and Actuators A:
Reliability, Vol. 46 Nos 2/4, pp. 645-650. Physical, Vol. 99 Nos 1-2, pp. 188-193.
Lee, H.-T., Lin, H.-S., Lee, C.-S. and Chen, P.-W. Wu, I.C., Wang, M.H. and Jang, L.S. (2018), “Experimental
(2005), “Reliability of sn–ag–sb lead-free solder joints”, location of damage in microelectronic solder joints after a
Materials Science and Engineering: A, Vol. 407 Nos 1/2, board level reliability evaluation”, Engineering Failure
pp. 36-44. Analysis, Vol. 83, pp. 131-140.
Libot, J.B., Joël, A., Dalverny, O., Arnaud, L., Milesi, P. and Yao, P., Li, X., Xu, H. and Xu, L. (2019), “Shear strength and
Dulondel, F. (2018), “Microstructural evolutions of sn-3.0 fracture mechanism for full cu-sn imcs solder joints with
ag-0.5 cu solder joints during thermal cycling”, different cu3sn proportion and joints with conventional
Microelectronics Reliability, Vol. 83, pp. 64-76. interfacial structure in electronic packaging”, Soldering &
Liu, F., Zhou, J. and Yan, N. (2017), “Thermal cycling effect Surface Mount Technology, Vol. 31 No. 1, pp. 6-19.
on the drop reliability of bga lead-free solder joints”, Soldering Yew, M.C., Tsai, M., Hu, D.C., Yang, W.K. and Chiang, K.
& Surface Mount Technology, Vol. 29 No. 4, pp. 199-202. N. (2009), “Reliability analysis of a novel fan-out type wlp”,
McLeish, J.G. (2010), “Enhancing mil-hdbk-217 reliability Soldering & Surface Mount Technology, Vol. 21 No. 3,
predictions with physics of failure methods”, 2010 pp. 30-38.
Yi, S. (2014), “Shear strengths of cbga/pbga solder ball Zhong, C.H. and Yi, S. (1999), “Solder joint reliability of
joints with lead-free solder pastes”, International Journal of plastic ball grid array packages”, Soldering & Surface Mount
Materials and Structural Integrity, Vol. 8 Nos 1/2/3, Technology, Vol. 11 No. 1, pp. 44-48.
pp. 62-75. Zhong, C.H., Yi, S. and Whalley, D.C. (2002), “Solder ball
Yi, S., Luo, G. and Chian, K.S. (2002), “A viscoplastic failure mechanisms in plastic ball grid array packages”,
constitutive model for 63sn37pb eutectic solders”, Journal of Soldering & Surface Mount Technology, Vol. 14 No. 2,
Electronic Packaging, Vol. 124 No. 2, pp. 91-96. pp. 40-50.
Yi, S., Luo, G., Chian, K.S. and Chen, W.T. (2000), “A Zhong, W., Qin, F., An, T. and Wang, T. (2010), “Mechanical
viscoplastic constitutive model for 63sn37pb eutectic properties of intermetallic compounds in solder joints”, 11th
solders”, Electronic Materials and Packaging, 2000. International Conference on Electronic Packaging Technology
(EMAP 2000). International Symposium on, IEEE, High Density Packaging, Elsevier, pp. 520-524, doi: 10.1109/
pp. 183-190. ICEPT.2010.5582669.
Yoon, J.-W., Bang, J.-H., Lee, C.-W. and Jung, S.-B. (2015), Zhong, C.H., Yi, S., Mui, Y.C., Howe, C.P., Olsen, D. and
“Interfacial reaction and intermetallic compound formation Chen, W.T. (2000), “Missing solder ball failure mechanisms
of sn–1ag/enig and sn–1ag/enepig solder joints”, Journal of in plastic ball grid array packages”, Electronic Components &
Alloys and Compounds, Vol. 627, pp. 276-280. Technology Conference, 2000. 2000 Proceedings. 50th, IEEE,
Zhang, B., Ding, H. and Sheng, X. (2009), “Reliability study of pp. 151-159.
board-level lead-free interconnections under sequential
thermal cycling and drop impact”, Microelectronics Reliability, Corresponding author
Vol. 49 No. 5, pp. 530-536. Sung Yi can be contacted at: syi@pdx.edu
For instructions on how to order reprints of this article, please visit our website:
www.emeraldgrouppublishing.com/licensing/reprints.htm
Or contact us for further details: permissions@emeraldinsight.com

Machine Learning Framework For Predicting Reliability of Solder Joints

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Machine Learning Framework For Predicting Reliability of Solder Joints

Uploaded by

Copyright:

Available Formats

Machine learning framework for predicting

reliability of solder joints

1. Introduction pack, burn-in, etc. When the solder joint of an electronic

Soldering & Surface Mount Technology Received 2 April 2019

Figure 2 Fundamental factors inﬂuencing the reliability of solder joints

Table I Intermetallic compound formation

2.2 Mechanical failure modes

Figure 5 Failure modes for solder joints where

P ðt > tÞ ¼ 1 P ðt < tÞ ¼ 1 F ðtÞ (6)

The conditional probability function h(t) deﬁned by:

P ðt < t t 1 Dtjt > tÞ P ðt < t t 1 DtÞ

The conditional function is generally called the hazard rate.

Sometimes it is more convenient to deal with the

Table II Solder joint failure modes

Each node of the hidden layers exhibits the same behavior

vdw ¼ b 1 vdw 1 ð1 b 1 Þdw (14a)

vdb ¼ b 1 vdb 1 ð1 b 1 Þdb (14b)

sdw ¼ b 2 sdw 1 ð1 b 2 Þdw2 (15a)

sdb ¼ b 2 sdb 1 ð1 b 2 Þdb2 (15b)

This implementation of the Adam algorithm uses bias

Figure 8 ML cost versus epochs of training

For each additional epoch of training, it should be noted that

points in each relevant attribute category, to allow the model to

You might also like