Professional Documents
Culture Documents
Rami Mangoubi
Rami Mangoubi
CELL CHROMATIN
level. Thus, given a texture decomposition that provides B H0 : X1 , . . . , Xb , . . . XS ∼ f0 = f0b (xsb )(5)
wavelet bands, the characterizing pdf for that texture is b=1 s=1
B
S
f (x1 , . . . , xB ; ω1 , . . . , ωB , p1 , . . . , pB ) = H1 : X 1 , . . . , X b , . . . , X S ∼ f1 = f1b (xsb )(6)
B b=1 s=1
pb pb
e−(|xb |/ωb ) (2)
2ωb Γ(1/pb )
b=1 where each of the densities p0b and p1b , b = 1, . . . , B is a
generalized Gaussian density function given in (1), with re-
The above texture characterization is an approximation; it as-
spective parameters (ω1 , p1 ), . . . , (ωB , pB ).
sumes that the wavelet coefficients are statistically indepen-
dent across resolution levels. Though it may be approximate,
this statistical model of texture decomposition has been val- 2.2.1. The χp random variables
idated on many types of textures [3]. In particular, it was
shown to be applicable to the analysis of stem cell colonies To derive the log-likelihood ratio statistic in Section 2.2.2 for
in [5, 7] and to nuclei of various stem cell lineages in [10]. the above test and its density functions, we need to generalize
In all these references, the Kullback-Leibler distance is the χ2 random variable. For a generalized Gaussian random
used as a basis for classifying the texture. The KL distance variable x with parameters ω and p, define
between two density functions f1 and f2 is given by x p
χp = (7)
f2 (x) ω
DKL (1, 2) = f1 (x) ln dx. (3)
f1 (x)
For p = 2, we have the χ2 random variable, which is the
That is, given a new texture that needs to be assigned to one square of a standard normal variable. Likewise, χp is a gen-
class from among C classes, we compute the wavelet coef- eralization of χ2 with respect to the generalized Gaussian
ficients from samples for the new texture, and we use these variable with width parameter ω = 1, raised to the power
sample decompositions to estimate the generalized Gaussian p, whose pdf is
parameters [3]. We now have knowledge of the new texture’s
pdf, fnew , and can select the class c∗ whose pdf has the short- 1
fχp (x) = e−x |x|−1+1/p , x≥0 (8)
est KL distance from the new texture’s pdf Γ(1/p)
c∗ = arg min DKL (fnew , fc ) (4) The χ2N , or the χ2 random variable with N degrees of free-
c
dom, is simply the sum of N independent χ2 random vari-
As noted in [3], KL distance based classification is equivalent ables. Likewise, we can define the χpN random variable to be
to maximum likelihood classification only in the asymptotic the sum of N independent χp random variables.
sense. For finite samples, however, this equivalence does not To go a step further, if the shape parameters are different
hold, except for the unrealistic case where the shape parame- for each of the random variables in the sum, we can define,
ters have the same value in (2), when p1 = p2 = . . . = pB . with −
→p = (p1 , . . . , pN ), the random variable
381
Binary hypotesis tests against class 1
1
0.95
0.85
0.8
Fig. 1. (A-C) with increasing chromatin granularity, com-
0.75
pared to a control somatic endothelial cell (D). Bar in A is 10
μm. 0.7
0.65
0.55
The log-likelihood ratio for the hypothesis test (5-6) is ex-
pressed in terms of these new random variables. With N = 0.5
0 0.02 0.04 0.06 0.08 0.1 0.12 0.14 0.16
Probability of Class 1 False Positive
SB, where S is the number of samples for the texture and B
the number of wavelet bands per sample, we have
B S Fig. 2. Probability of correctly identifying a pluripotent nu-
b=1 s=1 f1b (xsb ) cleus (class 1) vs. probability of a false positive from partially
Λ (X1 , . . . , Xs ) = ln B S
f0b (xsb ) or totally differentiated nuclei.
b=1 s=1
N
= χpi00i − χpi01i + K that chromatin became more granular and did not vary over
i=1
→
− →
− time, unlike pluripotent cells. By 5 weeks (Figure 1 C), dif-
p
= χ0 0
− χ1p 1 +K (11) ferentiated stem cells were nearly as granular as an adult hu-
man vascular endothelial cell (Figure 1 D). Pluripotent nuclei
where K is a constant dependent on the scale and shape pa- are physically very plastic and become less compliant during
rameters. Now we have the test differentiation due in part to chromatin condensation [9]. The
H1 bright fluorescent regions within the nuclei that we observed
→
− →
− reflect compact chromatin supercoiling which limits acces-
χ0p 0 − χ1p 1 ≷ T (12)
H0 sibility of DNA to soluble proteins [4]. Chromatin conden-
sation is biologically significant because transcription factors
Here, T denotes the classification threshold that absorbs K. and activators need to have access to DNA in order to ex-
The above methodology has been validated on generic tex- press genes. The granularity of chromatin therefore reflects
tures as well as images of various stem cell nuclei. In the next the segregation of the nucleus into domains of high density
section, we describe one such application. (bright areas, heterochromatin) and low density (dim areas,
euchromatin). Since heterochromatin generally contains si-
lenced genes, texture analysis provides a direct measure of
3. APPLICATION TO STEM CELL NUCLEI
the degree of gene silencing by chromatin remodeling.
Human embryonic stem cells (hESC, line UC06 from the NIH-
approved registry) were grown under standard conditions on 4. PERFORMANCE RESULTS
mouse feeder cells. Pluripotency of hESC was routinely con-
firmed by immunostaining for the pluripotency marker, Oct- Figure 1 shows nuclear images of 4 cells at first time in a
4. hESCs were induced to differentiate for up to 5 weeks by time-lapse series of 9 images over a 10 minutes duration. The
plating on feeder cells at half the normal density, which in- last two classes are very close and are expected to be indis-
duced differentiation to early neuronal lineages as determined tinguishable. Of utmost importance is the ability to identify
by the neural marker, nestin [8]. We visualized chromatin totally pluripotent nuclei, while minimizing false positives.
in living cells with a fluorescent histone that bound to DNA. Figure 2 plots the probability of correctly identifying such a
Cells were transiently transfected with a plasmid expressing nucleus, against the probability of misclassifying a differenti-
the histone H2B labeled with the fluorescent protein GFP [6]. ated nucleus from each of the other three classes as pluripo-
4-D movies were acquired with a spinning disk microscope tent. Each differentiated class is shown by a separate curve.
(Perkin Elmer) using a 40x 1.3NA Nikon objective with a It is shown, for instance, that for a probability of misclassi-
resolution of 0.2 μm. We observed that nuclei in pluripotent fication of less than 0.05, we have a correct classification of
cells were small and chromatin was generally smooth textured larger than 0.95, for any of the three alternate classes.
(Figure 1 A). During differentiation (Figure 1 B) we found Pairwise comparison of cells from various classes is shown
382
4 H0 = A; H1 = B 4 H0 = A; H1 = C Pp.3125-27.
x 10 x 10
15 6
[2] M. Desai, R. Mangoubi, J. Shah, W. Karl, D. Kennedy, H. Pien,
5
10
and A. Worth, “Functional MRI activity characterization using
4
curve evolution,” IEEE Trans. on Medical Imaging , Vol.21,
Λ(x)
Λ(x)
3 No. 11, Nov, 2002, pp. 1402-1412.
5
2
[3] M. N. Do and M. Vetterli, “Wavelet-based texture retrieval
0 1
0 5 10 0 5 10 using generalized Gaussian density and Kullback-Leibler dis-
cell sample cell sample
4 H0 = A; H1 = D H0 = C; H1 = D tance,” IEEE Transactions on Image Processing, Vol. 11, no. 2,
x 10
3000 , Feb. 2002, pp. 146–158.
6
2500 [4] S. M. Gorisch, M. Wachsmuth, et al. “Histone acetylation in-
creases chromatin accessibility”, J. Cell Science, Vol. 118, Pt.
Λ(x)
Λ(x)
4 2000
34, 2005, pp. 5835-34.
1500
2 [5] C. Jeffreys, “Support vector machine and parametric wavelet-
1000
0 5 10 0 5 10 based texture classification of stem cell images,” M.S., Mas-
cell sample cell sample sachusetts Institute of Technology, Cambridge, MA, June
2004.
Fig. 3. Pairwise hypothesis testing between classes A and B [6] T. Kanda, K. Sullivan, et al. “Histone-GFP fusion protein en-
(top left), A and C (top right), A and D (bottom left), and C ables sensitive analysis of chromosome dynamics in living
and D (bottom right). Cells for which the null hypothesis H0 mammalian cells” , Curr. Biology Vol. 8, No. 7, 1998, pp. 377-
is true are plotted in blue, while cells for which the alternative 85.
hypothesis H1 is true are plotted in red. Results indicate that [7] R. Mangoubi, C. Jeffreys, A. Copeland, M. Desai, and P. Sam-
the likelihood ratio statistics show clear separation between mak, “Non-invasive image based support vector machine clas-
pairs of classes. sification of human embryonic stem cells,” Proc. IEEE Int’l
Symp. on Biomed. Imaging, Washington, D.C., April, 2007.
[8] J. A. Ozolek, E. P. Jane, et al. “Human embryonic stem cells
in Figure 3. The marker GFP-H2B is imaged in each of the (HSF-6) show greater proliferation and apoptoses when grown
4 cells shown in Figure 1 at 9 time points (Pluripotent cell A; on glioblastoma cells than mouse embryonic fibroblasts at day
differentiated cell B; differentiated cell C; control endothe- 19 in culture: comparison of proliferation, survival, and neural
lial cell D). Because the cells from each class carry the same differentiation on two different feeder cell types.” Stem Cell
color, and cells are segregated by colors, the results show that Dev. Vol. 16, No.3, 2007, pp.403-12.
the likelihood ratio test’s statistic described in (12) enables [9] J. D. Pajerowski, K. N. Dahl, et al. “From the Cover: Physi-
clear separation between cells of distinct classes. cal plasticity of the nucleus in stem cell differentiation.” Proc.
Natl. Acad. Sci. USA , Vol 104, No.40, 2007, pp.15619-24.
6. REFERENCES
383