Professional Documents
Culture Documents
Algorithmic Design of Self Assembling ST
Algorithmic Design of Self Assembling ST
Algorithmic Design of Self Assembling ST
Henry Cohn
Microsoft Research New England, One Memorial Drive, Cambridge, MA 02142∗
Abhinav Kumar
Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139†
We study inverse statistical mechanics: how can one design a potential function so as to produce
arXiv:0906.3550v1 [cond-mat.stat-mech] 18 Jun 2009
a specified ground state? In this paper, we show that unexpectedly simple potential functions suffice
for certain symmetrical configurations, and we apply techniques from coding and information theory
to provide mathematical proof that the ground state has been achieved. These potential functions
are required to be decreasing and convex, which rules out the use of potential wells. Furthermore,
we give an algorithm for constructing a potential function with a desired ground state.
I. INTRODUCTION 100
From the static perspective, we wish to understand 2. The 20 vertices of a regular dodecahedron, with 12
what the ground state is (i.e., which configuration mini- pentagonal facets.
mizes energy) and what other local minima exist. From
the dynamic perspective, we wish to understand the 3. The 16 vertices of a hypercube in four dimensions,
movement of particles and the basins of attraction of the with 8 cubic facets (see Fig. 3).
local minima for energy. There are several fundamental
questions: 4. The 600 vertices of a regular 120-cell in four dimen-
sions, with 120 dodecahedral facets (see Fig. 4).
1. Given a configuration C, can one choose an isotropic
pair potential V under which C is the unique ground The latter two configurations exist in four dimensions,
state for |C| points? but as discussed above the sphere containing them is a
three-dimensional space, so they are intrinsically three-
2. How simple can V be? Can it be decreasing? Con- dimensional.
vex? These configurations are important test cases, because
3. How large can the basin of attraction be made? they are elegant and symmetrical yet at the same time
not at all easy to build. The problem is that their facets
In this paper, we give a complete answer to the first are too large, which makes them highly unstable. Un-
question, giving necessary and sufficient conditions for der ordinary potential functions, such as inverse power
such a potential to exist. The second question is more laws, these configurations are never even local minima,
subtle, but we rigorously answer it for several important let alone global minima. In the case of the cube, one can
cases. In particular, we show that one can often use re- typically improve it by rotating two opposite facets so
markably simple potential functions. The third question they are no longer aligned. That lowers the energy, and
is the most subtle of all, and there is little hope of provid- indeed the global minimum appears to be the antiprism
ing rigorous proofs; instead, experimental evidence must arrived at via a 45◦ rotation (and subsequent adjustment
suffice. of the edge lengths). It might appear that this process
The second question is particularly relevant for exper- always works, and that the cube can never minimize a
imental work, for example with colloids, because only a convex, decreasing potential function. However, careful
limited range of potentials can be manipulated in the lab. calculation shows that this argument is mistaken, and
Inverse statistical mechanics with simple potential func- we will exhibit an explicit convex, decreasing potential
tions was therefore raised as a challenge for future work function for which the cube is provably the unique global
in ref. [1]. minimum.
4
are all d. In other words, as ε tends to 0 the distance distances, we approximate them by imposing these con-
distributions of all ground states must approach d. Be- ditions on a large but finite subset of the interval.)
cause the number of points at each given distance is an Given a finite set of competitors C1 , . . . , Cℓ to C, we
integer, it follows that when ε is sufficiently small, for can choose the coefficients by solving a linear program.
each r ∈ supp(d), there are exactly d(r) distances in Specifically, we add an additional variable ∆ and impose
each ground state that are within ε of r. Because Vε the constraints
has a strict local minimum at each point in supp(d), it
follows that it is minimized at d (and only at d) when
ε is sufficiently small. The conclusion of the theorem EV (Ci ) ≥ EV (C) + ∆
then follows from our assumption that C is the unique
configuration in S n−1 with distance distribution d.
It would be interesting to have a version of Theorem 1 for 1 ≤ i ≤ ℓ, in addition to any additional constraints (as
for infinite collections of particles in Euclidean space, but in the previous paragraph). We then choose λ1 , . . . , λk
there are technical obstacles. Having infinitely many dis- and ∆ so as to maximize ∆ subject to these constraints.
tances between particles makes the analysis more com- Because this maximization problem is a linear program,
plicated, and one particular difficulty is what happens if its solution is easily found.
the set of distances has an accumulation point or is even If the coefficients can be chosen so that C is the global
dense (for example in the case of a disordered packing). minimum, then ∆ will be positive and this procedure
In such a case there seems to be no simple way to use will produce a potential function for which C has en-
potential wells, but in fact a continuous function with a ergy less than each of C1 , . . . , Cℓ . The difficulty is how
fractal structure can have a dense set of strict local min- to choose these competitors. In some cases, it is easy to
ima, and perhaps it could in theory serve as a potential guess the best choices: for example, the natural competi-
function. tors to a cube are the square antiprisms. In others, it
is far from easy. Which configurations compete with the
regular 120-cell in S 3 ?
III. SIMULATION-GUIDED OPTIMIZATION
Our simulation-guided algorithm iteratively builds a
In this section we describe an algorithm for optimiz- list of competitors and an improved potential function.
ing the potential function to create a specified ground We start with any choice of coefficients, say λ1 = 1 and
state. Our algorithm is similar to, and inspired by, the λ2 = · · · = λk = 0, and the empty list of competitors.
zero-temperature optimization procedure introduced in We then choose |C| random points on S n−1 and minimize
ref. [3]; the key difference is that their algorithm is based energy by gradient descent to produce a competitor to C,
on a fixed list of competing configurations and uses simu- which we add to the list (if it is different from C) and
lated annealing, whereas ours dynamically updates that use to update the choice of coefficients. This alternation
list and uses linear programming. (We also omit certain between gradient descent and linear programming con-
conditions on the phonon spectrum that ensure mechani- tinues until either we are satisfied that C is the global
cal stability. In our algorithm, they appear to be implied minimum of the potential function, or we find a list of
automatically once the list of competitors is sufficiently competitors for which linear programming shows that ∆
large.) must be negative (in which case no choice of coefficients
Suppose the allowed potential functions are the linear makes C the ground state).
combinations of a finite set V1 , . . . , Vk of specified func- This procedure is only a heuristic algorithm. When
tions. In practice, this may model a situation in which ∆ is negative, it proves that C cannot be the ground
only certain potential functions are physically realizable, state (using linear combinations of V1 , . . . , Vk satisfying
with relative strengths that can be adjusted within a the desired constraints), but otherwise nothing is proved.
specified range, while in theory we may choose the basic As the number of iterations grows large, the algorithm
potential functions so that their linear combinations can is almost certain to make C the ground state if that is
approximate any reasonable function arbitrarily closely possible, because eventually all possible competitors will
as k becomes large. be located. However, we have no bounds on the rate at
Given a configuration C ⊂ S n−1 , we wish to choose a which this occurs.
linear combination
We hope that C will not only be the ground state, but
V = λ1 V1 + · · · + λk Vk will also have a large basin of attraction under gradi-
ent descent. Maximizing the energy difference ∆ seems
so that C is the global minimum for EV . We may also to be a reasonable approach, but other criteria may do
wish to impose other conditions on V , such as monotonic- even better. In practice, simulation-guided optimization
ity or convexity. We assume that all additional conditions does not always produce a large basin of attraction, even
are given by finitely many linear inequalities in the coef- when one is theoretically possible. Sometimes it helps to
ficients λ1 , . . . , λk . (For conditions such as monotonicity remove the first handful of competitors from the list once
or convexity, which apply over the entire interval (0, 2] of the algorithm has progressed far enough.
6
√
IV. RIGOROUS ANALYSIS t ∈ [−1, 1). Note that 2 − 2t is the Euclidean dis-
tance between two unit vectors with inner product t, be-
The numerical method described in the previous sec- cause |x − y|2 = |x|2 + |y|2 − 2hx, yi = 2 − 2hx, yi when
tion appears to work well, but it is not supported by |x| = |y| = 1. We view h as a function of the inner prod-
rigorous proofs. In this section we provide such proofs in uct, and the previous inequality simply says that it is a
several important cases. The key observation is that the lower bound for V .
conditions for proving a sharp bound in Proposition 3 be- We say the configuration C is compatible with√h if two
low are themselves linear and can be added as constraints conditions hold. The first is that h(t) = V 2 − 2t
in the simulation-guided optimization. While this does whenever t is the inner product between two distinct
not always lead to a solution, when it does the solution points in C. The
P second is that whenever αi > 0 with
is provably optimal (and in fact no simulations are then i > 0, we have x,y∈C Pi (hx, yi) = 0. This equation holds
if and only if for every z ∈ S n−1 , x∈C Pi (hx, zi) = 0.
P
needed). The prototypical example is the following the-
orem: (The subtle direction follows from Theorem 9.6.3 in
ref. [14].)
Theorem 2. Let the potential function V : (0, 2] → R be
defined by Proposition 3 (Yudin [12]). Given the hypotheses listed
above for h, every N -point configuration in S n−1 has V -
1 1.13 0.523 potential energy at least (N 2 α0 − N h(1))/2. If C is com-
V (r) = − 6 + 9 .
r3 r r patible with h, then it is a global minimum for energy
among all |C|-point configurations in S n−1 , and every
Then the cube is the unique global energy minimum such global minimum must be compatible with h.
among 8-point configurations on S 2 . The function V is
decreasing and strictly convex. Proof: Let C ⊂ S n−1 be any finite configuration with
N points (not necessarily compatible with h). Then
Theorem 2 is stated in terms of a specific potential
function, but of course many others could be found using 1 X p
our algorithm. Furthermore, as discussed in the con- EV (C) = V 2 − 2hx, yi
2
clusions below, the proof techniques are robust and any x,y∈C, x6=y
potential function sufficiently close to this one works. 1 X
≥ h hx, yi
The potential function used in Theorem 2 is modeled 2
x,y∈C, x6=y
after the Lennard-Jones potential. The simplest general-
ization (namely a linear combination of two inverse power N h(1) 1 X
=− + h hx, yi
laws) cannot work here, but three inverse power laws suf- 2 2
x,y∈C
fice. The potential function in Theorem 2 is in fact de-
d
creasing and convex on the entire right half-line, although N h(1) 1 X X
=− + αi Pi hx, yi
only the values on (0, 2] are relevant to the problem at 2 2 i=0
x,y∈C
hand and the potential function could be extended in an
arbitrary manner beyond that interval. N h(1) α0 X
≥− + P0 hx, yi
To prove Theorem 2, we will apply linear programming 2 2
x,y∈C
bounds, in particular, Yudin’s version for potential energy 2
[12]. Let Pi denote the ith degree Gegenbauer polynomial N α0 − N h(1)
= .
for S n−1 [i.e., with parameter (n − 3)/2, which we sup- 2
press in our notation for simplicity], normalized to have The first inequality holds because h is a lower bound
Pi (1) = 1. These are a family of orthogonal polynomials for V , and the second holds because all the Pi -sums are
that arise naturally in the study of harmonic analysis on nonnegative (as are the coefficients αi ). The lower bound
S n−1 . The fundamental property they have is that for for energy is attained by C if and only if both inequalities
every finite configuration C ⊂ S n−1 , are tight, which holds if and only if C is compatible with
X h, as desired.
Pi (hx, yi) ≥ 0. Proof of Theorem 2: It is straightforward to check that
x,y∈C
V is decreasing and strictly convex. To prove that the
cube is the unique local minimum, we will use linear pro-
(Here, hx, yi denotes the inner product, or dot product,
gramming bounds.
between x and y.) See section 2.2 of ref. [13] for further
Let h be the unique polynomial of the form
background.
The linear programming bound makes use of an aux- h(t) = α0 + α1 P1 (t) + α2 P2 (t) + α3 P3 (t) + α5 P5 (t)
iliary function h to produce a lower bound on poten-
tial energy. The function h will be a polynomial h(t) = (note
Pd √ that P4 is missing) such that h(t) agrees with
i=0 αi Pi (t) with coefficients α0 , . . . , α√
d ≥ 0. It will V 2 − t to order 2 at t = ±1/3 and to order 1 at
also be required to satisfy h(t) ≤ V 2 − 2t for all t = −1. These values of t are the inner products between
7
distinct points in the cube. One can easily compute the where t = 1 − r2 /2. Then the hypercube is the unique
coefficients of h by solving linear equations and verify global energy minimum among 16-point configurations on
that they are all positive. Furthermore,
√ it is straightfor- S 3 . The function V is decreasing and strictly convex.
ward to check that h(t) ≤ V 2 − 2t for all t ∈ [−1, 1),
with equality only for t ∈ {−1, −1/3, 1/3}. For the 120-cell, let q(t) be the monic polynomial
The cube is compatible with h, and to complete the whose roots are the inner product between distinct points
proof all that remains is to show that it is the only 8- in the 120-cell; in other words,
point configuration that is compatible with h. Every such
configuration C can have only −1, −1/3, and 1/3 as inner q(t) = t(t + 1)(4t + 1)(4t − 1)(4t + 3)(4t − 3)
products between distinct points. For each y ∈ C and · (2t + 1)(2t − 1)(16t2 − 5)
1 ≤ i ≤ 3,
X · (4t2 + 2t − 1)(4t2 − 2t − 1)
Pi (hx, yi) = 0. · (16t2 + 4t − 1)(16t2 − 4t − 1)
x∈C
· (16t2 + 4t − 11)(16t2 − 4t − 11)
If there are Nt points in C that have inner product t with
y, then · (16t2 + 12t + 1)(16t2 − 12t + 1)
· (16t2 + 20t + 5)(16t2 − 20t + 5)/250 .
Pi (−1)N−1 + Pi (−1/3)N−1/3 + Pi (1/3)N1/3 + Pi (1) = 0
for 1 ≤ i ≤ 3. These linear equations have the unique Let m1 , . . . , m29 be the integers 2, 4, 6, 8, 10, 14, 16,
solution N−1 = 1, N±1/3 = 3. 18, 22, 26, 28, 34, 38, 46, 1, 3, 5, 7, 9, 11, 13, 15, 17, 19,
In other words, not only is the complete distance distri- 21, 23, 25, 27, 29 (in order), and let c1 , . . . , c17 be 1, 2/3,
bution of C determined, but the distances from each point 4/9, 1/4, 1/9, 1/20, 1/20, 1/15, 1/15, 9/200, 3/190, 0,
to the others are independent of which point is chosen. 7/900, 1/40, 1/35, 3/190, and 1/285.
The remainder of the proof is straightforward. For each
point in C, consider its three nearest neighbors. They Theorem 6. Let the potential function V : (0, 2] → R be
must have inner product −1/3 with each other: no two defined by
can be antipodal to each other, and if any two were closer
17 29 2
together than in a cube, then some other pair would be X X Pmi (t) 5 q(t)
V (r) = ci Pi (t) + + 10 ,
farther (which is impossible). Thus, the local configura- 106 1−t
i=1 i=1
tion of neighbors is completely determined, and in this
case, that determines the entire structure. where t = 1−r2 /2. Then the regular 120-cell is the unique
Theorem 4. Let the potential function V : (0, 2] → R be global energy minimum among 600-point configurations
defined by on S 3 . The function V is decreasing and strictly convex.
(t + 1)2 (t − 1/3)2 (t + 1/3)2 (t2 − 5/9)2 The proofs of Theorems 5 and 6 use the same tech-
V (r) = (1+t)5 + ,
6(1 − t)2 niques as before. The most elaborate case is the 120-cell,
where t = 1 − r2 /2. Then the regular dodecahedron is the specifically the proof of uniqueness. The calculation of
unique global energy minimum among 20-point configu- the coefficients Nt , as in the proof of Theorem 2, proceeds
rations on S 2 . The function V is decreasing and strictly as before, except that the P12 sum does not vanish (note
convex. that the coefficient c12 of P12 in V is zero). Nevertheless,
there are enough simultaneous equations to calculate the
The proof is analogous to that of Theorem 2, except numbers Nt . Straightforward case analysis suffices to
that we choose h(t) = (1 + t)5 . The proof of unique- show then that the four neighbors of each point form
ness works similarly. Note that the potential function a regular tetrahedron, and the entire structure is deter-
used in Theorem 4 is physically unnatural. It does not mined by that.
seem worth carefully optimizing the form of this poten-
tial function when it is already several steps away from
real-world application. Instead, Theorems 4 through 6 V. CONCLUSIONS AND OPEN PROBLEMS
should be viewed as plausibility arguments, which prove
that there exists a convex, decreasing potential while al-
In this article, we have shown that symmetrical config-
lowing its form to be highly complicated.
urations can often be built by using surprisingly simple
Theorem 5. Let the potential function V : (0, 2] → R be potential functions, and we have given a new algorithm
defined by to search for such potential functions. However, many
−13 + 73t + 5t2 + 7t3 + t5 open problems remain.
V (r) = One natural problem is to extend the linear program-
120
ming bound analysis to Euclidean space. There is no
7(t + 1)(t + 1/2)2 t2 (t − 1/2)2 conceptual barrier to this (section 9 of ref. [13] develops
+ ,
120(1 − t) the necessary theory), but there are technical difficulties
8
[1] Torquato S (2009) Inverse optimization techniques Dispersions (Cambridge Univ Press, Cambridge).
for targeted self-assembly. Soft Matter 5:1157–1173, [8] Sadoc J-F, Mosseri R (1999) Geometrical Frustration
arXiv:0811.0040. (Cambridge Univ Press, Cambridge).
[2] Rechtsman M, Stillinger FH, Torquato S (2005) Op- [9] Dinsmore AD, et al. (2002) Colloidosomes: selectively
timized interactions for targeted self-assembly: appli- permeable capsules composed of colloidal particles. Sci-
cation to honeycomb lattice. Phys Rev Lett 95:228301, ence 298:1006–1009.
arXiv:cond-mat/0508495. [10] Bowick M, Giomi L (2009) Two-dimensional mat-
[3] Rechtsman M, Stillinger FH, Torquato S (2006) Designed ter: order, curvature and defects. Adv Phys, arXiv:
isotropic potentials via inverse methods for self-assembly. 0812.3064v1 [cond-mat.soft].
Phys Rev E 73:011406, arXiv:cond-mat/0603415. [11] Donev A, Torquato S, Stillinger FH, Connelly R (2004)
[4] Rechtsman M, Stillinger FH, Torquato S (2006) Jamming in hard sphere and disk packings. J Appl Phys
Self-assembly of the simple cubic lattice via 95:989–999.
an isotropic potential. Phys Rev E 74:021404, [12] Yudin VA (1993) Minimum potential energy of a point
arXiv:cond-mat/0606674. system of charges. Discrete Math Appl 3:75–81.
[5] Rechtsman M, Stillinger FH, Torquato S (2007) Syn- [13] Cohn H, Kumar A (2007) Universally optimal distribu-
thetic diamond and Wurtzite structures self-assemble tion of points on spheres. J Am Math Soc 20:99–148,
with isotropic pair interactions. Phys Rev E 75:031403, arXiv:math.MG/0607446.
arXiv:0709.3807. [14] Andrews G, Askey R, Roy R (1999) Special Functions
[6] Likos CN (2001) Effective interactions in soft condensed (Cambridge Univ Press, Cambridge).
matter physics. Phys Rep 348:267–439. [15] Terras A (1985) Harmonic Analysis on Symmetric Spaces
[7] Russel WB, Saville DA, Showalter WR (1989) Colloidal and Applications (Springer, New York), Vol. I.