Professional Documents
Culture Documents
Supplementary Online Appendix For Trade, Farmers Heterogeneity, and Agricultural Productivity
Supplementary Online Appendix For Trade, Farmers Heterogeneity, and Agricultural Productivity
A Data
This section describes the construction of the dataset used in our analysis. Specifically,
we describe: (1) the output and revenue data; (2) the measurement of farm capital and
farm labor, which we use in the construction of our measures of farm productivity; (3) the
geographic variables for each municipio; and (4) the construction of the instruments that we
use to evaluate the impact of distance to urban centers.
Output and Revenue. The agricultural census of Colombia, conducted by DANE, col-
lected information on the production volume for all crops planted in each farm plot during
2013. We compute the physical output per crop for each farm as the sum across all farm
plots, which incorporates the total production in all harvest seasons of 2013. We gather
information on crop prices from different sources to compute the total farm revenue. Our
final price data includes 258 crops and covers 85% of the total planted area in the country.
We only lack price data for timber trees and aromatic herbs that are usually consumed at
the farm. Our main source of price data is the System of Price Monitoring (SIPSA). SIPSA
collects weekly information of prices for food items in 61 wholesale markets. For each crop
and price, we first apply a discount factor used by government officials to convert wholesale
prices to producers prices. We then compute annual prices as the average of weekly prices
across all wholesale markets. We also bring in data from the producers association of cof-
fee, African palm, sugarcane, rice, cotton, and cocoa. In each case, producers associations
provide average producer prices across the country. We use conversion factors provided by
DANE to obtain units of measurement consistent between the census and the price data.
Panel A in Appendix Figure A.2 presents the distribution of our measure of farm revenue.
We exclude outliers with extremely high or low values representing less than 1 percent of
the data. In our final data, the median farm revenue is 2564 USD and the average is 12945
USD.
Farm capital. The census contains information on the type and number of machines
available in the farm for agriculture and livestock production. Since our primary interest is on
agricultural productivity, we focus on machines typically used for agriculture. This accounts
for 22 different machines. To compute a measure of farm capital, we combine the information
on machinery from the census with data that we gathered on machinery prices. We collect
price information online from 17 national suppliers of machinery and from advertisements
published in one specialized newspaper.42 We complement this information with data on
machinery values from FAOSTAT. Since the census does not provide information on specific
product references, we take as a proxy for each machine’s value the minimum price among
42
Source: https://www.agronegocios.co/
1
those prices found for machines of the same type.43 We were able to recover price information
for 76% of the machines that we selected. This allows us to construct a capital valuation
measure for 98% of the farms that report some type of machinery. Nevertheless, only 20%
of farms report having any machinery. For the rest of the farms, we make an imputation of
capital value by fitting a linear regression of the log of farm capital value on log farm size,
log workers, a dummy that indicates if the farm has more than 80% of the land in a single
crop, and dummy variables for departamento, for the main crop in the farm, and for several
types of livestock. To ensure that our results or not driven by such imputation, we test
the robustness of our main results to several different approaches, as reported in Appendix
Tables A.5 and A.10. Appendix Figure A.3 presents the distribution of our observed values
of capital and of the imputed values.
Farm labor. The census collects information on the number of permanent farmworkers and
the number of additional man-days hired on the farm the month before the interview. About
77% of farms have between 1 and 2 permanent farmworkers and 63% of farms did not hire
any additional labor the month before the interview. We use this information to compute
the total number of man-days of work employed at the farm per year. First, we convert
the number of permanent farmworkers on the farm to annual man-days by multiplying this
number by six days a week, four weeks a month, and 12 months a year. Second, we multiply
the number of additional man-days hired the previous month by 12 to measure annual
seasonal man-days.44
In addition, for our calibration of the model, we use data from the cost module of the
Municipality Agricultural Evaluation (EVA) of 2018. This data contains detailed information
on farms’ input use by production stage (land preparation, planting, maintenance, harvest,
and post-harvest) for the 33 crops selected according to their importance in the country. The
data was collected with a survey instrument applied to a sample of about 1,000 farmers and
local extensionists. The sample was designed to provide cost estimates that are representative
at the crop level. We used the microdata from this survey, which includes information on
where farmers usually sell their produce, and categorical variables that classify farms as
small, medium, or large depending on their land use. For the calibration of the model,
we construct a measure of total man-days hired by each farm. This measure includes all
seasonal workers hired during the production cycle and the labor hired to perform clerical
work, which is not available in the agricultural census.
43
We use minimum prices for three reasons. First, it is possible that the information that we were able to
gather online has an upward bias because providers of the cheapest and less sophisticated machines might
be less likely to be advertise their products online. Second, given the low levels of technology adoption that
have been documented in Colombian farms, the machines with the minimum values in our sample might be
the best proxies for the types of machines in most farms in the census. Third, using minimum prices allows
us to take into account capital depreciation. We obtain a similar distribution of capital values when we use
median prices instead.
44
For farmers interviewed off-peak, we underestimate the use of labor. To the extent that peak seasons
vary within crops across regions and within regions and across crops, and the timing of the interviews is
not correlated with crop cycles, we expect this measurement error to be uncorrelated with our variables of
interest.
2
Geographic characteristics. The geographic variables that we use are the altitude of
the municipality, a measure of the ruggedness of the terrain, total rainfall in the year of the
survey, proximity to rivers, and 6 variables capturing the share of the area in a municipality
under 6 different climatological zones. To compute these variables, we perform the geometric
intersection between the municipality boundaries and the raster and shapefiles containing
information on each of these variables. For elevation and terrain roughness, we use ASTER
GDEM rasters and compute the weighted average of all pixel values on each variable within
a municipality. For climatological zones, we use estimations from the Caldas-Lang model
for climatological zoning. This model classifies geographic areas according to temperature,
annual precipitation and elevation, ranging from warm and dry to extremely cold and humid.
We use the shapefiles with this information provided by the National Institute of Hydrology
and Meteorology (IDEAM). In addition, we use maps on the location of rivers to compute the
distance from the centroid of each municipality to the closest river and measures of annual
rainfall provided by IDEAM to compute a measures of water availability in the municipality
in the year of the survey. Finally, we use data from FAO-GAEZ to measure agricultural
suitability across regions in Colombia. Specifically, we measure the agricultural suitability
in coffee, sugarcane, cocoa, banana and oil palm for rain-fed and low-input technology use.
3
where ȳi represents the total output of a farm i in value based on the average price of each
crop at the country-level, li the sown area, ni the annual P man-days of work, and mi the
measure of machinery use. Specifically, we calculate ȳi = k qik p̄k , where qik is the quantity
produced by farm i of crop k and p̄k is the average price of crop k in the country. This
production function is consistent with the span-of-control function that we use in our model.
To compute Âi , we need values for the parameters of the production function, which we
pick from the literature. We follow estimates from Avila and Evenson (2010), which are
estimated based on data from Latin American countries. For Colombia, they estimate a
factor share of 0.55 for labor, 0.23 for land, 0.08 for fertilizers and chemicals, 0.06 for seeds,
0.07 for machinery and 0.04 for animals. For our baseline empirical results, we set labor
share to 0.55, capital to 0.10 (based on the summation of machinery and animals), and the
rest we attribute to land. Since we do not have data on fertilizer and chemical use, we
incorporate the application of these inputs into the factor share of land intensity.
As discussed in Appendix Section A, we use imputed data on machinery use for part of
the farms and we aggregated both permanent and temporary labor in our final measure of
labor employment. To test the robustness of our results to these choices, Appendix Table
A.10 report estimates of our main results using several different measurement approaches.
Panel A shows that our results for the effect of market access on agricultural productivity are
robust to an estimation that restricts the sample to farms that report machinery. Similarly,
Panel B shows that our results are robust to an estimation of farm productivity assuming
a production function without machinery. In both panels, we include as a control the share
of farmers in the municipality that report machinery ownership. Panel D, shows that the
share of farmers reporting machinery ownership falls with distance to urban centers. Panel
C presents results with an alternative measure of farm productivity that only takes into
account the number of permanent farmworkers to compute the use labor in the production
function. In sum, our main conclusions hold across these different measurement approaches.
4
The production of bananas and African oil palm takes place on large landholdings typ-
ically farther from urban centers. Due to its ecological conditions of humidity and tem-
perature, banana cultivation is concentrated in the Northern region, while the cultivation
of African oil palm is widely spread across several regions. Exports of cocoa have recently
gained importance in Colombia. Like coffee, cocoa is grown on small farms. There are two
large processing companies that buy the raw product from agricultural intermediaries in
the local market. Producers’ cooperatives have emerged recently seeking to reduce inter-
mediation costs and facilitate contracts with international buyers. Similar to African palm,
cooperatives play a smaller role in the distribution of cocoa relative to coffee.
Tubers, including potatoes, yams and yucca, and plantains are the most important food
crops in the country. Production of these crops usually takes place on small farms. The soil
and climate of the mountains are suitable for growing potatoes,while plantains, yam and
yucca are suited for lower elevation and warmer temperatures.
5
and the unit price of each.) Data on price by variety is available for these two types of
tomatos in key consumer markets between 2016 and 2019 for eight municipalities. We find
that the average price premium for tomatoes larga-vida relative to concho was 18%.45
Similarly, rice is graded according to the percentage of broken grains. First grade rice
has up to 10% of broken grains, while second grade rice has between 10% and 20% of broken
grains. The share of whole grains obtained after milling, and therefore, the quality of rice
depends on farming and milling practices. We have data on rice prices by quality type for
the period between 2016 and 2019 in seven municipalities. In this case, we find that the
average price premium equals 9%.
In the calibration of our model, we find that regions with larger urban centers have,
in average, higher price premium for exportable varieties of agricultural goods, since these
regions tend to have a larger share of farmers selecting into non-local markets. Appendix
Figure A.5 presents the relationship between the total population in an urban center and
the price premia. We find a positive relationship between these. We have also checked
the relationship between the price premium and distance to urban centers. We find that
the correlation between the log of the price premium and the log of distance to major
departamento capitals of Bogota (Cali, Bogota and Medellin) to be -0.63, which is consistent
with farmers having lower incentives to produce these types of varieties.
(τij yi )κ
µij = P κ.
i (τij yi )
We parametrize based on τij = (distij )−τ1 . We pick τ1 = 0.05, which is in line with values
obtained in Bryan and Morten (2019) and Pellegrina and Sotelo (2021). Appendix Table
A.16 reports results for this case. In both tests, our main conclusions are similar if we use
this version of the model.
45
These data is provided by the price monitoring unit of the Colombian Statistical Unit SIPSA.
6
E Details of the Model
This section provides details about the model. Section E.1 to E.4 derive the key expressions
presented in the main body of the paper. Section E.5 proves the parameter conditions
that are necessary and sufficient for the existence of each type of local equilibria. Section
E.6 proves the propositions in the main body of the paper. Lastly, Section E.7 present
the algorithm that we apply to solve for the spatial equilibrium in the model. To save on
notation, in what follows, we often drop the index for locality, unless otherwise indicated.
l = λqγα/r (13)
n = λqγ (1 − α) /w (14)
Isolate the langrangian λ in the expression above and substitute it into equations (13) and
(14) to get
(1 − α) γ−1 1
n= [As] γ (r)α (w)−α (q) γ (15)
α0
α γ−1 1
l= [As] γ (r)α−1 (w)1−α (q) γ (16)
α0
7
which we will use later. Using equations (15) and (16), we can write the cost function as
C = wn + rl
1 γ−1 1
= [As] γ (r)α (w)1−α (q) γ .
α0
The profit maximization problem of a farmer with ability s is thus
1 γ−1
α 1−α 1
max pq − [As] γ (r) (w) (q) γ
q α0
1
! 1−γ
p0
π̃ = κπ A γ
(r)α (w)1−α
sπ̃ ≥ w.
8
An invididual who is indifferent between becoming worker or a farmer must thus satisfy
w
s= . (22)
π̃
A farmer chooses the exportable variety over the conventional one if
1
sδ 1−γ π̃ − f w ≥ sπ̃.
A farmer who is indifferent between the two varieties must thus satisfy
fw
s̄ = , (23)
ρπ̃
1
where ρ ≡ δ 1−γ − 1 is a profit premium for producing exportable varieties.
where we defined
Using similar steps, integrate over the optimal land use in equation (19) to get
1
! 1−γ
1 γ p0
L = N Sαγ 1−γ (α0 ) 1−γ A . (25)
(r)1−γ(1−α) (w)γ(1−α)
rL = γαY. (26)
9
Following analogous steps that we took to derive equation (26), we get
wNV = γ (1 − α) Y, (27)
where NV is the employment of variable labor in a region. In what follows, we write variable
labor as
NV = N V,
where we defined
V ≡ Gs (s) − f (1 − Gs (s̄)) .
Lastly, the total profit of farmers is
Z
Π= π ∗ (s) dGs (s) .
s
Π = N S π̃
1
! 1−γ
γ
h γ 1
i p
= N S (α0 ) 1−γ (γ) 1−γ − (γ) 1−γ A γ .
(r)α (w)1−α
π̃N S = (1 − γ) Y. (28)
Dividing the expression above by N gives the value added per capita
γα
1−γ 1−γ γ(1−α) L
y = p0 (A) (S) (V ) .
N
10
and (iii) an autarky equilibrium in which all farmers produce conventional varieties. This
section proves the conditions on the parameters for each equilibria to hold.
Before we turn to the proof of each type of local equilibria, we notice that, for any value
of ρ and f > 0, there is a positive mass of individuals who choose to become a farmer.
This result comes directly from an inspection of equation (21). Since s is unbounded from
above, for any value of w, we can always pick a value of s that is large enough so that
π̃s > w is satisfied. Therefore, any local equilibria must have some individuals choosing to
become a farmer. In addition, we also highlight that we assume that individuals who are
indifferent between becoming a farmer who produces exportable varieties and conventional
ones will choose to produce exportable varieties, and individuals who are indifferent between
becoming farmers or farmworkers will choose to become farmers.
Autarky Equilibrium. Lastly, we prove that an autarky equilibrium in which all farmers
produce conventional varieties holds if and only if 0 ≥ ρ. We know that for ρ ≥ f we
have a fully integrated equilibrium and for f > ρ > 0 we have a partially integrated one.
Therefore, given that in any local equilibrium some individuals become farmers since s is
11
unbouded from above, then we know that for 0 > ρ all farmers must choose to produce the
conventional variety.
f
s̄ = s. (29)
ρ
Since there is no variable labor, full employment implies that the total mass of individuals
becoming farmworkers must equal the total demand for farmworkers from farmers producing
exportable variety of
Gs (s) = f (1 − Gs (s̄)) . (30)
Insert equation (29) into equation (30) and differentiate the expression above, treating s as
a function of ρ
ds f f ds
gs (s) = −f gs (s̄) − 2 s +
dρ ρ ρ dρ
12
where gs () is the pdf of the distribution of ability. Rearrange the expression above to get
2
f
sgs (s̄)
ds ρ
= > 0. (31)
dρ (f )2
gs (s) + gs (s̄)
ρ
Therefore, an increase in ρ leads to an increase in the cutoff s. Because the average measured
farm productivity is given by Ai E (s|s > s), we know that the average farm productivity rises
with increases in ρ.
Step 2: Impact of ρ on s̄. To show the impact of ρ on s̄, we will take advantage of the
result derived in step 1. First, differentiate equation (30), making s̄ a function of s
ds̄
gs (s) = −f gs (s̄) . (32)
ds
Rearrange to get
ds̄ 1 g (s)
=− < 0. (33)
ds f g (s̄)
Combining equation (33) above with equation (31), it gives the result that an increase in
ρ generates a drop in s̄. Therefore, the share of farmers selecting into non-local markets
increases.
Step 3: Impact of ρ on average farm size. If we define farm size as the ratio of land
relative to the number of farms, our average farm size will be given by
L
FS = .
N (1 − G (s))
From the previous steps, we know that s increases when ρ increases. Therefore, if N is fixed,
then farm size should necessarily increase with ρ, since there is a reduction in the total mass
of farmers. If we consider individuals location choices, then both N and s change with ρ.
Intuitively, N should rise with a ρ, since the region becomes more attractive. Recall that
the mass of farmworkers in region i is given by
(yi )θ
Ni = N.
Ξθ
As such, the impact of ρ on F S depends on yi , which is a function of Si and Vi (defined in
Section 4.4). The impact of ρ on farm size will depend on how much individuals will move
to a region in reaction to an increase in δ, which is directly related to θ. When θ is larger,
more inviduals will move to region when ρ, which tends to reduce farm size.
13
E.6.3 Impact of ρ on S
To study the impact of ρ on S, it is useful to work with the following expression for S
Z ∞ Z ∞
S= sg(s)ds + ρ sg (s) ds.
s(ρ) s̄(s(ρ))
ds̄
Using equations (32) and (29), we get −sg (s) − ρg (s̄) = 0, which we can use to write
ds
Z ∞
dS
= sg (s) ds.
dρ s̄
R∞
Since s̄
sg (s) ds > 0, we know that an increase in ρ always increases S.
γ (1 − α) Si
si = , (34)
(1 − γ) Vi
which can be derived by combining equations (22), (27) and (28). Second,
f
s̄i = s, (35)
ρi i
which comes from equations (22) and (23). In addition, we use the previous definitions for
Si and Vi .
For the fully integrated equilibrium, we need a single equation to solve for the equilibrium,
14
given by
γ (1 − α) 1+f Si
s̄i = ,
(1 − γ) 1 + ρi Vi
where we used s̄i = (1 + f ) w/ (1 + ρi ) π̃, which must hold for the fully integrated equilib-
rium. Notice that here Si = (1 + ρi ) (1 − Gs (s̄i )) E (s|s > s̄i ) and Vi = Gs (s̄i )−f (1 − Gs (s̄i )).
Lastly, for autarky equilibrium, we use
γ (1 − α) Si
si = ,
(1 − γ) Vi
where we used the condition that si = w/π̃, which must hold for the autarky equilibrium.
Notice that in this final case we have Si = (1 − Gs (si )) E (s|s > si ) and Vi = Gs (si ).
Step 2: Spatial Equilibrium. Second, we compute the spatial equilibrium across regions.
Given Ai , Si , Vi , N , pi0 , and pi , we solve for the value of Ni , using the following expressions
γα
1−γ 1−γ γ(1−α) Li
yi = pi0 (Ai ) (Si ) (Vi ) , (36)
Ni
(yi /pi )θ
Ni = P θ
N, (37)
i∈I (yi /pi )
X
N= Ni . (38)
i∈I
Equations (36) and (37) gives the relative employment of labor across regions and equation
38 pins down the level.
Individuals observe this productivity term after observing their location preferences and
managements abilities, and after making their location and occupation choices.
The technology to produce a good is given by
γ
qik (s) = Aik (zk (s) s)1−γ (likv (s))α (nikv (s))1−α .
15
Notice that, relative to the main formulation, we rearranged the productivity term Aik in
the production function, this facilitates the algebra in our derivations. The optimal land use
is given by
1
! 1−γ
∗ A ik p ikv
likv (s) = κL szk (s) , (39)
(ri )1−γ(1−α) (wi )γ(1−α)
the use of variable labor is
1
1−γ
Aik pikv
n∗ikv (s) = κN szk (s) , (40)
(ri ) (wi )1−γα
γα
From equation (39), we can see that farmers with higher management ability s will also
have, on average, larger farms. This will occur even within a crop choice (that is, on k being
∗
the choice that maximizes πikv (s)).
In this extension of the model, farmers make discrete choices about which crop to produce
in their farm. We can integrate these discrete choices over continuum of farmers using our
distributional assumption that zk (s) is Fréchet. Similar to derivations in Sotelo (2016), we
get the following share of farmers producing crop k
ξ
(Aik pik ) 1−γ
λik = ,
Φθi
hP ξ
i 1ξ
where Φi ≡ κ k (A p
ik ik0 ) 1−γ . Here, Φi is a term that captures the role of comparative
advantage forces in land use. For example, an increase in pik0 of a single crop changes Φi .
The extent to which this price affects overall productivity depends on ξ, which controls the
heterogeneity in crop productivity across farmers. When ξ is low, so that there is high crop
heterogeneity, changes in pik0 generate smaller changes in the share of farmers λik dedicated
to a given crop. Notice that, because s and δ operate as a profit shifter that is proportional
to all crops, they do not affect crop-decisions.
Lastly, one can show that the average value added per worker becomes
γα
1−γ 1−γ γ(1−α) Li
yi = (Φi ) × (Si ) × (Vi ) × .
Ni
| {z } | {z } | {z } | {z }
Comparative Advantage Ability Variable Labor Congestion
In the equation above, different from equation (10), the natural advantage become an en-
dogenous term that captures the role of prices and selection of farmers into different crops.
16
E.9 First-Order Approximation to Counterfactuals
This section derives the first-order approximation of a change in value added (that is, pay-
ments to farmworkers, managers and land) in response to a change in the price premium for
the production of exportable agricultural goods (δi ).
The problem of individuals in a region i is equivalent to that of a planner in that region
that chooses land use, labor use, and occupational choices to maximize total revenues, subject
to the endowment of land and skill distribution. Dropping the region index for simplicity,
we get:
XZ γ
VA= max pv (As)1−γ (lv (s))α (nv (s))1−α ds
lv (s),nv (s),Σv ,Σw s∈Σv
v
subject to
XZ
N lv (s) dG (s) = L
v s∈Σv
XZ Z
nv (s) dG (s) = sdG(s)
v s∈Σv s∈Σiw
XZ Z
sdG(s) + sdG(s) = 1.
v s∈Σv s∈Σw
Recall that p1 = p0 δ and denote by lv∗ (s) , n∗v (s) , Σ∗v and Σ∗w the optimal choices of the
central planner. Now consider a change in δ, holding these optimal choices fixed. The
envelope theorem implies
Z
γ
dV A = p0 dδ (As)1−γ (lv∗ (s))α (n∗v (s))1−α ds
s∈Σ∗v
Z
dδ γ
= pv (As)1−γ (lv∗ (s))α (n∗v (s))1−α ds.
δ s∈Σ∗v
17
F Additional Tables and Figures
Figure A.1: Average Farm Productivity and Average Distance to Urban Centers across
Municipalities of Colombia
−.2 −.4
Log(avg farm productivity)
−.8 −.6
−1
−1.2
−2 −1 0 1
Log(avg distance to urban center)
Notes: This figure presents a binscatter plot illustrating the conditional relationship between average farm productivity and
average distance to urban centers across municipalities of Colombia. The y-axis represents the log of average of measured
productivity across farms in each municipality and the x-axis represents the log of distance between the municipality and the
nearest urban center. Urban centers are defined as departamento capitals. Measured farm productivity assumes a Cobb-Douglas
function with land, labor, and machinery as inputs (see Appendix Section B for details). The X and Y variables are residualized
and the mean of each variable is added back to its residuals before plotting. Control variables include: total rainfall, average
terrain slope, ruggedness, altitude, and the average distance from each municipality to the closest river. The line represents a
linear fit.
Figure A.2: Distribution of Farm Output in Value and Measured Farm Productivity
(a) Farm Output in Value (b) Measured Farm Productivity
.25
.3
.2
.2
.15
Density
Density
.1
.1
.05
0
10 15 20 25 5 10 15 20 25
Log of farm output in value Log of measured farm productivity
Notes: Panel A shows the distribution of log of the farm output in value computed with the information on volume of
production and crop prices. Panel B shows the disrtribution of our baseline measured farm productivity, which incorporates
information on land use, labor use, and machinery. See text for details on the construction of these variables.
18
Figure A.3: Distribution of Farm Capital Before and After Imputations
(a) Sample with information (b) Whole sample after imputation
1
1
.8
.8
.6
Density
Density
.6
.4
.4
.2
.2
0
0
5 10 15 5 10 15
Log of the value of farm capital Log of value of farm capital
Notes: Panel A shows the distribution of the log of the farm capital computed with the information on farm machinery and
machinery prices for the farms that report machines. Panel B shows the distribution of this variable after the imputation for
capital value in farms that do not report any machinery. See text for details on the construction of these variables.
Figure A.4: Current Departamento Capitals and Historical Origins of Urban Centers
(a) Colonial Gold Mines (b) Indigenous Settlements
Notes: Panel A shows the location of colonial gold mines (triangles) and departamento capitals (dots) by year of establishment
(lighter colors represent earlier years). Panel B shows with stars the location of indigenous settlements between 1510 and
1561. Data on mines was compiled from historical records presented in Urrutia and Ortiz (2015), Colmenares (1972). Data
on indigenous settlements available in CEDE-Data Center from Universidad de los Andes and compiled from historical records
presented in Melo (1995). We compute the distance from the centroid of every municipality to the nearest indigenous settlements
and the colonial goldmines. We then used these as instrumental variables to evaluate the impact of distance of a municipality
to the nearest urban center nowadays.
19
Figure A.5: Price Premium and Population for Selected Crops
1.6
Tomato
Rice
1.4
Price Premium
1.2 1
11 12 13 14 15 16
Log of Population
Notes: This figure shows the relationship between the price premium for exportable (high-quality) types of tomatoes and rice.
Specifically, for tomatoes, data on prices is available for two types: “tomate larga vida” and “tomate chonto”, the former is
export-oriented. For rice, data is available for a high and low quality categories. On the y-axis, we plot the ratio of the average
ratio of the price of the exportable relative to the conventional variety between 2016 and 2019 in a given market. On the x-axis,
we plot the log of the population in that given market. The red line presents the slope of a regression of the price premia
against the log of the population, which equals 0.046.
Notes: This figure shows the two types of tomatoes with measures of price premia. Tomatoes “larga-vida” are considered of
higher quality and more often exported, whereas tomatoes “chonto” are of lower quality and tends to be consumed domestically.
Prices measured in June 2021.
20
Figure A.7: Distribution of the Impact of Endogenous Reallocation of Farmers
(a) Within Region Integration (b) Between Region Integration
Notes: This figure shows the additional impact of changes in the price premia when we allow farmers to re-optimize their
occupational choices, which changes the pool of farmers in the market. Panel A presents the additional impact in the first set
of counterfactuals, in which we change the price premia within the catchment area of different urban centers. Panel B shows
the additional impact in the second set of counterfactuals, in which we equalize the price premia between the catchement area
of different urban centers.
Notes: This figure shows the distribution of changes in Si and Vi when we change the exportable price premia in our first set
of counterfactuals.
21
Table A.1: Gap in Value Added between Municipalities
Quintile Ratio value added per worker
Q1 1.0
Q2 2.1
Q3 3.1
Q4 4.8
Q5 12.1
Notes: This table presents the gap on the municipality’s average value added per worker across quintiles of the value added
distribution. The gaps are computed relative to average value added in municipalities in Q1.
Table A.2: Initial Inspection of the Relationship between Farm Productivity and Distance
to Departamento Capital
(1) (2) (3) (4) (5) (6) (7)
Log(Avg dist) -0.269* -0.261* -0.230 -0.232 -0.228 -0.253* -0.269*
(0.143) (0.147) (0.140) (0.144) (0.144) (0.148) 0.156
R2 0.148 0.155 0.187 0.168 0.172 0.157 0.214
Obs 32 32 32 32 32 32 32
Rainfall Y Y
Slope Y Y
Altitude Y Y
Ruggedness Y Y
Dist river Y Y
Notes: This table provides an initial inspection of the relationship between distance to urban centers and measured farm
productivity in Colombia. The dependent variable is the average of the measured farm productivity across municipalities in
Colombia—weighted by the production area— and the explanatory variable is the average distance to the departamento capital.
Each column controls for a different geographic control, which is the average of that geographic variable across municipalities
within departamentos, with the exception of rainfall, which we take the sum. Robust standard errors in parenthesis. ***
denotes significance at 1% level, ** significance at 5% and * at 10%.
22
Table A.4: Relationships between Farm Size, Measured Farm Productivity, and Selection
into Non-local Markets within Veredas
(1) (2) (3)
Panel A: Log farm productivity
Log(Farm Size) 0.334*** 0.357*** 0.339***
(0.011) (0.007) (0.008)
R2 0.181 0.434 0.664
Obs 854946 854946 854946
Table A.5: Relationships between Alternative Measures of Farm productivity and Farm size
within Municipalities and Crop-Choices
(1) (2) (3)
Panel A: Log farm productivity (only farms with machinery)
Log(Farm Size) 0.249*** 0.312*** 0.283***
(0.002) (0.002) (0.002)
R2 0.096 0.317 0.548
Obs 163929 163929 163929
23
Table A.6: The Impact of Access to Urban Centers - Flexible IV Specification
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: Dep var log of average agricultural productivity
log(DistUC) -0.192*** -0.140*** -0.374*** -0.432*** -0.370*** -0.400***
(0.048) (0.048) (0.138) (0.116) (0.124) (0.145)
R2 or F 0.416 0.474 18.044 9.684 14.266 8.502
Table A.7: The Impact of Access to Urban Centers - Specification with Distance to Nearest
Urban Center in Levels
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: Dep var is log of average measured farm productivity
log(DistUC) -0.106*** -0.081** -0.207*** -0.194*** -0.198*** -0.207***
(0.029) (0.032) (0.063) (0.058) (0.058) (0.059)
R2 or F 0.417 0.475 31.023 59.948 27.704 43.626
24
Table A.8: The Impact of Access to Urban Centers - Unweighted Estimations
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: Dep var is log of average measured farm productivity
log(DistUC) -0.109*** -0.092*** -0.153 -0.155 -0.153 -0.182
(0.029) (0.029) (0.129) (0.148) (0.129) (0.133)
R2 or F 0.253 0.345 132.443 82.296 66.081 67.824
25
Table A.10: The Impact of Access to Urban Centers - Alternative Measures of Productivity
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: log of avg ag productivity (only include farms with machinery)
log(DistUC) -0.094 -0.052 -0.428** -0.122 -0.281 -0.306**
(0.058) (0.058) (0.213) (0.230) (0.218) (0.148)
R2 or F 0.310 0.388 30.205 39.354 16.756 23.633
Table A.11: The Impact of Access to Urban Centers - Alternative Measures of Participation
in Non-local Markets
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: Dep var is the share of farms selling to non-local markets (broader measure)
log(DistUC) -0.145*** -0.093*** -0.344*** -0.382*** -0.362*** -0.375***
(0.022) (0.016) (0.071) (0.078) (0.072) (0.074)
R2 or F 0.549 0.678 29.594 36.338 16.381 18.971
Panel B: Dep var is the share of farms selling to non-local markets (more restrictive measure)
log(DistUC) -0.066*** -0.033*** -0.172*** -0.176*** -0.174*** -0.183***
(0.017) (0.012) (0.045) (0.045) (0.044) (0.046)
R2 or F 0.470 0.630 29.594 36.338 16.381 18.971
Panel C: Dep var is the share farms producing for subsistence or barter trade
log(DistUC) 0.053*** 0.061*** 0.122** 0.115* 0.119** 0.135**
(0.019) (0.018) (0.055) (0.060) (0.055) (0.063)
R2 or F 0.406 0.494 29.594 36.338 16.381 18.971
IV Mines Ind Both Both
Obs 1082 1082 1082 1082 1082 1082
Nearest UC FE Y Y Y Y Y Y
Geo+Suitability Y Y Y Y Y
Population 1760-1808 Y
Non-agricultural labor Y
Notes: This table shows the results from Table 1 with alternative measures of non-local markets. Robust standard errors in
parenthesis. *** denotes significance at 1% level, ** significance at 5% and * at 10%. The dependent variable in Panel A adds
to our main definition of non-local markets farmers who sell their produce to the industry and in local food markets. Panel B
excludes from our main definition farmers who sell to food wholesalers and farm-gate sales, which excludes farmers who use
part of their produce for household consumption or barter trade. The dependent variable in Panel C is the share of farms in
the municipality that produce for household consumption or barter trade. See table 1 for a description of the control variables.
. OLS columns report R2 and IV columns report F from Kleibergen-Paap weak instrument test.
26
Table A.12: The Impact of Access to Urban Centers - Robustness to Alternative Mechanisms
IV IV IV IV IV
(1) (2) (3) (4) (5)
Panel A: Dep var log of average agricultural productivity
log(DistUC) -0.582*** -0.522*** -0.582*** -0.582*** -0.416***
(0.176) (0.176) (0.176) (0.176) (0.125)
R2 or F 22.534 20.218 22.534 22.534 33.252
Table A.13: The Impact of Access to Urban Centers on the Share of Individuals who are
Farmworkers relative to Farm Managers and on Soil Improvements
OLS OLS IV IV IV IV
(1) (2) (3) (4) (5) (6)
Panel A: Dep var is the share of farmworkers relative to farmers
log(DistUC) -0.021*** -0.014* -0.068*** -0.065** -0.066*** -0.067**
(0.007) (0.008) (0.025) (0.025) (0.024) (0.027)
R2 or F 0.264 0.369 31.721 48.348 19.590 22.534
27
Table A.14: The Impact of Access to Urban Centers on the Price and Profit Premia of
Exportable Varieties of Agricultural Goods
Price Premium (δi )
OLS IV
(1) (2)
log(DistUC) -0.008*** -0.042***
(0.002) (0.013)
R2 or F 0.297 67.337
IV Both
Obs 1082 1082
Nearest UC FE Y Y
Geo+Suitability Y Y
Notes:This table shows the impact of access to urban centers on the log of the model-implied price premium. Specifications in
this table replicates columns (2) and (5) from Table 1. Robust standard errors in parenthesis. *** denotes significance at 1%
level, ** significance at 5% and * at 10%.
28
Table A.15: Aggregate Impacts of Changes in the Price Premia of Exportable Varieties of
Agricultural Goods - Pareto distribution
Types of Market Integration
Within Regions Between Regions Between and Within
Endogenous Endogenous Endogenous
Reallocations Reallocations Reallocations
FOA Partial Full FOA Partial Full FOA Partial Full
(1) (2) (3) (4) (5) (6) (7) (8) (9)
Panel A: Aggregate Effects
Aggregate VA 4.29 4.73 4.77 1.56 1.68 1.71 5.93 6.69 6.76
Non-local sales - 35.16 35.16 - 22.89 22.89 - 41.03 41.03
Avg farm size - 23.14 23.95 - 8.02 8.37 - 39.94 39.77
Table A.16: Aggregate Impacts of Changes in the Price Premia of Exportable Varieties of
Agricultural Goods - Bilateral Migration Costs
Types of Market Integration
Within Regions Between Regions Between and Within
Endogenous Endogenous Endogenous
Reallocations Reallocations Reallocations
FOA Partial Full FOA Partial Full FOA Partial Full
(1) (2) (3) (4) (5) (6) (7) (8) (9)
Panel A: Aggregate Effects
Aggregate VA 9.13 9.76 9.95 3.21 3.39 3.51 12.65 13.75 13.99
Non-local sales - 34.48 34.48 - 22.21 22.21 - 40.38 40.38
Avg farm size - 51.21 57.10 - 15.08 19.96 - 88.11 91.40
29