Professional Documents
Culture Documents
Longitudinal Ageing Study in India (LASI) : Wave 1
Longitudinal Ageing Study in India (LASI) : Wave 1
Longitudinal Ageing Study in India (LASI) : Wave 1
(LASI)
Wave 1, 2017-18
This document provides a brief user guide of LASI Wave 1 micro data with
information on sampling design, data structure, and questionnaire content, as well
as information about publicly accessible data.
For additional information about the Longitudinal Ageing Study in India (LASI) Wave 1
Please contact:
LASI Wave 1 was funded by the Ministry of Health and Family Welfare, Government of India, the
National Institute on Aging (R01 AG042778), and United Nations Population Fund, India. LASI is
a collaborative study of three partnering institutions: International Institute for Population Sciences
(IIPS), Harvard T.H. Chan School of Public Health (HSPH), and University of Southern California
(USC) and several other national and international institutions.
LASI is the world’s largest and India’s first longitudinal ageing study. It is well-positioned to evaluate
the effect of changing policies on the behavioural outcomes in India. LASI survey design, tools and
protocols are harmonised with the Health and Retirement Study (HRS) in the United States and its
sister surveys in Asia, Europe, and elsewhere.
The main objective of the LASI is to provide a comprehensive scientific evidence base on
demographics, household economic status, chronic health conditions, symptom-based health
conditions, functional health, mental health (cognition and depression), biomarkers, health insurance
and healthcare utilization, family and social networks, social welfare programmes, work and
employment, retirement, satisfaction, and life expectations.
The LASI has embraced the state of the art of large-scale survey protocols and field implementation
strategies with one or more of the following innovative attributes that existing studies lack:
representative sample of India and its states and by socio-economic spectrum, an expansive topical
focus, harmonization with HRS, a longitudinal design, coverage of comprehensive biomarkers, and
the use of Computer Assisted Personal Interviewing (CAPI) technology for data collection, quality
control and geographic information system (GIS) use. No other survey in India collects detailed data
on health and biomarkers together with data on family and social network, income, assets, and
consumption.
The LASI survey instrument comprises the household (HH) schedule, individual schedule, biomarker
surveys and community schedule. The household survey was conducted with a selected key
informant in each household. The individual and biomarker surveys were administered to each
selected respondent.
For more detailed information about LASI, its methodology and findings, please refer LASI India
Report (2020) at www.iipsindia.ac.in/lasi.
Eligible households are defined as those with at least one member 45 years of age or above. Eligible
individuals are those in these eligible households who were 45 years of age or older and their spouses,
regardless of age. In addition, LASI oversamples individuals at age 65 and older from all states and
major cities.
Wave 1 of the LASI covered a panel sample of 73,396 individuals age 45 and above and their spouses,
including 31,902 elderly (age 60 and above) and 6,880 oldest-old persons (age 75 and above) from
all states and union territories of India (Table 1).
The LASI Wave 1 field survey was conducted across India from April 2017 to December 2018*. The
HH response rate is defined as the number of HHs which participated in the survey divided by the
total number of age-eligible sampled HHs. The overall household response rate is 96.6%. Household
response rate in urban areas is lower (94.7%) than the rural areas (97.5%). Individual response rate
is defined as the number of individuals who participated in the survey divided by the total number of
eligible individuals in age-eligible sampled HHs. Individual response rate is higher in rural (89.7%)
than in urban areas (83.7%) (Table 2).
HH Rosters
Age- Age-eligible Age-eligible Individual
Completed
State/UTs Eligible HH Interviews Individuals Interviews
(age-eligible and
HHs Completed Identified Completed
non-age-eligible)
Haryana 1,821 1,285 1,251 2,391 1,898
Delhi 1,283 776 754 1,494 1,319
Rajasthan 2,200 1,336 1,302 2,493 2,244
Uttar Pradesh 4,721 2,820 2,747 4,965 4,567
Bihar 3,336 2,109 2,083 3,828 3,520
Chhattisgarh 1,943 1,259 1,189 2,272 2,055
Madhya Pradesh 2,790 1,727 1,690 3,241 2,914
Gujarat 2,159 1,579 1,455 3,039 2,341
Daman & Diu (UT) 1,236 662 577 1,271 991
Dadra & Nagar Haveli (UT) 1,370 741 631 1,373 1,090
Karnataka 2,018 1,555 1,488 2,981 2,420
Goa 1,147 958 877 1,857 1,427
Lakshadweep (UT) 894 694 627 1,328 1,139
Kerala 1883 1,542 1,411 3,000 2,497
Tamil Nadu 3,248 2,176 2,150 3,845 3,530
Puducherry (UT) 1,173 848 839 1,542 1,428
Jammu & Kashmir 1,435 963 957 1,813 1,613
Himachal Pradesh 1,081 838 805 1,683 1,388
Uttarakhand 1,256 878 863 1,566 1,358
Punjab 1769 1,296 1,234 2,519 2,124
Chandigarh (UT) 1,107 708 651 1,373 1,026
Arunachal Pradesh 1,163 704 702 1,291 1,215
Nagaland 1,207 803 799 1,367 1,316
Sikkim 1,029 649 635 1,209 1,146
Manipur 1,180 862 860 1,594 1,369
Mizoram 1,188 765 732 1,410 1,246
Tripura 1,099 748 721 1,374 1,195
Assam 2,281 1,540 1,511 2,817 2,366
West Bengal 3,201 2,296 2,279 4,428 3,933
Jharkhand 2,236 1,451 1,408 2,758 2,464
Odisha 2,306 1,670 1,645 3,102 2,917
Maharashtra 3,293 2,446 2,421 4,675 3,973
Andhra Pradesh 2,264 1,568 1,511 2,854 2,679
Andaman & Nicobar Islands (UT) 1,089 736 725 1,347 1,244
Telangana 2,272 1,487 1,418 2,703 2,475
Meghalaya 666 636 636 1,056 969
India 66,371 45,111 43,584 83,859 73,396
A more detailed discussion on sampling frame, survey weights and procedures adopted is provided
in the chapter 2 (methodology section) of India report.
The household survey collected data from a selected key informant, who could be any knowledgeable
adult aged 18 or older. It was administered per household and collected information about the
household finances and living conditions for all persons in the household. The data consists of five
modules, as listed below in Table 3.
Housing & Environment (HE) It covers information related to housing and environmental
condition of surveyed households.
Household Consumption (CO) This section covers information on household expenditure and
consumption of food and non-food items, including home-grown
goods
Household Assets and Debts (AD) Covers information related to home ownership, land ownership,
agricultural and business assets, financial and non-financial assets
and household debts.
Household Income (IN) Covers information related to agricultural income, non- agricultural
business income, individual earning, and household income from
government transfers, private transfers, other household income,
tax related questions, and overall economic condition.
Household Health Insurance (HI) Collect household level information about awareness of health
insurance, type of health insurance, type of coverage and health
insurance premiums.
The individual survey was collected for each age-eligible respondent at least 45 years of age and their
spouse regardless of age. A proxy interview is done if the selected respondent is severely cognitively
or physically impaired or incapable of understanding or responding to human interaction. Proxy
interviews do not include biomarkers, psychosocial questions, cognitive tests, or experimental
modules. The individual survey consists of eight modules listed below in Table 4. Respondents
received one of four experimental modules, which were randomly assigned.
The biomarker data was collected for each consenting age-eligible respondent and their spouse in the
individual survey. Given the lack of health care services in India, biological markers (e.g.
anthropometrics, blood pressure, and dried blood spots) and performance measures (e.g. gait speed,
grip strength, balance, and vision) allow researchers to assess the measured health of the respondents.
Health (HT) Self-reported health status, diagnosed chronic diseases, functional health, family
medical history, mental health, cognitive health, and health behaviours.
Health Care Access and Collects information regarding in- patient visit, out-patient visit, health insurance,
Utilization (HC) quality of care, out of pocket (OOP) expenditure, etc.
Family & Social Networks (FS) Information related to the relationship with children, grandchildren, parents,
sibling and friends. This module also collects information on living arrangement,
available social support, and involvement in social activities, social security
programs meant for the elderly and psychological measure is collected under family
and social networks section.
Social Welfare Schemes (SW) Covers information about the awareness, utilizations of different national and state
level schemes and programs meant for senior citizens. Amount received under the
schemes and utilization of this money is also collected. Information on problem
faced in availing the benefits and grievance system is also collected. Awareness
about different types of concessions provided by the government to older people
is also asked.
Experimental Modules Questions about time use, economic expectations, social connectedness, and
(TU,EE,ES,EV) vignettes.
Biomarkers (BM) Performance tests (blood pressure, grip strength/hand strength, timed walk,
balance tests, vision tests), and anthropometrics.
It is essential to gather information at the community level (Rural and Urban) in addition to the
household and individual level information.
3.4.1. Rural community: Village is the smallest administrative unit in India. The educational, health
and other infrastructure facilities/amenities vary considerably from village to village. The key
informants for this schedule were village community head (Sarpanch/Pradhan/Up-
Pradhan/Panchayat Chairperson) and Village Officer/Secretary (administrative person in-charge of
the village). In addition, the survey team also contacted health personnel (ANM/Medical Officer),
school teacher, Gram -Sevak and other functionaries available in the village to gather the community
related information.
3.4.2. Urban community: LASI envisions collecting some information from urban areas using the
Urban Community Survey Schedule. The ward level information from the key informant who was
residing in the area for at least 2 years or knows about the area (elected representative/
officer/engineer/secretary/medical officer/health worker) specifically about the basic facilities, safety
and security of the particular ward was contacted. The CEB level information was collected from a
knowledgeable person of the community who were staying in that CEB for more than 20 years. For
the section which includes information regarding health, the key informant was any doctor or health
worker practicing in that area for at least 2 years.
Individual Module
Social Welfare Schemes
Biomarker, Experimental
As per ethics protocols, consent forms were administered to each HH and age-eligible individual. In
accordance with Human Subjects Protection, four consent forms were used in the LASI: Household
Informed Consent, Individual Informed Consent, Consent for Blood Sample Collection for Storage
and Future Use (DBS), and Proxy Consent.
LASI variables are indexed by module and serially by question number. The variable prefix is an
abbreviated version of the module name. Above tables 3 & 4 lists the variable prefix for each
household and individual module.
The LASI sample weights account for selection probabilities and adjusted for non-response and post-
stratification to accurately represent the population characteristics. The LASI weights are available
for household data and individual data.
There are two types of household weight: one is national-level (indiahhweight) that should be used
to produce estimates at national level; and one is state-level (statehhweight) that should be used to
provide state level estimates.
Similarly, there are two types of individual weights are available: one is national-level
(indiaindividualweight) that should be used to produce nationally representative population
estimates; and one is state-level (stateindividualweight) that should be used to produce state
representative estimates.
Respondents who refused or could not answer the questions are recoded with special missing codes:
“.d” for “Don’t know”, “.r” for “Refused”, and “.m” for “Missing”. The missing code (.E/.e/.E_) is
also used in the data for other reasons, including respondents were not asked specific questions, either
because of survey skip patterns or because respondents were not eligible to answer them.
Like other HRS studies, the LASI survey includes “unfolding bracket” questions to help respondents
to identify specific monetary evaluations of their assets, earnings, and consumption patterns.
Unfolding brackets ask respondents to indicate whether the amount in question is above or below a
certain threshold, randomly chosen from a set of five thresholds. These thresholds were derived from
National Sample Survey (NSS) to reflect the distribution of various consumption patterns in rural
and urban settings across the country. Respondents typically proceed through three brackets until an
upper and lower bound can be identified by deduction.
Typically, for these unfolding bracket questions, there are seven corresponding auxiliary variables in
the data. Consider the question ad502 from the Household Assets and Debts module in the household
LASI interview. It asks the current value of all farming/agriculture assets/equipment that the
respondent’s household owns.
Data quality assessment indicated that economic data, particularly income data, was the most difficult
part of LASI questionnaire for field investigators. There was a tendency among some
households/individuals to underreport their actual income. In some cases, investigator typo error of
adding more zeros inflated the income i.e. Rs. 5,000 became Rs. 50,000. Both extreme negative
income and upper values make major difference to the state’s median income. In these cases, data
cleaning and imputation were required. Household income data imputation was done in case the
following two types of missing information.
Case 1
Households reporting zero income: If any households reported zero income from various sources
namely, agricultural and allied activities, non-agricultural business or self-employed activities,
wages/salary, pension, and government transfers, in that case, the cases were matched between the
individual data file and the household data file. Income information was also collected at individual
level (Please note that the work section of individual data file gives information about the income for
each and every individual) and case-wise income data were merged accordingly for the households
if income was reported in the individual survey. The total income of all individuals of a particular
household was summed up and this value was replaced in the household section variable showing
zero income.
Secondly, in some of the households, zero income was reported for specific sources, essentially
implying “missing values”. In this case, the median value of that particular SSU was imputed and
missing values were replaced with source-wise income for the households.
Third, there were outliers in the reported income (either very high income or very low income). In
some of the households, income was reported either very high or very low (outliers) from single or
multiple sources of income. With the help of box plots, data was visualised which depicted the
presence of outliers. Further, frequency checks were conducted to detect and segregate outlier cases
(households).
For households with low or high outliers, work status of individuals, education, land holding & land
size and business information available for the household were checked. Data was thoroughly
scrutinised and income imputation was done based on the available information such as type of
source of income, level of education, type of occupation, place of residence, family size, land use
and availability of assets.
Case 2
Misreporting of income information: For example, screening questions which is the key for income
from different sources at the household level, inconsistencies were checked with respective sources.
Screening variables such as in001, in002, in003, in004, in005 which provides information about type
of activity involved were checked and corrected in case of misreporting. If there was any income
reported (different sources) for the later set of questions of the subsequent section and there were no
missing values, then the screening question response was corrected as “1” (yes). In case of no income,
then the screening variable outcome was corrected as ”2” (no).
L A S I W A V E 1 – D A T A U S E R G U I D E | 10
Comparison with other sources: In consideration of all these cases, the cleaning and imputation of
income data was done and also the validation of the income data with other available government
data (Central Statistics Office under the Ministry of Statistics and Programme Implementation) was
compared. After cleaning and imputation of missing values, it was found that the rank correlation
was more than 0.9 between LASI HH income and CSO (2017-18) income data across states of India.
Table 7: Variables used for computing the source-wise income variables by using screening questions.
Variables used for computation Computed variables
in103a in103b in103c in103d in103e in107. Agriculture_income : Total income from agriculture
in107a_thres in107b_thres in107c_thres
In the LASI survey, data on consumption expenditure are collected using the abridged version of the
consumption schedule of the National Sample Survey (NSS). Sets of 11 and 29 questions on the
expenditures on food and non-food items, respectively, was used to canvas the sample households.
Food expenditure was collected based on a reference period of seven days, and non-food expenditure
was collected based on reference periods of 30 days and 365 days. Food and non-food expenditures
have been standardised to the 30-day reference period. The monthly per capita consumption
expenditure (MPCE quintile) is computed and used as the summary measure of consumption.
Monthly per capita consumption expenditure = Total consumption expenditure/Household size.
Please note that for calculation of monthly per capita consumption expenditure (MPCE) population
weight (weight*household size) was used.
11 | L A S I W A V E 1 – D A T A U S E R G U I D E
6. Merging of Data
The all datasets (household roster, household, individual, biomarker and community) can be merged
using a prim_key or hhid or ssuid variables given in the datasets as required.
Note: The count mismatch on merging the individual & housing datasets is possible because there
are some households where LASI household (housing sections) information has been collected but
LASI eligible individual has not been interviewed from that household (due to refusal/un-availability
etc.) and vice versa. Therefore, on merging individual & housing datasets, only matched cases will
show data from both the files.
Housing data available but no Individual The housing sections data from 1,100 households
data are available but no LASI eligible individual has
been interviewed from these households, so no
individual sections data available for these
households.
Individual data available but no Housing The individual sections data for 1,126
data individuals are available but their corresponding
875 distinct households housing sections data
not available.
The LASI released the data through the IIPS website (www.iipsindia.ac.in/lasi ). A data request form
is available on IIPS website (https://iipsindia.ac.in/content/data-request ), which every user need to
submit along with a copy of valid photo identity card. The request will be processed by ICT Centre
of IIPS.
L A S I W A V E 1 – D A T A U S E R G U I D E | 12
References
International Institute for Population Sciences (IIPS), NPHCE, MoHFW, Harvard T. H. Chan School
of Public Health (HSPH) and University of Southern California (USC) (2020). Longitudinal
Ageing Study in India (LASI)Wave 1, 2017-18, India Report, International Institute for
Population Sciences, Mumbai.
International Institute for Population Sciences (IIPS) and NPHCE, MoHFW (2020). Longitudinal Ageing
Study in India (LASI) Wave 1, 2017-18, Factsheets – States/UTs, International Institute for
Population Sciences, Mumbai.
13 | L A S I W A V E 1 – D A T A U S E R G U I D E