Professional Documents
Culture Documents
Philippines Census Data Processing2
Philippines Census Data Processing2
2007 Census of
Population
Data Processing
Valentino C. Abuan
Director, Information Resources Dept.
National Statistics Office (Philippines)
Coding Verification
- 20% sample for Forms 1, 2 and 4
- 100% verification for Forms 5 and 7
(more)
UN Regional Workshop on Data Processing, Bangkok, 15-19 Sep 2008 5
P rovincial Processing Tasks
(Continuation)
Image QA
- Form recognition
- Visual check for dark, blurred, blanks, folded, etc.
(more)
UN Regional Workshop on Data Processing, Bangkok, 15-19 Sep 2008 7
R egional Processing Tasks
(Continuation)
1st Pass Capture from Images
- Interpretation/recognition of mark fields
- ICR-based interpretation/recognition of geographic &
household ID fields with 100% key verification
- Used Eyes and Hands for Forms (EHF) software
Consistency check (certification pass)
of CP Forms 5 and 7
Completeness Check (CP Forms 2 & 4)
Transmittal of Files in DVD to Central
DP Center
UN Regional Workshop on Data Processing, Bangkok, 15-19 Sep 2008 8
C entral Data Processing Tasks
Edit/Imputation
Tabulations
Creation of Microdata Public-Use-Files
UN Regional Workshop on Data Processing, Bangkok, 15-19 Sep 2008 9
ICR Use: 2000 CPH and 2007 POPCEN
Characteristics 2000 CPH 2007 POPCEN
Scanners Kodak 3900D (22 units) Kodak i610 (22 units)
Software Eyes and Hands for Forms EHF (same version used in
(EHF) 2000 CPH)
Scanning Sites 4 Data Capture Centers Regional DP Centers
(3 regional + 1 central) (17 sites)
Data Capture Capture on Single Pass Capture on Two Passes
Strategy (from (all fields captured thru (1st pass on mark fields and
images) ICR in the same pass; ID fields done in Regional DP
done in all the 4 Data Centers; 2nd pass on write-in
Capture Centers) fields via key-from-image
done centrally)
ICR Application System Supplier In-House Staff
Developer
(more)
UN Regional Workshop on Data Processing, Bangkok, 15-19 Sep 2008 10
ICR Use: 2000 CPH and 2007 POPCEN
(Continuation)
Characteristics 2000 CPH 2007 POPCEN
Problems •Emphasis on clear and •Prescribed/provided pencil
encountered legible entries led to some lead not used by some
amount of rewriting into
new quests •Image QA not strictly
implemented by some reg’l
•Unanticipated quest DP centers
shortage led to reprinting
of additional quests •File corruption in image files
during DVD writing
•New reprinted quests did
not exactly match the
defined form templates for
ICR
Thank You!