Rwanda - Integrated Household Living Conditions Survey 2005-2006
Reference ID | RWA-NISR-EICV-2005-2006-V1.1 |
Year | 2005 - 2006 |
Country | Rwanda |
Producer(s) | NISR (National Institute of Statistics, Rwanda) - Government of Rwanda |
Sponsor(s) | The Government of Rwanda - GOR - Government funding Department for Intenational Development - DIFD - Bilateral funding assistance |
Metadata | Documentation in PDF |
Created on
Jul 20, 2013
Last modified
Jun 28, 2016
Page views
305917
Data Processing
Data Editing
Data Editing (see external resource entilted: Final Data Processing Report)
Questionnaires were reviewd by the controller in the field before they were dispatched for data entry. A control sheet was provided to the contollers to assist in the process of manually editing the questionnaires. Questionnaire structures were verified when the questionnaires were checked in prior to data entry. Three contracted persons reviewed the questionnaire and filled in a form that served as a primary data control sheet. Automated data editing was largely done during the data entry phase (see "Other Data Processing" for details). Some batch edit programs were used to identify inconsistent data.
Data Imputation
Data iimputation was largely done during the analysis phase by analysts. However, a "structural" imputation on the microdata was required for the own consumption data. This was done to adjust for erroneous pricing when the unit for measuring own consumption was buckets. For more information, please refer to the SPSS su=yntax files orthe data processing report.
Primary Data Issues
Coding of products was based on sequential codes for each section. Sequential coding was used to correlate the indexed position of the item for locating the record in the data processing system with the actual row number or sequence. For the poverty study, a recode was done to expenditures to the EICV-1 codes. The recodes are available in the syntax files. However a general recode to standardize commodites to a standard (such as COICOP) was not done.
Other Processing
Data Entry (see external resource entilted: Final Data Processing Report)
New systems and techniques were used to capture and edit the data for the EICV-2. Many improvements were implemented to the data entry system for the EICV-2. The EICV-1 used the DOS based software called IMPS for both data entry and data editing (CENTRY and CONCOR modules respectively). In addition, EICV-1 used various short term and intermittent consultant inputs for the design and implementation of the data processing system. The first five months of the data entry process during the EICV-1 suffered greatly from a lack of quality control. This lack of cohesive support during both the design phase and initiation of the data processing system likely impacted the quality of the data despite attempts made to correct the system during mid-survey.
For the EICV-2, long-term and continuous technical support was provided by the OPM consulting firm and better trained and more committed local supervisors followed through in implementing and maintaining the system. In addition and more importantly, the EICV-2 data processing activities followed quickly behind the processing of the DHS (Demographic and Health Survey). It was clearly advantageous to simply adapt the DHS data processing system for the EICV-2. The DHS data processing system is a broadly used and dynamic system designed for use with the data processing software CSPro (Census and Survey Processing System). In fact, CSPro is designed with the DHS as its model survey. Furthermore, this system of managing the data processing activities is also being used by UNICEF to process the MICS. Applying a robust system and modifying it for use during the EICV-2 saved a great deal of time and effort in training and development. The staff was already familiar with the DHS data processing and editing system and porting the system to the EICV-2 over the long term and through the extent of the survey proved very useful. Some of the specifications that are used by the DHS, MICS and the EICV are:
a. An integrated sample design control sheet used to check in questionnaires.
b. A data entry system designed as "system control". A system controlled application is a very tight control system where the path of data entry cannot be circumvented by the data entry clerk. The path is fully programmed and must include: skips and pre-defined keys for: missing, other or incoherent data.
c. Full double-entry for independent verification.
d. A systematic control of data files from: primary-verified-raw-edit-final data files.
e. Full reconstruction of the consolidated data file with the primary cluster file.
f. All corrections done on the lowest ASCII cluster level.
The data entry was done centrally in the NISR headquarters. Activity was initiated in the old Census building in Remera on October 20. On December 16, 2006, the NISR consolidated its offices and moved the Census activities to its current location in the old MINIPLAN building. The move required the establishment of the new data entry operations in the new building and the transfer of all machinery to the building. This operation did not adversely affect the keying operations. The remainder of the survey was keyed in the MINPLAN building.
All computers were set up in a LAN with data being copied and written to the supervisor machines and backed up daily.
The questionnaires were received and checked into a central repository. Data was entered by the cluster (9 urban questionnaires or 12 rural questionnaires). Two archivists managed the check-in and distribution of questionnaires to the data entry supervisors. A sample of the check-in forms is provided in Annex 1.
Once the questionnaires were received and logged on a control sheet, the control sheet was entered in an automated control system by the data entry supervisors prior to being assigned to the data entry clerk. This system maintained by the supervisors assured that the sample design was strictly adhered to and that the coding and tracking of the questionnaires was properly initiated and followed. This system was built on the DHS control system and used CSPro to manage the flow and assignment of the questionnaires.
There was a 100% full independent double data entry of the questionnaires. This assured virtual certainty that inconsistencies found in the data were mostly due to errors and misreported items from the field.
Average data processing time to process all three questionnaires related to a cluster was 21.3 days