Back to Search Start Over

Algorithms for automatic data editing.

Authors :
Riera-Ledesma, Jorge
Salazar-Gonzalez, Juan-José
Source :
Statistical Journal of the UN Economic Commission for Europe. 2003, Vol. 20 Issue 3/4, p255-264. 10p. 3 Charts.
Publication Year :
2003

Abstract

This paper concerns the optimization problem arising when a statistical agency must modify microdata to guarantee that the records satisfy a set of rules (called edits). Indeed, before using a collection of data records to infer statistical properties of some groups of responders, the agencies must check and possibly correct the consistence of the collected data. To this end, the edits must be tested on each record and whenever a record does not satisfy all the edits, then the agency must determine the fields in the record to be modified, as well as imputing the new values. Among all the possible solutions, the statistical agency is interested in finding those requiring a minimum number of modified fields, thus leading to a combinatorial optimization problem known as Editing-and-Imputation Problem. This paper presents an Integer Linear Programming model for the specific case in which all edits are linear constraints, and proposes a branch-and-cut algorithm for the exact solution. An extensive computational analysis shows the good performance of our proposal on randomly-generated and artificial instances. The algorithm is used in a new software (TEIDE) to be applied by the Statistical Office of Canary Islands (ISTAC). [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01678000
Volume :
20
Issue :
3/4
Database :
Academic Search Index
Journal :
Statistical Journal of the UN Economic Commission for Europe
Publication Type :
Academic Journal
Accession number :
14169698