TY - JOUR
T1 - Statistical Disclosure in Two-Dimensional Tables: General Tables
AU - Dellaert, Nico P.
AU - Duarte de Carvalho, Filipa
AU - de Sanches Osorio, Margarida
PY - 1994/12/1
Y1 - 1994/12/1
N2 - Confidentiality protection of data published in tables is a major problem for statistical offices. To obtain full cooperation of the respondents, it is required that information of individual respondents with a confidential character is kept from being disclosed. One method to avoid disclosure is the method of cell suppression, in which the values of a number of statistical cells are not published but are suppressed from publication. We discuss the method of cell suppression in general two-dimensional tables, in which row totals and column totals are always published. The values of the sensitive cells are replaced by a cross (X). Usually, additional suppressions are necessary to prevent the values of the sensitive cells from being calculated from the row or column totals. Because of these additional suppressions, useful information gets lost. We want to minimize the loss of information by making the best choice for the additional suppressions. Therefore, we introduce and compare the performance of some heuristics for solving this problem.
AB - Confidentiality protection of data published in tables is a major problem for statistical offices. To obtain full cooperation of the respondents, it is required that information of individual respondents with a confidential character is kept from being disclosed. One method to avoid disclosure is the method of cell suppression, in which the values of a number of statistical cells are not published but are suppressed from publication. We discuss the method of cell suppression in general two-dimensional tables, in which row totals and column totals are always published. The values of the sensitive cells are replaced by a cross (X). Usually, additional suppressions are necessary to prevent the values of the sensitive cells from being calculated from the row or column totals. Because of these additional suppressions, useful information gets lost. We want to minimize the loss of information by making the best choice for the additional suppressions. Therefore, we introduce and compare the performance of some heuristics for solving this problem.
KW - statistical disclosure
U2 - 10.1080/01621459.1994.10476895
DO - 10.1080/01621459.1994.10476895
M3 - Article
SN - 0162-1459
VL - 89
SP - 1547
EP - 1557
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 428
ER -