|
|
||||||
Regulating Access to Web-published Databy Pierangela Samarati The overall goal of the FASTER project (Flexible Access to Statistics Tables and Electronic Resources) is the development of a flexible and open system for controlled dissemination of statistical information. The system includes two major security components: a Statistical Disclosure component, for the sanitization of sensitive tables, and an Access Control component, allowing enforcement of protection requirements on published data. Todays society places great demand on the dissemination and sharing of information. With the development and wide-spread use of the Internet and the World Wide Web, organizations in the private and public sectors are increasingly required to make their data available to the outside world. A growing amount of data is being collected by statistical agencies and census bureaus for analysis and subsequent distribution to the general public or to requesting organizations (eg, research institutions, government offices). Data producers can release their data directly, as in the case of national statistical institutions, or through the mediation of archive institutions (data publishers) that collect data from various sources for subsequent distribution. This data distribution process is clearly selective: data cannot just be released to anybody. For instance, certain sensitive data can only be released to authorised individuals and/or for authorised purposes (eg, health data). Some data is subject to time restrictions and can only be released to the general public after a certain period; some data can be released only for non-commercial purposes; other data can only be released on payment. These few examples already give an idea of the variety of protection requirements that may have to be enforced. There is thus the need for a powerful and flexible access control system able to enforce the different requirements that the data producers (or publishers) may want to impose on the data access. In the context of the FASTER project, we have developed an Access Control System for specifying and enforcing protection requirements on published data, such as statistical tables that have already undergone a statistical disclosure control process, or survey results, etc. The access control component is based on a simple, expressive language for the specification of protection requirements. The approach has the following features:
The Access Control System has been implemented and integrated in the FASTER architecture, which has been developed in a collaboration between the Data Archive at Essex University (UK), the Information Technology Dept. of the University of Milan (Italy), the Norwegian Social Science Data Services (Norway), the Dansk Data Arkiv(Denmark), the CentraalBureau voor de Statistiek (Netherlands), the Central Statistics Office (Ireland), the Statistik Sentralbyra (Norway), and the Centre National de la Recherche Scientifique (France). The access control language and component developed at the University of Milan are being adopted by the project partners to express and enforce protection requirements on the data they make available. The approach used to develop an access control for web-publishing is now being extended to the support of credentials and certified statements (instead of requiring them to be stored at the server as metadata) and to policy composition. Policy composition refers to the controlled combination of access constraints independently specified by different authorities (eg, data respondent, publisher, producer, and privacy advocates and regulators). Links: Please contact: |