|
The
"Data Liberation Initiative" (DLI) provides
Canadian academic institutions access to Statistics Canada data files
and databases for teaching and research. The list of data sets is very
extensive and includes such things as the Census, General Social Survey,
SLID, FAMEX, Agricultural Census and many more. (see Stats Canada for
a complete list).
The DLI is a co-operative effort among the Humanities and Social Science
Federation of Canada (HSSFC), the Canadian Association of Research Libraries
(CARL), the Canadian Association of Public Data Users (CAPDU), the Canadian
Association of Small University Libraries (CASUL), Statistics Canada
and other government departments.
As subscribing institutions, the University of Guelph, University of
Waterloo, and Wilfrid Laurier University pay a yearly fee which allows
individuals or organizations directly involved in teaching or academic
research to use these data files. These files are located at Statistics
Canada. Once a user makes a request (to the TDR), the data is transferred
to the TDR and is stored here for the current user, as well as for any
future use.
The data usually arrives in a 'raw' format, with electronic codebooks
and sample programs. These programs can read the data into SAS or SPSS.
Each data set is unique and the documentation varies. If it is apparent
the data will be heavily used then staff of the TDR will mount the data
on our web retrieval system. This allows for easy extraction and some
statistical analysis. If this is a 'one off' request TDR staff from
the home institution of the requestor will help the user in preparing
the necessary SAS or SPSS program to read the data.
Acquiring data can take time. Currently the DLI is only honouring requests
for 'current data' (data generated within the last few years). This
will change shortly. If the data has already been requested by another
institution, it can be transferred almost immediately, otherwise
users should expect a delay ranging from one week to one month, depending
on the division from which the data is requested.
Making the data available on our web site can usually be accomplished
within a few days, providing there is supporting SAS programs. There
may be slight delays, depending on staff availability.
DLI
is also providing aggregate data files in a format called IVISION B2020.
These are pre-packaged tables in which contained data can be manipulated
and customized on the fly by the individual user. Data can then be exported
into spreadsheets for further manipulation.
|