Cornell University Cornell University CISER

CISER Data Archive: Tutorial

How can I find out what's in the Data Archive?

The Data Archive catalog of holdings

Search our  holdings by words and phrases in study titles, principal investigators or sponsoring agencies, producers, CISER codebook number (our version of a call number), and ICPSR study number.    Use a simple search or compound field search.    A  search hints page has tips on how to use the search features.  In addition, you can browse studies by broad subject categories.

The catalog also tells you if a study is on the file server or CD-ROM/DVD and where it's located.  Here are some examples.

A study on CD-ROM

National Longitudinal Surveys: Old Cohort Databases -
Mature Men 45-59 in 1966, 1966-1990

The Center for Human Resource Research.  -- R 2.0 -- Columbus, OH : Center for Human Resource Research [producer].   Columbus, OH : Center for Human Resource Research, Ohio State University [distributor].   Files on CDROM#: 312.

 

A Study on the CISER file server

The example below represents the May 1999 Current Population Survey dataset. In addition to a data file, this study includes a machine-readable codebook (in PDF format) and a data dictionary file. The directory\filename information tells you where each file is located on the file server ( U:\ArchiveData\cph\005 ) and what the file names are (cpsmay99.dat, cpsmay99.pdf, cpsmay99.ddf). This example is especially important if you use Archive data from a CISER Research Computing node.

If this were a real catalog search, you could also download these files from the web site using hyperlinks in the file information display. The information icon has more information about this feature of the catalog.

Current Population Survey, May 1999: Tobacco Use Supplement

US Bureau of the Census for the National Cancer Institute. Washington, DC: Bureau of the Census; [producer]. Washington, DC: Bureau of the Census, 2001 [distributor]. Codebook: CPH-005(1999).

File Information:

Type of File: Data
Directory\Filename: U:\ArchiveData\cph\005\cpsmay99.dat
Logical Record Length (LRECL): 1030
Number of Records: 134994
Record Format (RECFM): C
Bytes (compressed): 12977278
Bytes (uncompressed): 139313808

Type of File: Codebook (PDF Format)
Directory\Filename: U:\ArchiveData\cph\005\cpsmay99.pdf
Technote: Binary - use Adobe Acrobat to view.
Logical Record Length (LRECL): 402
Number of Records: 8026
Record Format (RECFM): V
Bytes (compressed): 1130107
Bytes (uncompressed): 1373695

Type of File: Data Dictionary
Directory\Filename: U:\ArchiveData\cph\005\cpsmay99.ddf
Logical Record Length (LRECL): 58
Number of Records: 4264
Record Format (RECFM): V
Bytes (compressed): 23082
Bytes (uncompressed): 119488

Not all data documentation is machine readable. Some datasets have documentation in both machine-readable and hardcopy; for example, a codebook or data dictionary may be on the file server but a survey or data collection instrument may be held in paper format only.

In addition, staff print copies of most machine-readable documentation for use in the Archive. Hardcopy documentation in the Archive is shelved according to the codebook number; in this case, CPH-005(1999).

The Cornell University Library Catalog

The Cornell University Library (CUL) online catalog includes information about most studies on the CISER file server.  For complete information about the study, including a list of its files, use the Connect to CISER link. The CUL catalog does not contain information on CD/DVD holdings in the data archive.

Author/Creator: Davis, James A.
Title: General Social Survey, 2004 [electronic resource] / Davis, James A.; Smith, Tom W.; Marsden, Peter V.
Published: Chicago: National Opinion Research Center, 2005; Storrs, CT: Roper Center for Public Opinion Research.
Description: Computer data
Electronic Access: Connect via CISER
Other Names: Smith, Tom W. (Tom William), 1949-
Marsden, Peter V.
   
Location: CISER Data Archive
Call Number: SIND-002(2004)

previous   next