README for pub/data

Bronwyn H. Hall, University of California at Berkeley

 

This document describes the data available in the directory pub/data on my website. These data files are made available to other researchers on an AS IS basis. Because of the volume of requests and time constraints, I am unable to answer inquiries about them; users and potential users are referred to the publications cited within the documentation files for further information. Copies of these publications are available on my website.

Potential users of pstar, rvaluni3, hmuspan2, and pan91hdr should be aware that these data are drawn primarily from the Compustat files of Standard and Poor, and as such are proprietary to S&P. Before using these files, you should ascertain that your home institution is a Compustat subscriber.

Each file is stored as a zip file containing two files, one with extension doc (documentation) and one with extension dat (the data in ASCII format). The only exception is patrhgh, which contains 2 datafiles (see the documentation file itself for further information).

List of files available with links for downloading:

 

patrhgh

The patents-R&D data used in Hall, Griliches, and Hausman, "Patents and R&D: Is There a Lag?," IER 27(1986): 265-283.

patpan85

The patents data at the individual firm level, from the 1985 R&D master file. Total patent applications and grants for the firms, from  about 1965 to 1979 (1981 for grants). By CUSIP and YEAR, this can be matched to the other files.

pan91hdr

A dataset containing header information for all the firms in the 1991 manufacturing sector master file (not yet released, but the source of data in pstar, rvaluni3, and hmuspan2 below). This list contains the names and stock symbols of firms, and can be used to ascertain the presence of a particular firm.

pubexit

A list of firms that exited from the 1991 manufacturing sector master file, together with the reason for their exit, and the name (and CUSIP, where available) of the successor firm.

hmuspan2

The data used for productivity estimation in Hall, "Industrial Research During the 1980s: Did the Rate of Return Fall?," BPEA Micro (1993,2): 289-344, and Mairesse and Hall, "Estimating the Productivity of Research and Development: An Exploration of GMM Methods Using Data on French and United States Manufacturing Firms.” Approximately 22,000 obs, 1960-1990. Cleaned; R&D-doing firms only.

hmuscln8

The cleaned and deflated data file used in Mairesse and Hall, based on hmuspan2. 10,957 obs, 1,180 firms with data for at least 4 years between 1977 and 1989.

rvaluni3

The data used for market value estimation in Hall, BPEA Micro (1993,2), and in Hall, "The Stock Market Value of Research and Development Investment During the 1980s," AER 83 (1993): 259-264. Approximately 44,000 obs, 1959-1991.  All firms; not very clean.

pstar

The data used in Hall and Hall, "The Value and Performance of US Corporations," BPEA 1993(1): 1-50.

 

Last updated by BHH on 11 June 2006