Advanced search
431
Views
9
CrossRef citations to date
0
Altmetric
Articles

Linking the 1940 U.S. Census with modern data

, , , &
Pages 246-257
Published online: 20 Dec 2018
 

Abstract

The U.S. Census Bureau has created a set of linkable census, survey, and administrative records that provides longitudinal data on the American population across the past eight decades. While these files include modern decennial censuses, Census Bureau surveys, and administrative records files from other federal agencies, the long time span is only possible with the addition of the complete count 1940 Census microdata. In this paper, we discuss the development of this linked data infrastructure and provide an overview of the record linkage techniques used. We primarily focus on the techniques used to produce a beta version of a linkable 1940 Census microdata file and discuss the potential to further document and extend the infrastructure.

Acknowledgements

The authors gratefully acknowledge support from Constructing Life Course and Intergenerational Data Through Censes Data Linkages (R21 HD087914) and the Minnesota Population Center (P2C HD041023), both funded from the Eunice Kennedy Shriver National Institute for Child Health and Human Development (NICHD). We also thank the participants in the 2015 Minnesota Population Center Record Linkage workshop and the 2015 Population Association of America Annual Meetings for their comments and suggestions. This research was conducted as a part of the Census Longitudinal Infrastructure Project (CLIP). Trent Alexander and Catherine Massey conducted this work while employees of the U.S. Census Bureau. Any opinions and conclusions expressed herein are those of the authors and do not necessarily represent the views of the U.S. Census Bureau.

Notes

1 Transactions occur when corrections or name changes are made on a record of a particular SSN. This allows us to observe both married and maiden names for women. The average number of transactions per SSN is 2.1 (Harris 2014 Harris, B. 2014. Transgender labor supply, employment, and earnings gaps: Evidence from the Federal Administrative Records and the American Community Survey. CARRA Working Paper. [Google Scholar]).

2 The PVS string comparator was developed by Winkler (1995) and measures the distance between two strings on a scale from 0 to 900, where a distance score of 0 is given if there is no similarity between two text strings and a score of 900 is given for an exact match. The cutoff value for the string distance is set to 750 in the Name Search module.

3 In modern linking conducted at the Census Bureau, linkages to the corresponding tax year provide addresses in the reference file (e.g. 1999 tax year returns are used to PIK the 2000 Census). We are unaware of any digitized historical administrative data that would provide a reference point for addresses in the 1940 Census.

4 We use Black et al. (2015 Black, D., Sanders, S., Taylor, E. & Taylor, L. (2015). The impact of the Great Migration on mortality of African Americans: Evidence from the Deep South. American Economic Review, 105 (2), 477503.[Crossref], [PubMed], [Web of Science ®] [Google Scholar]) GNIS codes for city of birth listed in the Numident. We then aggregate city up to the county level for the match.

5 Although address stings are available for many records in the 1940 Census, we do not observe addresses for observations in the reference file. Consequently, 1940 respondent address is not useful for PIK assignment.

6 Outside of the Census Bureau, many scholars link person records across public (pre-1950) decennial census data (for example, see Abramitzky et al. 2014 Abramitzky, R., L. Boustan, and K. Eriksson. 2014. A nation of immigrants: Assimilation and economic outcomes in the age of mass migration. Journal of Political Economy 122 (3):467506.[Crossref], [PubMed], [Web of Science ®] [Google Scholar]; Collins and Wanamaker 2014 Collins, W. J., and M. H. Wanamaker. 2014. Selection and economic gains in the great migration of African Americans: New evidence from linked census data. American Economic Journal: Applied Economics 6 (1):22052.[Crossref], [Web of Science ®] [Google Scholar]; Ferrie 1996 Ferrie, J. 1996. A new sample of Americans linked from the 1850 public use micro sample of the federal census of population to the 1860 federal census manuscript schedules. Historical Methods 34:14156.[Taylor & Francis Online] [Google Scholar]; Long and Ferrie 2013 Long, J., and J. Ferrie. 2013. Intergenerational occupational mobility in Great Britain and the United States since 1850. American Economic Review 103 (5):204137.[Crossref] [Google Scholar]). At the forefront, the IPUMS project produced the most comprehensive linkages of individuals across the 1850–1930 U.S. censuses with their IPUMS Linked Representative Samples (Goeken et al. 2011 Goeken, R., L. Huynh, T. Lenius, and R. Vick. 2011. New methods of census record linking. Historical Methods 44 (1):714.[Taylor & Francis Online], [Web of Science ®] [Google Scholar]; Ruggles, 2011 Ruggles, S. 2011. Intergenerational coresidence and family transitions in the United States. Journal of Marriage and Family 73 (1):13648.[Crossref], [Web of Science ®] [Google Scholar]). These linkages began with individuals in the 1880 complete-count census file linked to the other earlier censuses to produce seven pairs of linked samples. The IPUMS is expanding their historical linkages to the full-count historical population censuses to construct the Multigenerational Longitudinal Panel (IPUMS-MLP). A collaborator of IPUMS-MLP, the Longitudinal, Intergenerational Family Electronic Micro-database (LIFE-M) project, uses historical vital records to improve the linkages of women and minorities across the complete-count 1880–1940 censuses (Bailey et al. 2017 Bailey, M. J., C. Cole, M. Henderson, and C. G. Massey. 2017. “How do automated linking methods perform? Evidence from the Life-M Project. http://www-personal.umich.edu/∼baileymj/Bailey_Cole_Henderson_Massey.pdf (accessed September 21, 2017). [Google Scholar]). Similarly, the Early Indicator’s project links Union Army enlistees and their families across numerous decennial censuses (Costa et al. 2017 Costa, D. L., H. DeSomer, E. Hanss, C. Roudiez, S. E. Wilson, and N. Yetter. 2017. Union army veterans, all grown up. Historical Methods 50 (2):7995.[Taylor & Francis Online], [Web of Science ®] [Google Scholar]).

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
EUR 40.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
EUR 101.00 Add to cart

Purchase access via tokens

  • Choose from packages of 10, 20, and 30 tokens
  • Can use on articles across multiple libraries & subject collections
  • Article PDFs can be downloaded & printed
From EUR 400.00
per package
Learn more
* Local tax will be added as applicable
 

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.