Below is some context about the data provided in the paper.
The SGM [Social Genome Model] is constructed using two data sets from the Bureau of Labor Statistics' National Longitudinal Surveys. Our primary data set is the ‘Children of the NLSY79’ (CNLSY). It represents children born mainly in the 1980s and 90s, and is the source of our data for the birth, early and middle childhood, and adolescent stages. No respondent in the CNLSY is yet old enough to track through adulthood, so we impute their adult values with help from a second dataset: the ‘National Longitudinal Survey of Youth 1979’ (NLSY79).
The result is a longitudinal dataset in which synthetic individuals, part actual CNLSY data and part imputed data, pass through five life stages from birth to adulthood. This includes 5,783 children from the CNLSY, born between 1971 and 2009.
The paper was published by The Federal Reserve Bank of Boston.