Datasets
auctions
Auction data
Data on eBay auctions, based upon the paper “Econometrics of Auctions by Least Squares” by Leonardo Rezende, Journal of Applied Econometrics, 2008, 23:925-948. The dataset consists of eBay auctions for Apple iPod mini devices in June and July 2006, limited to only auctions for the 4GB models.
babynames
Popular names data
Data on the names of all babies born in the United States in 2022, as provided by the Social Security Administration. Each observation corresponds to a specific name and gender, with a count of that name provided. For confidentiality reasons, the minimum count for any name is 5. All other names (with fewer than 5 occurrences in the U.S.) are included within the observation having “OTHER” as the name. There are two “OTHER” observations, one for female babies and one for male babies. Data are sorted alphabetically by name.
brands
Brand data
Data on the purchase behavior of customers at a specific market. The dataset consists of customers who purchased one of five candy-bar brands in their previous visit to the market and records whether or not they make a purchase during this visit and, if so, which brand they purchase. The dataset is adapted from the full dataset that is referenced in the source citation.
congress
Congressional election data
Data on congressional election outcomes in the United States between 1948 and 1990, based upon the paper “Do Voters Affect or Elect Policies? Evidence from the U.S. House” by David S. Lee, Enrico Moretti, Matthew J. Butler, 2004, Quarterly Journal of Economics, 119: 807-859. This sample is restricted to elections where (i) the incumbent is running for re-election and (ii) are not running unopposed. There are 9,788 observations available, and demographic variables are available for 6,774 of the observations.
dictator
Dictator-game data
Data on the results from “dictator games” played in an experimental study, based on the paper “Giving and taking in dictator games – differences by gender? A replication study of Chowdhury et al.”, Journal of Comments and Replications in Economics, 2023. Each observation corresponds to one play of the game. Earnings are for the dictator. Two game variants are the “giving game” (dictator starts with endowment) and “taking game” (recipient starts with endowment).
houseprices
Housing price data
Data on house sales in Ames, Iowa between 2006 and 2010. The dataset is limited to one-family homes with public utilities and excludes new home sales.
hrs
Health-expenditure data
Data on healthcare utilization and expenditures for adults 50 years and older in the United States, taken from the Health and Retirement Study (HRS) and Asset and Health Dynamics Among the Oldest Old (AHEAD). Data was originally used in the paper “On the distribution and dynamics of health care costs” by Eric French and John Bailey Jones, 2004, Journal of Applied Econometrics, 19: 705-721. This dataset is restricted to non-married individuals in the year 2000.
inflation_expectations
Inflation expectations data
Data on individual inflation expectations, based on the paper: “Measuring consumer uncertainty about future inflation,” by Wandi Bruine de Bruin, Charles F. Manski, Giorgio Topa, Wilbert van der Klaauw, 2011, Journal of Applied Econometrics, 26: 454-478. This dataset has only the observations with point estimates of inflation for individuals between 30 and 70 years of age. The survey took place in 2007 and 2008. The actual inflation, for benchmark, was 3.2% in 2006, 2.9% in 2007, and 3.8% in 2008.
metricsgrades
Econometrics course data
Data on performance in a graduate econometrics course, with GRE test information and domestic/international status available.
mutualfunds
Mutual-fund performance data
Data on mutual funds categorized as “Large Blend Equity” funds by Morningstar, limited to funds in existence for more than 10 years. Data captured 2/28/2023.
premier
Premier League soccer data
Data on all game results for the 2020 Premier League soccer season. The Premier League consists of 20 teams. Each team plays every other team twice (home and away) during the season, so there are a total of 38 rounds in the season and 380 total games.
resume
Resume response data
Data on responses to hypothetical resumes that were created for an experimental study, based upon “Ban the Box, Criminal Records, and Racial Discrimination: A Field Experiment” by Amanda Agan and Sonja Starr, 2018, Quarterly Journal of Economics, 133: 191-235. This dataset considers only the subsample from before the ban-the-box initiative.