UW Data Collaborative

Data licensing details

Data set Description PI General use? Licensing details
Centers for Medicare & Medicaid Services Minimum Data Set and Master Beneficiary Summary File The Centers for Medicare & Medicaid Services (CMS) provides a number of data sets, including the Minimum Data Set (MDS) from a nursing home resident survey and the Master Beneficiary Summary File (MBSF) Base data covering beneficiary enrollment. Data are 2018 versions. Anirban Basu;
Tracy Mroz;
Rachel Prusynski
No Available only to named researchers on the specific study; contact PI for more information
Chitwan Valley Family Study data A comprehensive family panel study of individuals, households, and communities in the Chitwan Valley of Nepal Nathalie Williams Possible Permission of PI
Data Axle (Infogroup) Historical Business and Consumer data
  1. Geo-referenced information on millions of households basic consumer profiles; and
  2. Address-level data on US business entities and other organizations.
UW Libraries Yes Available to all UW members
Eurostat Microdata Data consist of records with information on individuals persons, household, or businesses in the EU and used in the production of Eurostat’s official statistics and aggregate tables. Arthur Acolin No Collaboration with PI
Gallup Micro-Level Polling Data The Gallup World Poll measures factors such as well-being, employment, law and order, food and shelter, migration, personal health, financial issues, civic engagement, and communications as they pertain to world development indicators.  IHME Possible Approval by IHME
IBM Watson MarketScan Data The IBM MarketScan® Research Databases are a family of research data sets that fully integrate de-identified patient-level data. The UW licensed data include administrative health records (inpatient, outpatient, and prescription drugs), productivity (workplace absence, short- and long-term disability, and workers’ compensation), Doug Barthold  
  1. Primarily for members of the Schools of Medicine, Pharmacy, and Public Health
  2. PI-countersigned data use agreement
  3. Not to be used for grant-funded research without paying extra fees
Restricted Access Add Health The National Longitudinal Study of Adolescent to Adult Health (Add Health) is a longitudinal study of a nationally representative sample of adolescents in grades 7-12 in the United States during the 1994-95 school year.  Sara Curran   Approval by Add Health review committee at UNC
Scraped Online Rental Listings This database is part of an ongoing collaboration between CSDE and the Department of Sociology to generate information on rental housing markets via scraped online housing advertisements. Individual listings are scraped, processed, and geocoded. Kyle Crowder   Approval by PI
SEER-Medicare Linked Records The SEER-Medicare data reflect the linkage of two large population-based sources of data that provide detailed information about Medicare beneficiaries with cancer.  Anirban Basu   Available only to named researchers on the specific study
Washington Merged Longitudinal Administrative Data State-level merged and geocoded administrative data from multiple state agencies, assembled to examine employment and earnings outcomes. Jennie Romich;
Scott Allard;
Mark Long
  1. Permission of PIs
  2. Approval by WA State IRB
Zillow Assessor and Real Estate Database (ZTRAX) The Zillow Transaction and Assessment Dataset (ZTRAX) is the nation’s largest real estate database made freely available to academic, non-profit, and government researchers. Arthur Acolin;
Kyle Crowder
Possible Data use contract expires Sept 30, 2023. Zillow is not taking new contracts. Additional projects may be added by having PIs contact Zillow. See https://www.zillow.com/research/ztrax/.