Data is Plural archive

Trawl through the backlog, or roll the dice for a random dataset 🎲

Jul 2025

Subway art.

The Metropolitan Transportation Authority’s Permanent Art Program commissions public artworks for New York City Transit, Metro-North Railroad, Long Island Rail Road, and even one of its tunnels. It publishes a dataset of the collection’s 380+ pieces, which include mosaics, stained glass windows, sculptures, passageway floors, murals, fences, and more. It provides each piece’s location, transit agency, artist, artwork name, date constructed, material, description, and online catalog link. As seen in: Stephanie Dang’s Art Off the Rails, winner of the 2024 MTA Open Data Challenge. [h/t Matt Yarri]

Jul 2025

British literary prizes.

The Selected British Literary Prizes dataset, created by Katherine Binhammer and colleagues, “contains information on nine major literary prizes in the U.K. from 1990 to 2022 and demographic information on 682 prize winners and shortlisted authors.” The demographic attributes include “gender, sexuality, UK residency, ethnicity, geography and details of educational background.” The documentation includes a section on how the researchers approached the ethnicity categorizations. Previously: US literary prizewinners (DIP 2022.12.07), also via the Post45 Data Collective. [h/t Melanie Walsh]

data.post45.org data.post45.org/posts/british-literary-prizes/ apps.ualberta.ca apps.ualberta.ca/directory/person/kb1 data.post45.org data.post45.org/posts/british-literary-prizes/#notes-on-ethnic-identifications data.post45.org data.post45.org/posts/the-index-of-major-literary-prizes-in-the-us/ data-is-plural.com data-is-plural.com/archive/2022-12-07-edition/ data.post45.org data.post45.org/

Jul 2025

AI legal flubs.

Damien Charlotin’s AI Hallucination Cases database “tracks legal decisions in cases where generative AI produced hallucinated content – typically fake citations, but also other types of arguments.” Charlotin, who teaches a course called “Large Language Models and the Future of the Legal Profession”, has collected 212 examples so far. The database lists each case’s name, jurisdiction, decision date, party responsible, AI tool used, “nature of hallucination”, outcome, penalties, and description. [h/t Simon Willison + Avi Levin]

damiencharlotin.com damiencharlotin.com/ damiencharlotin.com damiencharlotin.com/hallucinations/ damiencharlotin.com damiencharlotin.com/cv/

Jul 2025

Car crash datasets.

Transportation policy scholars Hannah Younes and Robert B. Noland have compiled a catalog of US states’ car crash data resources. Thirty states and DC publish raw data, 14 states provide only a dashboard or map, while six provide no public data, according to the authors’ survey. They’ve provided links to each of the resources, such as New Jersey’s data downloads, Arizona’s dashboard, and Wisconsin’s map. The data sources vary in detail, time span, and ease of access. Previously: The National Highway Traffic Safety Administration’s Fatality Analysis Reporting System (DIP 2016.08.31). [h/t Michael Allen]

vtc.rutgers.edu vtc.rutgers.edu/people/younes/ bloustein.rutgers.edu bloustein.rutgers.edu/people/noland/ tandfonline.com tandfonline.com/eprint/4VRRJASI2CYVQDFQRPTX/full arcgis.com arcgis.com/home/item.html?id=3ef625ffe7ce4ef38be81ee6d80a5385 nj.gov nj.gov/transportation/refdata/accident/rawdata01-current.shtm azdhs.gov azdhs.gov/preparedness/emergency-medical-services-trauma-system/data-visualization/index.php#dashboards-mvt-related-trauma transportal.cee.wisc.edu transportal.cee.wisc.edu/partners/community-maps/crash/search/BasicSearch.do nhtsa.gov nhtsa.gov/research-data/fatality-analysis-reporting-system-fars data-is-plural.com data-is-plural.com/archive/2016-08-31-edition/

Jul 2025

Landfills.

The EPA’s Landfill Methane Outreach Program, “a voluntary program that works cooperatively with industry stakeholders and waste officials to reduce or avoid methane emissions,” maintains a database of 2,600+ municipal solid waste (MSW) landfills in the United States. One table provides each site’s name, location, owners, operators, ownership type, year opened, year closed, size, waste capacity, latest waste tonnage, methane-related metrics, and more. Another table lists “landfill gas energy projects in various stages, such as planned, under-construction, operational and shutdown.” As seen in: “America’s Hot Garbage Problem” (Bloomberg). [h/t Laura Bliss]

epa.gov epa.gov/lmop/about-landfill-methane-outreach-program epa.gov epa.gov/lmop/lmop-landfill-and-project-database epa.gov epa.gov/lmop/landfill-technical-data epa.gov epa.gov/lmop/landfill-gas-energy-project-data bloomberg.com bloomberg.com/graphics/2025-america-hot-garbage-problem-toxic-landfills/

May 2025

Long- and short-lived states.

Marten Scheffer et al.’s Mortality of States Index “documents commonly agreed state formation and end dates for over 440 different states, covering approximately 5,000 years from 3100 BCE (Egyptian Dynasties I and II) to 2021.” The data — built from sources including Seshat, the Correlates of War Project, and The Encyclopedia of Empire — “cover a broad set of entities ranging from persistent empires to fleeting polities such as the Maukhari dynasty of Northern India or multiple Khaganates that lasted under a century.”

pnas.org pnas.org/doi/10.1073/pnas.2218834120 pnas.org pnas.org/doi/10.1073/pnas.2218834120#supplementary-materials seshat-db.com seshat-db.com/ correlatesofwar.org correlatesofwar.org/ onlinelibrary.wiley.com onlinelibrary.wiley.com/doi/book/10.1002/9781118455074

May 2025

Who runs Italy?

Sesso è Potere, an ongoing project from info.nodes and onData, examines gender representation in positions of power in Italy. The 2025 report draws on individual-level datasets of leaders in politics (national lawmakers, ambassadors, regional legislators, mayors, city council members, etc.), business, media, higher education, and other fields. Read more: The project’s 2023, 2022, and 2021 reports. [h/t Liberiamoli Tutti]

infonodes.org infonodes.org/sesso-%C3%A8-potere infonodes.org infonodes.org/ ondata.it ondata.it/ drive.google.com drive.google.com/file/d/15NPjII-W5_yFpBVJLyEVYk1GUsK2v64e/view github.com github.com/ondata/sesso-e-potere/tree/main/dati/2025 irp.cdn-website.com irp.cdn-website.com/6c73ff89/files/uploaded/MARLA%20sesso%20e%20potere%202023%2027%2011.pdf infonodes.org infonodes.org/sessoepotere22 infonodes.org infonodes.org/sessoepotere2021

May 2025

Corporate contracts.

Peter Adelson and Julian Nyarko’s Material Contracts Corpus contains “over one million contracts filed by public companies with the U.S. Securities and Exchange Commission (SEC) between 2000 and 2023,” which the authors collected from the SEC’s EDGAR filings database. In addition to the text of the contracts, the dataset provides metadata — including party names and contract types — extracted from the documents using machine-learning techniques. The dataset is available to download in bulk and search online. [h/t Alice Kalinowski]

arxiv.org arxiv.org/abs/2504.02864 mcc.law.stanford.edu mcc.law.stanford.edu/ sec.gov sec.gov/search-filings/edgar-search-assistance/accessing-edgar-data mcc.law.stanford.edu mcc.law.stanford.edu/download/contracts/ mcc.law.stanford.edu mcc.law.stanford.edu/search/

May 2025

FiveThirtyEight’s public data.

ABC News shut down FiveThirtyEight earlier this year. Datasets built and collected by the publication were featured often in this newsletter. (For part of 2022, FiveThirtyEight also paid to republish Data Is Plural.) Its still-available data repository contains 160+ subdirectories, covering sports predictions, presidential cabinet turnover, media mentions, surveys on comma usage and steak preferences, the Bechdel Test, competitive Scrabble, and much more. [h/t Jan Willem Tulp]

niemanlab.org niemanlab.org/2025/03/fivethirtyeight-is-shutting-down-as-part-of-broader-cuts-at-abc-and-disney/ en.wikipedia.org en.wikipedia.org/wiki/FiveThirtyEight fivethirtyeight.com fivethirtyeight.com/tag/data-is-plural/ github.com github.com/fivethirtyeight/data github.com github.com/fivethirtyeight/data/tree/master/cabinet-turnover github.com github.com/fivethirtyeight/data/tree/master/puerto-rico-media github.com github.com/fivethirtyeight/data/tree/master/comma-survey github.com github.com/fivethirtyeight/data/tree/master/steak-survey github.com github.com/fivethirtyeight/data/tree/master/bechdel github.com github.com/fivethirtyeight/data/tree/master/scrabble-games

May 2025

Unemployment insurance.

The US Department of Labor’s Office of Unemployment Insurance publishes dozens of datasets collected through its coordination of state-administered programs. These include quantifications — reported regularly by each state — of claimant demographics, denials of eligibility, appeal caseloads, overpayments, disaster unemployment assistance, and much more. For example: Data from ETA 9050 reports, available monthly and going back to the late 1990s, indicate how many of each state’s claimants received their first payments within one week, two weeks, et cetera. The office also provides a chartbook and various report-generating tools.

oui.doleta.gov oui.doleta.gov/unemploy/index.asp oui.doleta.gov oui.doleta.gov/unemploy/DataDownloads.asp oui.doleta.gov oui.doleta.gov/unemploy/chartbook.asp oui.doleta.gov oui.doleta.gov/unemploy/DataDashboard.asp

May 2025

What the nose knows.

Antonie Louise Bierling et al. have published a dataset of “descriptions, evaluative ratings, and qualitative labels for 74 chemically diverse mono-molecular odors, rated by a large sample of young adults.” Another paper by Bierling et al. “elicited body odor descriptions from 2,607 participants across 17 countries and 13 languages” to assemble “a standardized lexicon of body odor words.” Related: The Pyrfume Project provides “tools, models, and data for odorant-linked research.”

nature.com nature.com/articles/s41597-025-04644-2 zenodo.org zenodo.org/records/14727277 nature.com nature.com/articles/s41597-025-04630-8 osf.io osf.io/rpzjk/ pyrfume.org pyrfume.org/

May 2025

US dams.

The National Inventory of Dams “documents all known dams in the U.S. and its territories that meet certain criteria” related to the dam’s height, reservoir size, and likely impacts of its “failure or mis-operation.” The inventory, maintained by the US Army Corps of Engineers since the 1970s, now includes 92,000+ structures. The data — available via a searchable map, bulk downloads, and an API — indicate each dam’s name, location, year built, structural characteristics, purpose, operational status, and much more. Previously: Global Dam Watch’s datasets (DIP 2020.01.29) and the USGS’s National Hydrography Dataset (DIP 2022.10.12).

May 2025

California ghost guns.

“Ghost guns have been a uniquely Californian issue,” with the state accounting for a majority of the untraceable firearms that are reported to the ATF, according to The Trace. Earlier this year, on its Gun Violence Data Hub, the publication posted datasets counting the ghost guns recovered by California law enforcement agencies, as well as “firearm-level data on guns reported lost or stolen in the state.” [h/t Aaron Mendelson]

thetrace.org thetrace.org/2024/11/ghost-guns-decline-regulation-biden-atf/ en.wikipedia.org en.wikipedia.org/wiki/Bureau_of_Alcohol,_Tobacco,_Firearms_and_Explosives thetrace.org thetrace.org/ datahub.thetrace.org datahub.thetrace.org/ datahub.thetrace.org datahub.thetrace.org/dataset/california-ghost-guns-stolen-guns-and-more/

May 2025

European workforces.

Each quarter, dozens of countries collectively conduct more than 1.7 million interviews for the European Union Labour Force Survey. The survey, the continent’s largest, aims “to classify people into 3 groups that are mutually exclusive and cover the whole target population”: employed, unemployed, and outside the labor force. Eurostat publishes aggregate results, with breakdowns by age, sex, country, nationality, citizenship status, education level, sector, and more. Detailed microdata are also available to approved researchers. As seen in: Bruegel’s labor market dashboard. [h/t Nina Ruer]

ec.europa.eu ec.europa.eu/eurostat/web/lfs/information-data ec.europa.eu ec.europa.eu/eurostat/web/lfs ec.europa.eu ec.europa.eu/eurostat/web/lfs/database ec.europa.eu ec.europa.eu/eurostat/web/microdata bruegel.org bruegel.org/dataset/eu-labour-market-outlook-dashboard

May 2025

Deportation records.

The Deportation Data Project, run by a team of academics and lawyers, “collects and posts public, anonymized U.S. government immigration enforcement datasets.” These include data from border apprehensions, deportations, Title 42 expulsions, ICE arrests and detentions, ICE-operated flights, and more. Some of the data files come directly from the government, while others were initially obtained from the government by other organizations, such as the University of Washington Center for Human Rights. The project also posts information about its Freedom of Information Act requests. Read more: The project’s “U.S. Immigration Enforcement Data: A Short Guide.” As seen in: “The Rising Cost of ICE Flying Immigrants to Far-Flung Detention Centers” (Bloomberg). [h/t Alex Albright]

deportationdata.org deportationdata.org/ deportationdata.org deportationdata.org/team.html deportationdata.org deportationdata.org/data.html en.wikipedia.org en.wikipedia.org/wiki/Title_42_expulsion github.com github.com/UWCHR deportationdata.org deportationdata.org/foia.html deportationdata.org deportationdata.org/guide.html bloomberg.com bloomberg.com/graphics/2025-trump-ice-immigrant-move-costs-taxpayers/

Apr 2025

Canoe marathons.

Paddle UK’s Marathon Racing Committee promotes endurance canoe and kayak competitions that range “from a couple of miles or kilometres to the ultimate challenge of the 125-mile Devizes to Westminster Canoe Race.” The organization publishes race results online, which data scientist Andrew Collier has collected into structured data files that indicate each competition’s date, name, region, and category, as well as each paddler’s name, club, division, class, finishing time, position, and points.

paddleuk.org.uk paddleuk.org.uk/ canoemarathon.org.uk canoemarathon.org.uk/governance/marathon-racing-committee/ canoemarathon.org.uk canoemarathon.org.uk/what-is-canoe-marathon/ entries.canoemarathon.org.uk entries.canoemarathon.org.uk/results datawookie.dev datawookie.dev/ datawookie.gitlab.io datawookie.gitlab.io/british-canoeing-results/

Apr 2025

Previously unmapped waterways.

WaterNet Global Waterways is “a new global dataset that predicts the locations of waterways around the world” using an AI model trained on satellite imagery and elevation data. A collaboration between Bridges to Prosperity and the Better Planet Laboratory, the dataset — available as raster files, vector files, and an interactive map — “triples the known extent of mapped waterways globally, adding 124 million kilometers to the previously mapped 54 million kilometers.” [h/t Cameron Kruse]

source.coop source.coop/repositories/fika/waternet/description medium.com medium.com/fika-blog/waternet-ai-powered-global-water-mapping-triples-known-waterways-bc3095783661 bridgestoprosperity.org bridgestoprosperity.org/ betterplanetlab.com betterplanetlab.com/ apps.fikamap.com apps.fikamap.com/waternet

Apr 2025

US sewer overflow sites.

“There are approximately 700 communities in the United States that have combined sewer systems and experience combined sewer overflow (CSO) discharges,” according to the EPA, whose National Combined Sewer Overflow Inventory lists 8,600+ outfalls across those communities. The downloadable inventory, last updated in September 2023, provides each outfall’s location and relevant information from the National Pollutant Discharge Elimination System’s permit database. As seen in: “Minority communities twice as likely to have sewage polluting nearby river or creek, CBS News analysis shows”. Previously: Sewer overflows in England (DIP 2024.05.15).

epa.gov epa.gov/npdes/where-combined-sewer-overflow-outfalls-are-located echo.epa.gov echo.epa.gov/tools/data-downloads/cso-inventory-summary epa.gov epa.gov/npdes/combined-sewer-overflows-csos echo.epa.gov echo.epa.gov/tools/data-downloads#downloads epa.gov epa.gov/npdes echo.epa.gov echo.epa.gov/help/loading-tool/monitoring-data-download-help cbsnews.com cbsnews.com/news/sewage-river-creek-us-minority-community/ environment.data.gov.uk environment.data.gov.uk/dataset/21e15f12-0df8-4bfc-b763-45226c16a8ac data-is-plural.com data-is-plural.com/archive/2024-05-15-edition/

Apr 2025

Tens of millions of flights.

Sebastiaan Menger has developed a series of quarterly datasets “featuring global, high-level flight schedules extracted from worldwide aircraft ADS-B position transmissions,” going back to early 2024. Each quarterly extract, derived from the ADSB.lol flight-tracking initiative’s open data, features 10–13 million flights. Each flight’s entry indicates the aircraft’s registration number, type, call sign, airline (when applicable), approximate liftoff/touchdown times, and origin/destination airports.

linkedin.com linkedin.com/in/sebastiaanmenger/ github.com github.com/MrAirspace/aircraft-flight-schedules github.com github.com/adsblol adsb.lol adsb.lol/docs/overview/introduction/

Apr 2025

Refugee and asylum policies.

The Dataset of World Refugee and Asylum Policies “offers a complete dataset of de jure asylum and refugee policies” across 190+ countries and 70+ years, from 1951 to 2022. The project, developed by Christopher W. Blair et al. and updated in collaboration with the Joint Data Center on Forced Displacement, evaluates 54 aspects of each policy across five dimensions: access, services, the ability to earn a livelihood, freedom of movement, and political inclusion. Each aspect is scored on a 0-1-2-3 scale. The results are available to download and to analyze online. [h/t Annika Younge]

datanalytics.worldbank.org datanalytics.worldbank.org/dwrap/ cambridge.org cambridge.org/core/journals/american-political-science-review/article/abs/liberal-displacement-policies-attract-forced-migrants-in-the-global-south/F6872E76FBB27F61B96B90193BDE9A1D jointdatacenter.org jointdatacenter.org/ datanalytics.worldbank.org datanalytics.worldbank.org/dwrap/_w_3baf84c2101c4832b34da12495c9dbfc/DWRAP_Tech_Guide_Sep2024.pdf datacatalog.worldbank.org datacatalog.worldbank.org/search/dataset/0066171/Dataset-of-World-Refugee-and-Asylum-Policies--DWRAP-

Feb 2025

Chord progressions.

Spyridon Kantarelis et al. have created CHORDONOMICON, a dataset identifying the progressions of 51 million chords in 667,000+ songs. The dataset is based on tablatures from the website Ultimate Guitar and then “annotated with structural parts, genre, and release date”. Most entries also include the song’s and artist’s IDs in Spotify’s system. [h/t Dale Debber]

arxiv.org arxiv.org/abs/2410.22046v1 huggingface.co huggingface.co/datasets/ailsntua/Chordonomicon en.wikipedia.org en.wikipedia.org/wiki/Tablature ultimate-guitar.com ultimate-guitar.com/ developer.spotify.com developer.spotify.com/documentation/web-api/concepts/spotify-uris-ids

Feb 2025

Argentine treaties.

Javier I. Santander, a career diplomat, has built a dataset of 8,200+ bilateral treaties signed by Argentina from 1810 and 2023. It lists each treaty’s title, status, date signed, and counterpart country. The dataset is based on the government’s Digital Library of Treaties, where you can find copies of the treaties themselves. The most common counterparts have been neighboring countries — Chile, Brazil, Bolivia, Paraguay, and Uruguay — followed by Germany, the US, and Italy.

jisantander.com jisantander.com/ jisantander.com jisantander.com/ressources/bilaterals/ tratados.cancilleria.gob.ar tratados.cancilleria.gob.ar/

Feb 2025

18 million deceased veterans.

BIRLS.org, a new website from Reclaim The Records, provides “an index to basic biographical information on more than 18 million deceased American veterans who received some sort of veterans benefits in their lifetime”. Those records, obtained through a FOIA lawsuit, represent a substantial chunk of the Department of Veterans Affairs’ Beneficiary Identification Records Locator Subsystem. The site also helps you file follow-up requests for any individual’s “full VA claims file, which may contain hundreds of pages of never-before-seen biographical and historical material about the veteran, their military service, and their interactions with the VA.” Note: The “database is not a comprehensive database of all American veterans, but rather a partial and incomplete index of veterans who were eligible for VA benefits or whose heirs had some kind of contact with the VA regarding benefits.”

birls.org birls.org/ reclaimtherecords.org reclaimtherecords.org/ archive.org archive.org/details/BIRLS_database reclaimtherecords.org reclaimtherecords.org/records-request/20/

Feb 2025

Water availability.

The US Geological Survey last month released its National Water Availability Assessment, “a pioneering scientific overview of water availability that offers first-of-its-kind insights into the balance between water supply and demand across the conterminous United States.” Alongside the report, USGS launched a “data companion” providing “regularly updated, model-based estimates” of monthly water usage within each of the country’s hydrologic units. Estimates for water availability and water supply are “coming soon,” while those for water quality and aquatic ecosystems are “coming later.”

usgs.gov usgs.gov/news/national-news-release/usgs-releases-a-comprehensive-look-water-resources-united-states usgs.gov usgs.gov/special-topics/integrated-water-availability-assessments/national-water-availability-assessments usgs.gov usgs.gov/publications/us-geological-survey-integrated-water-availability-assessment-2010-20 water.usgs.gov water.usgs.gov/nwaa-data/ water.usgs.gov water.usgs.gov/nwaa-data/data-overview water.usgs.gov water.usgs.gov/GIS/huc.html

Feb 2025

Presidential schedules.

Among its various White House–related undertakings, Roll Call Factba.se provides event-by-event structured data representing the public presidential calendars for Donald Trump and Joe Biden since the latter’s inauguration in January 2021. The schedules, available to download in bulk, provide each event’s day and time, location, a brief description, and other details. They contain 9,400+ entries from Biden’s four years in office plus another 300+ from Trump’s second term so far. The events include those from the official presidential schedule, those derived from pool reports, and press briefings. As seen in: POTUS Tracker. [h/t Dan Brady]

rollcall.com rollcall.com/factbase/ rollcall.com rollcall.com/factbase/trump/calendar/ rollcall.com rollcall.com/factbase/biden/calendar/ en.wikipedia.org en.wikipedia.org/wiki/Press_pool potustracker.us potustracker.us/faq

Jan 2025

A royal regatta.

The Henley Royal Regatta, a multi-day rowing competition, has been held on the River Thames nearly every year since 1839. Dominic Goymour has scraped the event’s online results into a dataset covering 7,500+ outcomes since 1999. It includes each race’s date, starting time, stage, boat class, cup, winning crew/club, losing crew/club, winning time, and more.

hrr.co.uk hrr.co.uk/ linkedin.com linkedin.com/in/dominic-goymour-gradiema-b660981ab/ hrr.co.uk hrr.co.uk/results github.com github.com/domigmr/henley

Jan 2025

Grocery ingredients.

To compile GroceryDB, Babak Ravandi et al. scraped data about 50,000+ food products available on the websites of Walmart, Target, and Whole Foods. For each product, they extracted the nutritional information and ingredient list, which they provide as structured data and use for estimating each product’s degree of processing. Related: TrueFood, a website the research team built with the findings.

github.com github.com/Barabasi-Lab/GroceryDB/ nature.com nature.com/articles/s43016-024-01095-7 github.com github.com/Barabasi-Lab/GroceryDB/tree/main/data nature.com nature.com/articles/s41467-023-37457-1 truefood.tech truefood.tech/ truefood.tech truefood.tech/about?store=all

Jan 2025

Hurricane landfalls.

NOAA’s Hurricane Research Division maintains a table of hurricanes that have made landfall on the continental US since the 1850s. It records the year and month of landfall, designated name, states affected, the highest Saffir-Simpson category, central pressure at landfall, and maximum sustained wind speed. The division publishes another table containing more details — such as the full date, latitude, and longitude of landfall — but with a gap in the late 1970s to early 1980s. [h/t Michael Ferragamo + Dale Debber]

aoml.noaa.gov aoml.noaa.gov/hurricane-research-division/ aoml.noaa.gov aoml.noaa.gov/hrd/hurdat/All_U.S._Hurricanes.html nhc.noaa.gov nhc.noaa.gov/aboutsshws.php aoml.noaa.gov aoml.noaa.gov/hrd/hurdat/UShurrs_detailed.html

Jan 2025

Private schools.

The National Center for Education Statistics’s Private School Universe Survey has been gathering data about private elementary and secondary schools every two years since the 1989–90 school year. It collects information on “religious orientation; level of school; size of school; length of school year, length of school day; total enrollment (K-12); number of high school graduates, whether a school is single-sexed or coeducational and enrollment by sex; number of teachers employed; program emphasis” and more. In the latest data, covering the 2021–22 school year, “there were 29,727 private schools, enrolling 4,731,303 students and employing 482,571 full-time teachers”. As seen in: ProPublica’s Private School Demographics lookup tool (webinar scheduled for January 31) and its reporting on “segregation academies”.

nces.ed.gov nces.ed.gov/ nces.ed.gov nces.ed.gov/surveys/pss/index.asp nces.ed.gov nces.ed.gov/surveys/pss/pssdata.asp projects.propublica.org projects.propublica.org/private-school-demographics propublica.org propublica.org/events/how-to-use-our-private-school-demographics-news-app propublica.org propublica.org/series/segregation-academies

Jan 2025

Hyperlocal Trump/Harris results.

Earlier this month, colleagues at The New York Times published “An Extremely Detailed Map of the 2024 Election” and made the underlying data available to download. The effort “currently includes results for more than 110,000 precincts, or 73 percent of all votes, and will be updated as more data is collected.” The dataset lists each precinct’s state, county FIPS code, votes received by Kamala Harris, votes received by Donald Trump, and total votes (including third parties and write-ins). It also provides each precinct’s geographical boundaries, derived from a mix of official sources and estimations. Previously: “An Extremely Detailed Map of the 2020 Election” and the data behind it (DIP 2021.02.10). See also: Precinct-level election results for 2020, 2018, 2016, and 2012 from the Voting and Election Science Team.

nytimes.com nytimes.com/interactive/2025/us/elections/2024-election-map-precinct-results.html nytimes.com nytimes.com/2025/01/15/us/elections/2024-election-map-data.html github.com github.com/nytimes/presidential-precinct-map-2024 nytimes.com nytimes.com/interactive/2021/upshot/2020-election-map.html github.com github.com/TheUpshot/presidential-precinct-map-2020 data-is-plural.com data-is-plural.com/archive/2021-02-10-edition/ dataverse.harvard.edu dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/K7760H election.lab.ufl.edu election.lab.ufl.edu/precinct-data/