Data Set Jeopardy host Alex Trebek poses with a winner.

Published on August 14th, 2014 | by


Fans Create Database of Over 200,000 Jeopardy Questions

Reddit users have created a machine-readable data set of over 200,000 Jeopardy questions. The data, which the dataset’s creators scraped from fan-created question repository J!-Archive, contains each question’s answer, along with category, dollar value, air date, and other data. One analysis using the data set showed how diverse Jeopardy’s question categories are: the 100 most commonly used categories span only 11 percent of total questions asked. The creator of that analysis noted that this extreme amount of variation “has given me a lot of sympathy for IBM’s Jeopardy!-playing robot Watson.”

Get the data.

Photo: Queen’s University

Tags: , , , , ,

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to Top ↑

Show Buttons
Hide Buttons