Data Set The /r/Datasets subreddit

Published on December 13th, 2013 | by Travis Korte


Reddit’s Datasets

One of the most popular features of social news site Reddit is its variety of discussion categories (called subreddits) organized around different niche interests. They include popular mainstream categories, like Pics, as well as smaller pages for specific groups, like PetMice. So it is no wonder that among the more than 12,000 active subreddits, Datasets exists.

The subreddit has amassed around 6,500 subscribers since its creation four years ago, but it remains relatively little-known.

Highlights include a database of over 600 battles fought between 1600 and 1973, the last words of every inmate executed in Texas since 1984, and 10,000 photos of cats. The cat photos may sound frivolous, but such datasets have been used recently for cutting-edge machine vision research.

Get the datasets.

Tags: , , , ,

About the Author

Travis Korte is a research analyst at the Center for Data Innovation specializing in data science applications and open data. He has a background in journalism, computer science and statistics. Prior to joining the Center for Data Innovation, he launched the Science vertical of The Huffington Post and served as its Associate Editor, covering a wide range of science and technology topics. He has worked on data science projects with HuffPost and other organizations. Before this, he graduated with highest honors from the University of California, Berkeley, having studied critical theory and completed coursework in computer science and economics. His research interests are in computational social science and using data to engage with complex social systems. You can follow him on Twitter @traviskorte.

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to Top ↑

Show Buttons
Hide Buttons