Data Set A 10,000-cat database is available for research download.

Published on August 21st, 2014 | by Travis Korte


10,000 Cat Pictures (For Science)

Researchers from Microsoft Research Asia and the Chinese University of Hong Kong created a database with 10,000 photos containing cat heads to test image recognition algorithms, and now the data set is available freely for research purposes. The photos, which the researchers downloaded mainly from Flickr, are paired with data files that specify the location of each cat’s eyes, mouth, and ears. Using these features with their algorithmic method, the researchers were able to distinguish a cat photo from a non-cat photo with better accuracy than previous methods. The creators of the collection, which is affectionately known as the CAT Database, hope their data might be useful to other machine learning researchers working in facial recognition, image comprehension, and other computer vision topics.

Get the data.

Tags: , , , , ,

About the Author

Travis Korte is a research analyst at the Center for Data Innovation specializing in data science applications and open data. He has a background in journalism, computer science and statistics. Prior to joining the Center for Data Innovation, he launched the Science vertical of The Huffington Post and served as its Associate Editor, covering a wide range of science and technology topics. He has worked on data science projects with HuffPost and other organizations. Before this, he graduated with highest honors from the University of California, Berkeley, having studied critical theory and completed coursework in computer science and economics. His research interests are in computational social science and using data to engage with complex social systems. You can follow him on Twitter @traviskorte.

Leave a Reply

Your email address will not be published. Required fields are marked *

Back to Top ↑

Show Buttons
Hide Buttons