Google has published an open-source data visualization tool called Facets that allows users to visualize large datasets in order to create better machine learning systems. With Facets, users can see the distribution of values in their training dataset compared to their test set to find unexpected values and possible skews. Users can also see the relationships between each data point by organizing them by feature, such as age, education, or sex. Google has provided examples to help users begin using the tool.
Visualizing Big Data to Create Better Algorithms
Michael McLaughlin is a research assistant at the Center for Data Innovation. He previously worked at Oracle and held internships at USA TODAY and in local government. Prior to joining the Center for Data Innovation, Michael graduated from Wake Forest University, where he majored in Communication with Minors in Politics and International Affairs and Journalism. He is currently pursuing his Master’s in Communication at Stanford University, specializing in Data Journalism.
View all posts by Michael McLaughlin