Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Exploring Goodreads Data: An Analysis of 10 Million Books

Ammar Alyousfi’s Blog: “Goodreads is one of the largest book websites on the internet. It has data about millions and millions of books from different genres and in many languages. It’s hard not to find a book on Goodreads whether it’s published hundreds of years ago or just a few days ago. Today, I present the analysis results of more than 10 million books on Goodreads. In fact, the original dataset that I used had 50+ million books but I excluded 40 million of them for data quality reasons mentioned later in this article. Goodreads allows you to search for any book and view its info, but there is no way to see all the available books and interact with them. Using the data in this analysis, however, I was able to do just that with millions of titles. Below, I’ll share some interesting findings and provide a method for further exploration at the end. Continue reading to know more about the analysis and the data or you can jump directly to the results section. But don’t also forget to read about how to get the most out of this analysis.”

Sorry, comments are closed for this post.