Common Crawl is a non-profit foundation dedicated to providing an open repository of web crawl data that can be accessed and analyzed by everyone.
.
Check out the new hyperlink graph analysis of the 2012
Common Crawl corpus by Web Data Commons!
The talented team at Web Data Commons extracted and analyzed the hyperlink graph
within the 2012 Common Crawl corpus. You can see the results on their website.