I recently started recording the CDN value in HTTP Archive. This is the CDN used for the base HTML page. This value is determined by WebPagetest (relevant code). This set of queries might be a controversial. If you see flaws or caveats please comment. One caveat is that “Google” includes a lot of blogs that run on Blogger (so classifying that as using Google as a “CDN” is debatable).
It’s interesting how Akamai leads for the topmost sites, but when we look across all 300K URLs Google and Cloudflare rise to the top. This might be due to cost of entry.
Top 1,000 Websites``` SELECT cdn, round(ratio*1000)/10 AS percent FROM ( SELECT cdn, count(*) AS total, RATIO_TO_REPORT(total) OVER() AS ratio FROM httparchive:runs.latest_pages where rank <= 1000 GROUP by cdn) order by percent desc; ```
Top 10,000 Websites(Change "1000" to "10000" in the query.)
Top 100,000 Websites(Change "1000" to "100000" in the query.)
Top 300,000 Websites(Change "1000" to "300000" in the query.)