Why number of URLs are changing in each month?

yajavu · March 30, 2020, 2:49pm

We have each month different number of URLs in httparchive dataset. This is because Google provides in its dataset differently or does HTTP Archive do some filtering?

rviscomi · March 30, 2020, 5:09pm

The URLs that HTTP Archive tests each month come from the latest Chrome UX Report (CrUX) dataset. CrUX is a collection of origins that reflect real UX trends on the web, so as usage changes the HTTP Archive dataset will fluctuate.

yajavu · April 21, 2020, 8:41am

Thanks! and do you know why the number of domains in CrUX varies?

rviscomi · April 21, 2020, 4:38pm

CrUX is a reflection of real world web usage month-to-month.

yajavu · April 22, 2020, 9:49am

and is there any information regarding discrepancy between Alexa and CrUX. I checked some URLs there are in both dataset. But I couldn’t find any info for example how many URL in Alexa is also in CrUX?

charlie.clark · April 22, 2020, 10:02am

Different datasets slurped using different user bases and updated
constantly so such comparisons are not really meaningful even if
possible.

Charlie

yajavu · April 22, 2020, 10:04am

And do you know if google provides also a ranking in CrUX?

charlie.clark · April 22, 2020, 10:21am

I’m not sure what you mean by “ranking” but you can find out everything
you need about CrUX from the website.

Charlie

rviscomi · April 22, 2020, 6:11pm

CrUX is not ranked. You can join the URLs in HTTP Archive with old Alexa datasets (see Getting domain rank with the new rank-less Chrome UX Report corpus) but the Alexa dataset only provides ranking at the domain granularity*.

About 2/3 of the domains in CrUX are also found in the ranked Alexa domain list.

Topic		Replies	Views
Alexa Rank for each url Meta	1	1951	January 27, 2020
Why are there no statistics for well-known websites on some dates?	2	955	February 24, 2019
Changes to the HTTP Archive corpus Announcements	0	7284	December 30, 2018
Use Tranco list instead of Alexa Top 1M Analysis	7	4347	March 13, 2019
Why are some of the crux pages missing? Analysis	1	75	February 21, 2026

Why number of URLs are changing in each month?

Related topics