Alexa rank attribute in the pages table


#1

The rank attribute in the pages table in appears to change from one run to another as shown below (but remains constant 15 Oct 17 onwards).

As per the following thread, the updated Alexa top million was non longer available
[Number of domains in HTTPArchive] and we could expect to see the same URLs and page ranks in all of the crawls this year.

So is the cutoff date 15 Oct 17?

SELECT
rank
FROM
httparchive.summary_pages.*
WHERE
_TABLE_SUFFIX >= ‘2017_08_01’ AND
ENDS_WITH(_TABLE_SUFFIX, ‘desktop’) AND
url IN (‘http://www.onlinesbi.com/’)

image

SELECT
SUBSTR(_TABLE_SUFFIX, 0, 10) AS date,
rank
FROM
httparchive.summary_pages.*
WHERE
_TABLE_SUFFIX >= ‘2017_08_01’ AND
ENDS_WITH(_TABLE_SUFFIX, ‘desktop’) AND
url IN (‘http://www.sbi.co.in/’)
ORDER BY
date

image


#2

Yes that sounds right.