Improving the HTTP Archive pipeline and dataset by 10x

One side effect of this migration that’s also worth mentioning is that we’re planning to shut down legacy.httparchive.org. We deprecated it in 2018 and have been putting it off for as long as we can, but with this major infrastructure change it’s time to pull the plug.

What’s going away? The legacy.httparchive.org website will stop serving traffic as early as 90 days from now. Until then, you’ll get an annoying upgrade prompt :smile:. The downloadable CSV and database dumps will not be updated after April 2022 and completely inaccessible after the website stops serving traffic. All of the raw data will continue to exist in Google Cloud Storage and you can export BigQuery tables in CSV format if needed.