Hey all, I am enjoying all of the data in this amazing archive, but I am afraid there is many "noise" of ads in the data. I guess the data includes all the network calls for ads and banners that are running on the domains, and they can take some 40% of the site's performance (e.g, 1 banner can have a "waterfall" of ads inside its JS and call around 10 different servers until it gets an ad).
So few questions:
1- Does the data includes network calls from ads technologies ?
2- How can one make a "clean" test/query and understand the real performance of the web (without the ads)?