I’d like to remove obvious adult sites from my analysis. Searching these forums for “adult” doesn’t turn up anything helpful, and the
_adult_site column in
summary_pages is always false (e.g. in
My best approach so far has just been to remove records with
REGEXP_CONTAINS(url, r"porn|xxx|adult"). Is there a better way? Maybe a table I’m missing?