Top MIME Types based on HTTPArchive

The following query would return the top MIME types seen on HTTPArchive

select count(requestid) as ct, mimeType 
 FROM [httparchive:runs.latest_requests]
  group by mimeType 
  having ct > 10000 
  order by ct desc;

same for mobile

select count(requestid) as ct, mimeType 
 FROM [httparchive:runs.latest_requests_mobile]
  group by mimeType 
  having ct > 10000 
  order by ct desc;

Just wanted to see if I am missing anything above? Also isnt MIME type the same as response Content-Type. If not the same queries would be

select count(requestid) as ct, resp_content_type 
 FROM [httparchive:runs.latest_requests]
  group by resp_content_type
  having ct > 10000 
  order by ct desc;

Same for mobile

select count(requestid) as ct, resp_content_type 
 FROM [httparchive:runs.latest_requests_mobile]
  group by resp_content_type
  having ct > 10000 
  order by ct desc;

Just wondering how does the community go about solving such issues and if what I did was valid?

You dig into the data… :slight_smile:

select domain(url), resp_server, mimeType, resp_content_type
 FROM [httparchive:runs.latest_requests]
 LIMIT 1000;

Paging through the results of above query shows that the difference is likely due to the extra charset specification in resp_content_type. The delta is not big, I think that would account for it…