The HTTP Archive tracks how the Web is built.

The 2692522254 is open source and the data is downloadable.
Write your own custom queries!