The world’s most visited websites attract staggering volumes of traffic, delivering petabytes of data to users worldwide. Given that images constitute over 60% of today’s Internet traffic, it’s reasonable to expect the top 100 websites to prioritize lightweight image content. After all, optimizing images could potentially save millions of dollars in annual bandwidth and storage expenses. Let’s delve into how visual content is managed on these influential platforms.
Methodology
The most reliable method to retrieve all images from these websites is by employing a real browser on a desktop machine. Consequently, we manually crawled the websites in this test and exported all network activity as HAR (HTTP Archive) files. While Puppeteer with the stealth
plugin and an auto-rotating proxy typically yields satisfactory results for most use cases, we wanted to ensure absolute accuracy in capturing all visuals. After all, the quality of test results is contingent upon the integrity of the input data.
The collected HAR files were then loaded into our test application and parsed. Subsequently, we extracted all requests with a matching image/*
MIME type, filtering out those with a body length of less than 100 bytes, which primarily consisted of 1x1 GIF tracking pixels.
A notable advantage of utilizing HAR files is that all binary responses are already base64
-encoded within them. Thus, we simply needed to load them into temporary buffers and save the binaries to disk:
(async () => {
const src = Buffer.from(response.content.text, "base64");
try {
await fs.writeFile(`./sites/${site_name}/images/${file_name}`, src);
} catch(err) {
console.error(err);
}
})();
Test Results
We processed all the source images through the Optidash API. Overall, we collected and optimized 8353 images, achieving an impressive 32.61% reduction in the initial file size.
The optimization and recompression outcomes were quantified using the following formula:
100 * (original_size - optimized_size) / original_size
Popular Image Formats
JPEG maintains its dominance as the most commonly utilized image format, commanding a 53% share, followed by PNG at 23% and WebP at 16%. It’s noteworthy that certain websites, such as StackOverflow, Amazon AWS, and Atlassian, heavily favor SVGs, employing them almost exclusively.
JPEG Images
- The most commonly used (24% of all the JPEGs) encoding quality is
87
followed by77
(8.6%) and100
(8.3%). - Only 31% of JPEG images use progressive coding.
4:2:0
is the most commonly used sampling scheme (69%) followed by4:4:4
(30%) and4:2:2
(1%)
The statistics above reveal significant potential for further reducing the file size of JPEG images. By implementing progressive coding alongside the 4:2:0
sampling scheme and determining the ideal Q value based on each image, substantial savings can be achieved.
The Rise of WebP images
It’s encouraging to see that exactly 33.33% of the top 100 websites prioritize WebP as their primary image format. However, serving WebP images is just one part of the equation; optimizing them is equally crucial. We’ve managed to enhance all WebP content further using a near-lossless algorithm, resulting in significant savings.
Interesting Stats
- The total weight of visual content across all 100 websites is 228.96MB.
- The website with the largest number of images (397) is Rakuten.
- The website with the heaviest visual content (11.1MB) is Sina.
- The top five heaviest images were served on Sina (3.7MB), Reddit (3.6MB), IMDB (3.0MB), Office (2.8MB) and BBC (1.9MB).
Detailed Results
Below we present a detailed list of the top 100 websites along with optimization results. Please reach out to us if you want to get access to the dataset used for compiling the following table.
Rank | Site | Images | Before | After | Savings | Savings % |
01 | google.com | 5 | 92 kB | 81.8 kB | 10.2 kB | 11.12% |
02 | youtube.com | 54 | 815 kB | 751 kB | 64.2 kB | 7.88% |
03 | tmall.com | 154 | 15.4 MB | 7.99 MB | 7.43 MB | 48.16% |
04 | qq.com | 37 | 630 kB | 340 kB | 290 kB | 46.02% |
05 | baidu.com | 20 | 83.4 kB | 36.1 kB | 47.2 kB | 56.64% |
06 | facebook.com | 8 | 116 kB | 56.4 kB | 59.8 kB | 51.44% |
07 | sohu.com | 184 | 7.82 MB | 5.15 MB | 2.66 MB | 34.07% |
08 | taobao.com | 142 | 2.61 MB | 2.28 MB | 327 kB | 12.53% |
09 | yahoo.com | 146 | 2.84 MB | 2.3 MB | 541 kB | 19.05% |
10 | amazon.com | 213 | 2.84 MB | 2.4 MB | 434 kB | 15.31% |
11 | jd.com | 166 | 4.65 MB | 3.02 MB | 1.63 MB | 35.09% |
12 | 360.cn | 140 | 3.77 MB | 2.28 MB | 1.48 MB | 39.32% |
13 | wikipedia.org | 4 | 86.4 kB | 83.9 kB | 2.45 kB | 2.84% |
14 | login.tmall.com | 7 | 162 kB | 114 kB | 47.9 kB | 29.61% |
15 | weibo.com | 159 | 2.38 MB | 1.89 MB | 490 kB | 20.55% |
16 | live.com | 48 | 4.38 MB | 3.95 MB | 427 kB | 9.74% |
17 | zoom.us | 73 | 2.44 MB | 1.05 MB | 1.38 MB | 56.76% |
18 | reddit.com | 151 | 9.72 MB | 4.34 MB | 5.38 MB | 55.38% |
19 | sina.com.cn | 187 | 11.1 MB | 5.78 MB | 5.28 MB | 47.71% |
20 | netflix.com | 7 | 603 kB | 480 kB | 124 kB | 20.53% |
21 | microsoft.com | 23 | 275 kB | 266 kB | 9.14 kB | 3.32% |
22 | xinhuanet.com | 292 | 8.35 MB | 3.84 MB | 4.51 MB | 54.05% |
23 | okezone.com | 132 | 1.6 MB | 1.44 MB | 165 kB | 10.28% |
24 | vk.com | 9 | 182 kB | 181 kB | 613 B | 0.34% |
25 | office.com | 12 | 3.37 MB | 2.86 MB | 512 kB | 15.16% |
26 | instagram.com | 10 | 990 kB | 525 kB | 464 kB | 46.93% |
27 | myshopify.com | 2 | 5.28 kB | 5.06 kB | 218 B | 4.13% |
28 | csdn.net | 107 | 3.83 MB | 1.4 MB | 2.43 MB | 63.38% |
29 | alipay.com | 11 | 699 kB | 448 kB | 251 kB | 35.86% |
30 | yahoo.co.jp | 168 | 1.44 MB | 1.23 MB | 209 kB | 14.52% |
31 | bing.com | 19 | 461 kB | 324 kB | 137 kB | 29.71% |
32 | bongacams.com | 84 | 1.6 MB | 1.5 MB | 102 kB | 6.38% |
33 | blogger.com | 43 | 1.5 MB | 1.4 MB | 103 kB | 6.88% |
34 | twitch.tv | 63 | 863 kB | 641 kB | 222 kB | 25.75% |
35 | aliexpress.com | 197 | 4 MB | 2.88 MB | 1.12 MB | 28.08% |
36 | google.com.hk | 5 | 91.9 kB | 81.7 kB | 10.2 kB | 11.13% |
37 | zhanqi.tv | 109 | 2.65 MB | 1.31 MB | 1.33 MB | 50.37% |
38 | naver.com | 53 | 1.56 MB | 904 kB | 656 kB | 42.05% |
39 | ebay.com | 16 | 227 kB | 146 kB | 81.1 kB | 35.69% |
40 | tianya.cn | 8 | 202 kB | 88.5 kB | 113 kB | 56.11% |
41 | livejasmin.com | 166 | 2.21 MB | 2 MB | 213 kB | 9.60% |
42 | amazon.co.jp | 247 | 4.57 MB | 4.18 MB | 387 kB | 8.47% |
43 | apple.com | 18 | 358 kB | 229 kB | 129 kB | 35.99% |
44 | google.co.in | 5 | 91.9 kB | 81.7 kB | 10.2 kB | 11.13% |
45 | chaturbate.com | 125 | 1.79 MB | 1.64 MB | 146 kB | 8.19% |
46 | adobe.com | 27 | 1.01 MB | 746 kB | 259 kB | 25.79% |
47 | china.com.cn | 111 | 6.3 MB | 2.91 MB | 3.39 MB | 53.80% |
48 | amazon.in | 314 | 4.91 MB | 4.43 MB | 480 kB | 9.77% |
49 | tribunnews.com | 151 | 1.2 MB | 1.03 MB | 171 kB | 14.25% |
50 | babytree.com | 77 | 1.89 MB | 862 kB | 1.03 MB | 54.38% |
51 | twitter.com | 15 | 298 kB | 262 kB | 36.1 kB | 12.13% |
52 | linkedin.com | 29 | 371 kB | 160 kB | 211 kB | 56.77% |
53 | msn.com | 181 | 3.79 MB | 2.08 MB | 1.72 MB | 45.30% |
54 | yandex.ru | 28 | 273 kB | 207 kB | 65.7 kB | 24.08% |
55 | sogou.com | 9 | 20 kB | 18.9 kB | 1.1 kB | 5.49% |
56 | huanqiu.com | 48 | 1.22 MB | 745 kB | 480 kB | 39.18% |
57 | aparat.com | 245 | 9.24 MB | 6.16 MB | 3.08 MB | 33.38% |
58 | yy.com | 188 | 5.41 MB | 3.34 MB | 2.07 MB | 38.31% |
59 | dropbox.com | 24 | 1.04 MB | 924 kB | 115 kB | 11.05% |
60 | wordpress.com | 38 | 1.66 MB | 1.66 MB | 1.21 kB | 0.07% |
61 | ok.ru | 21 | 1.88 MB | 1.13 MB | 750 kB | 39.91% |
62 | pornhub.com | 72 | 917 kB | 792 kB | 125 kB | 13.66% |
63 | mail.ru | 67 | 1.02 MB | 664 kB | 356 kB | 34.90% |
64 | whatsapp.com | 8 | 321 kB | 301 kB | 19.3 kB | 6.04% |
65 | google.co.jp | 5 | 91.9 kB | 81.7 kB | 10.2 kB | 11.13% |
66 | google.com.br | 5 | 91.9 kB | 81.7 kB | 10.2 kB | 11.13% |
67 | medium.com | 19 | 54.3 kB | 33.9 kB | 20.5 kB | 37.67% |
68 | imdb.com | 53 | 4.65 MB | 4.62 MB | 26.3 kB | 0.57% |
69 | err.tmall.com | 9 | 33.2 kB | 26.6 kB | 6.6 kB | 19.86% |
70 | imgur.com | 153 | 5.31 MB | 4.46 MB | 848 kB | 15.97% |
71 | 17ok.com | 165 | 7.97 MB | 6.29 MB | 1.68 MB | 21.13% |
72 | aws.amazon.com | 131 | 1.84 MB | 1.33 MB | 510 kB | 27.76% |
73 | cnn.com | 70 | 1.74 MB | 1.21 MB | 527 kB | 30.33% |
74 | spotify.com | 4 | 5.47 kB | 2.26 kB | 3.21 kB | 58.66% |
75 | indeed.com | 1 | 775 B | 775 B | 0 B | 0.00% |
76 | google.cn | 2 | 29.1 kB | 17.6 kB | 11.6 kB | 39.70% |
77 | nytimes.com | 131 | 6.67 MB | 4.35 MB | 2.31 MB | 34.70% |
78 | xvideos.com | 54 | 791 kB | 677 kB | 114 kB | 14.43% |
79 | jrj.com.cn | 132 | 2.41 MB | 1.44 MB | 963 kB | 40.00% |
80 | 1688.com | 234 | 9.84 MB | 5.19 MB | 4.65 MB | 47.26% |
81 | bbc.com | 68 | 9.7 MB | 7.47 MB | 2.22 MB | 22.93% |
82 | etsy.com | 9 | 349 kB | 276 kB | 72.9 kB | 20.92% |
83 | mama.cn | 102 | 3.32 MB | 1.54 MB | 1.78 MB | 53.69% |
84 | rakuten.co.jp | 397 | 3.2 MB | 2.45 MB | 749 kB | 23.41% |
85 | grid.id | 60 | 1.15 MB | 924 kB | 225 kB | 19.56% |
86 | amazon.de | 226 | 3.53 MB | 3.04 MB | 488 kB | 13.86% |
87 | udemy.com | 30 | 522 kB | 436 kB | 85.2 kB | 16.33% |
88 | amazon.co.uk | 305 | 4.8 MB | 4.05 MB | 746 kB | 15.55% |
89 | flipkart.com | 138 | 751 kB | 744 kB | 6.74 kB | 0.90% |
90 | paypal.com | 15 | 490 kB | 484 kB | 6.2 kB | 1.26% |
91 | google.de | 5 | 92 kB | 81.8 kB | 10.2 kB | 11.12% |
92 | roblox.com | 14 | 695 kB | 593 kB | 102 kB | 14.67% |
93 | okta.com | 30 | 438 kB | 340 kB | 98.4 kB | 22.46% |
94 | soundcloud.com | 28 | 2.7 MB | 2.17 MB | 527 kB | 19.53% |
95 | pixnet.net | 63 | 522 kB | 366 kB | 157 kB | 29.99% |
96 | office365.com | 11 | 1.65 MB | 492 kB | 1.16 MB | 70.15% |
97 | detik.com | 131 | 4.27 MB | 2.16 MB | 2.11 MB | 49.45% |
98 | booking.com | 59 | 1.28 MB | 1.12 MB | 162 kB | 12.67% |
99 | atlassian.com | 49 | 751 kB | 686 kB | 65 kB | 8.66% |
100 | stackoverflow.com | 32 | 181 kB | 139 kB | 42.3 kB | 23.34% |