Categories
Software

Googlebot and 15MB stuff | GoogleSearchCentral Blog | Google Developers


Tuesday, June 28, 2022

Over the past few days, we’ve been asked a lot of questions about recent updates to Googlebot’s documentation.That is, Googlebot has documented that it “sees” only the first 15
Megabyte (MB) When fetching
Specific file type.. This threshold is not new. It has been around for years. I’ve added it to the documentation as it can be useful when debugging and rarely changes.

This limitation is
Part-Time Job (Content) Received for the first request made by Googlebot, not the reference resource in the page.For example, when opened , The browser may first download the bytes of the HTML file and then request more external JavaScript, images, or anything else referenced in the URL in the HTML, based on those bytes. Googlebot does the same.

What does this 15MB limit mean to me?
There is almost nothing.There is
Very few pages

On the internet with a large size. Dear readers, you are unlikely to be the owner.
The median size of an HTML file is about 1/500: 30 Kilobyte (kB).. However, if you own an HTML page larger than 15 MB, you can move at least some inline scripts and CSS dust to an external file.

What happens to my content above 15 MB?
Content after the first 15MB will be removed by Googlebot and only the first 15MB will be transferred to the index.

Which content types does the 15 MB limit apply to?
The 15 MB limit applies to fetches made by Googlebot (Googlebot smartphones and Googlebot desktops) on fetch.
File types supported by Google Search..

Does this mean that Googlebot doesn’t recognize my images and videos?
No. Googlebot gets the videos and images referenced using URLs in HTML (for example)
<img src=" alt="cute puppy looking very disappointed" />
Apart from continuous fetches.

Is the data URI added to the HTML file size?
yes.use
Data URI

Since it is included in the HTML file, it affects the size of the HTML file.

How can I find out the size of the page?
There are several ways, but perhaps the easiest is to use your own browser and its developer tools. Load the page as you normally would, then launch the development tools,[ネットワーク]Switch to the tab. When you reload the page, you’ll see all the requests your browser needs to make to render the page.The top request is what you are looking for and the byte size of the page is[サイズ]It will be displayed in the column.

for example,
Chrome development tools
If the size column contains 150kB, then:

Chrome developer tools[ネットワーク]tab

If you are more adventurous, you can use curl
From the command line:

curl \
-A "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/103.0.0.0 Safari/537.36" \
-so /dev/null  -w '%{size_download}'

If you have any questions,
twitter
And
Search Central ForumAnd if you need more clarification about the documentation, please leave feedback on the page itself.





Standing Desk Reviews