Table of Contents
What is Google crawl?
How does Google crawler work?
JavaScript and HTML Rendering
When to use JavaScript for scraping?
in conclusion
Home Web Front-end JS Tutorial Will Google crawl JavaScript that contains body content?

Will Google crawl JavaScript that contains body content?

Aug 25, 2023 pm 02:33 PM

Google 是否会抓取包含正文内容的 JavaScript

Historically, search engine crawlers like Googlebot could only read static HTML source code and were unable to scan and index material written dynamically using JavaScript. However, this has changed with the rise of JavaScript-rich websites and frameworks such as Angular, React, and Vue.JS, as well as single-page applications (SPA) and progressive web applications (PWA). In order to display web pages correctly before indexing them, Google modified and discontinued its previous AJAX crawling technology. While Google can generally crawl and index most JavaScript information, they recommend against using client-side solutions because JavaScript "is difficult to process, and not all search engine crawlers can process it correctly or quickly." ”

What is Google crawl?

Google and other search engines use software called Google crawlers (also known as search bots or spiders) to scan the web. In other words, it "crawls" the Internet from page to website, looking for fresh or updated content that isn't already in Google's database.

Each search engine has a unique collection of crawlers. For Google, there are more than 15 different types of crawlers, with Googlebot being the main one. Since Googlebot does crawling and indexing, we'll examine its operation in more detail.

How does Google crawler work?

No search engine (including Google) maintains a central register of URLs and updates that URL every time a new page is created. This means that Google has to search the internet for new pages rather than automatically "alert" them. Googlebot is constantly prowling the Internet, looking for new web pages to add to Google's inventory of existing web pages.

Once a new website is found, Googlebot renders (or "visualizes") the site in the browser by loading all HTML, third-party code, JavaScript, and CSS. Search engines use this data saved in databases to index and rank pages. The page will be added to the Google index, which is an additional, very large Google database if it is indexed.

JavaScript and HTML Rendering

Lengthy code can be difficult for Googlebot to process and render. If the code is not clean, the crawler may not be able to render your site correctly, in which case it will be treated as empty.

Regarding JavaScript rendering, please keep in mind that the language is evolving rapidly and Googlebot may sometimes stop supporting the latest version. Make sure your JavaScript is compatible with Googlebot to avoid showing your site wrongly. Ensure JavaScript loads quickly. Googlebot will not render and index script-generated material if it takes longer than five seconds to load.

When to use JavaScript for scraping?

We still recommend selectively using JavaScript crawling when first analyzing a site for JavaScript, although Google will typically render every page. JavaScript is used to leverage known client-side dependencies for auditing and during deployment on large sites.

All resources (including JavaScript, CSS, and images) must be selectively crawled to display each web page and build the DOM in a headless browser behind the scenes. JavaScript crawling is slower and more labor intensive.

While this isn't a problem for smaller sites, it can have a significant impact on larger sites with hundreds or even millions of pages. If your website doesn't rely heavily on JavaScript to dynamically change web pages, there's no need to spend time or resources.

When processing JavaScript and web pages with dynamic content (DOM), the crawler must read and evaluate the Document Object Model. After all the code is loaded and processed, a fully displayed version of such a website must also be generated. Browsers are the easiest tool for us to view displayed web pages. For this reason, crawling JavaScript is sometimes described as using a "headless browser."

in conclusion

There will be more JavaScript in the next few years because it is here to stay. JavaScript can coexist peacefully with SEOs and crawlers as long as you discuss it with SEO early on when creating your website architecture. Crawlers are still just replicas of the behavior of actual search engine bots. In addition to JavaScript crawlers, we strongly recommend using log file analysis, Google's URL inspection tool, or mobile-friendly testing tools to understand what Google can crawl, render, and index.

The above is the detailed content of Will Google crawl JavaScript that contains body content?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

What should I do if I encounter garbled code printing for front-end thermal paper receipts? What should I do if I encounter garbled code printing for front-end thermal paper receipts? Apr 04, 2025 pm 02:42 PM

Frequently Asked Questions and Solutions for Front-end Thermal Paper Ticket Printing In Front-end Development, Ticket Printing is a common requirement. However, many developers are implementing...

Who gets paid more Python or JavaScript? Who gets paid more Python or JavaScript? Apr 04, 2025 am 12:09 AM

There is no absolute salary for Python and JavaScript developers, depending on skills and industry needs. 1. Python may be paid more in data science and machine learning. 2. JavaScript has great demand in front-end and full-stack development, and its salary is also considerable. 3. Influencing factors include experience, geographical location, company size and specific skills.

How to merge array elements with the same ID into one object using JavaScript? How to merge array elements with the same ID into one object using JavaScript? Apr 04, 2025 pm 05:09 PM

How to merge array elements with the same ID into one object in JavaScript? When processing data, we often encounter the need to have the same ID...

Demystifying JavaScript: What It Does and Why It Matters Demystifying JavaScript: What It Does and Why It Matters Apr 09, 2025 am 12:07 AM

JavaScript is the cornerstone of modern web development, and its main functions include event-driven programming, dynamic content generation and asynchronous programming. 1) Event-driven programming allows web pages to change dynamically according to user operations. 2) Dynamic content generation allows page content to be adjusted according to conditions. 3) Asynchronous programming ensures that the user interface is not blocked. JavaScript is widely used in web interaction, single-page application and server-side development, greatly improving the flexibility of user experience and cross-platform development.

The difference in console.log output result: Why are the two calls different? The difference in console.log output result: Why are the two calls different? Apr 04, 2025 pm 05:12 PM

In-depth discussion of the root causes of the difference in console.log output. This article will analyze the differences in the output results of console.log function in a piece of code and explain the reasons behind it. �...

How to achieve parallax scrolling and element animation effects, like Shiseido's official website?
or:
How can we achieve the animation effect accompanied by page scrolling like Shiseido's official website? How to achieve parallax scrolling and element animation effects, like Shiseido's official website? or: How can we achieve the animation effect accompanied by page scrolling like Shiseido's official website? Apr 04, 2025 pm 05:36 PM

Discussion on the realization of parallax scrolling and element animation effects in this article will explore how to achieve similar to Shiseido official website (https://www.shiseido.co.jp/sb/wonderland/)...

Can PowerPoint run JavaScript? Can PowerPoint run JavaScript? Apr 01, 2025 pm 05:17 PM

JavaScript can be run in PowerPoint, and can be implemented by calling external JavaScript files or embedding HTML files through VBA. 1. To use VBA to call JavaScript files, you need to enable macros and have VBA programming knowledge. 2. Embed HTML files containing JavaScript, which are simple and easy to use but are subject to security restrictions. Advantages include extended functions and flexibility, while disadvantages involve security, compatibility and complexity. In practice, attention should be paid to security, compatibility, performance and user experience.

Is JavaScript hard to learn? Is JavaScript hard to learn? Apr 03, 2025 am 12:20 AM

Learning JavaScript is not difficult, but it is challenging. 1) Understand basic concepts such as variables, data types, functions, etc. 2) Master asynchronous programming and implement it through event loops. 3) Use DOM operations and Promise to handle asynchronous requests. 4) Avoid common mistakes and use debugging techniques. 5) Optimize performance and follow best practices.

See all articles