Analyzing “How Google Search Works” Changes from Google
Google has made some new substantial changes to their “How Google Search Works” search documents for website owners. And as always when Google makes changes to important documents with impact on SEO, such as How Search Works and the Quality Rater Guidelines, there are some key insights SEOs can gleam from the new changes Google has made.Google對(duì)網(wǎng)站所有者的“ Google搜索的工作方式”搜索文檔進(jìn)行了一些新的重大更改。與往常一樣,當(dāng)Google更改對(duì)SEO有影響的重要文檔時(shí),例如Search How Works和Quality Rater Guidelines,SEO可以從Google所做的新更改中汲取一些重要見(jiàn)解。
?
Of particular note, Google detailing how it views a “document” as potentially comprising of more than one webpage, what Google considers primary and secondary crawls, as well as an update to their reference of “more than 200 ranking factors” which has been present in this document since 2013.特別值得注意的是,Google詳細(xì)介紹了其如何將“文檔”視為可能包含多個(gè)網(wǎng)頁(yè),Google認(rèn)為主要和次要抓取內(nèi)容,以及它們對(duì)“超過(guò)200個(gè)排名因素”的引用的更新 自2013年以來(lái)在本文檔中。
But here are the changes and what they mean for SEOs.但是,這里有變化及其對(duì)SEO的意義。
Contents
l?1?Crawling
l?1.1?Improving Your Crawling
l?2?The Long Version
l?3?Crawling
l?3.1?How does Google find a page?
l?3.2?Improving Your Crawling
l?4?Indexing
l?4.1?Improving your Indexing
l?4.1.1?What is a document?
l?5?Serving Results
l?6?Final Thoughts
l?6.0.1?Jennifer Slegg
l?6.0.2?Latest posts by Jennifer Slegg (see all)
-------------------------------------------------------------
Crawling
Google has greatly expanded this section.谷歌已經(jīng)大大擴(kuò)展了本節(jié)。
They made a slight change to wording, with “some pages are known because Google has already crawled them before” changed to “some pages are known because Google has already visited them before.”? ?This is a fairly minor change, primarily because Google decided to include an expanded section detailing what crawling actually is. 他們對(duì)措辭做了些微更改,將``某些頁(yè)面已知,因?yàn)镚oogle之前已經(jīng)對(duì)其進(jìn)行了爬網(wǎng)''更改為``某些頁(yè)面已知,因?yàn)镚oogle之前已經(jīng)對(duì)其進(jìn)行了訪問(wèn)''。這是一個(gè)相當(dāng)小的更改,主要是因?yàn)镚oogle決定包括一個(gè)擴(kuò)展部分,詳細(xì)說(shuō)明實(shí)際的爬網(wǎng)。
Google removed:
This process of discovery is called crawling. 發(fā)現(xiàn)的過(guò)程稱為爬網(wǎng)。
The removal of the crawling definition was simply because it was redundant. ?In Google’s expanded crawling section, they included a much more detailed definition and description of crawling instead. 刪除爬網(wǎng)定義僅僅是因?yàn)樗嵌嘤嗟?。在Google的展開(kāi)的抓取部分中,它們包含了更詳細(xì)的抓取定義和說(shuō)明。
The added definition:
Once Google discovers a page URL, it visits, or crawls, the page to find out what’s on it. Google renders the page and analyzes both the text and non-text content and overall visual layout to decide where it should appear in Search results. The better that Google can understand your site, the better we can match it to people who are looking for your content. Google發(fā)現(xiàn)網(wǎng)頁(yè)網(wǎng)址后,便會(huì)訪問(wèn)或爬網(wǎng)該網(wǎng)頁(yè)以查找其中的內(nèi)容。Google渲染頁(yè)面并分析文本和非文本內(nèi)容以及整體視覺(jué)布局,以決定頁(yè)面應(yīng)出現(xiàn)在搜索結(jié)果中的位置。Google越了解您的網(wǎng)站,我們就越能將其與正在尋找您內(nèi)容的人匹配。
There is still a great debate on how much page layout is taken into account. ?There was the page layout algo that was released many years, in order to penalize content that was pushed well below the fold in order to increase the odds a visitor might click on an advertisement that appeared above the fold instead. ?But with more traffic moving to mobile, and the addition of mobile first indexing, the importance of above and below the fold for on page layout seemingly was less important. 關(guān)于要考慮多少頁(yè)面布局仍存在很大爭(zhēng)議。有一種發(fā)布了多年的頁(yè)面布局算法,目的是對(duì)被遠(yuǎn)遠(yuǎn)低于首屏的內(nèi)容進(jìn)行處罰,以增加訪問(wèn)者點(diǎn)擊首屏上出現(xiàn)的廣告的幾率。但是,隨著越來(lái)越多的流量轉(zhuǎn)移到移動(dòng)設(shè)備上,并增加了移動(dòng)設(shè)備首次索引功能,頁(yè)面布局上下折疊的重要性似乎已變得不那么重要了。
When it comes to page layout and mobile first, Google says:在頁(yè)面布局和移動(dòng)優(yōu)先方面,Google表示:
Don’t let ads harm your mobile page ranking. Follow the?Better Ads Standard?when displaying ads on mobile devices. For example, ads at the top of the page can take up too much room on a mobile device, which is a bad user experience. 不要讓廣告損害您的移動(dòng)頁(yè)面排名。在移動(dòng)設(shè)備上展示廣告時(shí),請(qǐng)遵循“更好的廣告標(biāo)準(zhǔn)”。例如,頁(yè)面頂部的廣告可能會(huì)在移動(dòng)設(shè)備上占用過(guò)多的空間,這是糟糕的用戶體驗(yàn)。
But in How Google Search Works, Google is specifically calling attention to the “overall visual layout” with “where it should appear in Search results.”?但是在“ Google搜索的工作方式”中,Google特別呼吁人們注意“整體視覺(jué)布局”,即“其應(yīng)出現(xiàn)在搜索結(jié)果中的位置”。
It also brings attention to “non-text” content. ?While the most obvious of this refers to image content, the referral to it is quite open ended. ?Could this refer to OCR as well, which we know Google has been dabbling in? 它還引起對(duì)“非文本”內(nèi)容的關(guān)注。雖然最明顯的是圖像內(nèi)容,但對(duì)它的引用是開(kāi)放式的。難道這也指OCR,我們知道Google一直在涉足?
Improving Your Crawling
Under the “to improve your site crawling” section, Google has expanded this section significantly as well. 在“改善您的網(wǎng)站抓取”部分下,Google也在此部分進(jìn)行了顯著擴(kuò)展。
Google has added this point:
Verify that Google can reach the pages on your site, and that they look correct. Google accesses the web as an anonymous user (a user with no passwords or information). Google should also be able to see all the images and other elements of the page to be able to understand it correctly. You can do a quick check by typing your page URL in the Mobile-Friendly test tool. 確認(rèn)Google可以訪問(wèn)您網(wǎng)站上的頁(yè)面,并且看起來(lái)正確。Google以匿名用戶(沒(méi)有密碼或信息的用戶)訪問(wèn)網(wǎng)絡(luò)。Google還應(yīng)該能夠查看頁(yè)面中的所有圖像和其他元素,以便能夠正確理解它。您可以通過(guò)在適用于移動(dòng)設(shè)備的測(cè)試工具中輸入頁(yè)面網(wǎng)址來(lái)進(jìn)行快速檢查。
This is a good point – so many new site owners end up accidentally blocking Googlebot from crawling or not realizing their site is set to be only viewable by logged in users only. ?This makes it clear that site owners should try viewing their site without also being logged into it, to see if there are any unexpected accessibility or other issues that aren’t note when logged in as an admin or high level user. 這是一個(gè)好點(diǎn)-許多新的網(wǎng)站所有者最終無(wú)意間阻止了Googlebot抓取,或者沒(méi)有意識(shí)到自己的網(wǎng)站被設(shè)置為只能由登錄用戶查看。這清楚表明,網(wǎng)站所有者應(yīng)嘗試在未登錄的情況下查看其網(wǎng)站,以查看是否有意外的可訪問(wèn)性或其他以管理員或高級(jí)用戶身份登錄時(shí)未注意到的問(wèn)題。
Also recommending site owners check their site via the Mobile-Friendly testing tool is good, since even seasoned SEOs use the tool to quickly see if there are Googlebot specific issues with how Google is able to see, render and crawl a specific webpage – or a competitor’s page. 此外,建議網(wǎng)站所有者通過(guò)移動(dòng)設(shè)備友好的測(cè)試工具檢查其網(wǎng)站是否良好,因?yàn)榧词故墙?jīng)驗(yàn)豐富的SEO也會(huì)使用該工具來(lái)快速查看Googlebot在查看,呈現(xiàn)和抓取特定網(wǎng)頁(yè)方面是否存在特定問(wèn)題,或者 競(jìng)爭(zhēng)對(duì)手的頁(yè)面。
Google expanded their specific note about submitting a single page to the index. Google擴(kuò)展了有關(guān)將單個(gè)頁(yè)面提交到索引的特定注釋。
If you’ve created or updated a single page, you can submit an individual?URL to Google. To tell Google about many new or updated pages at once, use a sitemap. 如果您創(chuàng)建或更新了一個(gè)頁(yè)面,則可以向Google提交一個(gè)單獨(dú)的URL。要一次將許多新頁(yè)面或更新頁(yè)面告訴Google,請(qǐng)使用Sitemap。
Previously, it just mentioned submitting changes to a single page using the submit URL tool. ?This just adds clarification to those who are newer to SEO that they do not need to submit every single new or updated pages to Google individually, but that using sitemaps is the best way to do that. ?There have definitely been new site owners who add each page to Google using that tool because they don’t realize sitemaps is a thing. ?But part of this is that WordPress is such a prevalent way to create a new website, yet it does not have native support for sitemaps (yet), so site owners need to either install a specific sitemaps plugin or use one of the many SEO tool plugins that offer sitemaps as a feature. 以前,它只是提到使用“提交URL”工具將更改提交到單個(gè)頁(yè)面。這只是向那些剛接觸SEO的人增加了澄清,即他們不需要分別向Google提交每個(gè)新的或更新的頁(yè)面,但是使用站點(diǎn)地圖是最好的方法。肯定有新的網(wǎng)站所有者會(huì)使用該工具將每個(gè)頁(yè)面添加到Google,因?yàn)樗麄儧](méi)有意識(shí)到站點(diǎn)地圖是一回事。但這部分是因?yàn)閃ordPress是創(chuàng)建新網(wǎng)站的一種普遍方式,但是它還沒(méi)有對(duì)站點(diǎn)地圖的本機(jī)支持(因此),因此站點(diǎn)所有者需要安裝特定的站點(diǎn)地圖插件或使用眾多SEO工具之一 提供站點(diǎn)地圖功能的插件。
This new change also highlights using the tool for creating pages as well, instead of just the previous reference of “changes to a single page.”?這項(xiàng)新更改還強(qiáng)調(diào)了使用該工具來(lái)創(chuàng)建頁(yè)面,而不僅僅是以前提到的“更改到單個(gè)頁(yè)面”。
Google has also made a change to the section about “if you ask Google to crawl only one page” section as well. ?They are now referencing what Google views as a “small site” – according to Google, ?a smaller site is one with less than 1,000 pages. Google還對(duì)“如果您要求Google僅抓取一頁(yè)”這一部分進(jìn)行了更改。他們現(xiàn)在引用的是Google所說(shuō)的“小型網(wǎng)站”-根據(jù)Google的說(shuō)法,較小的網(wǎng)站就是少于1000頁(yè)的網(wǎng)站。
Google also stresses the importance of a strong navigation structure, even for sites it considers “small.”? It says site owners of small sites can just submit their homepage to Google, “provided that Google can reach all your other pages by following a path of links that start from your homepage.” Google還強(qiáng)調(diào)了強(qiáng)大的導(dǎo)航結(jié)構(gòu)的重要性,即使對(duì)于它認(rèn)為“很小”的網(wǎng)站也是如此。它說(shuō),小型站點(diǎn)的站點(diǎn)所有者只需將其主頁(yè)提交給Google,“前提是Google可以通過(guò)遵循從您的主頁(yè)開(kāi)始的鏈接路徑來(lái)訪問(wèn)您的所有其他頁(yè)面。”
With so many sites being on WordPress, it is less likely that there will be random orphaned pages that are not accessible by following links from the homepage ?But depending on the specific WordPress theme used, sometimes there can be orphaned pages from pages being added but not manually added to the pages menu… in these cases, if a sitemap is used as well, those pages shouldn’t be missed even if not directly linked from the homepage. WordPress上的站點(diǎn)如此之多,不太可能出現(xiàn)隨機(jī)的孤立頁(yè)面,這些頁(yè)面無(wú)法通過(guò)跟隨主頁(yè)上的鏈接進(jìn)行訪問(wèn),但是根據(jù)所使用的特定WordPress主題,有時(shí)可以添加頁(yè)面中的孤立頁(yè)面,但不能 手動(dòng)添加到頁(yè)面菜單中…在這種情況下,如果還使用了站點(diǎn)地圖,則即使不直接從首頁(yè)鏈接也不會(huì)錯(cuò)過(guò)這些頁(yè)面。
In the “get your page linked to by another page” section, Google has added that links in “advertisements links that you pay for in other sites, links in comments, or other links that don’t follow the Google Webmaster Guidelines won’t be followed by Google.”? A small change, but Google is making it clear that it is a Google specific thing that these links won’t be followed, but they might be followed by other search engines. Google在“讓您的頁(yè)面鏈接到另一個(gè)頁(yè)面”部分中,添加了“您在其他網(wǎng)站上付費(fèi)的廣告鏈接”,“評(píng)論中的鏈接”或不遵循“ Google網(wǎng)站站長(zhǎng)指南”的其他鏈接中的鏈接 被Google跟蹤。” 這是一個(gè)很小的變化,但是Google明確表示Google明確規(guī)定這些鏈接不會(huì)被跟蹤,但是其他搜索引擎可能會(huì)跟蹤它們。
But perhaps the most telling part of this is at the end of the crawling section, Google adds:?但Google補(bǔ)充說(shuō),其中最有說(shuō)服力的部分是在爬網(wǎng)部分的末尾:
Google doesn’t accept payment to crawl a site more frequently, or rank it higher. If anyone tells you otherwise, they’re wrong. Google不接受付款來(lái)更頻繁地抓取網(wǎng)站或?qū)ζ渑琶?。如果有人告訴你,他們錯(cuò)了。
It has long been an issue with scammy SEO companies to guarantee first positioning on Google, to increase rankings or requiring payment to submit a site to Google. ?And with the ambiguous Google Partner badge for AdWords, many use the Google Partners badge to imply ?they are certified by Google for SEO and organic ranking purposes. ?That said, most of those who are reading the How Search Works probably are already aware of this. ?But nice to see Google add this in writing again, for times when SEOs need to prove to clients that there is not a “pay to win” option, outside of AdWords, or simply to show someone who might be falling for some scammy SEO company’s claims of Google rankings. 狡猾的SEO公司長(zhǎng)期以來(lái)一直要保證在Google上的排名第一,提高排名或需要付費(fèi)才能將網(wǎng)站提交給Google一直是一個(gè)問(wèn)題。而且由于AdWords的Google合作伙伴徽章含糊不清,許多人都使用Google合作伙伴徽章來(lái)暗示他們已通過(guò)Google的SEO和自然排名認(rèn)證。就是說(shuō),大多數(shù)閱讀“搜索方式”的人可能已經(jīng)意識(shí)到這一點(diǎn)。但是很高興看到Google再次以書(shū)面形式添加此內(nèi)容,有時(shí)SEO需要向客戶證明除了AdWords之外沒(méi)有“支付即贏”的選擇,或者只是向那些可能會(huì)欺騙某些SEO公司的人展示 Google排名的聲明。
The Long Version
Google then gets into what they call the “l(fā)ong version” of How Google Search Works, with more details on the above sections, covering more nuances that impact SEO. 然后,Google進(jìn)入所謂的“ Google搜索的工作方式”的“長(zhǎng)篇版”,在上述部分中提供了更多詳細(xì)信息,涵蓋了影響SEO的更多細(xì)微差別。
Crawling
Google has changed how they refer to the “algorithmic process”. ?Previously, it stated “Googlebot uses an algorithmic process: computer programs determine which sites to crawl, how often and how many pages to fetch from each site.”? Curiously, they removed the reference to “computer programs”, which provoked the question about which computer programs exactly Google was using. Google改變了他們對(duì)“算法過(guò)程”的稱呼。以前,它說(shuō)過(guò)“ Googlebot使用算法過(guò)程:計(jì)算機(jī)程序確定要爬網(wǎng)的站點(diǎn),從每個(gè)站點(diǎn)獲取的頻率和數(shù)量?!?奇怪的是,他們刪除了對(duì)“計(jì)算機(jī)程序”的引用,這引發(fā)了關(guān)于Google確切使用的計(jì)算機(jī)程序的問(wèn)題。
The new updated version simply states:
Googlebot uses an algorithmic process to determine which sites to crawl, how often, and how many pages to fetch from each site. Googlebot使用算法過(guò)程來(lái)確定要爬網(wǎng)的站點(diǎn),從每個(gè)站點(diǎn)獲取的頻率以及獲取的頁(yè)面數(shù)量。
Google also updated the wording for the crawl process, changing that it is “augmented with sitemap data” to “augmented by sitemap” data. Google還更新了抓取過(guò)程的用語(yǔ),將其由“通過(guò)站點(diǎn)地圖數(shù)據(jù)增強(qiáng)”改為“通過(guò)站點(diǎn)地圖增強(qiáng)數(shù)據(jù)”。
Google also made a change where it referenced that Googlebot “detects” links and changed it to “finds” links, as well as changes from Googlebot visiting “each of these websites” to the much more specific “page”. ?This second change makes it more accurate since Google visiting a website won’t necessarily mean it crawls all links on all pages. ?The change to “page” makes it more accurate and specific for webmasters. ?Google還進(jìn)行了更改,引用Googlebot“檢測(cè)”鏈接并將其更改為“查找”鏈接,以及從Googlebot訪問(wèn)“每個(gè)網(wǎng)站”到更具體的“頁(yè)面”的更改。由于谷歌訪問(wèn)網(wǎng)站并不一定意味著它會(huì)爬網(wǎng)所有頁(yè)面上的所有鏈接,因此第二個(gè)更改使其更加準(zhǔn)確。對(duì)“頁(yè)面”的更改使它更加準(zhǔn)確和針對(duì)網(wǎng)站管理員。
Previously it read:
As Googlebot visits each of these websites it detects links on each page and adds them to its list of pages to crawl. 當(dāng)Googlebot訪問(wèn)這些網(wǎng)站中的每個(gè)網(wǎng)站時(shí),它會(huì)檢測(cè)每個(gè)頁(yè)面上的鏈接,并將它們添加到要抓取的頁(yè)面列表中。
Now it reads:
When Googlebot visits a page it finds links on the page and adds them to its list of pages to crawl. 當(dāng)Googlebot訪問(wèn)頁(yè)面時(shí),它會(huì)找到頁(yè)面上的鏈接并將其添加到要爬網(wǎng)的頁(yè)面列表中。
Google has added a new section about using Chrome to crawl:
During the crawl, Google renders the page using a recent version of Chrome. As part of the rendering process, it runs any page scripts it finds. If your site uses dynamically-generated content, be sure that you?follow the JavaScript SEO basics. 在抓取過(guò)程中,Google使用最新版本的Chrome渲染頁(yè)面。作為渲染過(guò)程的一部分,它將運(yùn)行找到的所有頁(yè)面腳本。如果您的網(wǎng)站使用動(dòng)態(tài)生成的內(nèi)容,請(qǐng)確保遵循JavaScript SEO基礎(chǔ)知識(shí)。
By referencing a recent version of Chrome, this addition is clarifying the change from last year where Googlebot was finally upgraded to the latest version of Chromium for crawling, an update from Google only crawling with Chrome 41 for years. 通過(guò)引用最新版本的Chrome,此次添加可以明確說(shuō)明自去年Googlebot最終升級(jí)到最新版本的Chromium進(jìn)行爬網(wǎng)以來(lái)所做的更改,這是Google多年來(lái)僅使用Chrome 41進(jìn)行爬網(wǎng)的更新。
Google also notes it runs “any page scripts it finds,” and advises site owners to be aware of possible crawl issues as a result of using dynamically-generated content with the use of JavaScript, specifying that site owners should ensure they follow their JavaScript SEO basics. Google還指出,它會(huì)運(yùn)行“找到的所有頁(yè)面腳本”,并建議網(wǎng)站所有者注意使用動(dòng)態(tài)生成的內(nèi)容和JavaScript所導(dǎo)致的爬網(wǎng)問(wèn)題,并指定網(wǎng)站所有者應(yīng)確保他們遵循其JavaScript SEO 基本。
Google also details the primary and secondary crawls, something that has garnered much confusion since Google revealed primary and secondary crawls, but Google’s details in this How Google Search Works documents detail it differently than how some SEOs previously interpreted it. Google還詳細(xì)介紹了主要爬網(wǎng)和輔助爬網(wǎng),自從Google揭示了主要爬網(wǎng)和輔助爬網(wǎng)以來(lái),這引起了很大的混亂,但是Google在“ Google搜索方式”文檔中的詳細(xì)信息與某些SEO先前對(duì)它的解釋不同。
Here is the entire new section for primary and secondary crawls:?這是主要和輔助爬網(wǎng)的整個(gè)新部分:
Primary crawl /?secondary crawl
Google uses two different crawlers for crawling websites: a mobile crawler and a desktop crawler. Each crawler type simulates a user visiting your page with a device of that type. Google使用兩種不同的搜尋器來(lái)搜尋網(wǎng)站:移動(dòng)搜尋器和桌面搜尋器。每種搜尋器類型都模擬一個(gè)用戶使用該類型的設(shè)備訪問(wèn)您的網(wǎng)頁(yè)。
Google uses one crawler type (mobile or desktop) as the primary crawler for your site. All pages on your site that are crawled by Google are crawled using the primary crawler. The primary crawler for all new websites is the mobile crawler. Google使用一種爬蟲(chóng)類型(移動(dòng)或桌面)作為您網(wǎng)站的主要爬蟲(chóng)。Google抓取的網(wǎng)站上的所有頁(yè)面都是使用主抓取工具抓取的。所有新網(wǎng)站的主要搜尋器是移動(dòng)搜尋器。
In addition, Google recrawls a few pages on your site with the other crawler type (mobile or desktop). This is called the secondary crawl, and is done to see how well your site works with the other device type. 此外,Google會(huì)使用其他搜尋器類型(移動(dòng)設(shè)備或臺(tái)式機(jī))來(lái)檢索您網(wǎng)站上的幾個(gè)頁(yè)面。這稱為第二次爬網(wǎng),用于查看您的站點(diǎn)在其他設(shè)備類型下的運(yùn)行情況。
In this section, Google refers to primary and secondary crawls as being specific to their two crawlers – the mobile crawler and the desktop crawler. ?Many SEOs think of primary and secondary crawling in reference to Googlebot making two passes over a page, where javascript is rendered on the secondary crawl. ?So while Google clarifies their use of desktop and mobile Googlebots, the use of language here does cause confusion for those who use this to refer to the primary and secondary crawls for javascript purposes. ?So to be clear, Google’s reference to their primary and secondary crawl has nothing to do with javascript rendering, but only to how they use both mobile and desktop Googlebots to crawl and check a page. Google在本節(jié)中將主要和輔助爬網(wǎng)稱為特定于它們的兩個(gè)爬網(wǎng)程序-移動(dòng)爬網(wǎng)程序和桌面爬網(wǎng)程序。許多SEO會(huì)參考Googlebot在頁(yè)面上進(jìn)行兩次傳遞來(lái)考慮主要和輔助爬網(wǎng),在頁(yè)面上會(huì)在輔助爬網(wǎng)上呈現(xiàn)javascript。因此,盡管Google澄清了他們對(duì)臺(tái)式機(jī)和移動(dòng)Googlebot的使用,但此處的語(yǔ)言使用確實(shí)使使用JavaScript來(lái)指代主要和次要抓取的人感到困惑。需要明確的是,Google提及其主抓取和輔助抓取與javascript渲染無(wú)關(guān),而僅涉及他們?nèi)绾问褂靡苿?dòng)和桌面Googlebots抓取和檢查頁(yè)面。
What Google is clarifying in this specific reference to primary and secondary crawl is that Google is using two crawlers – both mobile and desktop versions of Googlebot – and will crawl sites using a combination of both. Google在此特定參考中對(duì)主爬網(wǎng)和輔助爬網(wǎng)的澄清是,谷歌正在使用兩個(gè)爬網(wǎng)程序(包括Googlebot的移動(dòng)版和臺(tái)式機(jī)版本),并將結(jié)合使用這兩個(gè)爬網(wǎng)程序來(lái)爬網(wǎng)站。
Google did specifically state that new websites are crawled with the mobile crawler in their “Mobile-First Indexing Best Practices” document, as of July 2019. ?But this is the first time it has made an appearance in their How Google Search Works document. 谷歌確實(shí)在2019年7月的``移動(dòng)優(yōu)先索引最佳實(shí)踐''文檔中特別聲明了使用移動(dòng)爬網(wǎng)程序?qū)π戮W(wǎng)站進(jìn)行爬網(wǎng)。
Google does go into more detail about how it uses both the desktop and mobile Googlebots, particularly for sites that are currently considered mobile first by Google. ?It wasn’t clear just how much Google was checking desktop versions of sites if they were mobile first, and there have been some who have tried to take advantage of this by presenting a spammier version to desktop users, or in some cases completely different content. ?But Google is confirming it is still checking the alternate version of the page with their crawlers. Google確實(shí)更詳細(xì)地介紹了它如何同時(shí)使用臺(tái)式機(jī)和移動(dòng)Googlebot,尤其是對(duì)于那些目前被Google視為移動(dòng)優(yōu)先的網(wǎng)站。尚不清楚Google是否首先檢查了桌面版本的網(wǎng)站是否是移動(dòng)設(shè)備,并且有一些人試圖通過(guò)向桌面用戶提供垃圾郵件版本來(lái)利用此功能,或者在某些情況下,它們會(huì)提供完全不同的內(nèi)容 。但是Google確認(rèn)仍在與其抓取工具一起檢查網(wǎng)頁(yè)的備用版本。
So sites that are mobile first will see some of their pages crawled with the desktop crawler. ?However, it still isn’t clear how Google handles cases where they are vastly different, especially when done for spam reasons, as there doesn’t seem to be any penalty for doing so, aside from a possible spam manual action if it is checked or a spam report is submitted. ?And this would have been a perfect opportunity to be clearer about how Google will handle pages with vastly different content depending on whether it is viewed on desktop or on mobile. ?Even in the mobile friendly documents, Google only warns about ranking differences if content is on the desktop version of the page but is missing on the mobile version of the page. 因此,最先移動(dòng)的網(wǎng)站將看到其某些網(wǎng)頁(yè)已通過(guò)桌面搜尋器進(jìn)行了搜尋。但是,目前尚不清楚Google如何處理截然不同的情況,尤其是在由于垃圾郵件原因而處理的情況下,因?yàn)槌藱z查可能的垃圾郵件手動(dòng)操作外,這樣做似乎沒(méi)有任何懲罰?;蛱峤焕]件報(bào)告。這將是一個(gè)絕佳的機(jī)會(huì),可以使您更清楚地了解Google將如何處理在桌面上或在移動(dòng)設(shè)備上查看的內(nèi)容完全不同的頁(yè)面。即使在適合移動(dòng)設(shè)備的文檔中,如果內(nèi)容在頁(yè)面的桌面版本上,但在頁(yè)面的移動(dòng)版本上缺少,則Google僅警告等級(jí)差異。
How does Google find a page?
Google has removed this section entirely from the new version of the document. Google已從文檔的新版本中完全刪除了此部分。
Here is what was included in it:
How does Google find a page?
Google uses many techniques to find a page, including:
·?Following links from other sites or pages
·?Reading sitemaps
It isn’t clear why Google removed this specifically. ?It is slightly redundant, but it was missing the submitting a URL option as well. 尚不清楚Google為什么專門刪除此內(nèi)容。它有點(diǎn)多余,但是也缺少提交URL選項(xiàng)。
Improving Your Crawling
Google makes the use of hreflang a bit clearer, especially for those who might just be learning what hreflang is and how it works by providing a bit more detail. Google使hreflang的使用更加清晰,特別是對(duì)于那些可能只是通過(guò)提供更多細(xì)節(jié)來(lái)了解hreflang是什么以及它如何工作的人。
Formerly it said “Use hreflang to point to alternate language pages.”? Now it states “Use hreflang to point to alternate versions of your page in other languages.”?以前它說(shuō)“使用hreflang指向備用語(yǔ)言頁(yè)面。” 現(xiàn)在,它顯示“使用hreflang指向其他語(yǔ)言的頁(yè)面?zhèn)溆冒姹??!?/span>
Not a huge change, but a bit clearer. 變化不大,但更清晰。
Google has also added two new points, providing more detail about ensuring Googlebot is able to access all the content on the page, not just the content (words) specifically. Google還增加了兩個(gè)新點(diǎn),提供了更多有關(guān)確保Googlebot能夠訪問(wèn)頁(yè)面上所有內(nèi)容的詳細(xì)信息,而不僅僅是特定的內(nèi)容(單詞)。
First, Google added:
Be sure that Google can access the key pages, and also the important resources (images, CSS files, scripts) needed to render the page properly. 確保Google可以訪問(wèn)關(guān)鍵頁(yè)面以及正確呈現(xiàn)頁(yè)面所需的重要資源(圖像,CSS文件,腳本)。
So Google is stressing about ensuring Google can access all the important content. ?And it is also specifically calling attention to other types of elements on the page that Google wants to also have access to in order to properly crawl the page, including images, CSS and scripts. ?For those webmasters who went through the whole “mobile first indexing” launch, they are fairly familiar with issues surrounding blocking files, especially CSS and scripts, something that some CMS had blocked Googlebot from crawling by default. ?因此,Google強(qiáng)調(diào)要確保Google可以訪問(wèn)所有重要內(nèi)容。而且,它還特別要引起人們注意頁(yè)面上Google希望也可以訪問(wèn)的其他類型的元素,以便正確地爬行頁(yè)面,包括圖像,CSS和腳本。對(duì)于那些經(jīng)歷了整個(gè)“移動(dòng)優(yōu)先索引”發(fā)布的網(wǎng)站管理員,他們非常熟悉圍繞阻止文件(尤其是CSS和腳本)的問(wèn)題,某些CMS默認(rèn)已阻止了Googlebot進(jìn)行爬網(wǎng)。
But for newer site owners, they might not realize this was possible, or that they might be doing it. ?It would have been nice to see Google add specific information on how those newer to SEO can check for this, particularly for those who also might not be clear on what exactly “rendering” means. 但是對(duì)于新的網(wǎng)站所有者,他們可能沒(méi)有意識(shí)到這是可能的,或者他們可能正在這樣做。很高興看到Google添加有關(guān)SEO的新手如何檢查的特定信息,特別是對(duì)于那些可能還不清楚“渲染”到底是什么的人。
Google also added:
Confirm that Google can access and render your page properly by running the?URL Inspection tool?on the live page. 通過(guò)運(yùn)行實(shí)時(shí)頁(yè)面上的URL檢查工具,確認(rèn)Google可以正確訪問(wèn)和呈現(xiàn)您的頁(yè)面。
Here Google does add specific information about using the URL Inspection tool in order to see what site owners are blocking or content that is causing issues when Google tries to render it. ?I think these last two new points could have been combined, and made slightly clearer for how site owners can use the tool to check for all these issues. Google確實(shí)在此處添加了有關(guān)使用URL檢查工具的特定信息,以便查看哪些站點(diǎn)所有者正在阻止哪些內(nèi)容,或者哪些內(nèi)容導(dǎo)致了在Google嘗試呈現(xiàn)該問(wèn)題時(shí)引起問(wèn)題的內(nèi)容。我認(rèn)為可以將這最后兩個(gè)點(diǎn)結(jié)合起來(lái),使站點(diǎn)所有者可以使用該工具檢查所有這些問(wèn)題的方式更加清晰。
Indexing
Google has made significant changes to this section as well. And Google starts off with making major changes to the first paragraph. ?Here is the original version: Google也對(duì)該部分進(jìn)行了重大更改。Google首先對(duì)第一段進(jìn)行了重大更改。這是原始版本:
Googlebot processes each of the pages it crawls in order to compile a massive index of all the words it sees and their location on each page. In addition, we process information included in key content tags and attributes, such as <title> tags and alt attributes.
The updated version now reads:
Googlebot processes each page it crawls in order to understand the content of the page. This includes processing the textual content, key content tags and attributes, such as <title> tags and alt attributes, images, videos, and more. Googlebot處理其抓取的每個(gè)頁(yè)面,以了解該頁(yè)面的內(nèi)容。這包括處理文本內(nèi)容,關(guān)鍵內(nèi)容標(biāo)簽和屬性,例如<title>標(biāo)簽和alt屬性,圖像,視頻等。
Google no longer states it processes pages to “compile a massive index of all the words it sees and their location on each page.”? This was always a curious way for them to call attention to the fact they are simply indexing all words it comes across and their position on a page, when in reality it is a lot more complex than that. ?So it definitely clears that up. Google不再聲明將頁(yè)面處理為“對(duì)所看到的所有單詞及其在每個(gè)頁(yè)面上的位置進(jìn)行大規(guī)模索引編制”。對(duì)于他們來(lái)說(shuō),這總是一種奇怪的方式來(lái)引起人們注意,他們只是索引所有碰到的單詞以及它們?cè)陧?yè)面上的位置,而實(shí)際上這要復(fù)雜得多。因此,它肯定可以解決這一問(wèn)題。
They have also added that they are processing “textual content” which is basically calling attention to the fact it indexes the words on the page, something that was assumed by everyone. ?But it does differentiate between the new addition later in the paragraph regarding images, videos and more. 他們還補(bǔ)充說(shuō),他們正在處理“文本內(nèi)容”,這基本上是在引起人們注意它為頁(yè)面上的單詞建立索引的事實(shí),這是每個(gè)人都假定的。但這確實(shí)區(qū)分了本段后面有關(guān)圖像,視頻等的新增內(nèi)容。
Previously, Google simply made reference to attributes such as title and alt tags and attributes. ?But now it is getting more granular, specifically referring to “images, videos and more.”? However, this does mean Google is considering images, videos and “more” to understand the content on the page, which could affect rankings. ?以前,Google僅引用諸如標(biāo)題和alt標(biāo)簽以及屬性之類的屬性。但是現(xiàn)在,它變得越來(lái)越細(xì)化,特別是指“圖像,視頻等”。但是,這確實(shí)意味著Google正在考慮使用圖片,視頻和“更多”內(nèi)容來(lái)理解頁(yè)面上的內(nèi)容,這可能會(huì)影響排名。
Improving your Indexing
Google changed “read our SEO guide for more tips” to “Read our basic SEO guide and advanced user guide for more tips.” Google將“閱讀我們的SEO指南以獲取更多提示”更改為“閱讀我們的基本SEO指南和高級(jí)用戶指南以獲取更多提示?!?/span>
What is a document?
Google has added a massive section here called “What is a document?”? It talks specifically about how Google determines what is a document, but also includes details about how Google views multiple pages with identical content as a single document, even with different URLs, and how it determines canonicals. Google在此處添加了一個(gè)很大的部分,稱為“什么是文檔?” 它專門討論了Google如何確定什么是文檔,還包括有關(guān)Google如何查看具有與單個(gè)文檔相同的內(nèi)容(甚至具有不同的URL)的多個(gè)頁(yè)面以及如何確定規(guī)范的詳細(xì)信息。
First, here is the first part of this new section:
What is a “document”?
Internally, Google represents the web as an (enormous) set of documents. Each document represents one or more web pages. These pages are either identical or very similar, but are essentially the same content, reachable by different URLs. The different URLs in a document can lead to exactly the same page (for instance, example.com/dresses/summer/1234 and example.com?product=1234 might show the same page), or the same page with small variations intended for users on different devices (for example, example.com/mypage for desktop users and m.example.com/mypage for mobile users). 在內(nèi)部,Google將網(wǎng)絡(luò)表示為一組(巨大的)文檔。每個(gè)文檔代表一個(gè)或多個(gè)網(wǎng)頁(yè)。這些頁(yè)面是相同的或非常相似的,但是本質(zhì)上是相同的內(nèi)容,可以通過(guò)不同的URL訪問(wèn)。文檔中不同的URL可以導(dǎo)致完全相同的頁(yè)面(例如,example.com/dresses/summer/1234和example.com?product=1234可能顯示同一頁(yè)面),或者同一頁(yè)面具有較小的差異, 用戶(例如,臺(tái)式機(jī)用戶為example.com/mypage,移動(dòng)用戶為m.example.com/mypage)。
Google chooses one of the URLs in a document and defines it as the document’s canonical URL. The document’s canonical URL is the one that Google crawls and indexes most often; the other URLs are considered duplicates or alternates, and may occasionally be crawled, or served according to the user request: for instance, if a document’s canonical URL is the mobile URL, Google will still probably serve the desktop (alternate) URL for users searching on desktop. Google選擇文檔中的一個(gè)URL并將其定義為文檔的規(guī)范URL。文檔的規(guī)范網(wǎng)址是Google最常抓取和編制索引的網(wǎng)址。其他網(wǎng)址則被視為重復(fù)網(wǎng)址或備用網(wǎng)址,并可能會(huì)根據(jù)用戶請(qǐng)求進(jìn)行抓取或提供:例如,如果文檔的規(guī)范網(wǎng)址是移動(dòng)網(wǎng)址,則Google仍可能會(huì)為用戶搜索提供桌面(備用)網(wǎng)址 在桌面上。
Most reports in Search Console attribute data to the document’s canonical URL. Some tools (such as the Inspect URL tool) support testing alternate URLs, but inspecting the canonical URL should provide information about the alternate URLs as well. Search Console中的大多數(shù)報(bào)告都將數(shù)據(jù)歸于文檔的規(guī)范網(wǎng)址。某些工具(例如“檢查URL”工具)支持測(cè)試備用URL,但是檢查規(guī)范URL也應(yīng)提供有關(guān)備用URL的信息。
You can tell Google which URL you prefer to be canonical, but Google may choose a different canonical for various reasons. 您可以告訴Google您希望使用哪個(gè)規(guī)范網(wǎng)址,但是出于各種原因,Google可能會(huì)選擇其他規(guī)范網(wǎng)址。
So the tl:dr is that Google will view pages with identical ?or near-identical content as the same document, regardless of how many of them there are. ?For seasoned SEOs, we know this as internal duplicate content. 因此tl:dr是Google將查看與同一文檔具有相同或幾乎相同內(nèi)容的頁(yè)面,而不管其中有多少頁(yè)面。對(duì)于經(jīng)驗(yàn)豐富的SEO,我們將其稱為內(nèi)部重復(fù)內(nèi)容。
Google also states that when Google determines these duplicates, they may not be crawled as often. ?This is important to note for site owners that are working to de-duplicate content which Google is considering duplicate. ?So it would be more important to submit these URLs to be recrawled, or give those newly de-duplicated pages links from the homepage in order to ensure Google recrawls and indexed the new content, so Google de-dupes them properly. Google還聲明,當(dāng)Google確定這些重復(fù)項(xiàng)時(shí),它們可能不會(huì)被頻繁檢索。對(duì)于正在努力重復(fù)刪除Google認(rèn)為重復(fù)的內(nèi)容的網(wǎng)站所有者,這一點(diǎn)很重要。因此,更重要的是提交要重新爬網(wǎng)的這些URL,或從首頁(yè)提供那些新刪除重復(fù)的頁(yè)面的鏈接,以確保Google重新抓取并索引新內(nèi)容,以便Google正確地對(duì)它們進(jìn)行重復(fù)刪除。
It also brings up an important note about desktop versus mobile, that Google will still likely serve the desktop version of a page instead of the mobile version for desktop users, when a site has two different URLs for the same page where is designed for mobile users and the other for desktop. ?While many websites have changed to serving the same URL and content for both using responsive design, some sites still run two completely different sites and URLs for desktop and mobile users. 它還提出了有關(guān)臺(tái)式機(jī)與移動(dòng)設(shè)備的重要說(shuō)明,即當(dāng)網(wǎng)站為移動(dòng)用戶設(shè)計(jì)的同一頁(yè)面有兩個(gè)不同的URL時(shí),Google仍可能會(huì)為頁(yè)面用戶提供頁(yè)面的桌面版本,而不是面向桌面用戶的移動(dòng)版本。另一個(gè)用于桌面。盡管許多網(wǎng)站已更改為使用響應(yīng)式設(shè)計(jì)為兩個(gè)網(wǎng)站提供相同的URL和內(nèi)容,但某些網(wǎng)站仍為臺(tái)式機(jī)和移動(dòng)用戶運(yùn)行兩個(gè)完全不同的網(wǎng)站和URL。
Google also mentions that you can tell Google the URL you prefer Google to use as the canonical, but states they can chose a different URL “for various reasons.”? While Google doesn’t detail specifics about why Google might choose a different canonical than the one the site owner specifies, it is usually due to http vs https, if a page is included in a sitemap or not, page quality, if the pages appear to be completely different and should not be canonicalized, or due to significant incoming links to the non-canonical URL. Google還提到您可以告訴Google您希望Google用作規(guī)范的URL,但指出“出于各種原因”,他們可以選擇其他URL。盡管Google并未詳細(xì)說(shuō)明為何Google選擇與網(wǎng)站所有者指定的規(guī)范不同的規(guī)范,但這通常是由于http vs https造成的,如果站點(diǎn)地圖中是否包含某個(gè)頁(yè)面,則頁(yè)面質(zhì)量,如果頁(yè)面出現(xiàn) 完全不同且不應(yīng)規(guī)范化,或由于大量輸入到非規(guī)范URL的鏈接。
Google has also included definitions for many o the terms used by SEOs and in Google Search Console. Google還為SEO和Google Search Console中的許多術(shù)語(yǔ)提供了定義。
Document: A collection of similar pages. Has a canonical URL, and possibly alternate URLs, if your site has duplicate pages. URLs in the document can be from the same or different organization (the root domain, for example “google” in www.google.com). Google chooses the best URL to show in Search results according to the platform (mobile/desktop), user language? or location, and many other variables. Google discovers related pages on your site by organic crawling, or by site-implemented features such as redirects or <link rel=alternate/canonical> tags. Related pages on other organizations can only be marked as alternates if explicitly coded by your site (through redirects or link tags). 文件:相似頁(yè)面的集合。如果您的網(wǎng)站有重復(fù)的頁(yè)面,則有一個(gè)規(guī)范的URL,并可能有備用URL。文檔中的URL可以來(lái)自相同或不同的組織(根域,例如www.google.com中的“ google”)。Google會(huì)根據(jù)平臺(tái)(移動(dòng)/臺(tái)式機(jī)),用戶語(yǔ)言?或位置以及許多其他變量,選擇在搜索結(jié)果中顯示的最佳URL。Google通過(guò)自然爬網(wǎng)或網(wǎng)站實(shí)現(xiàn)的功能(例如重定向或<link rel = alternate / canonical>標(biāo)簽)發(fā)現(xiàn)您網(wǎng)站上的相關(guān)頁(yè)面。如果您的站點(diǎn)(通過(guò)重定向或鏈接標(biāo)記)明確編碼,則其他組織上的相關(guān)頁(yè)面只能標(biāo)記為替代頁(yè)面。
Again, Google is talking about the fact a single document can encompass more than just a single URL, as Google will consider a single document to potentially have many duplicate or near duplicate pages as well as pages assigned via canonical. ?Google makes specific mention about “alternates” that appear on other sites, that can only be considered alternates if the site owner specifically codes it. ?And that Google will choose the best URL from within the collection of documents to show. 再次,Google在談?wù)撘粋€(gè)事實(shí),即一個(gè)文檔可以包含多個(gè)URL,因?yàn)镚oogle會(huì)認(rèn)為一個(gè)文檔可能包含許多重復(fù)或接近重復(fù)的頁(yè)面以及通過(guò)規(guī)范分配的頁(yè)面。Google特別提到了其他網(wǎng)站上出現(xiàn)的“替代”,只有在網(wǎng)站所有者明確編碼后才能視為替代。而且Google將從顯示的文檔集中選擇最佳的URL。
But it fails to mention that Google can consider pages duplicate on other sites and will not show those duplicates, even if they aren’t from the same sites, something that site owners see happen frequently when someone steals content and sometimes sees the stolen version ranking over the original. 但是,它沒(méi)有提及Google可以認(rèn)為頁(yè)面在其他網(wǎng)站上重復(fù),即使這些頁(yè)面不是來(lái)自同一網(wǎng)站,也不會(huì)顯示這些重復(fù),當(dāng)某人竊取內(nèi)容并有時(shí)看到被盜版本時(shí),網(wǎng)站所有者經(jīng)常會(huì)看到這種情況 超過(guò)原來(lái)的。
There was a notation added for the above, dealing with hreflang. 上面為hreflang添加了一種表示法。
?Pages with the same content in different languages are stored in different documents that reference each other using?hreflang tags; this is why it’s important to use hreflang tags for translated content. ?具有不同語(yǔ)言的具有相同內(nèi)容的頁(yè)面存儲(chǔ)在使用hreflang標(biāo)記相互引用的不同文檔中; 這就是為什么將hreflang標(biāo)記用于翻譯內(nèi)容很重要的原因。
Google shows that it doesn’t include identical content under the same “document” when it is simply in a different language, which is interesting. ?But Google is tressing the importance of using hreflang in these cases. Google表示,僅使用另一種語(yǔ)言,就不會(huì)在同一“文檔”下包含相同的內(nèi)容,這很有趣。但是在這些情況下,Google強(qiáng)調(diào)使用hreflang的重要性。
URL: The URL used to reach a given piece of content on a site. The site might resolve different URLs to the same page. URL:用于訪問(wèn)網(wǎng)站上給定內(nèi)容的URL。該網(wǎng)站可能會(huì)將不同的URL解析到同一頁(yè)面。
Pretty self explanatory, although it does have reference to the fact different URLs can be resolved to the same page, presumably such as with redirects or alias. 盡管它確實(shí)提到了可以將不同的URL解析到同一頁(yè)面的事實(shí),這很容易解釋,大概是諸如重定向或別名之類的。
Page: A given web page, reached by one or more URLs. There can be different versions of a page, depending on the user’s platform (mobile, desktop, tablet, and so on). 頁(yè)面:給定的網(wǎng)頁(yè),可以通過(guò)一個(gè)或多個(gè)URL進(jìn)行訪問(wèn)。頁(yè)面的版本可能不同,具體取決于用戶的平臺(tái)(移動(dòng)設(shè)備,臺(tái)式機(jī),平板電腦等)。
Also pretty self explanatory, bringing up the specifics that some site owners can be served different versions of the same page, such as if they try and view the same page on a mobile device versus a desktop computer. 這也很不言自明,提出了可以為某些網(wǎng)站所有者提供同一頁(yè)面的不同版本的細(xì)節(jié),例如,如果他們嘗試在移動(dòng)設(shè)備而非臺(tái)式機(jī)上查看同一頁(yè)面。
Version: One variation of the page, typically categorized as “mobile,” “desktop,” and “AMP” (although AMP can itself have mobile and desktop versions). Each version can have a different URL (example.com vs m.example.com) or the same URL (if your site uses dynamic serving or responsive web design, the same URL can show different versions of the same page) depending on your site configuration. Language variations are not considered different versions, but different documents. 版本:頁(yè)面的一種變體,通常分為“移動(dòng)”,“桌面”和“ AMP”(盡管AMP本身可以具有移動(dòng)和桌面版本)。每個(gè)版本可以具有不同的URL(example.com與m.example.com)或相同的URL(如果您的站點(diǎn)使用動(dòng)態(tài)服務(wù)或響應(yīng)式網(wǎng)頁(yè)設(shè)計(jì),則相同的URL可以顯示同一頁(yè)面的不同版本),具體取決于您的站點(diǎn) 組態(tài)。語(yǔ)言變體不是不同的版本,而是不同的文檔。
Simply clarifying with greater details the different versions of a page, and how Google typically categorizes them as “mobile,” “desktop,” and “AMP”. 只需更詳細(xì)地說(shuō)明頁(yè)面的不同版本,以及Google通常如何將其分類為“移動(dòng)”,“桌面”和“ AMP”即可。
Canonical page or URL: The URL that Google considers as most representative of the document. Google always crawls this URL; duplicate URLs in the document are occasionally crawled as well. 規(guī)范頁(yè)面或URL:Google認(rèn)為最能代表文檔的URL。Google始終會(huì)抓取該URL;偶爾也會(huì)抓取文檔中重復(fù)的URL。
Google states here again that non-canonical pages are not crawled as often as the main canonical that a site owner assigns to a group of pages they want canonical. ?Google does not include specific mention here that they sometimes chose a different page as the canonical one, even if there is a specific page designated as the canonical one. Google在此再次聲明,非規(guī)范頁(yè)面的抓取頻率不如網(wǎng)站所有者分配給他們想要的規(guī)范頁(yè)面組的主要規(guī)范。Google此處未特別提及他們有時(shí)會(huì)選擇其他頁(yè)面作為規(guī)范頁(yè)面,即使有特定頁(yè)面被指定為規(guī)范頁(yè)面。
Alternate/duplicate page or URL: The document URL that Google might occasionally crawl. Google also serves these URLs if they are appropriate to the user and request (for example, an alternate URL for desktop users will be served for desktop requests rather than a canonical mobile URL). 備用/重復(fù)頁(yè)面或URL:Google可能偶爾抓取的文檔URL。如果這些URL適合用戶和請(qǐng)求,則Google也會(huì)提供這些URL(例如,將為桌面用戶提供替代URL,而非桌面移動(dòng)URL)。
The key takeaway here is that Google “might” occasionally crawl the site’s duplicate or alternative page. ?And here they stress that Google will serve these alternative URLs “if they are appropriate.”? It is unfortunate they don’t go into greater detail in why they might serve these pages instead of the canonical, outside of the mention of desktop versus mobile, as we have seen many cases where Google picks a different page to show other than the canonical for a myriad of reasons. 這里的主要要點(diǎn)是,Google有時(shí)可能會(huì)“抓取”該網(wǎng)站的重復(fù)頁(yè)面或替代頁(yè)面。他們?cè)谶@里強(qiáng)調(diào),Google將“在適當(dāng)時(shí)”提供這些替代網(wǎng)址。不幸的是,除了提及臺(tái)式機(jī)還是移動(dòng)設(shè)備之外,他們沒(méi)有更詳細(xì)地說(shuō)明為什么可以提供這些頁(yè)面而不是標(biāo)準(zhǔn)頁(yè)面,因?yàn)槲覀円呀?jīng)看到許多情況下Google選擇了不同的頁(yè)面來(lái)顯示標(biāo)準(zhǔn)頁(yè)面以外的內(nèi)容 由于種種原因。
Google also fails to mention how this impacts duplicate content found on other sites, we we do know Google will crawl those less often as well. Google也沒(méi)有提及這會(huì)對(duì)其他網(wǎng)站上的重復(fù)內(nèi)容產(chǎn)生怎樣的影響,我們知道Google也將減少對(duì)這些內(nèi)容的檢索。
Site: Usually used as a synonym for a website (a conceptually related set of web pages), but sometimes used as a synonym for a Search Console property, although a property can actually be defined as only part of a site. A site can span subdomains (and even domains, for properly linked AMP pages). 網(wǎng)站:通常用作網(wǎng)站(概念上相關(guān)的一組網(wǎng)頁(yè))的同義詞,但有時(shí)也用作Search Console屬性的同義詞,盡管實(shí)際上可以將屬性定義為網(wǎng)站的一部分。一個(gè)站點(diǎn)可以跨越子域(對(duì)于正確鏈接的AMP頁(yè)面,甚至可以是域)。
Interesting to note here what they consider a website – a conceptually related set of webpages – and how it related to the usage of a Google Search Console property, as “a property can actually be defined as only part of a site.”?在這里有趣地指出他們認(rèn)為網(wǎng)站是什么(概念上相關(guān)的一組網(wǎng)頁(yè)),以及它與Google Search Console屬性的使用如何相關(guān),因?yàn)椤皩傩詫?shí)際上只能定義為網(wǎng)站的一部分?!?/span>
Google does make mention that AMP, which technically appear on a different domain, are considered part of the main site. Google確實(shí)提到AMP,它在技術(shù)上出現(xiàn)在不同的域中,被視為主站點(diǎn)的一部分。
Serving Results
Google has made a pretty interesting specific change here in regards to their ranking factors. ?Previously, Google stated: Google在排名方面做了非常有趣的具體更改。谷歌此前曾表示:
Relevancy is determined by over 200 factors, and we always work on improving our algorithm. 相關(guān)性是由200多個(gè)因素決定的,我們一直在努力改進(jìn)算法。
Google has now updated this “over 200 factors” with a less specific one. 谷歌現(xiàn)在用一個(gè)不太具體的因素更新了“ 200多個(gè)因素”。
Relevancy is determined by hundreds of factors, and we always work on improving our algorithm. 相關(guān)性由數(shù)百個(gè)因素決定,我們一直在努力改進(jìn)算法。
The 200 factors in the How Google Search Works dates back to 2013 when the document was launched, although then it also made reference to PageRank (“Relevancy is determined by over 200 factors, one of which is the PageRank for a given page”) which Google removed when they redesigned their document in 2018. “ Google搜索的工作方式”中的200個(gè)因素可追溯到文檔發(fā)布時(shí)的2013年,盡管當(dāng)時(shí)它還引用了PageRank(“相關(guān)性由200多個(gè)因素決定,其中之一是給定頁(yè)面的PageRank”) Google在2018年重新設(shè)計(jì)文檔時(shí)將其刪除。
While Google doesn’t go into specifics on the number anymore, it can be assumed that a significant number of ranking factors have been added since 2013 when this was first claimed in this document. ?But I am sure some SEOs will be disappointed we don’t get a brand new shiny number like “over 500” ranking factors that SEOs can obsess about. 盡管Google不再詳細(xì)說(shuō)明該數(shù)字,但可以假設(shè),自2013年在本文檔中首次提出此要求以來(lái),已經(jīng)添加了大量的排名因素。但是我敢肯定,有些SEO會(huì)讓我們感到失望的,因?yàn)槲覀儧](méi)有獲得SEO可以關(guān)注的全新的閃亮數(shù)字,例如“超過(guò)500”的排名因素。
Final Thoughts
There are some pretty significant changes made to this document that SEOs can get a bit of insight from. SEO可以對(duì)此文檔進(jìn)行一些相當(dāng)重要的更改,以獲取一些見(jiàn)識(shí)。
Google’s description of what it considers a document and how it relates to other identical or near-identical pages on a site is interesting, as well as Google’s crawling behavior towards the pages within a document it considers as alternate pages. ?While this behavior has often been noted, it is more concrete information on how site owners should handle these duplicate and near-duplicate pages, particularly when they are trying to un-duplicate those pages and see them crawled and indexed as their own document. Google對(duì)它認(rèn)為是什么文檔以及它與網(wǎng)站上其他相同或接近相同頁(yè)面之間的關(guān)系的描述,以及Google朝著它認(rèn)為是替代頁(yè)面的文檔中頁(yè)面的爬行行為,都很有趣。盡管經(jīng)常會(huì)注意到這種行為,但它是有關(guān)網(wǎng)站所有者應(yīng)如何處理這些重復(fù)和近乎重復(fù)的頁(yè)面的更具體的信息,尤其是當(dāng)他們嘗試取消重復(fù)這些頁(yè)面并看到它們已作為自己的文檔進(jìn)行爬網(wǎng)和建立索引時(shí)。
They added a lot of useful advice for newer site owners, which is particularly helpful with so many new websites coming online this year due to the global pandemic. ?Things such as checking a site without being logged in, how to submit both pages and sites to Google, etc. 他們?yōu)樾抡军c(diǎn)所有者增加了很多有用的建議,這對(duì)于今年由于全球大流行而新上線的眾多新站點(diǎn)尤其有用。諸如檢查未登錄的站點(diǎn),如何將頁(yè)面和站點(diǎn)都提交給Google等之類的事情。
The mention of what Google considers a “small site” is interesting because it gives a more concrete reference point for how Google sees large versus small sites. ?For some, a small site could mean under 30 pages and the idea of a site with millions of pages being unfathomable. ?And the reinforcement of a strong navigation, even for “small sites” is useful for showing site owners and clients who might push for navigation that is more aesthetic than practical for both usability and SEO. 提及Google認(rèn)為“小型網(wǎng)站”很有趣,因?yàn)樗鼮镚oogle如何看待大型網(wǎng)站與小型網(wǎng)站提供了更為具體的參考點(diǎn)。對(duì)于某些人來(lái)說(shuō),一個(gè)小型站點(diǎn)可能意味著不到30頁(yè),而擁有數(shù)百萬(wàn)個(gè)頁(yè)面的站點(diǎn)的構(gòu)想?yún)s是深不可測(cè)的。而且,即使對(duì)于“小型網(wǎng)站”而言,增強(qiáng)的導(dǎo)航功能也很有用,它有助于向可能會(huì)要求導(dǎo)航的網(wǎng)站所有者和客戶展示對(duì)可用性和SEO而言比實(shí)際更美觀的導(dǎo)航。
The primary and secondary crawl additions will probably cause some confusion for those who think of primary and secondary in terms of how Google processes scripts on a page when it crawls it. ?But it is nice to have more concrete information on how and when Google will crawl using the alternate version of Googlebot for sites that are usually crawled with either the mobile Googlebot or the desktop one。?主要和次要爬網(wǎng)選項(xiàng)可能會(huì)使那些想到主要和次要爬網(wǎng)的人感到困惑,因?yàn)樗麄儠?huì)在Google對(duì)其進(jìn)行爬網(wǎng)時(shí)如何處理頁(yè)面上的腳本。但是,對(duì)于通常使用移動(dòng)Googlebot或臺(tái)式機(jī)爬網(wǎng)的網(wǎng)站,有更具體的信息來(lái)說(shuō)明如何使用Googlebot的替代版本以及何時(shí)爬網(wǎng)Google會(huì)更好。
Lastly, the change from the “200 ranking factors” to a less specific, but presumably much higher number of ranking factors will disappoint some SEOs who liked having some kind of specific number of potential ranking factors to work out. 最后,從“ 200個(gè)排名因素”到不太具體但可能更高的排名因素的變化將使一些SEO失望,他們喜歡使用某種特定數(shù)量的潛在排名因素來(lái)解決。
?來(lái)源:JENNIFER SLEGG
文章為作者獨(dú)立觀點(diǎn),不代表DLZ123立場(chǎng)。如有侵權(quán),請(qǐng)聯(lián)系我們。( 版權(quán)為作者所有,如需轉(zhuǎn)載,請(qǐng)聯(lián)系作者 )

網(wǎng)站運(yùn)營(yíng)至今,離不開(kāi)小伙伴們的支持。 為了給小伙伴們提供一個(gè)互相交流的平臺(tái)和資源的對(duì)接,特地開(kāi)通了獨(dú)立站交流群。
群里有不少運(yùn)營(yíng)大神,不時(shí)會(huì)分享一些運(yùn)營(yíng)技巧,更有一些資源收藏愛(ài)好者不時(shí)分享一些優(yōu)質(zhì)的學(xué)習(xí)資料。
現(xiàn)在可以掃碼進(jìn)群,備注【加群】。 ( 群完全免費(fèi),不廣告不賣課!)