Google 不小心将内部 API 发布到 GitHub 上 Google 今年 3 月 27 日不小心将其部分内部 API 发布到 GitHub 上,这些文件直到 5 月 7 日才被删除。内部 API 文件披露了 Google 搜索引擎相关的敏感信息。Google 很早就发现,为了改善搜索质量它需要完整的点击流数据,也就是通过浏览器访问的 URL,这是它开发 Chrome 浏览器的一大动机。名叫 NavBoost 的系统最初从 Google 的 Toolbar PageRank 收集数据,它使用给定关键字的搜索次数识别趋势搜索需求、搜索结果点击次数以及长点击和短点击。Google 利用 cookie 历史记录、登录的 Chrome 数据和模式检测作为对抗手动和自动垃圾点击的有效手段。在 Covid-19 疫情期间 Google 会使用白名单让特定网站出现在 Covid 相关搜索的结果前列;在选举期间 Google 也会对网站使用白名单显示选举信息。 - https://www.solidot.org/story?sid=78286 - https://github.com/googleapis/elixir-google-api/commit/078b497fceb1011ee26e094029ce67e6b6778220 (内容已删除) - https://sparktoro.com/blog/an-anonymous-source-shared-thousands-of-leaked-google-search-api-documents-with-me-everyone-in-seo-should-see-them/
https://hexdocs.pm/google_api_content_warehouse/0.4.0/api-reference.html - https://www.reddit.com/r/SEO/comments/1d2gllz/google_caught_in_their_lies_with_leaked_api_docs/ - https://www.blackhatworld.com/seo/anonymous-source-shared-leaked-google-search-api-documents.1602172/
Secrets from the Algorithm: Google Search’s Internal Engineering Documentation Has Leaked ... An important thing we can all take away from this is: SEOs know what they are doing. After years of being told we’re wrong it’s good to see behind the curtain and find out we have been right all along. And, while there are interesting nuances of how Google works in these documents there is nothing that is going to make dramatically change course in how I strategically do SEO. For those that dig in, these documents will primarily serve to validate what seasoned SEOs have long advocated. Understand your audience, identify what they want, make the best thing possible that aligns with that, make it technically accessible, and promote it until it ranks. To everyone in SEO that has been unsure of what they are doing, keep testing, keep learning, and keep growing businesses. Google can’t do what they do without us. ... —— https://ipullrank.com/google-algo-leak / https://archive.ph/tBVGt
TL;DR KEY TAKEAWAYS: - Google claimed they don't use a "domain authority" metric, but the docs show they totally do - it's called "siteAuthority." - G said clicks don't affect rankings, but there's a whole system called "NavBoost" that uses click data to change search results. - Google denied having a "sandbox" that holds back new sites, but yep, the docs confirm it exists. - G assured us Chrome data isn't used for ranking, but surprise! It is. - The number and diversity of your backlinks still matter a lot. - Having authors with expertise and authority helps. - Putting keywords in your title tag and matching search queries is important. - Google tracks the dates on your pages to determine freshness. - A lot of long-held SEO theories have been validated, so trust your instincts. - Creating great content and promoting it well is still the best approach. - We should experiment more to see what works, rather than just listening to what Google says. —— https://www.reddit.com/r/SEO/comments/1d2gllz/comment/l60i5zl/
仿佛一群人对着 Google Search 这个黑盒不断输入来记录输出规律最后得出 SEO 技术, 然后 Google 突然把这个盒子打开了一条缝, 外面的人看到了一些管道的走向, 有些正好和长期以来的 SEO 理论不谋而合, 甚至相符到有人开始怀疑 Google 是故意泄露, 甚至整个文档都是假的. 但实际上, 我觉得就算要生编出这两千五百个模块和一万四千个属性也要花很多时间, 并且还不能让人发现其中的逻辑错误, 就算是 AIGC, 也得花很多人力和时间来复核. 真算假的, 那也是炒作和假新闻里面质量很高的了, 比 Yandex 源代码泄露那次给出的信息更有用. Google 发言人的出发点是好的, 但我们能相信他们吗? > 快速的答案是, 当你太接近核心秘密时就不能了.