PDA

View Full Version : Google index and punycode


jose
6th January 2008, 04:55 PM
Some members have raise questions about Google indexing of IDNs and Punycode. It's like this:

Google knows how to do the correct conversion between punycode and idn.

Google will do that conversion any time you enter your domain for search, either entering the domain itself or doing a site:idn/punycode (it’s the same)

Google sees the punycode of your domain as a broken link and autocorrects it to idn every time. I have reported this a while ago, with the title typo domains valuation just went south, or something like that. More info here (http://googlesystem.blogspot.com/2007/10/google-tries-to-fix-broken-links.html)

The Google index of sandboxed results (which used to appear as “supplemental results”) is much, much bigger than the standard index.

The auto-typo correction feature feeds itself on the wider index (which includes the sandboxed results)

If your IDN domain site is still sandboxed, a funny thing can happen: it might trigger the auto-typo correction, and be pulled from the wider Google index, but it will not have result when you search using punycode. This will give the wrong impression that Google only indexes the IDN version, but not the punycode version.


This is IMHO!