![]() |
![]() |
| idnforums | idntools |
|
|||||||
| Indian IDN Domains IDN Domains in Indo languages. |
![]() |
|
|
Thread Tools | Display Modes |
|
#1
|
||||
|
||||
|
Devanagari Encoding Mystery
साड़ी.com
xn--12bmg5i.com 136 google साड़ी.com xn--e2b9bngm.com 436 google These domains look the same in devanagari, but the unicode and pubycode is different, and google finds different pages for each. Does anyone know why?
__________________
Sign up with NameDrive. Last edited by blastfromthepast; 05-06-2006 at 07:19 PM.. |
|
#2
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
__________________
Quote:
Last edited by thegenius1; 05-06-2006 at 03:22 AM.. |
|
#3
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
__________________
Sign up with NameDrive. |
|
#4
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
Well after i slapped them into Microsoft Word this became an Enigma , but by closly looking at what you posted i can clearly see that the top ones "Dot" is further from the " S " and the 2nd ones is closer
__________________
Quote:
|
|
#5
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
Both are written the same- they are for sure the same, but one is written in hindi and the other is probably written in sanskrit or any other language with the same script as hindi, now when google maps it for searches it converts it into puny code and sees them only as a punycode, and the puny code for the first one is different from the second one, so different results, they mean the same but it should be rectified, and this can happen probably for indian languages with same script, so their might be two different puny codes. But, nice error found.
__________________
"I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." [color="DarkSlateBlue"]— Umberto Eco[/COLOR] |
|
#6
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
This matters a great deal, because it seems that some keyboards are keying in ड़ and some are keying in ड़.
Both are Devanagari and valid. Sanskrit should be using the same encoding as Hindi. The script is language independent. If you thought the Latin phishing domain issue was overblown, get ready for same-script phishing domains. A little bit of googling revealed that this problem appears in other Indian scripts as well. Not just in devanagari.
__________________
Sign up with NameDrive. Last edited by blastfromthepast; 05-06-2006 at 07:25 PM.. |
|
#7
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Looks like it could be a major problem.
Here are 2 variations and the unicode sequence for the same. xn--e2b9bngm.com (साड़ी.com): = स ा ड़ ी and xn--12bmg5i.com (साड़ी.com): = स ा ड ़ ी
__________________
Ads.co.in -Internet Advertising in India Last edited by a2zofb2b; 05-06-2006 at 07:26 PM.. |
|
#8
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
When unicode is converted to punycode to create a domain name, it is supposed to be normalized so that such problems don't occur. Quote:
I tried putting in both into IBM's punycode converter, and I get identical results (xn--e2b9bngm). If registrars implement the punycode conversion mechanism correctly then this shouldn't be a problem. Looks like some registrars had it wrong and weren't running the unicode through the nameprep routine and some people are now stuck with domain lookalikes because of errors in their registrars punycode conversion. http://www-950.ibm.com/software/glob...test&x=22&y=17
__________________
Sign up with NameDrive. Last edited by blastfromthepast; 05-06-2006 at 08:08 PM.. Reason: Automerged Doublepost |
|
#9
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
What other languages have you found this error ?
__________________
"I have come to believe that the whole world is an enigma, a harmless enigma that is made terrible by our own mad attempt to interpret it as though it had an underlying truth." [color="DarkSlateBlue"]— Umberto Eco[/COLOR] |
|
#10
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
1. Google is not combining results for differently-ordered text in Indic scripts - this applies to any script where you can enter characters in a different order to produce the same character. This should be resolved by google in the future. Maybe we should let them know. 2. Some registrars were not performing the nameprep routine, which is supposed to resolve such differences and produce a single punycode. So domains that were registered early on may be affected.
__________________
Sign up with NameDrive. |
|
#12
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
Quote:
__________________
Sign up with NameDrive. |
|
#13
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
It is getting like swimming with sharks in the dark these days.
Who knows who is on the guest list?
__________________
Premium Domains, large selection of most of the heavily speculated languages. PM me for details. All offers over 1 week old are null and void. dnlocal.com |
|
#14
|
|||
|
|||
|
Re: Devanagari Encoding Mystery
Quote:
__________________
off to watch some TV. Those doing NameDrive translations please send to me at the email address in my original PM; and I'll ensure Namedrive action them. Thanks. |
|
#15
|
||||
|
||||
|
Re: Devanagari Encoding Mystery
I wouldn't be surprised. They certainly have got enough mentions here during this past year; enough to be noticed...
. |
|
#16
|
|||
|
|||
|
Re: Devanagari Encoding Mystery
Quote:
hey, maybe it's you.
__________________
off to watch some TV. Those doing NameDrive translations please send to me at the email address in my original PM; and I'll ensure Namedrive action them. Thanks. |
![]() |
| Thread Tools | |
| Display Modes | |
|
|