Discussion:
Problem passing URL with non-ASCII characters
Jernej Simončič
17 years ago
Permalink
I received an e-mail with this URL:
<http://en.wikipedia.org/wiki/Nikkō_Tōshō-gū>, however when I click on
the URL in The Bat, Opera instead receives
<http://en.wikipedia.org/wiki/Nikkô_Tôshô-gű>, which leads to a
nonexistent page (interestingly enough, this only happens while
viewing the message - if I double-click the URL in editor, the correct
one is passed to Opera).

There's even more weirdness though - when I view this message in my
Outbox, the URL ends at the first non-ASCII character, which suggests
some kind of a problem with UTF-8 URL detection.

The original message was in ISO-8859-13 charset, and my system is set
up to use Slovenian language for non-Unicode programs (Windows-1252
codepage).
--
< Jernej Simončič ><><><><>< http://eternallybored.org/ >

If it should exist, it doesn't.
-- Arnold's First Law of Documentation

[The Bat! v4.0.26.4 on Windows XP Professional x64 Edition 5.2.3790.Service Pack 2]



________________________________________________________
Current beta is 4.0.26.4 | 'Using TBBETA' information:
http://www.silvers
Thomas Fernandez
17 years ago
Permalink
Hello Jernej,

On Fri, 1 Aug 2008 17:47:20 +0200 GMT (01/08/2008, 22:47 +0700 GMT),
Jernej Simončič wrote:

Do you mean high-ASCII characters? I thought this is what they are
called.

JS> I received an e-mail with this URL:
JS> <http://en.wikipedia.org/wiki/Nikkō_Tōshō-gū>, however when I click on
JS> the URL in The Bat, Opera instead receives
JS> <http://en.wikipedia.org/wiki/Nikkô_Tôshô-gű>, which leads to a
JS> nonexistent page (interestingly enough, this only happens while
JS> viewing the message - if I double-click the URL in editor, the correct
JS> one is passed to Opera).

JS> There's even more weirdness though - when I view this message in my
JS> Outbox, the URL ends at the first non-ASCII character, which suggests
JS> some kind of a problem with UTF-8 URL detection.

That is the problem. MicroEd seems to be selective with high-ASCII
characters. The links you mentioned stopped being underlined and blue
after ".../nikk".

I just tried this: www.año.es The website doesn't exist, but the blue
underlining continues until the end. It tries to open
http://www.xn--ao-zja.es/ - is this correct?
--
Cheers,
Thomas.

I've heard people are more violently opposed to fur than leather
because it's safer to harass rich women than motorcycle gangs.
http://thomas.fernandez.hat-gar-keine-homepage.de/

Message reply created with The Bat! 4.0.26.3
under Windows XP 5.1 Build 2600 Service Pack 2



________________________________________________________
Current beta is 4.0.26.4 | 'Using TBBETA' information:
http
Jernej Simončič
17 years ago
Permalink
Post by Thomas Fernandez
That is the problem. MicroEd seems to be selective with high-ASCII
characters. The links you mentioned stopped being underlined and blue
after ".../nikk".
Except that I'm not using MicroEd, but the Rich Text viewer.
Post by Thomas Fernandez
I just tried this: www.año.es The website doesn't exist, but the blue
underlining continues until the end. It tries to open
http://www.xn--ao-zja.es/ - is this correct?
Yes, that's punycode. You can try <http://čšž.ena.si/>, which does
exist.
--
< Jernej Simončič ><><><><>< http://eternallybored.org/ >

[The Bat! v4.0.26.4 on Windows XP Professional x64 Edition 5.2.3790.Service Pack 2]

Social innovations tend to the level of minimum tolerable well being.
-- Albrecht's Law


________________________________________________________
Current beta is 4.0.26.4 | 'Using TBBETA' information:
http://www.
Thomas Fernandez
17 years ago
Permalink
Hello Jernej,

On Fri, 1 Aug 2008 22:15:01 +0200 GMT (02/08/2008, 03:15 +0700 GMT),
Post by Thomas Fernandez
That is the problem. MicroEd seems to be selective with high-ASCII
characters. The links you mentioned stopped being underlined and blue
after ".../nikk".
JS> Except that I'm not using MicroEd, but the Rich Text viewer.

I see. The same problem seems to exist in both, then.
Post by Thomas Fernandez
I just tried this: www.año.es The website doesn't exist, but the blue
underlining continues until the end.
Funny, in the editor the URL was completely underlined, but in the
viewer (when I received your reply) not.
Post by Thomas Fernandez
It tries to open http://www.xn--ao-zja.es/ - is this correct?
JS> Yes, that's punycode. You can try <http://čšž.ena.si/>, which does
JS> exist.

This was not underlined in the viewer, but is underlined in the eidtor
while replying. When I double-click on it here in the editor, the
special characters are handed over to the browser correctly.
--
Cheers,
Thomas.

The real question for 1988 is whether we're going to go forward to
tomorrow or past to the--to the back! --V.P. Dan Quayle.
http://thomas.fernandez.hat-gar-keine-homepage.de/

Message reply created with The Bat! 4.0.26.3
under Windows XP 5.1 Build 2600 Service Pack 2



________________________________________________________
Current beta is 4.0.26.5 | 'Using TBBETA' information:
http://www.silverstones.com/thebat/TBUDLInfo.htm
Dwight Corrin
17 years ago
Permalink
Post by Jernej Simončič
<http://en.wikipedia.org/wiki/Nikkō_Tōshō-gū>, however when I click on
the URL in The Bat, Opera instead receives
<http://en.wikipedia.org/wiki/Nikkô_Tôshô-gű>, which leads to a
nonexistent page (interestingly enough, this only happens while
viewing the message - if I double-click the URL in editor, the correct
one is passed to Opera).
Can't confirm in xp, it worked fine for me, but in vista, my link is
marked only to 'nikk'

in the editor, though, the whole link is blue.

character set in vista is unicode.
--
Dwight A. Corrin
316.303.9385 phone ahead to fax
dcorrin at fastmail.fm
photo galleries at http://dcorrin.smugmug.com
Using IMAP with The Bat! 4.0.26.4 on Windows Vista version 6,0 (Service Pack 1)


________________________________________________________
Current beta is 4.0.26.4 | 'Using TBBETA' information:
http://www.si
Jernej Simončič
17 years ago
Permalink
Post by Dwight Corrin
Post by Jernej Simončič
<http://en.wikipedia.org/wiki/Nikkō_Tōshō-gū>,
Can't confirm in xp, it worked fine for me, but in vista, my link is
marked only to 'nikk'
It appears that the whole URL is underlined when using Plain Text
viewer (and when clicked, the correct characters are sent), but only
the ASCII part is underlined with Rich Text Viewer (when the message
is UTF-8 encoded; when it's ISO-something, the whole URL is
underlined, but wrong characters are sent to the browser).
--
< Jernej Simončič ><><><><>< http://eternallybored.org/ >

[The Bat! v4.0.26.4 on Windows XP Professional x64 Edition 5.2.3790.Service Pack 2]

Disorder expands proportionately to the tolerance for it.
-- Law of Organization


________________________________________________________
Current beta is 4.0.26.4 | 'Using TBBETA' information:
http://www.silverstones.com/thebat/TBUDLInfo.html
Dwight Corrin
17 years ago
Permalink
...
In your quoted text here, I get the whole url underlined, but in the
original message, only to nikk
--
Dwight A. Corrin
316.303.9385 phone ahead to fax
dcorrin at fastmail.fm
photo galleries at http://dcorrin.smugmug.com
Using IMAP with The Bat! 4.0.26.5 on Windows Vista version 6,0 (Service Pack 1)


________________________________________________________
Current beta is 4.0.26.5 | 'Using TBBETA' information:
http://www.silverstones.com/thebat/TBUDLInfo.html
Jernej Simončič
17 years ago
Permalink
Post by Dwight Corrin
In your quoted text here, I get the whole url underlined, but in the
original message, only to nikk
That's probably because the message is ISO-8859-4, while the original
one (where the URL wasn't underlined properly) was UTF-8.
--
< Jernej Simončič ><><><><>< http://eternallybored.org/ >

[The Bat! v4.0.26.5 on Windows XP Professional x64 Edition 5.2.3790.Service Pack 2]

It is easier to get forgiveness than permission.
-- Pope's Law of Retroactivity


________________________________________________________
Current beta is 4.0.26.5 | 'Using TBBETA' information:
http://www.silverstones.com/thebat/TBUDLInfo.html
Loading...