Advanced Search

Results 1 to 5 of 5

Thread: Funny characters

  1. #1
    Join Date
    Nov 2006
    Posts
    116
    Thanks
    6
    Thanked 0 Times in 0 Posts

    Default Funny characters

    Hi Guys,

    WHy do i get some funny characters on the address bar?

    Something like 20%
    any ideas????

  2. #2
    Join Date
    Sep 2006
    Location
    St. George, UT
    Posts
    2,769
    Thanks
    3
    Thanked 157 Times in 155 Posts

    Default

    %20 is a url encoded space. There are several more, just look at this page for the list of them:

    http://i-technica.com/whitestuff/urlencodechart.html

    Hope this helps
    "Computer games don't affect kids; I mean if Pac-Man affected us as kids, we'd all be running around in darkened rooms, munching magic pills and listening to repetitive electronic music." - Kristian Wilson, Nintendo, Inc, 1989
    TheUnlimitedHost | The Testing Site | Southern Utah Web Hosting and Design

  3. #3
    Join Date
    Nov 2006
    Posts
    116
    Thanks
    6
    Thanked 0 Times in 0 Posts

    Default

    Thanks a lot. so silly of me to put a space in the page name.

  4. #4
    Join Date
    Feb 2007
    Location
    🌎
    Posts
    528
    Thanks
    10
    Thanked 10 Times in 10 Posts
    Blog Entries
    2

    Default

    And an explanation of why would be that if you put special characters in the page name, including a space, turn into '%' then the hex code. The hex code for a space is 0020, and % + 20 = %20. You can encode any URL with hex codes by hand, but characters not in the UTF-8 character set are converted automatically. For example, the URL for this page is 'http://www.dynamicdrive.com/forums/showthread.php?t=21909,' which is the smae as 'http://%77%77%77%2E%64%79%6E%61%6D%69%63%64%72%69%76%65%2E%63%6F%6D%2F%66%6F%72%75%6D%73%2F%73%68%6F%77%74%68%72%65%61%64%2E%70%68%70%3F%74%3D%32%31%39%30%39.' A good idea to prevent this is use substitutes in the UTF-8 character set. For example, change ' ' to _, Ꮷ to d,  to OBJ, Ꭰ to D, 𐁘 to X, Ꭱ to R, Ꭲ to T, 𐀦 to Y, Ꭺ to A, Ꭻ to J, Ꭼ to E, Ꭹ to y, Ꭸ to h, Ꭵ to i, Ꮏ to t, Ϲ to C, Ϻ to M, ϻ to m, ϼ to p, ϸ to b, η to n, ο to o, etc.
    ....(o_ Penguins
    .---/(o_- techno_racing
    +(---//\-' in
    .+(_)--(_)' The McMurdo 500

  5. #5
    Join Date
    Jun 2005
    Location
    英国
    Posts
    11,878
    Thanks
    1
    Thanked 180 Times in 172 Posts
    Blog Entries
    2

    Default

    UTF-8 is a Unicode encoding, meaning that it can represent one of the largest numbers of characters of any encoding. All the "non-UTF-8" characters you've given above are in fact representable by UTF-8.

    Only ASCII characters are allowed in URLs, and only some of those. In general, try to avoid anything other than numbers, letters, underscores, and full stops.

    Unicode URLs are, in my opinion, one of the most needed upgrades to HTTP for the Web as it is now.
    Twey | I understand English | 日本語が分かります | mi jimpe fi le jbobau | mi esperanton komprenas | je comprends franšais | entiendo espa˝ol | t˘i Ýt hiểu tiếng Việt | ich verstehe ein bisschen Deutsch | beware XHTML | common coding mistakes | tutorials | various stuff | argh PHP!

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •