What should I do with %D8%B7%D8 pages??

Discussion in 'Google Sitemaps' started by metros, Oct 3, 2008.

  1. #1
    Hi,
    I have those pages:
    domain.com/tag/%D8%B7%D8

    Does Search Engines read them and translate it to the real language?
     
    metros, Oct 3, 2008 IP
  2. gbh1935

    gbh1935 Peon

    Messages:
    585
    Likes Received:
    11
    Best Answers:
    0
    Trophy Points:
    0
    #2
    no idea, but as a practice, I always try to make it as human readable as possible
     
    gbh1935, Oct 3, 2008 IP
  3. metros

    metros Notable Member

    Messages:
    3,978
    Likes Received:
    373
    Best Answers:
    0
    Trophy Points:
    245
    #3
    Well yeah, just wonder if the search engines auto-generate it to the real languages or shall I do that.
     
    metros, Oct 4, 2008 IP
  4. David Stone

    David Stone Peon

    Messages:
    21
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #4
    that kind code is ok for google, google can recognize it.
     
    David Stone, Oct 4, 2008 IP
  5. einfoway

    einfoway Member

    Messages:
    83
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    26
    #5
    I think google will not :)
     
    einfoway, Oct 7, 2008 IP
  6. metros

    metros Notable Member

    Messages:
    3,978
    Likes Received:
    373
    Best Answers:
    0
    Trophy Points:
    245
    #6
    Any one can confirm?
     
    metros, Oct 7, 2008 IP
  7. traitorworld

    traitorworld Banned

    Messages:
    544
    Likes Received:
    35
    Best Answers:
    0
    Trophy Points:
    0
    #7
    lol, if you cannot read %D8%B7%D8 then how can you expect the spiders to?
    They aren't so intelligent as we, so you should be doing all the editing job to be on the safe side.
     
    traitorworld, Oct 9, 2008 IP
  8. metros

    metros Notable Member

    Messages:
    3,978
    Likes Received:
    373
    Best Answers:
    0
    Trophy Points:
    245
    #8
    Jesus, you wanna kill me? hehe
    Can't edit millions of pages :p

    But I get lot of peoples tell me that SE's can understand it, after all they even gave same results..
     
    metros, Oct 9, 2008 IP
  9. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #9
    What you have there is percentage encoded / url encoded URLs. In your cause the encoding seems to be from encoding non-ASCII characters as UTF-8 bytes. If that is indeed the cause, when the URLs and encoding are fully correct. I have just posted the first draft of my sitemaps FAQ item about character percentage encoding / url encoding.
     
    websitetools, Oct 10, 2008 IP
  10. metros

    metros Notable Member

    Messages:
    3,978
    Likes Received:
    373
    Best Answers:
    0
    Trophy Points:
    245
    #10
    Hi Thomas,
    Then can I put them on sitemaps and the SE's will translate them to the real "words" ?
     
    metros, Oct 10, 2008 IP
  11. websitetools

    websitetools Well-Known Member

    Messages:
    1,513
    Likes Received:
    25
    Best Answers:
    4
    Trophy Points:
    170
    #11
    It's an error if you do not url encode URLs in sitemaps / webserver. (Assuming that you have URLs that require to be encoded, e.g. if you use non-English characters etc.)

    Quote from official sitemaps.org protocol website:

    I will add that to my sitemap generator FAQ :)
     
    websitetools, Oct 10, 2008 IP
  12. webrickco

    webrickco Active Member

    Messages:
    268
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    60
    #12
    Escape codes, url encoded strings are read very well by spiders. In fact if your browser can read the url string and display the results from the server, then the spider can! Spiders work like pieces of browser while opening remote content, but instead of parsing the result, they just analyze it. Take a look at http://www.google.com/support/webmasters/bin/answer.py?answer=35769, and if you are not sure if your url are spider readable, try to paste them into lynx. If it reads them, then spiders will. And this is not because human are less intelligent! :)
    Also if you want to know the reason why your url won't be followed by spiders, have a look to http://www.webrickco.com/buildsitemap.php. It will generate a sitemap, display in red every url that are not accessible and if you click on one of those lines, states the reason why it is not followable.
     
    webrickco, Oct 11, 2008 IP
  13. metros

    metros Notable Member

    Messages:
    3,978
    Likes Received:
    373
    Best Answers:
    0
    Trophy Points:
    245
    #13
    Thanks for the info :)
     
    metros, Oct 18, 2008 IP