1. Advertising
    y u no do it?

    Advertising (learn more)

    Advertise virtually anything here, with CPM banner ads, CPM email ads and CPC contextual links. You can target relevant areas of the site and show ads based on geographical location of the user if you wish.

    Starts at just $1 per CPM or $0.10 per CPC.

robots.txt pointing to index page

Discussion in 'robots.txt' started by NewComputer, Sep 26, 2004.

  1. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #21
    The main page was cached October 28, 2004 -- how long has this site been up?

    I could find only 3 backlinks, 2 from forums.

    If it is a new site, Google has found it and will eventually be back to try to spider the remaining pages. In the meantime, you need to submit to directories and see what backlinks you can arrange.
     
    minstrel, Oct 29, 2004 IP
  2. leeds1

    leeds1 Peon

    Messages:
    585
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #22
    Does that code above say to every useragent (ie: every spider) please do not come to my site rather than spider everything

    I think that's why you are not showing - the code is incorrect

    <edit> My mistake - I have just seen this on the robots site:

    To exclude all robots from the entire server

    User-agent: *
    Disallow: /

    To allow all robots complete access

    User-agent: *
    Disallow:

    Or create an empty "/robots.txt" file.
    To exclude all robots from part of the server

    User-agent: *
    Disallow: /cgi-bin/
    Disallow: /tmp/
    Disallow: /private/

    To exclude a single robot

    User-agent: BadBot
    Disallow: /

    To allow a single robot

    User-agent: WebCrawler
    Disallow:

    User-agent: *
    Disallow: /
     
    leeds1, Oct 29, 2004 IP
  3. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #23
    No, leeds1.... the line "Disallow: " with nothing after the ":" says "Disallow nothing", i.e., "spider everything".
     
    minstrel, Oct 29, 2004 IP
  4. leeds1

    leeds1 Peon

    Messages:
    585
    Likes Received:
    10
    Best Answers:
    0
    Trophy Points:
    0
    #24
    yep - saw that hence the follow up edit thingy
     
    leeds1, Oct 29, 2004 IP
  5. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #25
    Ah, sorry, leeds1 -- I must have been posting while you were editing... when I replied that wasn't there... spooky :eek:
     
    minstrel, Oct 29, 2004 IP
  6. mim

    mim Peon

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #26
    I have recently completed a makeover to a site which had excellent Google rating. I changed some of the page names to optimize them for keywords and used the .htaccess file to force the redirection of old pages to the new counterpart.Will this method pf redirection have any effect on the ranking?
     
    mim, Apr 19, 2005 IP
  7. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #27
    No. As long as it is clear that this is a permaent redirect (301), that is exactly what Google recommends:

    Google Information for Webmasters
     
    minstrel, Apr 19, 2005 IP
  8. mim

    mim Peon

    Messages:
    2
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #28
    Thanks for your reply, but I am unclear on where the 301 code should go and the syntax required. Should it be in the header of the .htaccess file. The pages from the old site had different names so don't exist anylonger. I first made a 404 error page which allowd them access to the new navigation, but this would be bypassed now with the redirect from the .htaccess.
     
    mim, Apr 19, 2005 IP
  9. noppid

    noppid gunnin' for the quota

    Messages:
    4,246
    Likes Received:
    232
    Best Answers:
    0
    Trophy Points:
    135
    #29
    After the rewrite rule you put [R=301] or [R=301,L] most likely.
     
    noppid, Apr 19, 2005 IP
  10. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #30
    Place one of the following lines in your .htaccess file for each file that you've renamed:

    Redirect 301 /oldpage.html [url]http://www.yoursite.com/newpage.html[/url]
    Code (markup):
    or

    Redirect permanent /oldpage.html http://www.yoursite.com/newpage.html
    Code (markup):
     
    minstrel, Apr 19, 2005 IP
  11. Blogmaster

    Blogmaster Blood Type Dating Affiliate Manager

    Messages:
    25,924
    Likes Received:
    1,354
    Best Answers:
    0
    Trophy Points:
    380
    #31
    could that be the reason http://www.imgascot.com is indexed with the url as the title instead of the title tags?
     
    Blogmaster, Apr 19, 2005 IP
  12. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #32
    Are you using a redirect on that site, sitetutor?

    I see this in Google: http://www.google.com/search?source...,GGLD:2005-12,GGLD:en&q=site:www.imgascot.com

    while the meta tags say this:

    <title>IMG Ascot - Indianapolis Mortgage and Loan Specialists Serving Indiana</title>
    <meta name="keywords" content="Indianapolis Mortgages, Indianapolis Indiana Mortgages, Indianapolis IN Mortgages, Indianapolis Mortgages and Loans, Indianapolis Indiana Mortgages and Loans, Indianapolis IN Mortgages and Loans, Indianapolis Home Loans, Indianapolis Indiana Home Loans, Indianapolis IN Home Loans, Indianapolis Loans, Indianapolis Indiana Loans, Indianapolis IN Loans">
    <meta name="description" content="Indiana mortgage, Indianapolis Mortgage Specialists Serving the State on Indiana, ">
    <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1" />
    
    Code (markup):
    So the "title" Google uses, if it IS reflecting the fact that it's a redirect, is interesting -- I've never seen that before.
     
    minstrel, Apr 19, 2005 IP
  13. Blogmaster

    Blogmaster Blood Type Dating Affiliate Manager

    Messages:
    25,924
    Likes Received:
    1,354
    Best Answers:
    0
    Trophy Points:
    380
  14. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #34
    Wait... the snippet is from an old version of the page... not the current version.

    Did the page EVER have that as a title?
     
    minstrel, Apr 19, 2005 IP
  15. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #35
    Oh. I'm not sure what that means fopr MSN search but Google does that for newly found but not yet fully spidered sites.

    How new IS this site?

    DOES it use a redirect?

    I'm still interested in that Google result...
     
    minstrel, Apr 19, 2005 IP
  16. noppid

    noppid gunnin' for the quota

    Messages:
    4,246
    Likes Received:
    232
    Best Answers:
    0
    Trophy Points:
    135
    #36
    That looks like a webhost's temp page when you activate an account. Your site seems to have been indexed before it was setup with what Minstrel shows.
     
    noppid, Apr 19, 2005 IP
  17. Blogmaster

    Blogmaster Blood Type Dating Affiliate Manager

    Messages:
    25,924
    Likes Received:
    1,354
    Best Answers:
    0
    Trophy Points:
    380
    #37
    no, we took the site over as a client but it has been indexed for a few months ... it seems like there is a robot.txt file in the way
     
    Blogmaster, Apr 19, 2005 IP
  18. noppid

    noppid gunnin' for the quota

    Messages:
    4,246
    Likes Received:
    232
    Best Answers:
    0
    Trophy Points:
    135
    #38
    It is ... http://www.imgascot.com/robots.txt

    # robots, nothing to see here. keep on move n...

    User-agent: *

    disallow /

    and it's written wrong too.
     
    noppid, Apr 19, 2005 IP
  19. minstrel

    minstrel Illustrious Member

    Messages:
    15,082
    Likes Received:
    1,243
    Best Answers:
    0
    Trophy Points:
    480
    #39
    Hey, good find there, noppid! That robots.txt file is telling Google (and all other spiders) not to index anything on the site... assuming the bots can figure it out since, as noppid noted, the syntax is incorrect.

    Change this:

    # robots, nothing to see here. keep on move n...
    
    User-agent: *
    
    disallow /
    
    Code (markup):
    to this

    User-agent: *
    Disallow:
    
    Code (markup):
    or just delete the file.
     
    minstrel, Apr 19, 2005 IP
  20. Blogmaster

    Blogmaster Blood Type Dating Affiliate Manager

    Messages:
    25,924
    Likes Received:
    1,354
    Best Answers:
    0
    Trophy Points:
    380
    #40
    Thank you guys, deleted that garbage!
     
    Blogmaster, Apr 19, 2005 IP