Need to find OUR URLs

Discussion in 'Services' started by Not Registered, May 26, 2006.

  1. #1
    Hello experts.

    Google indicates 20,000 pages from our site, but display only the first 1,000.
    That is a known issue.

    However wwc crawler has successfully crawled 5,000 pages. That's way more than Google actually gives us, but much less than the 20,000 pages.

    Can you suugest a solution?
    I'm very open to bids !!
     
    Not Registered, May 26, 2006 IP
  2. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #2
    Whats the url?
     
    mad4, May 26, 2006 IP
  3. Not Registered

    Not Registered Well-Known Member

    Messages:
    685
    Likes Received:
    58
    Best Answers:
    0
    Trophy Points:
    120
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #3
    PM sent, thanks.
     
    Not Registered, May 26, 2006 IP
  4. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #4
    I can see 17,600 pages in google.

    Are you trying to find out why google has listed so many pages when you think your site has fewer pages?

    If so it will be hard as google only shows the first 1000 results. You need to examine the site architecture and make sure all the old pages are 301 redirected to the new pages or to a 404 error page.
     
    mad4, May 26, 2006 IP
  5. MattUK

    MattUK Notable Member

    Messages:
    6,950
    Likes Received:
    377
    Best Answers:
    0
    Trophy Points:
    275
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #5
    MattUK, May 26, 2006 IP
  6. Not Registered

    Not Registered Well-Known Member

    Messages:
    685
    Likes Received:
    58
    Best Answers:
    0
    Trophy Points:
    120
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #6
    Instead of Google, I used WinWebCrawler to crawl the site, and found `6k pages. This bypasses the the 1K limit of Displayed pages by Google.

    The site changed hands, and in order to 301 Redirect the pages I want to crawl any Lost page; i.e 12K lost URLs need to be found.
    Will be happy to any solution possible

    Thanks
     
    Not Registered, May 26, 2006 IP
  7. Not Registered

    Not Registered Well-Known Member

    Messages:
    685
    Likes Received:
    58
    Best Answers:
    0
    Trophy Points:
    120
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #7
    Instead of Google, I used WinWebCrawler to crawl the site, and found `6k pages. This bypasses the the 1K limit of Displayed pages by Google.

    The site changed hands, and in order to 301 Redirect the pages I want to crawl any Lost page; i.e 12K lost URLs need to be found.
    Will be happy to any solution possible

    Thanks
     
    Not Registered, May 26, 2006 IP
  8. Not Registered

    Not Registered Well-Known Member

    Messages:
    685
    Likes Received:
    58
    Best Answers:
    0
    Trophy Points:
    120
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #8
    Cool tool, I'll try it out.
    Thanks!
     
    Not Registered, May 26, 2006 IP
  9. mad4

    mad4 Peon

    Messages:
    6,986
    Likes Received:
    493
    Best Answers:
    0
    Trophy Points:
    0
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #9
    Can you not just look at your server logs for the error messages? Find all the 404 and 500 errors and solve them one by won.
     
    mad4, May 26, 2006 IP
  10. Not Registered

    Not Registered Well-Known Member

    Messages:
    685
    Likes Received:
    58
    Best Answers:
    0
    Trophy Points:
    120
    As Seller:
    100% - 0
    As Buyer:
    100% - 0
    #10
    It appears that if I won't be able to relove the lost ones, that's what I'll be doing. Cheers :)
     
    Not Registered, May 26, 2006 IP