Hello, does anybody know here of a sitemap generator that would NOT harvest no-follow and robots.txt blocked links? Would be much appreciated!
gsitecrawler you can chose to import it or not, but why wouldn't you want to block the robots.txt in your sitemap?
The A1 Sitemap Generator program has "crawler filter" options for: download robots.txt obey robots.txt obey <meta> "robots" noindex obey <meta> "robots" nofollow obey <a> "rel" nofollow