Since the site is over a milion pages, then it is a dynamic web site (php,aspx,asp, etc.). None of the SEs will follow a query string path. This is why they created the XML Sitemap concept. They will index a page with a query string but they will not follow it. If you are not using extensive XML Sitemaps submission via Google's Webmaster Tools, then you have no chance of ever getting the pages indexed. Second, since it is a dynamic site, the "template" used must create different pages (i.e. less than 50% common code) or it will be considered duplicate content. It the pages have a gray bar, this is what is happening. You must remove as much of the common code as possible and then add more unique content. Without looking at the site, look close at your navigation. This usually make up over 80% of the code of a page and therefore will trigger the duplicate content penalty. What's the url?
Ok, you are on a free hosting your site have in the subfolder only 5 pages it not possible to submit this website Footbal.... to Google Webmaster tool because to domaine are alredy in Google tool froom the free service you need a confirmation key but i am not sur at 100% Huuummmm You follow me my english is not to bad ? Best Regard
So, I am still not indexed and it's still very slow... I have submitted sitemaps etc. I have created RSS feed for my website and submitted it to many RSS dirs. Still nothing - crawles one or two pages per 3-4 minuts. Any tips how to speed it up?
How long have those "millions of pages" been on your site? If they've not yet been indexed, I'm guessing not very long. If you have a new site with millions of pages, getting indexed is not your biggest issue. Your issue is that you either have duplicate content or no content. Either way, you'll never make it out of the supplemental index ...even if you do get indexed. You'll never rank high, because the search engines won't find you credible if you're going from 0 pages to millions of pages over night. No site can do that with "good content". ...it might have worked 10 years ago ;-)
It's not duplicate or zero content, it's not very spammy too (some data generated from public databases). It does not have millions of pages yet, currently less than 20k. I have similar site with about 5k pages, which has been indexed quite fast and ranks very well now.