Currently, we are using gSitecrawler to handle our very large sites. It automatically creates the multiple sites as you will need. But understand, it will take time (like overnight) to do 1,000,000 pages.