Can someone recommend a command line tool for windows that would convert HTML to plain text? Text doesn't need to look nice. I just need to compute the keyword density cloud.
Hmm. I don't know is there exact command line for windows but isn't it HTML already in plain text? perhaps you need more like a crawler or parser to extract text from html?
Try HTMLAsText software from Nirsoft. From the site: "HTMLAsText utility converts HTML documents to simple text files, by removing all HTML tags and formatting the text according to your preferences." Or Markup Remover from Aquatic Moon Software to convert HTML into plain text: there are several output options.