google store webpages but can any one tell how they store in there computer .means it is in the ASCII or normal text? or any other format use? if any one can give some info then that will be helpful. thanks in advance.
Why would you need that information? I think that they store the whole site source "as is" because of their cache feature...
they use a unique file system for storing such huge amounts of information.the file system is termed as the GFS : http://en.wikipedia.org/wiki/Google_File_System And there is no need to store any images at all.They just need need the alt text so that they serve it in the image search for the app.keyword.
I think they just store HTML of the whole website - that's why some cache pages appear to be wonky due to loss of javascripts, etc. This is just my guess by the way.
I am pretty sure that they save the source. Try to get cache from a page where is an image which doesnt exist anymore. It will be "bugged". And source is probably stored in files because database would translate it to text as is
Google Uses the Cache feature .. which stores the whole web page as it is .. (means the source codes are copied and kept in google's HDD)