Hello, I am having some very random crashes on a server. Its been crashing randomly for about 5 months. I have tried several different OS installs. Linux, Server2003... it will always crash. Its random, although I get it to crash by doing certain things with software. I am pretty confident its hardware though since it happens with numerous OS installs. The event viewer doesnt really reveal a whole lot that I can figure out although I guess there will be something useful. Can anyone suggest ways I can figure out what is going on? If someone is willing to take a look I can pay for an hours work. Send me a PM with contact details if you can provide assistance in any form.
Sounds like you need to run some kind of hardware diagnostics utility. I don't know of any that you can just download... I just run what comes with my computers.
it will be taking out all the memory and just put back 1 extra stick at a time.. that is the only way you going to find out..
Lol. No it's not. Download http://www.memtest86.com/ and burn it to CD-ROM, then reboot the computer. Your server will be down during this test and you need to let it run for about 8 hours to really make sure the RAM is alright, not just one or two passes. If the computer gets through memtest without any memory errors for 8+ hours, then the RAM is fine. Also, what CPU are you using? Is it a well ventilated case? Have you checked the CPU temperatures? Perhaps the CPU is overheating. I've had systems like that. I had to buy a better heatsink/fan for the CPU and it fixed it. It also could be the motherboard. I had a friend who had a bad southbridge or northbridge chip (can't remember which one it was) that would overheat and crash randomly. Simply putting a small heatsink on it solved the problem. So, how about full system specs.
Specs: Dual 3.0Ghz Xeons Supermicro 6014HT 4GB RAM Its quite a big machine to have idle doing nothing. I ran a memory check that came back negative anywya