Its been a few months since I last posted saying I’d fixed my crashing linux box issue. As it turns out, I hadn’t fixed it.
My server was stable for a very long time… then started having issues again. After everything I’d done, I went out and bought a better motherboard and installed it. Seemed good…. a week later issues crept in again.
Then I noticed the kernel modules installed for Virtualbox were for the Open Source Edition, not the version I was running. So I nuked them and installed the correct ones. All good for another week.
And the issues came back.
So I ran memtest86 for a few hours on the RAM (I did this months ago too), with no issues. Put the RAM back in, ran the linux box till it had issues again. I immediately rebooted into memtest86… and got errors.
I’ve removed the two 2gig DIMMs that I had been using, and dropped in a single 2gig DIMM I yanked from my HTPC. Its been about a month since then, and its been ok. Problem is I’ve gone 3 months without issues before…. so unless this thing is rock solid until Christmas, I’m not counting it as fixed. I’ve learnt my lesson.
In other news, both of the NICs I have in the server have issues under linux. The first uses a Realtek 8169 chipset… which has no drivers available that I can find. So I can’t use that one. The other users the sky2 module. Googling tells me lots of users with cards using this module that have the exact same issue I do. Specifically that when under very high throughput load I get this “NETDEV WATCHDOG: eth0: transmit timed out”. Further googling found me a dodgy script someone had put together that may or may not resolve the issue. It basically downs eth0, removes the module, reinserts the modules and ups the interface again. I haven’t had time to force the problem to occur so I can test if the script actually restores network traffic or not.
There you have it. As it stands I’m pretty happy with the server… its been stable for a while now, and its doing everything I want it to do. Which is good, I’m busy enough with random faults at work. I hate dealing with them in my own time.