Have you ever had one of those days? I have to admit they are rare for me, but yesterday was one of those days.
The first problem was that three of my blade servers refused to power up. After some initial troubleshooting I discovered that this is a ‘known firmware issue’. What?!? According to the vendor if the blades are up for a really long time (yes, mine were) and you remove them from power (yes, I moved them) then they will refuse to power up. The solution is to replace the motherboard. Good thing they were all under warranty.
The second problem was related to the server and storage that was just moved to a hosted data center. The data center advertised fantastic reliability; dual this and dual that for redundancy. After finally working through the blade issue I notice that my hosted server had shut down due to high ambient air temperatures. When I call the data center I find out that both AC units failed, but they are working on it. I check my server’s logs and find that the ambient air temperature had reached 47 Celsius.
This morning I get to replace the motherboards and I hope that ends my exciting “see what can happen week”.