June 30, 2017Working with GPT Partition tables, CRC32’s and byte-data in hard drives.
Downtime at MVD 01 Location.
Today we suffered a Network Outage due to a power grid failure on a 500 KVa line that brings power from the hydroelectric dams in the central part of the country.
This caused a power failure on Antel equipment that left the datacenter without connectivity for ~30 minutes on main network equipment.
Regarding the equipment:
UPS: held the power outage perfectly dropping only ~8% of it’s capacity,having an actual load on the redundant grid of 17%.
Router: Since Sunday we’ve been running on our backup main router. Leaving some services offline. This ocurred due to a Raid failure on the main router that left dropped out of the cluster.
Resolution: There’s a planned mantainance tonight through tomorrow morning re-configuring the network to avoid future issues.
Actual Status: UPS capacity has been restored to 100% leaving us ready for another network outage.
No mantainance was required, and no intervention was required.
Temperature Warning: During the power outage, the main HVAC unit went down for power saving until alarms went off 20 minutes after power loss.
HVAC was restored shortly after the alarm went off, resuming normal operation.
Las Flores remains offline until further notice, a main power failure remains, not affecting any services. Will update once sitaution normalizes.