This has to be the biggest outage effect from a small program update of all time. When I was making changes at Melb Water, the first thing you did was test it on a totally seperate system, then when you are ready to run it, you never ever ever did it on a Friday. The way forward here is to start in safe mode, apply the fix and restart normally. But down in the depths of server farms, the servers have no screens, they are just stacked in racks. Find the server, attach a screen, do the fix then move onto the next one. This could take the whole weekend. Manual warfare is a thing of the past.
![]() |
No comments:
Post a Comment