Jump to content


Photo

Brief Outage?


  • Please log in to reply
2 replies to this topic

#1 fshagan

fshagan

    Member

  • Members
  • PipPip
  • 145 posts

Posted 20 November 2011 - 03:38 PM

I had a ~13 minute outage today on the Atlantis server, and at the time I couldn't log into MDDHosting either. Was this a network-wide outage? Or did you see it on your end at all (thinking someone with a backhoe could have decided to dig up a cable or something).
  • 0

#2 MikeDVB

MikeDVB

    Forum Administrator

  • Staff Administrator
  • PipPipPipPipPip
  • 2,128 posts
  • Gender:Male
  • Location:Central Indiana, USA

Posted 20 November 2011 - 04:08 PM

After getting with the networking team, it seems that a distribution switch in our cabinet decided to go kaput. Thankfully we have a secondary switch that picks up and takes over in this event, but the change over can take 2 to 4 minutes and then there is the possibility of intermittence for 5 to 15 minutes afterwards. The total downtime registered by our monitoring is right at about 2 minutes but it's possible individual routes to individual IPs took longer to update.

The switches are identical, so we will be swapping the failed switch with a replacement and then setting it up as the backup (so that there is no downtime switching back to a new switch).
  • 0

Michael Denney - MDDHosting, LLC - Professional Hosting Solutions
LiteSpeed Powered - Shared, Reseller, Semi-Dedicated, and VPS
Incremental R1Soft CDP Backups on all shared, semi-dedicated, and VPS services!
http://www.mddhosting.com/ - Follow us on Twitter!


#3 fshagan

fshagan

    Member

  • Members
  • PipPip
  • 145 posts

Posted 21 November 2011 - 12:01 PM

After getting with the networking team, it seems that a distribution switch in our cabinet decided to go kaput. Thankfully we have a secondary switch that picks up and takes over in this event, but the change over can take 2 to 4 minutes and then there is the possibility of intermittence for 5 to 15 minutes afterwards. The total downtime registered by our monitoring is right at about 2 minutes but it's possible individual routes to individual IPs took longer to update.

The switches are identical, so we will be swapping the failed switch with a replacement and then setting it up as the backup (so that there is no downtime switching back to a new switch).


Thanks, Mike. My polling interval on my monitors are in 15 minute increments, so an outage can "look" longer. And I've found that Firefox, at least, seems to cache the "server not found" error (CTRL-F5 will clear this).
  • 0




0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users