![]() |
Is the power out on both the A side and the B side, or is there a network issue because a device somewhere upstream of you only sources power from one of the independent feeds?
|
Quote:
Quote:
Quote:
Quote:
|
We csn't tell if our machines are powered or not as, like colo4.com itself until recently, there is no network route past the colo4 edge.
|
|
Quote:
Seems that 1 hours past and nothing was done. |
We get it, everybody is angry. I, like the majority of these other people following this thread are following it to get updates from Colo4, not to read about your frustration. It's business, you can only hope for the best and plan for the worst. We are currently working on lighting up another facility in Toronto to provide data center redundancy for our cloud clients. I wish we already had this in place, but it happens, welcome to the tech world.
We're all in the same boat here, but let's please keep the posts to a minimum so we can concentrate on updates from Colo4. I hope everything is back up as soon as possible for all of us. |
Quote:
Quote:
|
Quote:
|
Quote:
it has a couple of slightly complicated things to do - signal the auto-start panel for the generator, wait for the generator power output to stabilise and then make sure that one power input is completely disengaged before the other is engaged so that you don't short out hundreds of kilowatts or even several megawatts from two different power sources! |
Edit: Following Dedicatedone's lead, snarkiness removed...
|
The colo4 network devices at the edge / border and core of our range are offline / unreachable. Packets aren't getting anywhere near our racks.
|
Quote:
|
Power issues at this scale are both complex and lethal.
Everyone needs to take a deep breath and remember that colo4 are the experts, have everyone they need onsite and will fix this. |
Quote:
|
Quote:
|
ATS cant really be bypassed without endangering the systems.
|
Any updates on the ETA?
|
4 Hours of downtime please we need ETA !
|
estiesio -270
So they have :
Quote:
|
I bet more 4 or 5 hours of downtime, easy.
I have 23 servers offline. |
Why are they just putting us up for a short time and then going to take it back offline again? This makes no sense. Make the final fix now and don't create a second outage.
|
They don't have the equipment right now. Put something up now and get everybody online then work on a permanent solution when you have a proper plan in place, not while you're in an emergency situation.
|
Quote:
Also... to clarify some confusion... utilityA > ATS-A ---- your equipment generator utilityB > ATS-B ---- your equipment generator This is what colo4's been referencing... the building that is affected, has two "service entrances"... two points at which they deliver power into the affected building (their other building, with 4 utility entrance points, are not affected). If you pay for A/B service, you have 2 circuits to your rack that are serviced separately by these A/B service entrances. If you're not paying for A/B service, you may still have 2 circuits at your rack... but they will trace back to the same service entrance.... so with the service entrance's ATS down, you're completely down. My problem is that even with A/B, it doesn't matter if the HSRP router (managed by colo4) isn't online either... my equipment could be powered up on the B power circuit, but you can't reach it, because colo4's HSRP router is down... presumably because IT isn't on an A/B service. Still no answer from them on this... but I'm apparenlty not the only colo4 customer who sees this same scenario, based on similar updates here. Also - colo4 justupdated their site: ---------------------------------------- Current Update Thank you for your patience as we work to address the ATS issue with our #2 service entrance. We apologize for the situation and are working as quickly as possible to restore service. We have determined that the repairs for the ATS will take more time than anticipated, so we are putting into service a backup ATS that we have on-site as part of our emergency recovery plan. We are working with our power team to safely bring the replacement ATS into operation. We will update you as soon as we have an estimated time that the replacement ATS will be online. Later, once we have repaired the main ATS, we will schedule an update window to transition from the temporary power solution. We will provide advance notice and timelines to minimize any disruption to your business. Again, we apologize for the loss of connectivity and impact to your business. We are working diligently to get things back online for our customers. Please expect another update within the hour. |
Quote:
|
9 servers offline for me right now- got a flood of support tickets!
Hopefully this will be resolved soon! |
Quote:
They only have one power line? |
Quote:
The building that's affected, the ATS for ONE of those power connections (service entrace) has failed. So everyone whose racks have power from that connection is down UNLESS they paid for an apparently optional/upgrade service to have 2 circuits at their rack, serviced SEPARATELY by the two separate power connections (service entrance). service entrance = where the power company comes into the building. |
Quote:
There are multiple power lines from the utility company to the facility itself. That isn't the issue here. |
Quote:
|
Paul, could we please get another update? I'd just like something to send my customers to let them know progress is still being made. A simple status update would be just fine.
|
Quote:
Customers that paid for full A+B redundancy still have power from the B side. Customers that opted to only take a single power feed from the A side will have to wait until the problem with the A side is repaired. |
Well I guess that explains how my host screwed up. They made a bad decision that's going to give me some serious food for thought.
|
That's false info.
Lots of hosts with A/B are down in the DC effected. Colo4's own core network is also down (which one would imagine uses A+B power) |
Beginning of the end?
Servers are down for more than 6 hours; Our customers are getting agitated; Is this the beginning of the end for all of US and COLO4?
Steve |
Just seen on a twitter post someone who just got off phone to colo4. Provisional eta of 1800 CST (around 2.5 hours) ... Ouch.
|
Quote:
|
Quote:
|
Quote:
|
Doesn't matter, if they offered a solution and my host didn't take that solution then my host had no place advertising themselves to me as having any sort of proof against this kind of thing. Someone's to blame. I wish I really knew who so I could form a plan of action.
|
Quote:
I think most people would rather have Colo4 lied to them they need 8 hours to get it repair rather them knowing nothing. Although from a client / customer perspective. 8 hours of fixing and not telling them anything makes no different... Those who leave will leave....... |
Around noon is when we went down. It was shaping up to be a good, brisk day on my site too. It's become the year's worst disaster.
|
Most people will stay. Let's be realistic here, it's not like Colo4 has power outages every month. Yes, there have been a few DDoS attacks in the last couple of months that affected network stability but hardly anything to worry about. People panic and freak out when they start losing money, understandable, and when it comes back online most people will move on and put this behind them.
|
Quote:
|
I find it interesting that people are knocking the data-center for their redundancy. Complaints about clients leaving and businesses failing cause of not being able to receive e-mails and such. Where are your redundancy plans?
Contingencies need to be planned for businesses as well, not just data-centers. Need your e-mail just in case your data-center gets DDoS'd or Power goes out. Setup a backup domain. Just as one can't plan for the weather, make it into work when it snows for that day (thus not getting paid), plan for disasters. -.- |
Downage
We have A/B power on multiple servers and they are all still down. We are in the older facility. Even if we didn't have this addon, Colo4's statement regarding this was a poor excuse.
We will be leaving Colo4 after this fiasco. There is no way an outage should last this long from a power failure, that's why you have backup plans. Acts of God are understandable but not this. There are some major clients who are affected by this right now. An example is Radiant Systems. Tens of thousands of restaurants have major parts of their systems down. Other major clients are experiencing the same thing. |
Quote:
|
Mine is back up!
|
Quote:
An event like this is a rare occurrence at Colo4 and not something I'd pack up and leave over. |
Quote:
|
We have been a customer of colo4 for over 4yrs now. We are HAPPY.. This was a mechanical electric switch ( ATS ) failure. If customers have A-B Service feeds electrically to there cabinets AND wired correctly to the servers (dual power supplies) they are working. I DO. I paid for it.
Folks dont think of WHAT IF. asimb |
Quote:
|
Quote:
|
Wikipedia on High Availability...
Please note the following... "Availability is usually expressed as a percentage of uptime in a given year.: |
That's right, I was going on what someone said before that if my or any client had paid for A/B they would not be suffering this loss right now. Come to find out that isn't true.
|
Paul-- Do you think we will be back online within teh next 2 hours? Can you ask the engineers?
|
Any word yet? We're waiting. And I'm moving past where I can keep waiting. I have to get my site up, however I do that.
|
On geographical redundancy
On a web server serving large amounts of static data, what is the best way to achieve geographical redundancy? Is Round robin DNS the way to go? What other alternatives exist? A load balancer would still be a single point of failure.
|
Use a CMS like Wordpress that can host cached content on a cloud service like AWS. Very inexpensive and works automatically.
|
Quote:
|
I am not going to leave a provider over this - these things happen. We have been dealing with colo4 for years and have had good results for a long time. Stuff happens..
Having said this, I am a little upset at Paul over his comments here. This power outage is clearly something colo4 needs to deal with. Insinuating that customers are at fault for not having A+B power feeds is not reasonable. Firstly, Colo4's rates for A+B are not really inline with other facilities we work with. Secondly, we have A+B on some of our infrastructure with colo4 - and some of it is up - yes - but, some of it is also down. What this means is that some PDUs are serviced from the same power plant - so, even though we have A+B protection on parts of our infrastructure, the feeds are coming from different PDUs(but the same powerplant) - and we are still down. I just do not think it is appropriate to suggest that this may be colo4's customers fault because they dont have A+B. This is something that should have been discussed with your customers - not publicly. I am now answering questions about where we have A+B and where we dont and why - and frankly - the issue is colo4 lost a power plant.. lets try and remember that.. |
| All times are GMT -4. The time now is 02:41 PM. |
Powered by vBulletin
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
© WebHostingTalk, 1998 - 2011. All Rights Reserved.