What should I do if the server is down? Server failure emergency plan

  
        Yesterday, Ctrip.com couldn't open the server. I believe that everyone can see the spectacular scene of the Ctrip Technology Building's overnight lighting, so what about the server downtime? This article is a small series to teach you the server failure emergency plan.


first distinguish the factors leading to the failure of the server:
1, external attacks
2, insider attacks
3, operation and maintenance misuse
server goes down how to do? Server Failure Emergency Plan
Whether it is an external attack or an internal failure, backup and redundancy measures can minimize downtime.
Although the backup problem sounds incredible, in practice, many companies have not established a set of tested backup systems. The significance of backup is that you can quickly recover or rebuild your production system in times of crisis. In the enterprise network, the problem that often occurs is:
The backup process is not completed and the correct backup process is not completed.
The subsequent backup fails due to the exhaustion of storage space after a certain period of time due to limited storage space.
Damage to backup media can't be recovered successfully
Traditionally, tape has become an ideal backup medium due to its low cost and high storage density. However, several fatal flaws of this traditional backup medium often make its contained data inaccessible:
Lost tape index card
Tape media is susceptible to external magnetic fields during storage
Damage to the media itself
Damage to the read device during media reading
In addition, the tape backup media itself is stored in the tape library, the time to retrieve the required backup tape from the warehouse, transfer to the data center, and reload the data Consumption is usually also objective.
Even if you have a backup system, you can't resist all the accidents. In 2014, a fire in the Samsung data center suspended its cloud services. If there is no offsite backup, this fire will make recovery of its local backup extremely difficult.
Redundancy For emergencies, it is important to recover as quickly as possible, or to continue to provide services. This month, a well-known payment company experienced a period of service disruption due to data center network connectivity failures. If there is a better redundancy scheme, the impact of such an accident will be reduced, and even resolved into an internal accident that the user is not aware of.



server has two independent majority PSU, PSU failure of any one does not affect the normal service; Generally, two PSU server will be connected to two different circuit paths Or uninterruptible power supply to avoid utility power failure; most data center power supplies are equipped with UPS and diesel generators to avoid service interruption caused by the power supply company's unannounced power supply service. The network is also the same; access to multiple ISP lines at the same time, and independent routing, and the announcement of addresses on multiple lines, can make the network service more robust.
From a system perspective, only backups and redundancy schemes configured at the same time can improve availability and avoid long service interruptions caused by uncontrollable factors.

Copyright © Windows knowledge All Rights Reserved