A fault tolerance is a setup or configuration that prevent a computer or network device from failing in the event of an unexpected problem or error. To make a computer or network fault tolerant requires that the user or company to think how a computer or network device may fail and take steps that help prevent that type of failure. Below are some examples of steps that can be taken.
How to design for fault tolerance
- Power Failure - Have the computer or network device running on a UPS. In the event of a power outage, make sure the UPS can notify an administrator and properly turn off the computer after a few minutes if the power is not restored.
- Power Surge - If no UPS is connected to the computer or the UPS does not provide surge protection, a surge protector connected to the computer or network device would help prevent the device from failing in the event of a power surge.
- Data loss - Run backups daily or at least monthly on the computer if important information is stored on it. Create a mirror of the data on an alternate location.
- Device or Computer failure - Have a second device, computer, or computer components available in the event of failure to prevent a long down time.
- Unauthorized access - If connected to a network, setup a firewall.
- Frequently check for updates - Make sure the operating system and any running programs have the latest updates.
- Lock device or password protect computer - When not in use lock the computer and store the computer or network device in a secure area.
- Overload - Setup an alternate computer or network device that can be used as an alternative access point or can share the load either through a load balancing or round robin setup.
- Virus - Make sure the computer has updated virus definitions.