Hard disk drive failure
   HOME

TheInfoList



OR:

A hard disk drive failure occurs when a
hard disk drive A hard disk drive (HDD), hard disk, hard drive, or fixed disk is an electro-mechanical data storage device that stores and retrieves digital data using magnetic storage with one or more rigid rapidly rotating platters coated with magne ...
malfunctions and the stored information cannot be accessed with a properly configured computer. A hard disk failure may occur in the course of normal operation, or due to an external factor such as exposure to fire or water or high magnetic fields, or suffering a sharp
impact Impact may refer to: * Impact (mechanics), a high force or shock (mechanics) over a short time period * Impact, Texas, a town in Taylor County, Texas, US Science and technology * Impact crater, a meteor crater caused by an impact event * Impac ...
or environmental contamination, which can lead to a head crash. The stored information on a hard drive may also be rendered inaccessible as a result of
data corruption In the pursuit of knowledge, data (; ) is a collection of discrete values that convey information, describing quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted. ...
, disruption or destruction of the hard drive's master boot record, or by malware deliberately destroying the disk's contents.


Causes

There are a number of causes for hard drives to fail including: human error, hardware failure, firmware corruption, media damage, heat, water damage, power issues and mishaps. Drive manufacturers typically specify a
mean time between failures Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a mechanical or electronic system during normal system operation. MTBF can be calculated as the arithmetic mean (average) time between failures of a system ...
(MTBF) or an
annualized failure rate Annualized failure rate (AFR) gives the estimated probability that a device or component will fail during a full year of use. It is a relation between the mean time between failure (MTBF) and the hours that a number of devices are run per year. AF ...
(AFR) which are population statistics that can't predict the behavior of an individual unit. These are calculated by constantly running samples of the drive for a short period of time, analyzing the resultant wear and tear upon the physical components of the drive, and extrapolating to provide a reasonable estimate of its lifespan. Hard disk drive failures tend to follow the concept of the
bathtub curve The bathtub curve is widely used in reliability engineering and deterioration modeling. It describes a particular form of the hazard function which comprises three parts: *The first part is a decreasing failure rate, known as early failures. *Th ...
. Drives typically fail within a short time if there is a defect present from manufacturing. If a drive proves reliable for a period of a few months after installation, the drive has a significantly greater chance of remaining reliable. Therefore, even if a drive is subjected to several years of heavy daily use, it may not show any notable signs of wear unless closely inspected. On the other hand, a drive can fail at any time in many different situations. The most notorious cause of drive failure is a head crash, where the internal read-and-write head of the device, usually just hovering above the surface, touches a platter, or scratches the magnetic data-storage surface. A head crash usually incurs severe
data loss Data loss is an error condition in information systems in which information is destroyed by failures (like failed spindle motors or head crashes on hard drives) or neglect (like mishandling, careless handling or storage under unsuitable conditions) ...
, and
data recovery In computing, data recovery is a process of retrieving deleted, inaccessible, lost, corrupted, damaged, or formatted data from secondary storage, removable media or files, when the data stored in them cannot be accessed in a usual way. The dat ...
attempts may cause further damage if not done by a specialist with proper equipment. Drive platters are coated with an extremely thin layer of non-
electrostatic Electrostatics is a branch of physics that studies electric charges at rest ( static electricity). Since classical times, it has been known that some materials, such as amber, attract lightweight particles after rubbing. The Greek word for amb ...
lubricant, so that the read-and-write head will likely simply glance off the surface of the platter should a collision occur. However, this head hovers mere
nanometers 330px, Different lengths as in respect to the molecular scale. The nanometre (international spelling as used by the International Bureau of Weights and Measures; SI symbol: nm) or nanometer (American and British English spelling differences#-re ...
from the platter's surface which makes a collision an acknowledged risk. Another cause of failure is a faulty
air filter A particulate air filter is a device composed of fibrous, or porous materials which removes solid particulates such as dust, pollen, mold, and bacteria from the air. Filters containing an adsorbent or catalyst such as charcoal (carbon) may a ...
. The air filters on today's drives equalize the
atmospheric pressure Atmospheric pressure, also known as barometric pressure (after the barometer), is the pressure within the atmosphere of Earth. The standard atmosphere (symbol: atm) is a unit of pressure defined as , which is equivalent to 1013.25 millibars, ...
and moisture between the drive enclosure and its outside environment. If the filter fails to capture a dust particle, the particle can land on the platter, causing a head crash if the head happens to sweep over it. After a head crash, particles from the damaged platter and head media can cause one or more bad sectors. These, in addition to platter damage, will quickly render a drive useless. A drive also includes controller electronics, which occasionally fail. In such cases, it may be possible to recover all data by replacing the controller board. The phenomenon of disk failure is not limited only to drives, but also applies to other types of magnetic media. In the late 1990s,
Iomega Iomega (later LenovoEMC) produced external, portable, and networked data storage products. Established in the 1980s in Roy, Utah, United States, Iomega sold more than 410 million digital storage drives and disks, including the Zip drive floppy ...
's 100-megabyte Zip disks used in
Zip drive The Zip drive is a removable floppy disk storage system that was introduced by Iomega in late 1994. Considered medium-to-high-capacity at the time of its release, Zip disks were originally launched with capacities of 100  MB, then 250 ...
s were affected by the
click of death Click of death is a term that had become common in the late 1990s referring to the clicking sound in disk storage systems that signals a disk drive has failed, often catastrophically. The clicking sound itself arises from the unexpected moveme ...
, called so because the drives endlessly clicked when accessed, indicating the impending failure. 3.5-inch floppy disks can also fall victim to disk failure. If either the drive or the media is dirty, users may experience the
buzz of death A head crash is a hard-disk failure that occurs when a read–write head of a hard disk drive makes contact with its rotating platter, slashing its surface and permanently damaging its magnetic media. It is most often caused by a sudden se ...
when attempting to access the drive.


Signs of drive failure

Failure of a hard disk drive can be catastrophic or gradual. The former typically presents as a drive that can no longer be detected by
CMOS setup Nonvolatile BIOS memory refers to a small memory on PC motherboards that is used to store BIOS settings. It is traditionally called CMOS RAM because it uses a volatile, low-power complementary metal-oxide-semiconductor (CMOS) SRAM (such as ...
, or that fails to pass
BIOS POST In computing, BIOS (, ; Basic Input/Output System, also known as the System BIOS, ROM BIOS, BIOS ROM or PC BIOS) is firmware used to provide runtime services for operating systems and programs and to perform hardware initialization during the ...
so that the operating system never sees it. Gradual hard-drive failure can be harder to diagnose, because its symptoms, such as corrupted data and slowing down of the PC (caused by gradually failing areas of the hard drive requiring repeated read attempts before successful access), can be caused by many other computer issues, such as malware. A rising number of bad sectors can be a sign of a failing hard drive, but because the hard drive automatically adds them to its own growth defect table, they may not become evident to utilities such as ScanDisk unless the utility can catch them before the hard drive's defect management system does, or the backup sectors held in reserve by the
internal hard-drive defect management Internal hard-drive defect management is a system present in hard drives for handling of bad sector A bad sector in computing is a disk sector on a disk storage unit that is permanently damaged. Upon taking damage, all information stored on that sec ...
system run out (by which point the drive is on the point of failing outright). A cyclical repetitive pattern of seek activity such as rapid or slower seek-to-end noises (
click of death Click of death is a term that had become common in the late 1990s referring to the clicking sound in disk storage systems that signals a disk drive has failed, often catastrophically. The clicking sound itself arises from the unexpected moveme ...
) can be indicative of hard drive problems.


Landing zones and load/unload technology

During normal operation, heads in HDDs fly above the data recorded on the disks. Modern HDDs prevent power interruptions or other malfunctions from landing its heads in the data zone by either physically moving (''parking'') the heads to a special ''landing zone'' on the platters that is not used for data storage, or by physically locking the heads in a suspended (''unloaded'') position raised off the platters. Some early PC HDDs did not park the heads automatically when power was prematurely disconnected and the heads would land on data. In some other early units the user would run a program to manually park the heads.


Landing zones

A ''landing zone'' is an area of the platter usually near its inner diameter (ID), where no data is stored. This area is called the Contact Start/Stop (CSS) zone, or the landing zone. Disks are designed such that either a
spring Spring(s) may refer to: Common uses * Spring (season) Spring, also known as springtime, is one of the four temperate seasons, succeeding winter and preceding summer. There are various technical definitions of spring, but local usage of ...
or, more recently, rotational
inertia Inertia is the idea that an object will continue its current motion until some force causes its speed or direction to change. The term is properly understood as shorthand for "the principle of inertia" as described by Newton in his first law ...
in the platters is used to park the heads in the case of unexpected power loss. In this case, the spindle motor temporarily acts as a generator, providing power to the actuator. Spring tension from the head mounting constantly pushes the heads towards the platter. While the disk is spinning, the heads are supported by an air bearing and experience no physical contact or wear. In CSS drives the sliders carrying the head sensors (often also just called ''heads'') are designed to survive a number of landings and takeoffs from the media surface, though wear and tear on these microscopic components eventually takes its toll. Most manufacturers design the sliders to survive 50,000 contact cycles before the chance of damage on startup rises above 50%. However, the decay rate is not linear: when a disk is younger and has had fewer start-stop cycles, it has a better chance of surviving the next startup than an older, higher-mileage disk (as the head literally drags along the disk's surface until the air bearing is established). For example, the Seagate Barracuda 7200.10 series of desktop hard disk drives are rated to 50,000 start–stop cycles; in other words, no failures attributed to the head–platter interface were seen before at least 50,000 start–stop cycles during testing. Around 1995 IBM pioneered a technology where a landing zone on the disk is made by a precision laser process (''Laser Zone Texture'' = LZT) producing an array of smooth nanometer-scale "bumps" in a landing zone, thus vastly improving
stiction Stiction is the static friction that needs to be overcome to enable relative motion of stationary objects in contact. The term is a portmanteau of the words ''static'' and ''friction'', and is perhaps also influenced by the verb '' to stick''. Any ...
and wear performance. This technology is still in use today, predominantly in lower-capacity Seagate desktop drives, but has been phased out in 2.5" drives, as well as higher-capacity desktop, NAS, and enterprise drives in favor of load/unload ramps. In general, CSS technology can be prone to increased stiction (the tendency for the heads to stick to the platter surface), e.g. as a consequence of increased humidity. Excessive stiction can cause physical damage to the platter and slider or spindle motor.


Unloading

''Load/unload'' technology relies on the heads being lifted off the platters into a safe location, thus eliminating the risks of wear and
stiction Stiction is the static friction that needs to be overcome to enable relative motion of stationary objects in contact. The term is a portmanteau of the words ''static'' and ''friction'', and is perhaps also influenced by the verb '' to stick''. Any ...
altogether. The first HDD RAMAC and most early disk drives used complex mechanisms to load and unload the heads. Nearly all modern HDDs use ramp loading, first introduced by
Memorex Memorex Corp. began as a computer tape producer and expanded to become both a consumer media supplier and a major IBM plug compatible peripheral supplier. It was broken up and ceased to exist after 1996 other than as a consumer electronics bran ...
in 1967, to load/unload onto plastic "ramps" near the outer disk edge. Laptop drives adopted this due to the need for increased shock resistance, and then ultimately it was adopted on most desktop drives. Addressing shock robustness, IBM also created a technology for their
ThinkPad ThinkPad is a line of business-oriented laptop computers and tablets designed, developed and marketed by Lenovo, and formerly by IBM until 2005, when IBM's PC business was acquired by Lenovo. ThinkPads have a distinct black, boxy design la ...
line of laptop computers called the Active Protection System. When a sudden, sharp movement is detected by the built-in
accelerometer An accelerometer is a tool that measures proper acceleration. Proper acceleration is the acceleration (the rate of change of velocity) of a body in its own instantaneous rest frame; this is different from coordinate acceleration, which is acc ...
in the ThinkPad, internal hard disk heads automatically unload themselves to reduce the risk of any potential data loss or scratch defects.
Apple An apple is an edible fruit produced by an apple tree (''Malus domestica''). Apple trees are cultivated worldwide and are the most widely grown species in the genus ''Malus''. The tree originated in Central Asia, where its wild ancestor, ' ...
later also utilized this technology in their PowerBook,
iBook iBook is a line of laptop computers designed, manufactured, and sold by Apple Computer from 1999 to 2006. The line targeted entry-level, consumer and education markets, with lower specifications and prices than the PowerBook, Apple's higher-end ...
, MacBook Pro, and MacBook line, known as the Sudden Motion Sensor.
Sony , commonly stylized as SONY, is a Japanese multinational conglomerate corporation headquartered in Minato, Tokyo, Japan. As a major technology company, it operates as one of the world's largest manufacturers of consumer and professiona ...
, HP with their HP 3D DriveGuard, and
Toshiba , commonly known as Toshiba and stylized as TOSHIBA, is a Japanese multinational conglomerate corporation headquartered in Minato, Tokyo, Japan. Its diversified products and services include power, industrial and social infrastructure systems, ...
have released similar technology in their notebook computers.


Modes of failure

Hard drives may fail in a number of ways. Failure may be immediate and total, progressive, or limited. Data may be totally destroyed, or partially or totally recoverable. Earlier drives had a tendency toward developing
bad sector A bad sector in computing is a disk sector on a disk storage unit that is permanently damaged. Upon taking damage, all information stored on that sector is lost. When a bad sector is found and marked, the operating system like Windows or Linux will ...
s with use and wear; these bad sectors could be "mapped out" so they were not used and did not affect operation of a drive, and this was considered normal unless many bad sectors developed in a short period of time. Some early drives even had a table attached to a drive's case on which bad sectors were to be listed as they appeared. Later drives map out bad sectors automatically, in a way invisible to the user; a drive with remapped sectors may continue to be used, though performance may decrease as the drive must physically move the heads to the remapped sector. Statistics and logs available through S.M.A.R.T. (Self-Monitoring, Analysis, and Reporting Technology) provide information about the remapping. In modern HDDs, each drive ships with zero user-visible bad sectors, and any bad/reallocated sectors may predict the impending failure of a drive. Other failures, which may be either progressive or limited, are usually considered to be a reason to replace a drive; the value of data potentially at risk usually far outweighs the cost saved by continuing to use a drive which may be failing. Repeated but recoverable read or write errors, unusual noises, excessive and unusual heating, and other abnormalities, are warning signs. * ''Head crash'': a head may contact the rotating platter due to mechanical shock or other reason. At best this will cause irreversible damage and data loss where contact was made. In the worst case the debris scraped off the damaged area may contaminate all heads and platters, and destroy all data on all platters. If damage is initially only partial, continued rotation of the drive may extend the damage until it is total. * ''Bad sectors'': some magnetic sectors may become faulty without rendering the whole drive unusable. This may be a limited occurrence or a sign of imminent failure. A drive that has any reallocated sectors at all has a significantly increased chance of failing soon. * ''Stiction'': after a time the head may not "take off" when started up as it tends to stick to the platter, a phenomenon known as
stiction Stiction is the static friction that needs to be overcome to enable relative motion of stationary objects in contact. The term is a portmanteau of the words ''static'' and ''friction'', and is perhaps also influenced by the verb '' to stick''. Any ...
. This is usually due to unsuitable lubrication properties of the platter surface, a design or manufacturing defect rather than wear. This occasionally happened with some designs until the early 1990s. * ''Circuit failure'': components of the electronic circuitry may fail making the drive inoperable, often due to electrostatic discharge or user error. * ''Bearing and motor failure'': electric motors may fail or burn out, and bearings may wear enough to prevent proper operation. Since modern drives use fluid dynamic bearings, this is a relatively uncommon reason for modern hard drive failure. * ''Miscellaneous mechanical failures'': parts, particularly moving parts, of any mechanism can break or fail, preventing normal operation, with possible further damage caused by fragments.


Metrics of failures

Most major hard disk and motherboard vendors support S.M.A.R.T, which measures drive characteristics such as operating temperature, spin-up time, data error rates, etc. Certain trends and sudden changes in these parameters are thought to be associated with increased likelihood of drive failure and data loss. However, S.M.A.R.T. parameters alone may not be useful for predicting individual drive failures. While several S.M.A.R.T. parameters affect failure probability, a large fraction of failed drives do not produce predictive S.M.A.R.T. parameters. Unpredictable breakdown may occur at any time in normal use, with potential loss of all data. Recovery of some or even all data from a damaged drive is sometimes, but not always possible, and is normally costly. A 2007 study published by
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
suggested very little correlation between failure rates and either high temperature or activity level. Indeed, the Google study indicated that "one of our key findings has been the lack of a consistent pattern of higher failure rates for higher temperature drives or for those drives at higher utilization levels.". Hard drives with S.M.A.R.T.-reported average temperatures below had higher failure rates than hard drives with the highest reported average temperature of , failure rates at least twice as high as the optimum S.M.A.R.T.-reported temperature range of to . The correlation between manufacturers, models and the failure rate was relatively strong. Statistics in this matter are kept highly secret by most entities; Google did not relate manufacturers' names with failure rates, though it has been revealed that Google uses Hitachi Deskstar drives in some of its servers. Google's 2007 study found, based on a large field sample of drives, that actual annualized failure rates (
AFRs The American Forces Network (AFN) is a government television and radio broadcast service the U.S. military provides to those stationed or assigned overseas. Headquartered at Fort George G. Meade, Maryland, AFN's broadcast operations, which i ...
) for individual drives ranged from 1.7% for first year drives to over 8.6% for three-year-old drives. A similar 2007 study at CMU on enterprise drives showed that measured MTBF was 3–4 times lower than the manufacturer's specification, with an estimated 3% mean AFR over 1–5 years based on replacement logs for a large sample of drives, and that hard drive failures were highly correlated in time. A 2007 study of latent sector errors (as opposed to the above studies of complete disk failures) showed that 3.45% of 1.5 million disks developed latent sector errors over 32 months (3.15% of nearline disks and 1.46% of enterprise class disks developed at least one latent sector error within twelve months of their ship date), with the annual sector error rate increasing between the first and second years. Enterprise drives showed less sector errors than consumer drives. Background scrubbing was found to be effective in correcting these errors. SCSI, SAS, and FC drives are more expensive than consumer-grade
SATA SATA (Serial AT Attachment) is a computer bus interface that connects host bus adapters to mass storage devices such as hard disk drives, optical drives, and solid-state drives. Serial ATA succeeded the earlier Parallel ATA (PATA) standard t ...
drives, and usually used in
server Server may refer to: Computing *Server (computing), a computer program or a device that provides functionality for other programs or devices, called clients Role * Waiting staff, those who work at a restaurant or a bar attending customers and su ...
s and
disk array A disk array is a disk storage system which contains multiple disk drives. It is differentiated from a disk enclosure, in that an array has cache memory and advanced functionality, like RAID, deduplication, encryption and virtualization. Compo ...
s, where SATA drives were sold to the home computer and desktop and near-line storage market and were perceived to be less reliable. This distinction is now becoming blurred. The
mean time between failures Mean time between failures (MTBF) is the predicted elapsed time between inherent failures of a mechanical or electronic system during normal system operation. MTBF can be calculated as the arithmetic mean (average) time between failures of a system ...
(MTBF) of SATA drives is usually specified to be about 1 million hours (some drives such as
Western Digital Raptor The Western Digital Raptor (often marketed as WD Raptor, 2.5" models known as VelociRaptor) is a discontinued series of high performance hard disk drives produced by Western Digital first marketed in 2003. The drive occupies a niche in the enthus ...
have rated 1.4 million hours MTBF), while SAS/FC drives are rated for upwards of 1.6 million hours. Modern helium-filled drives are completely sealed without a breather port, thus eliminating the risk of debris ingression, resulting in a typical MTBF of 2.5 million hours. However, independent research indicates that MTBF is not a reliable estimate of a drive's longevity (
service life A product's service life is its period of use in service. Several related terms describe more precisely a product's life, from the point of manufacture, storage, and distribution, and eventual use. Service life has been defined as "a product li ...
). MTBF is conducted in laboratory environments in test chambers and is an important metric to determine the quality of a disk drive, but is designed to only measure the relatively constant failure rate over the service life of the drive (the middle of the "
bathtub curve The bathtub curve is widely used in reliability engineering and deterioration modeling. It describes a particular form of the hazard function which comprises three parts: *The first part is a decreasing failure rate, known as early failures. *Th ...
") before final wear-out phase. "...disk manufacturers argue that the calculation f MTBFcorresponds to a user who buys a disk and keeps replacing the disk every five years- the planned lifetime of the disk." A more interpretable, but equivalent, metric to MTBF is
annualized failure rate Annualized failure rate (AFR) gives the estimated probability that a device or component will fail during a full year of use. It is a relation between the mean time between failure (MTBF) and the hours that a number of devices are run per year. AF ...
(AFR). AFR is the percentage of drive failures expected per year. Both AFR and MTBF tend to measure reliability only in the initial part of the life of a hard disk drive thereby understating the real probability of failure of a used drive. The cloud storage company Backblaze produces an annual report into hard drive reliability. However, the company states that it mainly uses commodity consumer drives, which are deployed in enterprise conditions, rather than in their representative conditions and for their intended use. Consumer drives are also not tested to work with enterprise
RAID Raid, RAID or Raids may refer to: Attack * Raid (military), a sudden attack behind the enemy's lines without the intention of holding ground * Corporate raid, a type of hostile takeover in business * Panty raid, a prankish raid by male college ...
cards of the kind used in a datacenter, and may not respond in the time a RAID controller expects; such cards will be identified as having failed when they have not. The result of tests of this kind may be relevant or irrelevant to different users, since they accurately represent the performance of consumer drives in the enterprise or under extreme stress, but may not accurately represent their performance in normal or intended use.


Example drive families with high failure rates

#IBM 3380 DASD, 1984 ca. #Computer Memories Inc. 20MB HDD for PC/AT, 1985 ca. #Fujitsu MPG3 and MPF3 series, 2002 ca. #IBM Deskstar 75GXP, 2001 ca. #Seagate ST3000DM001, 2012 ca.


Mitigation

In order to avoid the loss of data due to disk failure, common solutions include: *
Data backup In information technology, a backup, or data backup is a copy of computer data taken and stored elsewhere so that it may be used to restore the original after a data loss event. The verb form, referring to the process of doing so, is "back up", ...
, to allow restoration of data after a failure *
Data scrubbing Data scrubbing is an error correction technique that uses a background task to periodically inspect main memory or storage for errors, then corrects detected errors using redundant data in the form of different checksums or copies of data. Data ...
, to detect and repair latent corruption * Data redundancy, to allow systems to tolerate failures of individual drives *
Active hard-drive protection In computer hardware, active hard-drive protection refers to technology that attempts to avoid or reduce mechanical damage to hard disk drives by preparing the disk prior to impact. This approach is mainly used in laptop computers that are frequentl ...
, to protect laptop drives from external mechanical forces * S.M.A.R.T. (Self-Monitoring, Analysis, and Reporting Technology) included in hard-drives, to provide early warning of predictable failure modes *
Base isolation Seismic base isolation, also known as base isolation, or base isolation system, is one of the most popular means of protecting a structure against earthquake forces. It is a collection of structural elements which should substantially decoupl ...
used under server racks in data centers


Data recovery

Data from a failed drive can sometimes be partially or totally recovered if the platters' magnetic coating is not totally destroyed. Specialized companies carry out data recovery, at significant cost. It may be possible to recover data by opening the drives in a
clean room A cleanroom or clean room is an engineered space, which maintains a very low concentration of airborne particulates. It is well isolated, well-controlled from contamination, and actively cleansed. Such rooms are commonly needed for scientif ...
and using appropriate equipment to replace or revitalize failed components. If the electronics have failed, it is sometimes possible to replace the electronics board, though often drives of nominally exactly the same model manufactured at different times have different circuit boards that are incompatible. Moreover, electronics boards of modern drives usually contain drive-specific ''adaptation data'' required for accessing their system areas, so the related componentry needs to be either reprogrammed (if possible) or unsoldered and transferred between two electronics boards. Sometimes operation can be restored for long enough to recover data, perhaps requiring reconstruction techniques such as
file carving File carving is the process of reassembling computer files from fragments in the absence of filesystem metadata. Introduction and basic principles All filesystems contain some metadata that describes the actual file system. At a minimum, this in ...
. Risky techniques may be justifiable if the drive is otherwise dead. If a drive is started up once it may continue to run for a shorter or longer time but never start again, so as much data as possible is recovered as soon as the drive starts.


References


See also

* Solid-state drive#SSD reliability and failure modes *
Cascading failure A cascading failure is a failure in a system of interconnected parts in which the failure of one or few parts leads to the failure of other parts, growing progressively as a result of positive feedback. This can occur when a single part fails, in ...
**
Single point of failure A single point of failure (SPOF) is a part of a system that, if it fails, will stop the entire system from working. SPOFs are undesirable in any system with a goal of high availability or reliability, be it a business practice, software appl ...


External links

{{Wikibooks, Minimizing Hard Disk Drive Failure and Data Loss * Backblaze: Hard Drive Annual Failure Rates
2019Q2-2020

Failure Trends in a Large Disk Drive Population
– Google, Inc. February 2007
A Clean-Slate Look at Disk Scrubbing

Noises made by defective and failing hard disk drives

Hard disk drive anatomy: Logical and physical failures
Hard disk drives Technological failures