HOME

TheInfoList



OR:

A backup rotation scheme is a system of
backing up Reversing (also known as backing up) is the process of driving a vehicle in the reverse direction in order to maneuver. Rear view mirrors are somewhat standard equipment for this endeavor. Extremely large or luxury vehicles may have in addition ...
data to computer media (such as tapes) that minimizes, by re-use, the number of media used. The scheme determines how and when each piece of removable storage is used for a backup job and how long it is retained once it has backup data stored on it. Different techniques have evolved over time to balance
data retention Data retention defines the policies of persistent data and records management for meeting legal and business data archival requirements. Although sometimes interchangeable, it is not to be confused with the Data Protection Act 1998. The different ...
and restoration needs with the cost of extra data storage media. Such a scheme can be quite complicated if it takes incremental backups, multiple retention periods, and off-site storage into consideration.


Schemes


First in, first out

A first in, first out (FIFO) backup scheme saves new or modified files onto the "oldest" media in the set, i.e. the media that contain the oldest and thus least useful previously backed up data. Performing a daily backup onto a set of 14 media, the backup depth would be 14 days. Each day, the oldest media would be inserted when performing the backup. This is the simplest rotation scheme and is usually the first to come to mind. This scheme has the advantage that it retains the longest possible tail of daily backups. It can be used when archived data is unimportant (or is retained separately from the short-term backup data) and data before the rotation period is irrelevant. However, this scheme suffers from the possibility of data loss: suppose, an error is introduced into the data, but the problem is not identified until several generations of backups and revisions have taken place. Thus when the error is detected, all the backup files contain the error. It would then be useful to have at least one older version of the data, as it would not have the error.


Grandfather-father-son

Grandfather-father-son backup (GFS) is a common rotation scheme for backup media, in which there are three or more backup cycles, such as daily, weekly and monthly. The daily backups are rotated on a daily basis using a FIFO system as above. The weekly backups are similarly rotated on a weekly basis, and the monthly backup on a monthly basis. In addition, quarterly, half-yearly, and/or annual backups could also be separately retained. Often some of these backups are removed from the site for safekeeping and disaster recovery purposes.


Tower of Hanoi

The Tower of Hanoi rotation method is more complex. It is based on the mathematics of the
Tower of Hanoi The Tower of Hanoi (also called The problem of Benares Temple or Tower of Brahma or Lucas' Tower and sometimes pluralized as Towers, or simply pyramid puzzle) is a mathematical game or puzzle consisting of three rods and a number of disks of va ...
puzzle, using a recursive method to optimize the back-up cycle. Every tape corresponds to a disk in the puzzle, and every disk movement to a different peg corresponds with a backup to that tape. So the first tape is used every other day (1, 3, 5, 7, 9, ...), the second tape is used every fourth day (2, 6, 10, ...), the third tape is used every eighth day (4, 12, 20, ...). A set of ''n'' tapes (or other media) will allow backups for 2''n''−1 days before the last set is recycled. So, 3 tapes will give 4 days' worth of backups, and on the 4th day ''Set C'' will be overwritten; 4 tapes will give 8 days, and ''Set D'' is overwritten on the 9th day; 5 tapes will give 16 days, etc. Files can be restored from 1, 2, 4, 8, 16, ..., 2''n''−1 days ago. The following tables show which tapes are used on which days of various cycles. A disadvantage of the method is that half the backups are overwritten after only two days.


Three-tape Hanoi schedule


Four-tape Hanoi schedule


Five-tape Hanoi schedule


Extensions and example

Many variations are possible, and the concepts are readily extended to disc-based directories containing backups. Here are some options: * Save a base backup as set zero. * Save as many of the most recent backups as desired. * Save more than one of each set number, for greater coverage. Coverage automatically gets sparser the further back in time one goes, which approximates the likelihood of needing to do restores from past backups. And Tower of Hanoi has the huge advantage of freeing implementers from having to deal with managing hourly, daily, weekly, monthly, quarterly or annual management strategies. In general, backup set number ''set'' is used at ''seq'' = 2''set''−1 + j × 2''set'', ''j'' = 0, 1, 2, 3, 4, ..., where ''seq'' is the sequence or serial number of a backup (also the Tower of Hanoi move number). Here is an example showing coverage, including set 0, keeping at least the last 4 days, and recycling: * precious.20140515.seq.0 set 0 * precious.20150205.seq.256 set 9 * precious.20151026.seq.512 set 10 * precious.20160311.seq.640 set 8 * precious.20160516.seq.704 set 7 * precious.20160601.seq.720 set 5 * precious.20160609.seq.728 set 4 * precious.20160617.seq.736 set 6 * precious.20160618.seq.737.recycle set 1 * precious.20160619.seq.738 set 2 * precious.20160620.seq.739 set 1 * precious.20160621.seq.740 set 3 * precious.20160622.seq.741 set 1


Weighted random distribution

An alternative arrangement is to keep generations distributed across all points in time is by deleting (or overwriting) past generations (except the oldest and the most-recent-''n'' generations) when necessary in a weighted-random fashion. For each deletion, the weight assigned to each deletable generation corresponds to the probability of it being deleted. One acceptable
weight In science and engineering, the weight of an object is the force acting on the object due to gravity. Some standard textbooks define weight as a vector quantity, the gravitational force acting on the object. Others define weight as a scalar quan ...
is a constant
exponent Exponentiation is a mathematical operation, written as , involving two numbers, the '' base'' and the ''exponent'' or ''power'' , and pronounced as " (raised) to the (power of) ". When is a positive integer, exponentiation corresponds to re ...
(possibly the
square In Euclidean geometry, a square is a regular quadrilateral, which means that it has four equal sides and four equal angles (90- degree angles, π/2 radian angles, or right angles). It can also be defined as a rectangle with two equal-lengt ...
) of the
multiplicative inverse In mathematics, a multiplicative inverse or reciprocal for a number ''x'', denoted by 1/''x'' or ''x''−1, is a number which when multiplied by ''x'' yields the multiplicative identity, 1. The multiplicative inverse of a fraction ''a''/'' ...
of the duration (possibly expressed in the number of days) between the dates of the generation and the generation preceding it. Using a larger exponent leads to a more uniform distribution of generations, whereas a smaller exponent leads to a distribution with more recent and fewer older generations. This technique probabilistically ensures that past generations are always distributed across all points in time as is desired. The weighted random method only has an advantage over a more systematic approach, when backups are irregular or missed.


Incremented media method

This method has many variations and names. A set of numbered media is used until the end of the cycle. Then the cycle is repeated using media numbered the same as the previous cycle, but incremented by one. The lowest numbered tape from the previous cycle is retired and kept permanently. Thus one has access to every backup for one cycle and to one backup per cycle before that. This method has the advantage of ensuring even media wear, but requires a schedule to be precalculated.


See also

*
Off-site data protection In computing, off-site data protection, or vaulting, is the strategy of sending critical data out of the main location (''off'' the main ''site'') as part of a disaster recovery plan. Data is usually transported off-site using removable storage m ...
*
Legal hold A legal hold is a process that an organization uses to preserve all forms of potentially relevant information when litigation is pending or reasonably anticipated. The legal hold is initiated by a notice or communication from legal counsel to an ...


References

{{DEFAULTSORT:Backup Rotation Scheme Tape-based computer storage Backup