RAID explained

RAID (; "redundant array of inexpensive disks" or "redundant array of independent disks") is a data storage virtualization technology that combines multiple physical data storage components into one or more logical units for the purposes of data redundancy, performance improvement, or both. This is in contrast to the previous concept of highly reliable mainframe disk drives referred to as "single large expensive disk" (SLED).[1]

Data is distributed across the drives in one of several ways, referred to as RAID levels, depending on the required level of redundancy and performance. The different schemes, or data distribution layouts, are named by the word "RAID" followed by a number, for example RAID 0 or RAID 1. Each scheme, or RAID level, provides a different balance among the key goals: reliability, availability, performance, and capacity. RAID levels greater than RAID 0 provide protection against unrecoverable sector read errors, as well as against failures of whole physical drives.

History

The term "RAID" was invented by David Patterson, Garth Gibson, and Randy Katz at the University of California, Berkeley in 1987. In their June 1988 paper "A Case for Redundant Arrays of Inexpensive Disks (RAID)", presented at the SIGMOD Conference, they argued that the top-performing mainframe disk drives of the time could be beaten on performance by an array of the inexpensive drives that had been developed for the growing personal computer market. Although failures would rise in proportion to the number of drives, by configuring for redundancy, the reliability of an array could far exceed that of any large single drive.[2]

Although not yet using that terminology, the technologies of the five levels of RAID named in the June 1988 paper were used in various products prior to the paper's publication,[3] including the following:

Industry manufacturers later redefined the RAID acronym to stand for "redundant array of independent disks".[8] [9] [10] [11]

Overview

Many RAID levels employ an error protection scheme called "parity", a widely used method in information technology to provide fault tolerance in a given set of data. Most use simple XOR, but RAID 6 uses two separate parities based respectively on addition and multiplication in a particular Galois field or Reed–Solomon error correction.[12]

RAID can also provide data security with solid-state drives (SSDs) without the expense of an all-SSD system. For example, a fast SSD can be mirrored with a mechanical drive. For this configuration to provide a significant speed advantage, an appropriate controller is needed that uses the fast SSD for all read operations. Adaptec calls this "hybrid RAID".[13]

Standard levels

See main article: Standard RAID levels. Originally, there were five standard levels of RAID, but many variations have evolved, including several nested levels and many non-standard levels (mostly proprietary). RAID levels and their associated data formats are standardized by the Storage Networking Industry Association (SNIA) in the Common RAID Disk Drive Format (DDF) standard:[14] [15]

Nested (hybrid) RAID

See main article: Nested RAID levels.

In what was originally termed hybrid RAID,[22] many storage controllers allow RAID levels to be nested. The elements of a RAID may be either individual drives or arrays themselves. Arrays are rarely nested more than one level deep.

The final array is known as the top array. When the top array is RAID 0 (such as in RAID 1+0 and RAID 5+0), most vendors omit the "+" (yielding RAID 10 and RAID 50, respectively).

Non-standard levels

See main article: Non-standard RAID levels.

Many configurations other than the basic numbered RAID levels are possible, and many companies, organizations, and groups have created their own non-standard configurations, in many cases designed to meet the specialized needs of a small niche group. Such configurations include the following:

Implementations

The distribution of data across multiple drives can be managed either by dedicated computer hardware or by software. A software solution may be part of the operating system, part of the firmware and drivers supplied with a standard drive controller (so-called "hardware-assisted software RAID"), or it may reside entirely within the hardware RAID controller.

Hardware-based

See main article: RAID controller.

Hardware RAID controllers can be configured through card BIOS or Option ROM before an operating system is booted, and after the operating system is booted, proprietary configuration utilities are available from the manufacturer of each controller. Unlike the network interface controllers for Ethernet, which can usually be configured and serviced entirely through the common operating system paradigms like ifconfig in Unix, without a need for any third-party tools, each manufacturer of each RAID controller usually provides their own proprietary software tooling for each operating system that they deem to support, ensuring a vendor lock-in, and contributing to reliability issues.

For example, in FreeBSD, in order to access the configuration of Adaptec RAID controllers, users are required to enable Linux compatibility layer, and use the Linux tooling from Adaptec,[30] potentially compromising the stability, reliability and security of their setup, especially when taking the long-term view.[31]

Some other operating systems have implemented their own generic frameworks for interfacing with any RAID controller, and provide tools for monitoring RAID volume status, as well as facilitation of drive identification through LED blinking, alarm management and hot spare disk designations from within the operating system without having to reboot into card BIOS. For example, this was the approach taken by OpenBSD in 2005 with its bio(4) pseudo-device and the bioctl utility, which provide volume status, and allow LED/alarm/hotspare control, as well as the sensors (including the drive sensor) for health monitoring;[32] this approach has subsequently been adopted and extended by NetBSD in 2007 as well.[33]

Software-based

Software RAID implementations are provided by many modern operating systems. Software RAID can be implemented as:

Some advanced file systems are designed to organize data across multiple storage devices directly, without needing the help of a third-party logical volume manager:

Many operating systems provide RAID implementations, including the following:

If a boot drive fails, the system has to be sophisticated enough to be able to boot from the remaining drive or drives. For instance, consider a computer whose disk is configured as RAID 1 (mirrored drives); if the first drive in the array fails, then a first-stage boot loader might not be sophisticated enough to attempt loading the second-stage boot loader from the second drive as a fallback. The second-stage boot loader for FreeBSD is capable of loading a kernel from such an array.[61]

Firmware- and driver-based

Software-implemented RAID is not always compatible with the system's boot process, and it is generally impractical for desktop versions of Windows. However, hardware RAID controllers are expensive and proprietary. To fill this gap, inexpensive "RAID controllers" were introduced that do not contain a dedicated RAID controller chip, but simply a standard drive controller chip, or the chipset built-in RAID function, with proprietary firmware and drivers. During early bootup, the RAID is implemented by the firmware and, once the operating system has been more completely loaded, the drivers take over control. Consequently, such controllers may not work when driver support is not available for the host operating system.[62] An example is Intel Rapid Storage Technology, implemented on many consumer-level motherboards.[63] [64]

Because some minimal hardware support is involved, this implementation is also called "hardware-assisted software RAID",[65] [66] "hybrid model" RAID, or even "fake RAID".[67] If RAID 5 is supported, the hardware may provide a hardware XOR accelerator. An advantage of this model over the pure software RAID is that—if using a redundancy mode—the boot drive is protected from failure (due to the firmware) during the boot process even before the operating system's drivers take over.[68]

Integrity

Data scrubbing (referred to in some environments as patrol read) involves periodic reading and checking by the RAID controller of all the blocks in an array, including those not otherwise accessed. This detects bad blocks before use.[69] Data scrubbing checks for bad blocks on each storage device in an array, but also uses the redundancy of the array to recover bad blocks on a single drive and to reassign the recovered data to spare blocks elsewhere on the drive.[70]

Frequently, a RAID controller is configured to "drop" a component drive (that is, to assume a component drive has failed) if the drive has been unresponsive for eight seconds or so; this might cause the array controller to drop a good drive because that drive has not been given enough time to complete its internal error recovery procedure. Consequently, using consumer-marketed drives with RAID can be risky, and so-called "enterprise class" drives limit this error recovery time to reduce risk. Western Digital's desktop drives used to have a specific fix. A utility called WDTLER.exe limited a drive's error recovery time. The utility enabled TLER (time limited error recovery), which limits the error recovery time to seven seconds. Around September 2009, Western Digital disabled this feature in their desktop drives (such as the Caviar Black line), making such drives unsuitable for use in RAID configurations.[71] However, Western Digital enterprise class drives are shipped from the factory with TLER enabled. Similar technologies are used by Seagate, Samsung, and Hitachi. For non-RAID usage, an enterprise class drive with a short error recovery timeout that cannot be changed is therefore less suitable than a desktop drive. In late 2010, the Smartmontools program began supporting the configuration of ATA Error Recovery Control, allowing the tool to configure many desktop class hard drives for use in RAID setups.

While RAID may protect against physical drive failure, the data is still exposed to operator, software, hardware, and virus destruction. Many studies cite operator fault as a common source of malfunction,[72] [73] such as a server operator replacing the incorrect drive in a faulty RAID, and disabling the system (even temporarily) in the process.[74]

An array can be overwhelmed by catastrophic failure that exceeds its recovery capacity and the entire array is at risk of physical damage by fire, natural disaster, and human forces, however backups can be stored off site. An array is also vulnerable to controller failure because it is not always possible to migrate it to a new, different controller without data loss.[75]

Weaknesses

Correlated failures

In practice, the drives are often the same age (with similar wear) and subject to the same environment. Since many drive failures are due to mechanical issues (which are more likely on older drives), this violates the assumptions of independent, identical rate of failure amongst drives; failures are in fact statistically correlated. In practice, the chances for a second failure before the first has been recovered (causing data loss) are higher than the chances for random failures. In a study of about 100,000 drives, the probability of two drives in the same cluster failing within one hour was four times larger than predicted by the exponential statistical distribution—which characterizes processes in which events occur continuously and independently at a constant average rate. The probability of two failures in the same 10-hour period was twice as large as predicted by an exponential distribution.[76]

Unrecoverable read errors during rebuild

Unrecoverable read errors (URE) present as sector read failures, also known as latent sector errors (LSE). The associated media assessment measure, unrecoverable bit error (UBE) rate, is typically guaranteed to be less than one bit in 1015 for enterprise-class drives (SCSI, FC, SAS or SATA), and less than one bit in 1014 for desktop-class drives (IDE/ATA/PATA or SATA). Increasing drive capacities and large RAID 5 instances have led to the maximum error rates being insufficient to guarantee a successful recovery, due to the high likelihood of such an error occurring on one or more remaining drives during a RAID set rebuild.[77] When rebuilding, parity-based schemes such as RAID 5 are particularly prone to the effects of UREs as they affect not only the sector where they occur, but also reconstructed blocks using that sector for parity computation.[78]

Double-protection parity-based schemes, such as RAID 6, attempt to address this issue by providing redundancy that allows double-drive failures; as a downside, such schemes suffer from elevated write penalty—the number of times the storage medium must be accessed during a single write operation.[79] Schemes that duplicate (mirror) data in a drive-to-drive manner, such as RAID 1 and RAID 10, have a lower risk from UREs than those using parity computation or mirroring between striped sets.[80] Data scrubbing, as a background process, can be used to detect and recover from UREs, effectively reducing the risk of them happening during RAID rebuilds and causing double-drive failures. The recovery of UREs involves remapping of affected underlying disk sectors, utilizing the drive's sector remapping pool; in case of UREs detected during background scrubbing, data redundancy provided by a fully operational RAID set allows the missing data to be reconstructed and rewritten to a remapped sector.[81] [82]

Increasing rebuild time and failure probability

Drive capacity has grown at a much faster rate than transfer speed, and error rates have only fallen a little in comparison. Therefore, larger-capacity drives may take hours if not days to rebuild, during which time other drives may fail or yet undetected read errors may surface. The rebuild time is also limited if the entire array is still in operation at reduced capacity.[83] Given an array with only one redundant drive (which applies to RAID levels 3, 4 and 5, and to "classic" two-drive RAID 1), a second drive failure would cause complete failure of the array. Even though individual drives' mean time between failure (MTBF) have increased over time, this increase has not kept pace with the increased storage capacity of the drives. The time to rebuild the array after a single drive failure, as well as the chance of a second failure during a rebuild, have increased over time.[84]

Some commentators have declared that RAID 6 is only a "band aid" in this respect, because it only kicks the problem a little further down the road. However, according to the 2006 NetApp study of Berriman et al., the chance of failure decreases by a factor of about 3,800 (relative to RAID 5) for a proper implementation of RAID 6, even when using commodity drives. Nevertheless, if the currently observed technology trends remain unchanged, in 2019 a RAID 6 array will have the same chance of failure as its RAID 5 counterpart had in 2010.

Mirroring schemes such as RAID 10 have a bounded recovery time as they require the copy of a single failed drive, compared with parity schemes such as RAID 6, which require the copy of all blocks of the drives in an array set. Triple parity schemes, or triple mirroring, have been suggested as one approach to improve resilience to an additional drive failure during this large rebuild time.[85]

Atomicity

A system crash or other interruption of a write operation can result in states where the parity is inconsistent with the data due to non-atomicity of the write process, such that the parity cannot be used for recovery in the case of a disk failure. This is commonly termed the write hole which is a known data corruption issue in older and low-end RAIDs, caused by interrupted destaging of writes to disk.[86] The write hole can be addressed in a few ways:

Write hole is a little understood and rarely mentioned failure mode for redundant storage systems that do not utilize transactional features. Database researcher Jim Gray wrote "Update in Place is a Poison Apple" during the early days of relational database commercialization.[93]

Write-cache reliability

There are concerns about write-cache reliability, specifically regarding devices equipped with a write-back cache, which is a caching system that reports the data as written as soon as it is written to cache, as opposed to when it is written to the non-volatile medium. If the system experiences a power loss or other major failure, the data may be irrevocably lost from the cache before reaching the non-volatile storage. For this reason good write-back cache implementations include mechanisms, such as redundant battery power, to preserve cache contents across system failures (including power failures) and to flush the cache at system restart time.[94]

See also

External links

Notes and References

  1. Patterson . David . David Patterson (computer scientist) . Gibson . Garth A. . Garth A. Gibson . Katz . Randy . Randy Katz . 1988 . A Case for Redundant Arrays of Inexpensive Disks (RAID) . SIGMOD Conferences . 2024-01-03.
  2. The Story So Far . Frank . Hayes . Computerworld . November 17, 2003 . November 18, 2016 . Patterson recalled the beginnings of his RAID project in 1987. [...] 1988: David A. Patterson leads a team that defines RAID standards for improved performance, reliability and scalability..
  3. Web site: RAID: A Personal Recollection of How Storage Became a System . October 2010 . 2015-01-18 . Randy H. . Katz . IEEE Computer Society . eecs.umich.edu . We were not the first to think of the idea of replacing what Patterson described as a slow large expensive disk (SLED) with an array of inexpensive disks. For example, the concept of disk mirroring, pioneered by Tandem, was well known, and some storage products had already been constructed around arrays of small disks..
  4. Web site: HSC50/70 Hardware Technical Manual . 29, 32 . July 1986 . 2014-01-03 . . 2016-03-04 . https://web.archive.org/web/20160304032213/http://www.textfiles.com/bitsavers/pdf/dec/ci/EK-HS571-TM-001_HSC_hwTech.pdf . dead .
  5. See also The Connection Machine (1988)
  6. Web site: IBM 7030 Data Processing System: Reference Manual . 1960 . 2015-01-17 . . bitsavers.trailing-edge.com . PDF . 157 . Since a large number of bits are handled in parallel, it is practical to use error checking and correction (ECC) bits, and each 39 bit byte is composed of 32 data bits and seven ECC bits. The ECC bits accompany all data transferred to or from the high-speed disks, and, on reading, are used to correct a single bit error in a byte and detect double and most multiple errors in a byte..
  7. Web site: 2009-06-18 . IBM Stretch (aka IBM 7030 Data Processing System) . 2015-01-17 . brouhaha.com . A typical IBM 7030 Data Processing System might have been the following units: [...] IBM 353 Disk Storage Unit similar to IBM 1301 Disk File, but much faster. 2,097,152 (2^21) 72-bit words (64 data bits and 8 ECC bits), 125,000 words per second.
  8. "Originally referred to as Redundant Array of Inexpensive Disks, the term RAID was first published in the late 1980s by Patterson, Gibson, and Katz of the University of California at Berkeley. (The RAID Advisory Board has since substituted the term Inexpensive with Independent.)" Storage Area Network Fundamentals; Meeta Gupta; Cisco Press; ; Appendix A.
  9. Peter . Chen . Edward . Lee . Garth . Gibson . Randy . Katz . David . Patterson . RAID: High-Performance, Reliable Secondary Storage . ACM Computing Surveys . 26 . 2 . 145–185 . 1994 . 10.1145/176979.176981 . 10.1.1.41.3889 . 207178693.
  10. Book: Donald . L. . MCSA/MCSE 2006 JumpStart Computer and Network Basics . SYBEX . Glasgow . 2nd . 2003.
  11. Book: Redundant Arrays of Independent Disks from FOLDOC . Free On-line Dictionary of Computing . Imperial College Department of Computing . Howe . Denis . 2011-11-10.
  12. Dawkins, Bill and Jones, Arnold. "Common RAID Disk Data Format Specification" [Storage Networking Industry Association] Colorado Springs, 28 July 2006. Retrieved on 22 February 2011.
  13. Web site: Adaptec Hybrid RAID Solutions . Adaptec . Adaptec.com . 2012 . 2013-09-07.
  14. Web site: Common RAID Disk Drive Format (DDF) standard . SNIA . SNIA.org . 2012-08-26.
  15. Web site: SNIA Dictionary . SNIA . SNIA.org . 2010-08-24.
  16. Book: Tanenbaum, Andrew S. . Structured Computer Organization 6th ed. . 95.
  17. Book: Computer Architecture: A Quantitative Approach, 4th ed . John . Hennessy . David . Patterson . 2006 . 362 . 978-0123704900.
  18. Web site: FreeBSD Handbook, Chapter 20.5 GEOM: Modular Disk Transformation Framework. 2012-12-20.
  19. Web site: RAID-DP:NetApp Implementation of Double Parity RAID for Data Protection. NetApp Technical Report TR-3298. Jay. White. Chris. Lueth. May 2010. 2013-03-02.
  20. News: Why RAID 6 stops working in 2019 . https://web.archive.org/web/20100815215636/http://www.zdnet.com/blog/storage/why-raid-6-stops-working-in-2019/805 . dead . August 15, 2010 . . 22 February 2010.
  21. Web site: How to protect yourself from RAID-related Unrecoverable Read Errors (UREs). Techrepublic. . Scott . Lowe . 2009-11-16 . 2012-12-01.
  22. Book: Vijayan . S. . Selvamani . S. . Vijayan . S . Proceedings of the 1995 International Conference on Parallel Processing: Volume 1 . https://books.google.com/books?id=QliANH5G3_gC&q=%22hybrid+raid%22 . 1995 . . 978-0-8493-2615-8 . I–146ff . Dual-Crosshatch Disk Array: A Highly Reliable Hybrid-RAID Architecture . Google Books.
  23. Web site: Why is RAID 1+0 better than RAID 0+1? . aput.net . 2016-05-23.
  24. Web site: RAID 10 Vs RAID 01 (RAID 1+0 Vs RAID 0+1) Explained with Diagram . www.thegeekstuff.com . 2016-05-23.
  25. Web site: Comparing RAID 10 and RAID 01 SMB IT Journal . www.smbitjournal.com . 30 July 2014 . 2016-05-23.
  26. Jeffrey B. Layton:, Linux Magazine, January 6, 2011
  27. Web site: Performance, Tools & General Bone-Headed Questions . tldp.org . 2013-12-25.
  28. Web site: Main Page – Linux-raid . osdl.org . 2010-08-20 . 2010-08-24 . dead . https://web.archive.org/web/20080705104645/http://linux-raid.osdl.org/ . 2008-07-05 .
  29. Web site: Hdfs Raid . Hadoopblog.blogspot.com . 2009-08-28 . 2010-08-24.
  30. Web site: aac(4) — Adaptec AdvancedRAID Controller driver. BSD Cross Reference . FreeBSD. Scott . Long . Adaptec, Inc . Adaptec . 2000., Book: aac -- Adaptec AdvancedRAID Controller driver . FreeBSD Manual Pages . FreeBSD.
  31. Web site: 3.8: "Hackers of the Lost RAID". OpenBSD Release Songs. OpenBSD. 2005-11-01 . 2019-03-23.
  32. RAID management support coming in OpenBSD 3.8. Theo de . Raadt. Theo de Raadt. 2005-09-09. misc@. OpenBSD.
  33. MMath. Constantine A. . Murenin . 2010-05-21. 1.1. Motivation; 4. Sensor Drivers; 7.1. NetBSD envsys / sysmon. OpenBSD Hardware Sensors — Environmental Monitoring and Fan Control.. . UWSpace. 10012/5234. Document ID: ab71498b6b1a60ff817b29d56997a418..
  34. Web site: RAID over File System. 2014-07-22. 2013-11-09. https://web.archive.org/web/20131109055927/http://www.flexraid.com/faq-items/what-is-raid-over-file-system/. dead.
  35. Web site: ZFS Raidz Performance, Capacity and Integrity. calomel.org. 26 June 2017.
  36. Web site: ZFS -illumos . . 2014-09-15 . 2016-05-23 . 2019-03-15 . https://web.archive.org/web/20190315183042/https://wiki.illumos.org/display/illumos/ZFS . dead .
  37. Web site: Creating and Destroying ZFS Storage Pools – Oracle Solaris ZFS Administration Guide . . 2012-04-01 . 2014-07-27.
  38. Web site: 20.2. The Z File System (ZFS) . freebsd.org . 2014-07-27 . dead . https://web.archive.org/web/20140703043231/http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/filesystems-zfs.html . 2014-07-03 .
  39. Web site: Double Parity RAID-Z (raidz2) (Solaris ZFS Administration Guide) . . 2014-07-27.
  40. Web site: Triple Parity RAIDZ (raidz3) (Solaris ZFS Administration Guide) . . 2014-07-27.
  41. Web site: General Parallel File System (GPFS) Native RAID. Veera. Deenadhayalan. IBM. UseNix.org. 2011. 2014-09-28.
  42. Web site: Btrfs Wiki: Feature List. 2012-11-07. 2012-11-16.
  43. Web site: Btrfs Wiki: Changelog. 2012-10-01. 2012-11-14.
  44. Web site: 2015-04-22 . https://web.archive.org/web/20150422201638/http://linux-xfs.sgi.com/projects/xfs/papers/xfs_white/xfs_white_paper.html . Scalability and Performance in Modern File Systems . 2015-08-17 . Philip . Trautman . Jim . Mostek . linux-xfs.sgi.com.
  45. Web site: Linux RAID Setup – XFS . 2013-10-05 . 2015-08-17 . kernel.org.
  46. Web site: HPE Support document - HPE Support Center . Hewlett Packard Enterprise . support.hpe.com.
  47. Web site: Mac OS X: How to combine RAID sets in Disk Utility . 2010-01-04.
  48. Web site: Apple Mac OS X Server File Systems . 2008-04-23.
  49. Web site: Other World Computing Launches SoftRAID 8 Setting a New Standard for Reliability, Speed and Data Safeguards . TechPowerUp . 2024-03-20 . 2024-11-24 .
  50. Web site: FreeBSD System Manager's Manual page for GEOM(8) . 2009-03-19.
  51. Web site: freebsd-geom mailing list – new class / geom_raid5 . 6 July 2006 . 2009-03-19.
  52. Web site: FreeBSD Kernel Interfaces Manual for CCD(4) . 2009-03-19.
  53. Web site: The Software-RAID HowTo . 2008-11-10.
  54. Web site: mdadm(8) – Linux man page . Linux.Die.net . 2014-11-20.
  55. Web site: Windows Vista support for large-sector hard disk drives . 2007-05-29 . Microsoft . dead . https://web.archive.org/web/20070703092408/http://support.microsoft.com/kb/923332/ . 2007-07-03 . 2007-10-08.
  56. Web site: You cannot select or format a hard disk partition when you try to install Windows Vista, Windows 7 or Windows Server 2008 R2 . 14 September 2011 . . live . https://web.archive.org/web/20110303111057/http://support.microsoft.com/kb/927520/en-us . 3 March 2011 . 17 December 2009.
  57. Web site: Using Windows XP to Make RAID 5 Happen . 19 November 2004 . . 24 August 2010.
  58. Web site: Sinofsky . Steven . Virtualizing storage for scale, resiliency, and efficiency . Building Windows 8 blog. January 5, 2012. January 6, 2012. May 9, 2013. https://web.archive.org/web/20130509100721/http://blogs.msdn.com/b/b8/archive/2012/01/05/virtualizing-storage-for-scale-resiliency-and-efficiency.aspx. dead.
  59. Web site: NetBSD 1.4 Release Announcement . Perry . Metzger . The NetBSD Foundation . NetBSD.org . 1999-05-12 . 2013-01-30.
  60. Web site: OpenBSD softraid man page . 2018-02-03 . OpenBSD.org.
  61. Web site: FreeBSD Handbook . Chapter 19 GEOM: Modular Disk Transformation Framework . 2009-03-19.
  62. Web site: SATA RAID FAQ . Ata.wiki.kernel.org . 2011-04-08 . 2012-08-26.
  63. Web site: Red Hat Enterprise Linux – Storage Administrator Guide – RAID Types . redhat.com.
  64. Book: Charlie . Russel . Sharon . Crawford . Andrew . Edney . Working with Windows Small Business Server 2011 Essentials . 2011 . O'Reilly Media, Inc. . 978-0-7356-5670-3 . 90 . Google Books.
  65. Web site: Warren . Block . 19.5. Software RAID Devices . freebsd.org . 2014-07-27.
  66. Book: Ronald L. . Krutz . James . Conley . Wiley Pathways Network Security Fundamentals . 2007 . . 978-0-470-10192-6 . 422 . Google Books.
  67. Book: Smith, Gregory . PostgreSQL 9.0: High Performance . 2010 . Packt Publishing Ltd . 978-1-84951-031-8 . 31 . Google Books.
  68. Web site: Hardware RAID vs. Software RAID: Which Implementation is Best for my Application? Adaptec Whitepaper . adaptec.com.
  69. Ulf Troppens, Wolfgang Mueller-Friedt, Rainer Erkens, Rainer Wolafka, Nils Haustein. Storage Networks Explained: Basics and Application of Fibre Channel SAN, NAS, ISCSI, InfiniBand and FCoE. John Wiley and Sons, 2009. p.39
  70. Dell Computers, Background Patrol Read for Dell PowerEdge RAID Controllers, By Drew Habas and John Sieber, Reprinted from Dell Power Solutions, February 2006 http://www.dell.com/downloads/global/power/ps1q06-20050212-Habas.pdf
  71. Web site: Error Recovery Control with Smartmontools . 2009 . September 29, 2017 . dead . https://web.archive.org/web/20110928190045/http://www.csc.liv.ac.uk/~greg/projects/erc/ . September 28, 2011.
  72. Gray . Jim . A census of Tandem system availability between 1985 and 1990 . IEEE Transactions on Reliability . Oct 1990 . 39 . 4 . 409–418 . 10.1109/24.58719 . IEEE. 2955525 . https://web.archive.org/web/20190220114624/http://pdfs.semanticscholar.org/22a4/ddf4d609c6e9c8a0a0ea6187af4c3178a7ed.pdf . dead . 2019-02-20 .
  73. Murphy . Brendan . Gent . Ted . Measuring system and software reliability using an automated data collection process . Quality and Reliability Engineering International . 1995 . 11 . 5 . 341–353 . 10.1002/qre.4680110505.
  74. Patterson, D., Hennessy, J. (2009), 574.
  75. Web site: The RAID Migration Adventure . 10 July 2007 . 2010-03-10.
  76. http://www.usenix.org/events/fast07/tech/schroeder.html Disk Failures in the Real World: What Does an MTTF of 1,000,000 Hours Mean to You?
  77. Web site: Does RAID 6 stop working in 2019?. Robin. Harris. TechnoQWAN. StorageMojo.com. 2010-02-27. 2013-12-17.
  78. J.L. Hafner, V. Dheenadhayalan, K. Rao, and J.A. Tomlin. "Matrix methods for lost data reconstruction in erasure codes. USENIX Conference on File and Storage Technologies, Dec. 13–16, 2005.
  79. Web site: Understanding RAID Performance at Various Levels. Miller. Scott Alan. 2016-01-05. Recovery Zone. StorageCraft. 2016-07-22.
  80. Web site: RAID 5 versus RAID 10 (or even RAID 3, or RAID 4) . March 2, 2011 . October 30, 2014 . Art S. . Kagel . miracleas.com . dead . https://web.archive.org/web/20141103162704/http://www.miracleas.com/BAARF/RAID5_versus_RAID10.txt . November 3, 2014.
  81. Book: M. . Baker . M. . Shah . D.S.H. . Rosenthal . M. . Roussopoulos . P. . Maniatis . T. . Giuli . P . Bungale . Proceedings of the 1st ACM SIGOPS/EuroSys European Conference on Computer Systems 2006 . A fresh look at the reliability of long-term digital storage . April 2006. 221–234 . 10.1145/1217935.1217957 . 1595933220 . 7655425 .
  82. Book: http://research.cs.wisc.edu/adsl/Publications/latent-sigmetrics07.pdf . L.N. . Bairavasundaram . G.R. . Goodson . S. . Pasupathy . J. . Schindler . Proceedings of the 2007 ACM SIGMETRICS international conference on Measurement and modeling of computer systems . An analysis of latent sector errors in disk drives . June 12–16, 2007. 289–300 . 10.1145/1254882.1254917 . 9781595936394 . 14164251 .
  83. Patterson, D., Hennessy, J. (2009). Computer Organization and Design. New York: Morgan Kaufmann Publishers. pp 604–605.
  84. Web site: RAID's Days May Be Numbered . Newman . Henry . 2009-09-17 . 2010-09-07 . EnterpriseStorageForum.
  85. Web site: Triple-Parity RAID and Beyond. ACM Queue, Association for Computing Machinery . Adam . Leventhal . 2009-12-01 . 2012-11-30.
  86. Web site: "Write Hole" in RAID5, RAID6, RAID1, and Other Arrays. ZAR team. 15 February 2012.
  87. Web site: Danti . Gionatan. write hole: which RAID levels are affected? . Server Fault . en.
  88. Web site: ANNOUNCE: mdadm 3.4 - A tool for managing md Soft RAID under Linux [LWN.net]]. lwn.net .
  89. Web site: A journal for MD/RAID5 [LWN.net]]. lwn.net .
  90. Web site: Partial Parity Log . The Linux Kernel documentation.
  91. Web site: RAID-Z . Jeff Bonwick's Blog . Oracle Blogs . 2005-11-17 . 2015-02-01 . Jeff . Bonwick . dead . https://web.archive.org/web/20141216015058/https://blogs.oracle.com/bonwick/en_US/entry/raid_z . 2014-12-16 .
  92. Web site: Overstreet . Kent . bcachefs: Principles of Operation . 10 May 2023 . 18 Dec 2021.
  93. http://www.informatik.uni-trier.de/~ley/db/conf/vldb/Gray81.html Jim Gray: The Transaction Concept: Virtues and Limitations
  94. Web site: Definition of write-back cache at SNIA dictionary. www.snia.org.