In Section 2.1, we presented some research studies for storage devices such as magnetic tape and solid-state drives, where features of these storage devices were briefly introduced. What are the advantages and disadvantages of distributed DBMS. iBigtable consists of a series of security protocols based on designed data structure and BigTable. For example, storage systems such as Amazon S3, Google File System, and Hadoop Distributed File System all adopt similar data replication strategies called the "conventional multi-replica replication strategy," in which a fixed number of replicas (normally three) are stored for all data to ensure the reliability requirement. In addition to analyzing storage devices in the Cloud, the research on Cloud storage and data reliability assurance issues also requires the storage scheme of the Cloud be determined. With the development of science, the hypothesis to data has evolved from the empirical description stage, theoretical modelling stage, and computational simulation stage to the fourth paradigm today, the data-intensive scientific discovery stage. Data reliability indicates the ability of the storage system to keep data consistent, hence it is always one of the key metrics of a data storage/management system. Distributed storage usually adopts a distributed system structure, where multiple storage servers are used to share the storage load and location servers are used to locate and store information. The row keys of this BigTable are ordered lexicographically; a column key is obtained by concatenating the family and the qualifier fields. For example, the SciHmm [53] project is making optimizations on time and money for the phylogenetic analysis problem. In one study [55], data reliability of the system was measured by data missing rate and file missing rate, and the issue of maximizing data reliability with limited storage capacity was investigated. BigTable performance; the number of operations per tablet server. Due to the vast data size, knowledge on the storage format of scientific data in the cloud is very important. BigTable provides a flexible, high-performance solution for various products. Time stamps used to index different versions of the data in a cell are 64-bit integers. In an erasure coding–based data storage environment, the computation and time overheads for coding and decoding the data are so high that the overall cost-saving effort in reducing storage cost is significantly weakened. Tablet servers manage a set of tablet, including dealing with reading and writing operations on loaded tablets and splitting super large tablets into small ones. Figure 6.10 shows an example of BigTable, a sparse, distributed, multidimensional map for an Email application. A BigTable example; the organization of an Email application as a sparse, distributed, multidimensional map. BigTable serves quantities of projects at Google [13]. For describing data reliability of replication-based systems, analytical data reliability models have been proposed and comprehensively studied [4,19,55,57,60]. Big Data is more than simply a performance issue to be solved by scaling up technology; it has also brought with it a paradigm shift in data processing and data management practices. For data reliability specifically, which refers to the reliability provided by the data storage services and systems for the stored data, it can be defined as “the probability of the data surviving in the system for a given period of time” [2].