Abstract:Replica management strategy is critical for the availability, reliability, and overall performance of the distributed storage system. Aimed at the shortage of the dynamic replication strategy based on heat files, a strategy of dynamically adjusting replica factor base on hot data block is proposed. According to the principle of temporal locality and the rule of data access, calculate the predicted access frequency of data block by assigning different weights to the historical visit period and the current period. Then, according to the fact that HDFS data access approximately conform to twenty-eight law to determine the hot data block decision threshold, thereby determining whether a single data block is hot, dynamically adjusting the replication factor of the data block based on the determination result. Finally, the dynamic replica adjustment strategy of hotspot data block is designed in three steps. Experimental results show that the proposed dynamic replica adjustment strategy based on hot data block has been improved in both data access efficiency and cluster storage resource utilization.