Song B.,Beihang University |
Xiao L.,Beihang University |
Xiao L.,National Engineering Research Center for Science and Technology Resources Sharing Service |
Qin G.,Beihang University |
And 5 more authors.
Communications in Computer and Information Science | Year: 2017
Satellite applications such as remote sensing application are overwhelmed with vast quantities of data. Nevertheless, the storage resources in the satellite are so limited that it should be used more efficient. The similarity between the remote sensing data is high, but the dissimilar parts of the data distribute irregularly. When using the traditional deduplication algorithm to split the file into chunks, a large amount of chunks are exactly similar but not the same, which results in the bad effect of data deduplication. We propose a deduplication algorithm based on data similarity and delta encoding to reduce the usage of storage resources. The data similarity analysis can find out the similar data. The delta encoding technology can reduce the usage of storage resources. Through experiments on remote sensing application data, we have achieved deduplication ratios up to 30:1, and analyzed how the chunksize affect the experiment results. © Springer Nature Singapore Pte Ltd. 2017.