Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Hardware accelerator for similarity based data dedupe

Dongyang Li, University of Rhode Island
Qingbo Wang, Western Digital
Cyril Guyot, Western Digital
Ashwin Narasimha, Western Digital
Dejan Vucinic, Western Digital
Zvonimir Bandic, Western Digital
Qing Yang, University of Rhode Island

Document Type

Conference Proceeding

Date of Original Version

9-10-2015

Abstract

Data deduplication has proven important in backup storage systems as large amount of identical or similar data chunks exist. Recent studies have shown the great potential of data deduplication in primary storage and storage caches. Deduplications in these environments require high speed processing not to drag down production performance. This paper presents a hardware accelerator for similarity based data deduplication. It implements three compute-intensive kernel modules to improve throughput and latency in dedupe systems: sketch computation for data blocks, index searching for reference block, and delta encoding over similar blocks. Adopting pipelined computation and parallel data lookup across multiple hardware modules, our HW design is capable of processing high throughput data traffic by working on multiple data units concurrently, thus enabling wire speed dedupe for data stream where similar blocks present. Using a PC host system connected to the FPGA-based accelerator through a PCIe Gen 2×4 interface, our experiments show that the similarity based data dedupe performs 30% better in data reduction ratio than conventional dedupe techniques that look at identical blocks only. By comparing the hardware implementation with its software counterpart, the experimental results show that our preliminary FPGA implementation with maximum clock speed of 250MHz achieves at least 6 times improvement in latency over the software implementation running on state-of-art servers.

Publication Title, e.g., Journal

Proceedings of the 2015 IEEE International Conference on Networking, Architecture and Storage, NAS 2015

Citation/Publisher Attribution

Li, Dongyang, Qingbo Wang, Cyril Guyot, Ashwin Narasimha, Dejan Vucinic, Zvonimir Bandic, and Qing Yang. "Hardware accelerator for similarity based data dedupe." Proceedings of the 2015 IEEE International Conference on Networking, Architecture and Storage, NAS 2015 (2015): 224-232. doi: 10.1109/NAS.2015.7255198.

Link to Full Text

COinS

DOI

https://doi.org/10.1109/NAS.2015.7255198

Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Hardware accelerator for similarity based data dedupe

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Hardware accelerator for similarity based data dedupe

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner