DSpace Repository

Efficient Implementation Techniques for Block-Level Cloud Storage Systems

Show simple item record

dc.contributor.advisor Chiueh, Tzi-cker en_US
dc.contributor.author Simha, Dilip Nijagal en_US
dc.contributor.other Department of Computer Science. en_US
dc.date.accessioned 2017-09-20T16:52:29Z
dc.date.available 2017-09-20T16:52:29Z
dc.date.issued 2014-12-01 en_US
dc.identifier.uri http://hdl.handle.net/11401/77313 en_US
dc.description 260 pg. en_US
dc.description.abstract A fundamental building block for an IaaS (Infrastructure-as-a-Service) cloud service such as Amazon's EC2 is a storage virtualization system that provides block-level storage services to individual virtual machines over the network. This dissertation addresses four major problems in such a block-level cloud storage system, in the context of an end-to-end IaaS solution called ITRI Cloud OS. First, to effectively eliminate redundancies in stored data blocks, we propose a scalable block-level deduplication engine called <bold>Sungem</bold>, which uses both sampling and prefetching to minimize the performance overhead of fingerprint accesses, and features a storage block garbage collection algorithm whose run- time overhead is proportional only to the size of the delta between consecutive backup operations. Second, to efficiently flush meta-data updates associated with large-scale block-level storage management, we developed a novel storage system architecture called <bold>BOSC (Batching mOdifications with Sequential Commit)</bold>, which uses largely sequential writes to commit updates to disk and is thus able to sustain high-throughput and low-latency metadata updates that are largely random. Third, as part of the BOSC architecture, we invented a high-throughput low-latency disk logging system called <bold>Beluga</bold>, which fashions a carefully tuned disk write pipeline and makes it possible to provide, on an array of three commodity 7200 RPM SATA disks, close to 5 million fine-grained (64-byte) disk logging operations per second, which is close to the maximum possible bandwidth on a commodity disk, while keeping the latency of each logging operation under 1 msec. Finally, we devised a set of techniques for supporting software-defined storage service on a distributed and replicated storage architecture. Specifically, we developed a distributed storage QoS guarantee system called <bold>Cheetah</bold>, which is able to provide a bandwidth guarantee to each virtual disk attached to a virtual machine, while ensuring the loads on the distributed storage nodes be balanced, and the locality of the access stream associated with each virtual disk be preserved as much as possible. en_US
dc.description.sponsorship This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree. en_US
dc.format Monograph en_US
dc.format.medium Electronic Resource en_US
dc.language.iso en_US en_US
dc.publisher The Graduate School, Stony Brook University: Stony Brook, NY. en_US
dc.subject.lcsh Computer science en_US
dc.subject.other Beluga, BOSC, Cloud Storage, deduplication, Quality of Service, Sungem en_US
dc.title Efficient Implementation Techniques for Block-Level Cloud Storage Systems en_US
dc.type Dissertation en_US
dc.mimetype Application/PDF en_US
dc.contributor.committeemember Zadok, Erez en_US
dc.contributor.committeemember Porter, Donald en_US
dc.contributor.committeemember Aguilera, Marcos. en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account