Data blocking for partitioned data

dc.accession.numberT00713
dc.classification.ddc005.74 DEO
dc.contributor.advisorBhise, Minal
dc.contributor.authorDeore, Prajakta Balwant
dc.date.accessioned2019-03-19T09:30:53Z
dc.date.accessioned2025-06-28T10:21:09Z
dc.date.available2019-03-19T09:30:53Z
dc.date.issued2018
dc.degreeM. Tech
dc.description.abstractSince last few years the data consumed and produced by various applications is increasing tremendously. This thesis aims to achieve faster query processing for this data. The overall work of the thesis is divided into three phases, data partitioning, data blocking, and data skipping. Data partitioning includes identifying hot and cold partitions of data and storing as separate data blocks. Partitioned data is stored contiguously on the disk and verified. Data blocking is storing the data blocks on disk such that all hot data blocks are stored together and all cold data blocks are stored together. Data skipping is performed in order to reduce the disk seek time while accessing the data form disk. Data partitioning and blocking is implemented on column oriented database system. Data blocking resulted in significant reduction in amount of data scanned and query response time. The results are obtained for query execution time on three different query categorization such as range queries, nested queries and aggregate queries. On an average for these three types of queries QET became 55 times faster for partitioned data. For the above query categorization data blocking and skipping on an average results in reduction of 97% data scan and hence by accelerates queries.
dc.identifier.citationDeore, Prajakta Balwant (2018). Data Blocking for Partitioned Data. Dhirubhai Ambani Institute of Information and Communication Technology, x, 54 p. (Acc. No: T00713)
dc.identifier.urihttp://ir.daiict.ac.in/handle/123456789/747
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.student.id201611024
dc.subjectColumnar storage
dc.subjectData blocking
dc.subjectData partioning
dc.subjectData skipping
dc.subjectFrequent itemset minning
dc.subjectData processing
dc.subjectQuery processing
dc.titleData blocking for partitioned data
dc.typeDissertation

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
201611024_Prajakta Balwant Deore.PDF
Size:
2.08 MB
Format:
Adobe Portable Document Format
Description:
201611024