Hot and cold data identification using query aware hybrid partitioning

dc.accession.numberT00532
dc.classification.ddc004.5 KAN
dc.contributor.advisorBhise, Minal
dc.contributor.authorKanwar, Jai Jai
dc.date.accessioned2017-06-10T14:43:25Z
dc.date.accessioned2025-06-28T10:24:32Z
dc.date.available2017-06-10T14:43:25Z
dc.date.issued2015
dc.degreeM. Tech
dc.description.abstractThe price of main memory is reducing with time, which helps to store huge amount of data in the memory. OLTP applications have large database size. It is observed that some applications exhibit skewed access pattern i.e. not all the records are accessed every time. Older data is less likely to be accessed as compared to the recent data. The objective is to store data in such a way so that it makes optimal utilization of memory and helps in faster query execution. To identify this hot and cold data we have proposed a Query Aware approach using Hybrid Partitioning (QAA-HP) approach. For given query workload, QAA-HP identifies the hot schema and the hot data corresponding to it. The hot data and the cold data can be configured differently so that their directed queries are accelerated. Different configuration techniques like vertically partitioned table or binary tables, n-ary tables, horizontally partitioned tables are presented for this purpose. We have used TPC-C benchmark for our experiments, which is an OLTP workload. Initially tables are vertically partitioned for hot schema and then further partitioned horizontally for hot data. Metrics for performance analysis are designed based on Query Analysis and Query Execution Time. The results show that when taking 9% of the TPC-C data in clusters, 79% of the hottest query workload ���� is answered. The percentage of time gain ����% for hottest queries when run on hot clusters is observed to be 37% for cold runs and 31% for hot runs.
dc.identifier.citationKanwar, Jai Jai (2015). Hot and cold data identification using query aware hybrid partitioning. Dhirubhai Ambani Institute of Information and Communication Technology, viii, 61 p. (Acc.No: T00532)
dc.identifier.urihttp://ir.daiict.ac.in/handle/123456789/569
dc.publisherDhirubhai Ambani Institute of Information and Communication Technology
dc.student.id201311038
dc.subjectMemory
dc.subjectOLTP applications
dc.subjectQuery Aware approach
dc.subjectHybrid Partitioning
dc.subjectQAA-HP
dc.subjectPerformance analysis
dc.titleHot and cold data identification using query aware hybrid partitioning
dc.typeDissertation
dcterms.subjectcold data
dcterms.subjectdata partioning
dcterms.subjecthot data

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
201311038.pdf
Size:
1.7 MB
Format:
Adobe Portable Document Format