Repository logo

Smart Cube Predictions for Online Analytic Query Processing in Data Warehouses

dc.contributor.authorBelcin, Andrei
dc.contributor.supervisorViktor, Herna
dc.contributor.supervisorPaquet, Eric
dc.date.accessioned2021-04-01T12:44:51Z
dc.date.available2021-04-01T12:44:51Z
dc.date.issued2021-04-01en_US
dc.description.abstractA data warehouse (DW) is a transformation of many sources of transactional data integrated into a single collection that is non-volatile and time-variant that can provide decision support to managerial roles within an organization. For this application, the database server needs to process multiple users’ queries by joining various datasets and loading the result in main memory to begin calculations. In current systems, this process is reactionary to users’ input and can be undesirably slow. In previous studies, it was shown that a personalization scheme of a single user’s query patterns and loading the smaller subset into main memory the query response time significantly shortened the query response time. The LPCDA framework developed in this research handles multiple users’ query demands, and the query patterns are subject to change (so-called concept drift) and noise. To this end, the LPCDA framework detects changes in user behaviour and dynamically adapts the personalized smart cube definition for the group of users. Numerous data mart (DM)s, as components of the DW, are subject to intense aggregations to assist analytics at the request of automated systems and human users’ queries. Subsequently, there is a growing need to properly manage the supply of data into main memory that is in closest proximity to the CPU that computes the query in order to reduce the response time from the moment a query arrives at the DW server. As a result, this thesis proposes an end-to-end adaptive learning ensemble for resource allocation of cuboids within a a DM to achieve a relevant and timely constructed smart cube before the time in need, as a way of adopting the just-in-time inventory management strategy applied in other real-world scenarios. The algorithms comprising the ensemble involve predictive methodologies from Bayesian statistics, data mining, and machine learning, that reflect the changes in the data-generating process using a number of change detection algorithms. Therefore, given different operational constraints and data-specific considerations, the ensemble can, to an effective degree, determine the cuboids in the lattice of a DM to pre-construct into a smart cube ahead of users submitting their queries, thereby benefiting from a quicker response than static schema views or no action at all.en_US
dc.identifier.urihttp://hdl.handle.net/10393/41956
dc.identifier.urihttp://dx.doi.org/10.20381/ruor-26178
dc.language.isoenen_US
dc.publisherUniversité d'Ottawa / University of Ottawaen_US
dc.subjectMachine Learningen_US
dc.subjectConcept Driften_US
dc.subjectData Warehouseen_US
dc.subjectSmart Cubeen_US
dc.subjectOLAPen_US
dc.subjectPredictive Modellingen_US
dc.titleSmart Cube Predictions for Online Analytic Query Processing in Data Warehousesen_US
dc.typeThesisen_US
thesis.degree.disciplineGénie / Engineeringen_US
thesis.degree.levelMastersen_US
thesis.degree.nameMScen_US
uottawa.departmentScience informatique et génie électrique / Electrical Engineering and Computer Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
Belcin_Andrei_2021_thesis.pdf
Size:
6.17 MB
Format:
Adobe Portable Document Format
Description:
Three updates made to reflect feedback in email: file name, roman numerals begin at (ii), and the appendix is fully page numbered. The "Part 1" page after the table of contents is no longer considered "front matter" and I think the paging would be ok, otherwise this would look awkward when "Part II" comes along in the middle of the thesis with a different pagination style (?).

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail ImageThumbnail Image
Name:
license.txt
Size:
6.65 KB
Format:
Item-specific license agreed upon to submission
Description: