Clustering Models for Analyzing the Database Update Process

González-Rivera, Pablo I.

Clustering Models for Analyzing the Database Update Process

Archivos

Clustering Models for Analyzing the Database Update Process.pdf (1.67 MB)

Fecha

2025-12

Autores

González-Rivera, Pablo I.

Editor

ITESO

Resumen

This work presents an empirical characterization of Oracle Datapatch execution during live patching by analyzing performance telemetry generated by the Oracle RDBMS. The main problem addressed is the lack of studies, datasets or methodologies describing how Datapatch behaves under real runtime conditions. The general objective of this work is to model this behavior using unsupervised learning techniques in order to identify recurring execution patterns.

To achieve this objective, a data acquisition strategy based solely on native and license free instrumentation was developed using Statspack. A structured feature selection process was then applied using two complementary approaches: a statistical method based on a supervised model to identify variables correlated with execution duration, and a semantic method based on Oracle documentation to construct interpretable performance indicators. Both variable sets were used to train and evaluate clustering models capable of grouping similar execution behaviors.

The results show that the proposed methodology can distinguish between stable executions, executions with internal pressure spikes handled efficiently and executions characterized by dominant I/O demand. These clusters provide insight into runtime behavior beyond elapsed time measurement and demonstrate that Datapatch performance varies across identifiable operational states. The main contribution of this work is the development of a reproducible methodology for understanding Datapatch execution behavior through data driven analysis. Finally, the conclusions and potential research extensions are presented.

Palabras clave

Database, Patching, Clustering

Citación

González-Rivera, P. I. (2025). Clustering models for analyzing the database update process. Trabajo de obtención de grado, Maestría en Ciencia de Datos. Tlaquepaque, Jalisco: ITESO.

URI

https://hdl.handle.net/11117/12022

Colecciones

DMAF - Trabajos de fin de Maestría en Ciencia de Datos

Página completa del ítem

Clustering Models for Analyzing the Database Update Process

Archivos

Fecha

Autores

Título de la revista

ISSN de la revista

Título del volumen

Editor

Resumen

Descripción

Palabras clave

Citación

URI

Colecciones