In this tutorial you will learn how to train a machine learning model on Prometheus metrics to perform time series forecasting and predict anomalies.
We will use a Jupyter notebook to train a machine learning model on sample Prometheus time series metric data sets. We will be using the Prometheus API Client to collect and manipulate the time series data. Hence, we highly recommend you complete the Prometheus API Client Tutorial before starting this one!
Once the environment is ready to be used, you will see the links to access it in your workshop terminal.
During this tutorial you will be using a hosted OpenShift 4.7 environment that is created just for you. This environment is not shared with other users of the system.
Your environment will only be active for a one hour period. Keep this in mind before embarking on getting through the content.
Each time you start this tutorial, a new environment will be created on the fly.
Let's get started!
Feedback for this scenario
Did you like this scenario? or Do you think we can improve on something?
You can help us by providing your feedback/suggestions here.
More information about the applications we used
- Prometheus - A Cloud native application monitoring system.
- Jupyter Notebook - Open source web application for creating and sharing documents that contain live code, equations, visualizations and text.
- Prometheus Anomaly Detector - Framework to deploy a metric prediction model for detecting anomalies in prometheus metrics.
How to work with Prometheus Data in Python
Prometheus API Client Python - A Python side client to query metrics data from Prometheus.
We also have created a workshop that will walk you through using the Prometheus API Client:
Workshops to learn more about Openshift
- Foundations of OpenShift covers basic OpenShift concepts and common tasks.
- Building Applications on OpenShift has tutorials on using Quarkus, Node.js, Vert.x, etc. on OpenShift.
- Subsystems, Components, and Internals teaches about Linux container internals.
- OpenShift Playgrounds provide an open-ended environment on different OpenShift versions to allow you to experiment and learn on your own.
- Service Mesh Workshop with Istio introduces you to many service mesh concepts.
- Building Operators on OpenShift teaches you about Kubernetes operators, how to build operators on OpenShift, and provides a Kubernetes API refresher if you need it.
Workshops to learn more about Machine Learning on OpenShift
- JupyterHub on OpenShift has tutorials on deploying/using JupyterHub on OpenShift.
Playing with the SARIMA model
SARIMA model to forecast timeseries metrics
As we saw in the previous scenario, Prometheus metrics are time series data identified by metric name and key/value pairs. With the increased amount of metrics flowing in it is getting harder to see the signals within the noise. The current state of the art is to graph out metrics on dashboards and alert on thresholds. However, we can leverage machine learning algorithms to perform time series forecasting and predict an unusual behavior or pattern in the metrics. The predicted values from the model can be compared with the actual metric values and if they differ from the default threshold values, we can flag it as an anomaly.
What is Time Series Forecasting?
A time series is a sequence of observations taken sequentially in time. Time series adds an explicit order dependence between observations: a time dimension. This additional dimension is both a constraint and a structure that provides a source of additional information.
We have different goals depending on whether we are interested in understanding a dataset or making predictions. Making predictions about the future is called extrapolation in the classical statistical handling of time series data. More modern fields focus on the topic and refer to it as time series forecasting. Forecasting involves taking models fit on historical data and using them to predict future observations. An important distinction in forecasting is that the future is completely unavailable and must only be estimated from what has already happened. The skill of a time series forecasting model is determined by its performance at predicting the future. This is often at the expense of being able to explain why a specific prediction was made, confidence intervals and even better understanding the underlying causes behind the problem.
Components of Time Series
Some important components of time series are:
- Level - The baseline value for the series if it were a straight line
- Trend - The optional and often linear increasing or decreasing behavior of the series over time
- Seasonality - The optional repeating patterns or cycles of behavior over time
- Noise - The optional variability in the observations that cannot be explained by the model
All time series have a level, most have noise, and the trend and seasonality are optional. Assumptions can be made about these components both in behavior and in how they are combined, which allows them to be modeled using traditional statistical methods. These components may also be the most effective way to make predictions about future values, but not always
Concerns of Forecasting
When forecasting, it is important to understand your goal and ask lots of questions to help zoom in on the specifics of your predictive modeling problem. For example:
- How much data do you have available and are you able to gather it all together?
- What is the time horizon of predictions that is required? Short, medium or long term?
- Can forecasts be updated frequently over time or must they be made once and remain static?
- At what temporal frequency are forecasts required?
Time series data often requires cleaning, scaling, and even transformation.
- Frequency - Perhaps data is provided at a frequency that is too high to model or is unevenly spaced through time requiring resampling for use in some models
- Outliers - Perhaps there are corrupt or extreme outlier values that need to be identified and handled
- Missing - Perhaps there are gaps or missing data that need to be interpolated or imputed
SARIMA model for Time Series Forecasting
In this notebook, we will explore how to train a SARIMA (Seasonal AutoRegressive Integrated Moving Average) model for predicting anomalies on sample metric data. (S)ARIMA models are among the most widely used approaches for time series forecasting. In an AutoRegressive model the forecasts correspond to a linear combination of past values of the variable. In a Moving Average model the forecasts correspond to a linear combination of past forecast errors.
Basically, the ARIMA models combine these two approaches. Since they require the time series to be stationary, differencing (Integrating) the time series may be a necessary step, i.e. considering the time series of the differences instead of the original one.
The SARIMA model (Seasonal ARIMA) extends the ARIMA by adding a linear combination of seasonal past values and/or forecast errors.
Let us now proceed to the notebook and understand all the necessary steps involved while solving a machine learning problem!