What’s new in Prometheus 2.28?

by | 08.07.2021 | Changelog

Prometheus 2.28 is out. If you don’t know, Prometheus is an excellent open-source system monitoring and alerting toolkit. Let’s have a look at those features and have a look at the changelog.

Displaying Trace Examplers in the Graphic Interface

From the previous versions of 2.26 and 2.27, we can see the new support that Prometheus received for storing and receiving trace exemplars and sending exemplar data over Prometheus’s remote_write interface.

In this version, we see that the integration of exemplars with metrics is taking a step further with added support for displaying tracing exemplars in Prometheus’s graphing interface. From now on, whenever a PromQL query accesses series with exemplar data attached to them, we can now show these exemplars along with the returned PromQL query data by enabling a per graph “Show Exemplars” setting.

prometheus 2.28

Per Graph Show Exemplars (Source: Promlabs)

A latency-related graph allows us to quickly find trace exemplars for requests with high latency to investigate those requests in a tracing system further. Suppose we want to copy part of the exemplar metadata (such as the trace ID) to correlate it with another system in production. In that case, we can click on any exemplar to show its metadata more persistently under the graph.

An easy way for us to see this is to get a demo running where we can try out the exemplar display is by running Grafana’s “The New Stack” demo app:

# Clone the "The New Stack" demo app.
git clone git@github.com:grafana/tns

# Check out a predictable version of the repo.
git checkout ec5673d

# Use Prometheus 2.28 instead of 2.26.
cd production/docker-compose
sed -i 's/prometheus:v2.26/prometheus:v2.28/' docker-compose.yml

# Run components necessary to expose and store metrics and trace exemplars.
docker-compose up tempo db app loadgen prometheus

After this, we can head to  http://localhost:9090/ and run the query:

histogram_quantile(0.99, sum by(le, method) (rate(tns_request_duration_seconds_bucket[1m])))

We can click the “Show Exemplars” button to show trace exemplars.

Generic HTTP-Based Service Discovery

We have seen earlier that Prometheus has supported generic service discovery integrations for a long time via its file_sd discovery mechanism, which watches and reads a set of target files from disk. The only downside of the file_sd approach is that it requires sharing a filesystem between Prometheus and a sidecar process that generates the target files, which can be problematic in some environments. Users have asked for a network-based alternative for a long time.

A new HTTP-based generic discovery mechanism is now in Prometheus v2.28. This mechanism works almost like file_sd but loads target information from a specified URL rather than from a local file:

- job_name: http-sd
  http_sd_configs:
    - url: 'https://infra-db/prometheus-sd'

It also only supports JSON as the targets format, whereas file_sd supports both JSON and YAML.

This new HTTP-based discovery mechanism not only encourages people to build remote sidecar processes that can pull target information from various sources of truth. But developers will add native Prometheus HTTP discovery support directly into their infrastructure and service databases, getting rid of the need to run a separate process.

Defaulting to the New Expression Editor

During Prometheus 2.26, we saw the introduction of a shiny new PromQL editor with advanced auto-completion, inline linting, and syntax, highlighting that you had to explicitly enable it as an experimental feature. Since this new editor is so much more usable than the previous bare text input field, and since no major complaints came about it, Prometheus 2.28 now defaults to this new editor:

New Editor (Source: Promlabs)

This default is stored in the browser’s local storage and is only set to the new default value if no previously stored value is present for it yet. Thus if we have used Prometheus 2.26 or 2.27 in the meantime and now access 2.28 using the same URL, we may still have to enable the new editor explicitly.

Other Features and Enhancements

Promtool: We will now see an allowance of silencing output when importing or backfilling data.

Consul SD: It will now support reading tokens from the file.

Rules: Added a new .ExternalURL alert field templating variable containing the external URL of the Prometheus server.

Scrape: Added experimental body_size_limit scrape configuration setting to limit the allowed response body size for target scrapes.

Kubernetes SD: Added ingress class name label for ingress discovery.

UI: Showing of a startup screen with a progress bar when the TSDB is not ready yet.

SD: Addition of a target creation failure counter like prometheus_target_sync_failed_total and improvement of target creation failure handling.

TSDB: Improving validation of exemplar label set length.

TSDB: Added a prometheus_tsdb_clean_start metric indicating whether a TSDB lockfile from a previous run still existed upon startup.

Bug Fixes

UI: In the experimental PromQL editor, fixing autocompletion and parsing for unique float values and improving series metadata fetching.

TSDB: When merging chunks, split resulting chunks would contain more than the maximum of 120 samples.

SD: Fixed the computation of the prometheus_sd_discovered_targets metric when using multiple service discoveries.

Conclusion

A lot of new things came with Prometheus v2.28. You can try out this new version by clicking here.

Read more posts on CNCF here:

Join the Community

The DevOps Awareness Program

Subscribe to the newsletter

Join 100+ cloud native ethusiasts

#wearep3r

Join the community Slack

Discuss all things Kubernetes, DevOps and Cloud Native

More stories from our blog

How to Install Portainer on Remote Server ft. VSCode?

How to Install Portainer on Remote Server ft. VSCode?

Portainer is one of the most popular and trusted GUI for managing Docker, Swarms, ACIs and Kubernetes. The company boasts on its’ website for having 500K users, and there’s no doubt to the number looking at how easy it makes managing the tools. This post goes on the...

What’s new in Python-Tuf v0.18.0?

What’s new in Python-Tuf v0.18.0?

Python-Tuf v0.18.0 recently came, and it is quite a big update with major and minor changes. We will go through all of those changes, additions, fixes and removals in this document. Without further a due, let's start! What is Python-Tuf? The Update Framework (TUF) or...

What’s new in Envoyproxy v1.19.1?

What’s new in Envoyproxy v1.19.1?

Envoyproxy came with its new version a few days ago. Version 1.19.1 comes with very few updates. It provides a few minor behavioural changes and a few bug fixes to make the user experience smoother. In this article, we will cover all of the new changes. Let's start!...

What’s new in Jaeger v1.26.0?

What’s new in Jaeger v1.26.0?

Jaeger v1.26.0 recently came. It has a few changes in its backend. In this article, we will cover all of this in a straightforward way. We will see all of the fixes and the new features that the devs have added. Let's start! What is Jaeger? Jaeger is a graduated CNCF...

Prometheus: As Simple As Possible

Prometheus: As Simple As Possible

Distributed systems help an organisation absorb countless benefits but at the cost of complexity. With the rise of the adoption of container orchestrators like Kubernetes, a need for monitoring and alerting systems came. One such system is Prometheus which is famous...

Bootstrap K3S Data: For Beginners

Bootstrap K3S Data: For Beginners

For Kubernetes users, handling data management tasks and other analysis needs can become difficult with the inclusion of edge based devices. Internet of Things (IoT) as a whole is designed to complement online services for devices commonly used by people such as air...

What’s new in Ingress-Nginx Controller v1.0.0?

What’s new in Ingress-Nginx Controller v1.0.0?

Ingress-Nginx controller for Kubernetes came with its new release almost a month earlier. I know we are pretty late in documenting this but trust me, this update is pretty big. And in this article, we will see all of the new features and essential bug fixes and...

Getting gRPC Right: An Introduction and Review

Getting gRPC Right: An Introduction and Review

The question of APIs and their best implementation through online websites will always remain a tough nut to crack as the web undergoes scaled changes each year. It’s hard to think that the web was once draped by HTML and PHP alone until CSS and Javascript made...

What’s new in TikV v5.0.4?

What’s new in TikV v5.0.4?

TikV came up with its new release this month. It is a small one, but we can see a couple of improvements and some bug fixes along the way. In this article, we will see all of those and view the recent changes. Let's start! What is TikV? TiKV is a graduate project of...

Interested in what we do? Looking for help? Wanna talk about software strategy?