This event has ended. Visit the official site or create your own event on Sched.
Customize your schedule by session topic and skill level:  Session Topic - Refer to the "Type" filter list to the right to find a session based on topic. Talk Difficulty - Sessions are categorized as [B]eginner, [I]ntermediate or [A]dvanced at the end of each talk title.

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

A 06 [clear filter]
Wednesday, March 29

11:15 CEST

Alertmanager on Its Way to High Availability [A] - Frederic Branczyk, CoreOS
The latest releases of the Alertmanager introduced a high availability mode. Frederic will guide you through the contract of Prometheus and Alertmanager and why developing the high availability mode is unique to the problem the Alertmanager solves by giving an introduction into the principles used to implement it.

avatar for Frederic Branczyk

Frederic Branczyk

Software Engineer, CoreOS
Frederic is an engineer at CoreOS contributing to Prometheus and Kubernetes to build state of the art modern infrastructure and monitoring tools. He discovered his interest in monitoring tools and distributed systems in his previous jobs, where he used machine learning to detect anomalies... Read More →

Wednesday March 29, 2017 11:15 - 11:50 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

12:00 CEST

Counting with Prometheus [I] - Brian Brazil, Robust Perception
Counters are one of the two core metric types in Prometheus, allowing for tracking of request rates, error ratios and other key measurements. Learn why are they designed the way they are, how client libraries implement them and how rate() works.

avatar for Brian Brazil

Brian Brazil

Developer, Robust Perception
Brian Brazil is a core developer of Prometheus, and the founder of Robust Perception. He has developed and maintains components and features across the Prometheus ecosystem including the Python and Java clients, and many exporters. He wrote many of the best practices and guidelines... Read More →

Wednesday March 29, 2017 12:00 - 12:35 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

13:55 CEST

Binary Protocols - The Current State of Affairs [I] - Jason Smith, Container Solutions
With gRPC becoming increasingly popular, Protobuf is also drawing attention. Binary protocols are making a comeback in popularity. We will looks at the advantages and disadvantages of binary as well as how some of the protocols such as Flatbuffers, Capn'n Proto, Protobuf, etc differ.

avatar for Jason Smith

Jason Smith

Engineer, Container Solutions
Jason is a backend engineer who has 10 years of experience in building integrations, and yet he still doesn't hate SOAP. He is a devout student of the K.I.S.S. philosophy. He has become focused on microservices, and the best practices for developing in a distributed environment.

Wednesday March 29, 2017 13:55 - 14:30 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany
  gRPC, Developer

14:40 CEST

Alerting in Cloud Native Environments [I] - Fabian Reinartz, CoreOS
In a Cloud Native infrastructure, component failure is normal and expected. The loss of a single node or a dozen hard drives is automatically handled by the systems running a datacenter, removing the need to page someone at 4am.

This calls for an alerting system that understands service availability at a global scope, yet is still able to give detailed reports if and when there is a service-impacting incident. Prometheus achieves this by defining alerting conditions directly on time series data. The resulting alerts are grouped and aggregated into comprehensive and meaningful notifications.

Fabian will walk through the philosophy of time series based alerting, the Prometheus architecture behind it, and how practical anomaly detection can be implemented.

avatar for Fabian Reinartz

Fabian Reinartz

Software Engineer, Google
Fabian Reinartz is a software engineer at Google and one of the core developers of Prometheus, a monitoring system and time series database. Previously, he was a production engineer at SoundCloud and worked on information retrieval during his time at Saarland University.

Wednesday March 29, 2017 14:40 - 15:15 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

15:35 CEST

Understanding and Extending Prometheus AlertManager [I] - Lee Calcote, SolarWinds
The AlertManager component handles alerts sent by client applications such as the Prometheus server, carefully de-duplicating, correlating, and routing their notifications to their appropriate receiver (e.g. email, webhook, etc.). Current behavior of this component is only to display actively firing alerts.

Contributing to Prometheus is no different than most other open source endeavors, which, like many projects, welcomes community contributions. Let’s gain better familiarity with the process by augmenting Prometheus’ AlertManager with a new “history” view.

avatar for Lee Calcote

Lee Calcote

Founder, Layer5
Lee Calcote is an innovative product and technology leader, passionate about developer platforms and management software for clouds, containers, functions and applications. Advanced and emerging technologies have been a consistent focus through Calcote’s tenure at SolarWinds, Seagate... Read More →

Wednesday March 29, 2017 15:35 - 16:10 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

16:20 CEST

Grafana is Not Enough: DIY User Interfaces for Prometheus [I] - David Kaltschmidt, Weaveworks
This talk gives a quick overview of the currently available Prometheus UIs and shows ways to build your own interfaces to enable your workflows. Most popular among the UIs is Grafana, which works well with Prometheus and is lovely for dashboarding, but terrible for troubleshooting. What do you do if you want to slightly modify queries based on your dashboards? How can you explore the keys and values of your metric labels quickly? Having trouble remembering PromQL syntax? This talk presents small building UI blocks to get to your results faster and save the day.

avatar for David Kaltschmidt

David Kaltschmidt

Director of Engineering, Weaveworks
David has been building UIs for the networking industry for over a decade. He is currently with Weaveworks and heads their Berlin office.

Wednesday March 29, 2017 16:20 - 16:55 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany
Thursday, March 30

11:30 CEST

Integrating Long-Term Storage with Prometheus [A] - Julius Volz, Prometheus
While Prometheus's built-in storage does not support long-term data retention and durability, Prometheus provides nascent interfaces for integration with external long-term storage systems. This talk explains how these interfaces work, and how you can integrate with them to create custom long-term storage solutions. It will also showcase some real-world integration examples and highlight the key challenges in implementing them.

avatar for Julius Volz

Julius Volz

Founder, PromLabs
Julius co-founded the Prometheus monitoring system and originally led the project to success at SoundCloud and beyond. He now focuses on growing the Prometheus community, started PromCon, the first conference around Prometheus, and helps companies use and adapt Prometheus through... Read More →

Thursday March 30, 2017 11:30 - 12:05 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany
  Prometheus, Operations

12:15 CEST

Prometheus: The Unsung Heroes [I] - Alejandro Ramirez, Swisscom Directories AG
Prometheus is a fantastic tool, Prometheus/Kubernetes integration is amazing, PromQL is expressive…. We have all heard the superlatives and the articles, but this talk explores those tools of the Prometheus ecosystem that are not under the spotlight, but should not be discarded as mere footnotes to the saga.

We will discuss how mtail and the snmp_exporter allowed us to instrument our load balancer appliance and give us more insights into our performance in such a critical component of our system.


Alejandro Ramírez

Head of DevOps, Swisscom Directories AG
Alejandro Ramirez Local.ch (Swisscom Directories AG) Senior System Engineer Zürich, Switzerland Alejandro works at local.ch, one of the country’s highest traffic websites, ensuring that the platform runs smoothly while at the same empowering other engineers in the team to accelerate... Read More →

Thursday March 30, 2017 12:15 - 12:50 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

14:00 CEST

Configuring Prometheus for High Performance [A] - Björn Rabenstein, SoundCloud Ltd.
A key to Prometheus's success is its operational simplicity. It takes only minutes to get a server with a meaningful configuration up and running from scratch. However, once you need to run your Prometheus server a bit hotter, you will quickly find yourself practicing the dark art of tuning Prometheus command line flags. This talk will shed some light on how to handle various scenarios: How to run Prometheus on a tiny machine or how to utilize a large amount of RAM. How to optimize for very high ingestion rates or for a lot of time series. How to get the most out of your old spinning disk or how to limit the wearout of your expensive SSD. There will also be a sneak preview of ongoing efforts to reduce the required amount of flag tuning.

avatar for Björn Rabenstein

Björn Rabenstein

Engineer, Grafana Labs
Björn is a Production Engineer at SoundCloud and a Prometheus developer. Previously, he was a Site Reliability Engineer at Google and a number cruncher for science.

Slides pdf

Thursday March 30, 2017 14:00 - 14:35 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany
  Prometheus, Operations

14:45 CEST

Ops for Developers - Monitor Your Java Application with Prometheus [I] - Alexander Schwartz, msg systems ag
Prometheus is ready-to-go to monitor your infrastructure and your containers. Capturing simple metrics from a Java application can be done automatically. But Developers need to go one step further to provide useful application and business metrics.
This talk shows how to collect meaningful metrics in Vert.x and Spring based Java applications. It applies Dropwizard Metrics to gather internal application statistics, and Hystrix to provide metrics of the external interfaces the application consumes. It also shows how to fill gaps by writing your own collector.
By providing end-to-end examples this talk enables developers to make their application ready for production.

avatar for Alexander Schwartz

Alexander Schwartz

Principal IT Consultant, msg systems ag
I help organisations and teams to make architecture decisions that match their needs. To build their software in a maintainable way, I empower them with best practices, tools and libraries. I lead by example to adopt true agile processes and to stay curious. Ideas spark when sharing... Read More →

Thursday March 30, 2017 14:45 - 15:20 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

15:40 CEST

Monitor My Socks: Using Prometheus in a Polyglot Open Source Microservices Reference Architecture [I] - Phil Winder, Winder Research/Container Solution
This presentation describes how Prometheus was integrated into a polyglot microservices application. We will use the "Sock Shop", a cloud-native reference microservices architecture to demonstrate some of the best practices and pitfalls of attempting to unify monitoring in real life. Attendees will be able to use this application as a reference point, or as a real life starting point for their own applications.

Specifically, we will cover:
- Integrating Prometheus in Java/Go/Node.js/Haskell
- Best practices: what to monitor and how to structure the monitoring code
- Pitfalls: what goes wrong and why
- Demonstrations and descriptions how attendees can try it for themselves

avatar for Phil Winder

Phil Winder

Engineer, Winder Research / Container Solutions
Phil Winder is a multi-disciplinary freelance architect working towards the research and development of cutting-edge technology. Most recently he has been developing cloud-based full-stack microservice systems for a range of clients but has a significant past in machine learning and... Read More →

Thursday March 30, 2017 15:40 - 16:15 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany

16:25 CEST

Behind the Iron Curtain: Getting Metrics From the Browser into Prometheus [I] - Tom Wilkie, Weaveworks
Prometheus has so far been pigeon holed as a monitoring technology for the backend; well, no more! With the rise of single-page apps, gathering metrics from within the browser has never been more important - latency as experienced by the user is much more important than any measurements taken from individual backend services. And if you can’t track Javascript errors that could potentially lead to blank screens, you should feel bad.

This talk presents a method for tracking UI metrics and errors with Prometheus. By getting this data into Prometheus, you can reuse the dashboarding and alerting options that you've grown to love. And by using a common monitoring stack across frontend and backend, you can finally claim the throne of the king of fullstack!

avatar for Tom Wilkie

Tom Wilkie

VP Product, Grafana Labs
Tom is VP Product at Grafana Labs, a member of Prometheus team and one of the original authors of the Cortex and Loki projects. In his spare time he builds 3D printers and make craft beer.

Thursday March 30, 2017 16:25 - 17:00 CEST
A 06 Berlin Congress Center, Alexanderstraße 11, 10178 Berlin, Germany
Filter sessions
Apply filters to sessions.