Add observability to your django application - PyCon FR 2019

Nov 3, 20190 likes568 views

Bleemeo

Discover how to add very simply observabilty to your Django application with django-prometheus, Prometheus and Glouton.

Add Observability to your
Django application
Pierre Fersing & Lionel Porcheron

Who we are?
Pierre Fersing, CTO & co-founder Bleemeo
•Python Dev for +15 years
•DevOps for +10 years
•Toulouse DevOps Meetup Organizer
Lionel Porcheron, CEO & co-founder Bleemeo
•DevOps for +15 years (started my monitoring journey with nagios netsaint)
•Toulouse DevOps Meetup Leader, Capitole du Libre Leader, PyconFR 2017 Organizer

Bleemeo?
Observability& Monitoringas a service solution
Prometheus, Graphite, Statsd, compatible
2 Open Source projects:
◦ Glouton, universalmonitoring agent written in Go with
Prometheus, Statsd, Graphite,Nagios, Zabbixcompatibility
◦ SquirrelDB, a scalablePrometheus compatiblestorage
backend based on Cassandra

Old Monitoring Days
•Monitoryourserver as a blackbox
•Only monitor server & services (web server,
database)
•Only availability,not metrics
•Nagios& derivates

Microservices and modern era
•Increase architecture complexity
•Increase number of technical
components to monitor
•Moderns infrastructure base on
containers are dynamic
•Some components may come from third
parties

•Graphite change the way we were doing monitoring: metrics became central
•Graphite appeared in 2008
•Prometheus became de-facto standard for monitoring
•Prometheus was "invented" in 2012 at Soundcloud and is now a (graduated) CNCF project
•Ecosystem based on Prometheus: exporters, Grafana, software themselves (Kubernetes, Traefik
& many others)
Graphite... and Prometheus

s/monitoring/observability/
•No more monitoring as blackbox: we now know what is inside
•Exports tons of metrics for future usage
•Code need to be instrumented to provide business metrics
•New Buzzzword 😎

Observability Key Metrics
The RED Method
•(Request) Rate - the number of requests, per second, you services are serving.
•(Request) Errors - the number of failed requests per second.
•(Request) Duration - distributions of the amount of time each request takes.
The USE Method
•(Ressource) Utilization: as a percent over a time interval. eg, "one disk is running at 90% utilization".
•(Ressource) Saturation: as a queue length. eg, "the CPUs have an average run queue length of four".
•(Ressource) Errors: scalar counts. eg, "this network interface has had fifty late collisions".

Prometheus exporters
•Prometheus exporters export on web page metrics
(basic plain text page with a metric per line)
•Prometheus poll regularly those endpoints
•Prometheus exporters exist for almost everything:
https://prometheus.io/docs/instrumenting/exporters/

Instrument your code
•For Django application: django-prometheus
•Django Middleware for metrics
•Health checks and custom checks with
django-health-checks

Logs management
•ELK or Loki (log managementproject from Grafana Labs)

Tracing
•Jaeger or Zipkin
•Useful with microservices

Example
Small Django application showing quote of the day
Code is available on Bleemeo Labs github: https://github.com/bleemeolabs/quote

Questions?
👉 WANT TO TRY BLEEMEO? GET 30€ OF CREDITS WITH
PYCONFR2019 VOUCHER

This document provides an overview of a web services bootcamp session presented by Bill Buchan. The agenda covers using Domino to provide web services using LotusScript, Java servlets, and agents, as well as using Notes to consume web services using LotusScript, COM, and Stubby. The document introduces web services concepts and architectures. It discusses using LotusScript in Domino 7 and 8 to easily create web services and profile performance. It also covers more complex options like Java servlets which provide persistence but require more work. The session includes demonstrations of creating and testing a sample web service using a contacts database.

Web Storage & Web WorkersInbal Geffen

The document discusses HTML5 web storage and web workers. Web storage allows data to be stored locally within the browser and accessed across browser sessions. It provides alternatives to cookies for storing data on the client-side. Web workers allow JavaScript processes to run in background threads, improving performance for long-running tasks without blocking the user interface. Communication between the main thread and workers is done asynchronously using messages. Both features have varying browser support that must be checked before using.

Learn everything about IBM iNotes CustomizationIBM Connections Developers

Speaker: Eric Spencer, IBM Software Engineer, iNotes Development Learn how you can customize IBM iNotes and SmartCloud Notes web to adapt your corporate look and feel, modify the available functional areas, and add new capabilities. See the improvements made in recent releases, which allow for easier customization and greater tolerance during the upgrade process. I’ll step through examples, such as modifying the items on the action bar. With some HTML and JavaScript skills you can easily extend your IBM iNotes or SmartCloud Notes web mail client to make it your own!

Domino Tech School - Upgrading to Notes/Domino V10: Best PracticesChristoph Adler

Adobe Experience Manager Vision and RoadmapLoni Stark

The document discusses the evolution of digital experiences and connected devices. It notes that consumers expect personalized, relevant experiences in real-time across multiple channels. The document summarizes research showing growing adoption of devices like smartphones, tablets, smartwatches, connected homes/cars, and the challenges of managing diverse digital properties. It argues experiences will be driven by connection between the digital and physical worlds, with innovations in areas like personalized content delivery, content velocity, experience-driven commerce, cloud agility, and connected experiences across channels and devices.

Web Browser And Search Engine ! Batra Computer Centrejatin batra

MeetUp Monitoring with Prometheus and Grafana (September 2018)Lucas Jellema

This presentation introduces the concept of monitoring - focusing on why and how and finally on the tools to use. It introduces Prometheus (metrics gathering, processing, alerting), application instrumentation and Prometheus exporters and finally it introduces Grafana as a common companion for dashboarding, alerting and notifications. This presentations also introduces the handson workshop - for which materials are available from https://github.com/lucasjellema/monitoring-workshop-prometheus-grafana

Measuring CDN performance and why you're doing it wrongFastly

Integrating content delivery networks into your application infrastructure can offer many benefits, including major performance improvements for your applications. So understanding how CDNs perform — especially for your specific use cases — is vital. However, testing for measurement is complicated and nuanced, and results in metric overload and confusion. It's becoming increasingly important to understand measurement techniques, what they're telling you, and how to apply them to your actual content. In this session, we'll examine the challenges around measuring CDN performance and focus on the different methods for measurement. We'll discuss what to measure, important metrics to focus on, and different ways that numbers may mislead you. More specifically, we'll cover: Different techniques for measuring CDN performance Differentiating between network footprint and object delivery performance Choosing the right content to test Core metrics to focus on and how each impacts real traffic Understanding cache hit ratio, why it can be misleading, and how to measure for it

Monitoring microservice applications: An SRE’s perspectiveDevOpsProdigy

This document discusses monitoring challenges for microservice applications from a site reliability engineer's (SRE) perspective. It identifies 10 common problems including lack of service restart monitoring, error monitoring, health checks, API response time monitoring, and lack of application performance monitoring. It emphasizes the need to work with developers to implement proper monitoring and alerts for service restarts, errors, response times, and other metrics to effectively monitor microservice applications and catch issues early.

Slides: How to Select a PaaSAltoros

Tools. Techniques. Trouble?Testplant

DevOps for TYPO3 Teams and ProjectsFedir RYKHTIK

The document discusses DevOps practices for TYPO3 projects. It defines DevOps as the confluence of development and operations. It highlights the importance of communication between different roles like developers, system administrators, and integrators. It also provides examples of tools and techniques that can be used at different stages of a TYPO3 project to facilitate DevOps practices, such as automated testing, deployment automation, and content synchronization.

Adding Real-time Features to PHP ApplicationsRonny López

It's possible to introduce real-time features to PHP applications without deep modifications of the current codebase. Using WAMP you can build distributed systems out of application components which are loosely coupled and communicate in (soft) real-time. There is no need to learn a whole new language, with the implications it has. It also opens the door to write reactive, event-based, distributed architectures and to achieve easier scalability by distributing messages to multiple systems.

Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDogRedis Labs

Think you have big data? What about high availability requirements? At DataDog we process billions of data points every day including metrics and events, as we help the world monitor the their applications and infrastructure. Being the world’s monitoring system is a big responsibility, and thanks to Redis we are up to the task. Join us as we discuss how the DataDog team monitors and scales Redis to power our SaaS based monitoring offering. We will discuss our usage and deployment patterns, as well as dive into monitoring best practices for production Redis workloads

2019 hashiconf seattle_consul_iocPierre Souchay

Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit EuropeAppDynamics

A cornerstone of the DevOps philosophy, investment in automation at all stages across the SDLC has increased over recent years. Automation promises velocity and reduced errors, helps foster repeatable processes, and removes the need for long hours on dull, repetitive tasks. So what’s not to like? The downside of automation is that unless applied at the right place in your SDLC it can make a bad process worse. Automation also raises questions around job security, the need for re-skilling in other areas, and tool sprawl if different teams each choose their preferred technology. This session will outline: -A short chronology of where automation has impacted the modern software stack -Where it makes the most sense to automate (by identifying your key constraints) -Best practices for adopting automation and how to identify where it’s working — and where it isn’t For more information, visit: www.appdynamics.com

DevOpsGuys - DevOps Automation - The Good, The Bad and The UglyDevOpsGroup

Monitoring federation open stack infrastructureFernando Lopez Aguilar

Performance Monitoring for the Cloud - Java2Days 2017Werner Keil

Performance Monitoring tools like Performance Co-Pilot (PCP) existed almost longer than the World Wide Web. It was developed in the early 90s by SGI. Parts were made available open source from 2000 on, which led to a further spread of the tool. In recent years an active community formed and a variety of new features and enhancements were added. PCP is now part of Red Hat and SuSE Linux Enterprise editions and included in many other Linux distributions. Versions for other Unix variants, OS X and Windows also exist. This session compares popular Open Source Monitoring Tools like Performance Co-Pilot, StatsD, Dropwizard Metrics, Prometeus, MicroProfile Metrics or StatsD. How they each support Containers or Virtualization, share data with IT monitoring systems like Nagios or Zabbix, or process analyze and visualize it via Carbon, Graphite or Grafana/ElasticSerch.

Prometheus (Microsoft, 2016)Brian Brazil

Leveraging Analytics for DevOpsMichael Floyd

stackconf 2021 | Prometheus in 2021 and beyondNETWAYS

Building Modern Digital Services on Scalable Private Government Infrastructur...Andrés Colón Pérez

These are a series of presentations and knowledge collected from the web to help knowledge sharing at the government of Puerto Rico, created with the hope of helping transform government culture by engaging key personnel in diverse areas of central government IT. We discussed design and development methodologies as well as implementation, network and server technologies that led to the successful launch of the most popular online service in PR.gov, in the hope that the knowledge is retained and used to prevent problems that have plagued digital services of the past. How did Puerto Rico build the New Good standing Certificate Online Service? How did it scale to handle millions of visitors while having 0 licensing costs? This is the technical overview of the design, philosophy and implementation. - Good standing certificate knowledge transfer presentation by Andrés Colón Note on attribution: some content such as logos and designs were used from the web. Rights remain with their original authors. Thanks for sharing with the world.

from ai.backend import python @ pycontw2018Chun-Yu Tseng

1. The document describes the role of an AI engineer at a computer vision startup called Umbo. It discusses developing and maintaining computer vision services, building machine learning pipelines, and lessons learned. 2. Key responsibilities of an AI engineer include developing and maintaining computer vision services, building machine learning pipelines to improve services and measure model performance, and debugging and refactoring code. 3. The author learned that domain knowledge is important, Python is a unifying language, and collaboration between researchers and engineers is necessary. The future may see more high-quality backend development to support machine learning services.

Monitorama 2015 Netflix Instance AnalysisBrendan Gregg

Monitorama 2015 talk by Brendan Gregg, Netflix. With our large and ever-changing cloud environment, it can be vital to debug instance-level performance quickly. There are many instance monitoring solutions, but few come close to meeting our requirements, so we've been building our own and open sourcing them. In this talk, I will discuss our real-world requirements for instance-level analysis and monitoring: not just the metrics and features we desire, but the methodologies we'd like to apply. I will also cover the new and novel solutions we have been developing ourselves to meet these needs and desires, which include use of advanced Linux performance technologies (eg, ftrace, perf_events), and on-demand self-service analysis (Vector).

Debugging Microservices - key challenges and techniques - Microservices Odesa...Lohika_Odessa_TechTalks

Microservice architecture is widespread our days. It comes with a lot of benefits and challenges to solve. Main goal of this talk is to go through troubleshooting and debugging in the distributed micro-service world. Topic would cover: main aspects of the logging,  monitoring, distributed tracing, debugging services on the cluster. About speaker: Andrеy Kolodnitskiy is Staff engineer in the Lohika and his primary focus is around distributed systems, microservices and JVM based languages. Majority of time engineers spend debugging and fixing the issues. This talk will be dedicated to best practicies and tools Andrеys team uses on its project which do help to find issues more efficiently.

2025-05-Q4-2024-Investor-Presentation.pptxSamuele Fogagnolo

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...organizerofv

More Related Content

Similar to Add observability to your django application - PyCon FR 2019 (20)

MeetUp Monitoring with Prometheus and Grafana (September 2018)Lucas Jellema

Measuring CDN performance and why you're doing it wrongFastly

Monitoring microservice applications: An SRE’s perspectiveDevOpsProdigy

Slides: How to Select a PaaSAltoros

Tools. Techniques. Trouble?Testplant

DevOps for TYPO3 Teams and ProjectsFedir RYKHTIK

Adding Real-time Features to PHP ApplicationsRonny López

Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDogRedis Labs

2019 hashiconf seattle_consul_iocPierre Souchay

Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit EuropeAppDynamics

DevOpsGuys - DevOps Automation - The Good, The Bad and The UglyDevOpsGroup

Monitoring federation open stack infrastructureFernando Lopez Aguilar

Performance Monitoring for the Cloud - Java2Days 2017Werner Keil

Prometheus (Microsoft, 2016)Brian Brazil

Leveraging Analytics for DevOpsMichael Floyd

stackconf 2021 | Prometheus in 2021 and beyondNETWAYS

Building Modern Digital Services on Scalable Private Government Infrastructur...Andrés Colón Pérez

from ai.backend import python @ pycontw2018Chun-Yu Tseng

Monitorama 2015 Netflix Instance AnalysisBrendan Gregg

Debugging Microservices - key challenges and techniques - Microservices Odesa...Lohika_Odessa_TechTalks

MeetUp Monitoring with Prometheus and Grafana (September 2018)Lucas Jellema

Measuring CDN performance and why you're doing it wrongFastly

Monitoring microservice applications: An SRE’s perspectiveDevOpsProdigy

Slides: How to Select a PaaSAltoros

Tools. Techniques. Trouble?Testplant

DevOps for TYPO3 Teams and ProjectsFedir RYKHTIK

Adding Real-time Features to PHP ApplicationsRonny López

Monitoring and Scaling Redis at DataDog - Ilan Rabinovitch, DataDogRedis Labs

2019 hashiconf seattle_consul_iocPierre Souchay

Automation: The Good, The Bad and The Ugly with DevOpsGuys - AppD Summit EuropeAppDynamics

DevOpsGuys - DevOps Automation - The Good, The Bad and The UglyDevOpsGroup

Monitoring federation open stack infrastructureFernando Lopez Aguilar

Performance Monitoring for the Cloud - Java2Days 2017Werner Keil

Prometheus (Microsoft, 2016)Brian Brazil

Leveraging Analytics for DevOpsMichael Floyd

stackconf 2021 | Prometheus in 2021 and beyondNETWAYS

Building Modern Digital Services on Scalable Private Government Infrastructur...Andrés Colón Pérez

from ai.backend import python @ pycontw2018Chun-Yu Tseng

Monitorama 2015 Netflix Instance AnalysisBrendan Gregg

Debugging Microservices - key challenges and techniques - Microservices Odesa...Lohika_Odessa_TechTalks

Recently uploaded (20)

2025-05-Q4-2024-Investor-Presentation.pptxSamuele Fogagnolo

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...organizerofv

Linux Support for SMARC: How Toradex Empowers Embedded DevelopersToradex

Toradex brings robust Linux support to SMARC (Smart Mobility Architecture), ensuring high performance and long-term reliability for embedded applications. Here’s how: • Optimized Torizon OS & Yocto Support – Toradex provides Torizon OS, a Debian-based easy-to-use platform, and Yocto BSPs for customized Linux images on SMARC modules. • Seamless Integration with i.MX 8M Plus and i.MX 95 – Toradex SMARC solutions leverage NXP’s i.MX 8 M Plus and i.MX 95 SoCs, delivering power efficiency and AI-ready performance. • Secure and Reliable – With Secure Boot, over-the-air (OTA) updates, and LTS kernel support, Toradex ensures industrial-grade security and longevity. • Containerized Workflows for AI & IoT – Support for Docker, ROS, and real-time Linux enables scalable AI, ML, and IoT applications. • Strong Ecosystem & Developer Support – Toradex offers comprehensive documentation, developer tools, and dedicated support, accelerating time-to-market. With Toradex’s Linux support for SMARC, developers get a scalable, secure, and high-performance solution for industrial, medical, and AI-driven applications. Do you have a specific project or application in mind where you're considering SMARC? We can help with Free Compatibility Check and help you with quick time-to-market For more information: https://www.toradex.com/computer-on-modules/smarc-arm-family

What is Model Context Protocol(MCP) - The new technology for communication bw...Vishnu Singh Chundawat

The MCP (Model Context Protocol) is a framework designed to manage context and interaction within complex systems. This SlideShare presentation will provide a detailed overview of the MCP Model, its applications, and how it plays a crucial role in improving communication and decision-making in distributed systems. We will explore the key concepts behind the protocol, including the importance of context, data management, and how this model enhances system adaptability and responsiveness. Ideal for software developers, system architects, and IT professionals, this presentation will offer valuable insights into how the MCP Model can streamline workflows, improve efficiency, and create more intuitive systems for a wide range of use cases.

Procurement Insights Cost To Value Guide.pptxJon Hansen

Big Data Analytics Quick Research Guide by Arthur MorganArthur Morgan

This is a Quick Research Guide (QRG). QRGs include the following: - A brief, high-level overview of the QRG topic. - A milestone timeline for the QRG topic. - Links to various free online resource materials to provide a deeper dive into the QRG topic. - Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic. QRGs planned for the series: - Artificial Intelligence QRG - Quantum Computing QRG - Big Data Analytics QRG - Spacecraft Guidance, Navigation & Control QRG (coming 2026) - UK Home Computing & The Birth of ARM QRG (coming 2027) Any questions or comments? - Please contact Arthur Morgan at [email protected]. 100% human made.

Electronic_Mail_Attacks-1-35.pdf by xploitniftliyevhuseyn

Cyber Awareness overview for 2025 month of securityriccardosl1

Rusty Waters: Elevating Lakehouses Beyond Sparkcarlyakerly1

Spark is a powerhouse for large datasets, but when it comes to smaller data workloads, its overhead can sometimes slow things down. What if you could achieve high performance and efficiency without the need for Spark? At S&P Global Commodity Insights, having a complete view of global energy and commodities markets enables customers to make data-driven decisions with confidence and create long-term, sustainable value. 🌍 Explore delta-rs + CDC and how these open-source innovations power lightweight, high-performance data applications beyond Spark! 🚀

TrsLabs - Fintech Product & Business ConsultingTrs Labs

Hybrid Growth Mandate Model with TrsLabs Strategic Investments, Inorganic Growth, Business Model Pivoting are critical activities that business don't do/change everyday. In cases like this, it may benefit your business to choose a temporary external consultant. An unbiased plan driven by clearcut deliverables, market dynamics and without the influence of your internal office equations empower business leaders to make right choices. Getting things done within a budget within a timeframe is key to Growing Business - No matter whether you are a start-up or a big company Talk to us & Unlock the competitive advantage

Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxAnoop Ashok

tecnologias de las primeras civilizaciones.pdffjgm517

ThousandEyes Partner Innovation Updates for May 2025ThousandEyes

Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...BookNet Canada

Book industry standards are evolving rapidly. In the first part of this session, we’ll share an overview of key developments from 2024 and the early months of 2025. Then, BookNet’s resident standards expert, Tom Richardson, and CEO, Lauren Stewart, have a forward-looking conversation about what’s next. Link to recording, presentation slides, and accompanying resource: https://bnctechforum.ca/sessions/standardsgoals-for-2025-standards-certification-roundup/ Presented by BookNet Canada on May 6, 2025 with support from the Department of Canadian Heritage.

Drupalcamp Finland – Measuring Front-end Energy ConsumptionExove

AI and Data Privacy in 2025: Global TrendsInData Labs

In this infographic, we explore how businesses can implement effective governance frameworks to address AI data privacy. Understanding it is crucial for developing effective strategies that ensure compliance, safeguard customer trust, and leverage AI responsibly. Equip yourself with insights that can drive informed decision-making and position your organization for success in the future of data privacy. This infographic contains: -AI and data privacy: Key findings -Statistics on AI data privacy in the today’s world -Tips on how to overcome data privacy challenges -Benefits of AI data security investments. Keep up-to-date on how AI is reshaping privacy standards and what this entails for both individuals and organizations.

Generative Artificial Intelligence (GenAI) in BusinessDr. Tathagat Varma

Quantum Computing Quick Research Guide by Arthur MorganArthur Morgan

Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveScyllaDB

Want to learn practical tips for designing systems that can scale efficiently without compromising speed? Join us for a workshop where we’ll address these challenges head-on and explore how to architect low-latency systems using Rust. During this free interactive workshop oriented for developers, engineers, and architects, we’ll cover how Rust’s unique language features and the Tokio async runtime enable high-performance application development. As you explore key principles of designing low-latency systems with Rust, you will learn how to: - Create and compile a real-world app with Rust - Connect the application to ScyllaDB (NoSQL data store) - Negotiate tradeoffs related to data modeling and querying - Manage and monitor the database for consistently low latencies

Semantic Cultivators : The Critical Future Role to Enable AIartmondano

2025-05-Q4-2024-Investor-Presentation.pptxSamuele Fogagnolo

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...organizerofv

Linux Support for SMARC: How Toradex Empowers Embedded DevelopersToradex

What is Model Context Protocol(MCP) - The new technology for communication bw...Vishnu Singh Chundawat

Procurement Insights Cost To Value Guide.pptxJon Hansen

Big Data Analytics Quick Research Guide by Arthur MorganArthur Morgan

Electronic_Mail_Attacks-1-35.pdf by xploitniftliyevhuseyn

Cyber Awareness overview for 2025 month of securityriccardosl1

Rusty Waters: Elevating Lakehouses Beyond Sparkcarlyakerly1

TrsLabs - Fintech Product & Business ConsultingTrs Labs

Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxAnoop Ashok

tecnologias de las primeras civilizaciones.pdffjgm517

ThousandEyes Partner Innovation Updates for May 2025ThousandEyes

Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...BookNet Canada

Drupalcamp Finland – Measuring Front-end Energy ConsumptionExove

AI and Data Privacy in 2025: Global TrendsInData Labs

Generative Artificial Intelligence (GenAI) in BusinessDr. Tathagat Varma

Quantum Computing Quick Research Guide by Arthur MorganArthur Morgan

Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveScyllaDB

Semantic Cultivators : The Critical Future Role to Enable AIartmondano

Add observability to your django application - PyCon FR 2019

1. Add Observability to your Django application Pierre Fersing & Lionel Porcheron

2. Who we are? Pierre Fersing, CTO & co-founder Bleemeo •Python Dev for +15 years •DevOps for +10 years •Toulouse DevOps Meetup Organizer Lionel Porcheron, CEO & co-founder Bleemeo •DevOps for +15 years (started my monitoring journey with nagios netsaint) •Toulouse DevOps Meetup Leader, Capitole du Libre Leader, PyconFR 2017 Organizer

3. Bleemeo? Observability& Monitoringas a service solution Prometheus, Graphite, Statsd, compatible 2 Open Source projects: ◦ Glouton, universalmonitoring agent written in Go with Prometheus, Statsd, Graphite,Nagios, Zabbixcompatibility ◦ SquirrelDB, a scalablePrometheus compatiblestorage backend based on Cassandra

4. Old Monitoring Days •Monitoryourserver as a blackbox •Only monitor server & services (web server, database) •Only availability,not metrics •Nagios& derivates

5. Microservices and modern era •Increase architecture complexity •Increase number of technical components to monitor •Moderns infrastructure base on containers are dynamic •Some components may come from third parties

6. •Graphite change the way we were doing monitoring: metrics became central •Graphite appeared in 2008 •Prometheus became de-facto standard for monitoring •Prometheus was "invented" in 2012 at Soundcloud and is now a (graduated) CNCF project •Ecosystem based on Prometheus: exporters, Grafana, software themselves (Kubernetes, Traefik & many others) Graphite... and Prometheus

7. s/monitoring/observability/ •No more monitoring as blackbox: we now know what is inside •Exports tons of metrics for future usage •Code need to be instrumented to provide business metrics •New Buzzzword 😎

8. Three pillars of observability

9. Observability Key Metrics The RED Method •(Request) Rate - the number of requests, per second, you services are serving. •(Request) Errors - the number of failed requests per second. •(Request) Duration - distributions of the amount of time each request takes. The USE Method •(Ressource) Utilization: as a percent over a time interval. eg, "one disk is running at 90% utilization". •(Ressource) Saturation: as a queue length. eg, "the CPUs have an average run queue length of four". •(Ressource) Errors: scalar counts. eg, "this network interface has had fifty late collisions".

10. Prometheus exporters •Prometheus exporters export on web page metrics (basic plain text page with a metric per line) •Prometheus poll regularly those endpoints •Prometheus exporters exist for almost everything: https://prometheus.io/docs/instrumenting/exporters/

11. Instrument your code •For Django application: django-prometheus •Django Middleware for metrics •Health checks and custom checks with django-health-checks

12. Logs management •ELK or Loki (log managementproject from Grafana Labs)

13. Tracing •Jaeger or Zipkin •Useful with microservices

14. Example Small Django application showing quote of the day Code is available on Bleemeo Labs github: https://github.com/bleemeolabs/quote

15. Questions? 👉 WANT TO TRY BLEEMEO? GET 30€ OF CREDITS WITH PYCONFR2019 VOUCHER

Add observability to your django application - PyCon FR 2019

Recommended

More Related Content

Similar to Add observability to your django application - PyCon FR 2019 (20)

Recently uploaded (20)

Add observability to your django application - PyCon FR 2019