Tes Engineering Blog

Musings of the Tes Engineering Team

BlogHow we workMeet the TeamOpen rolesWork with us

Posts tagged with "monitoring"

  1. Watching the Watchmen

    26 February, 2020

    Monitoring lets our teams know when something unexpected is happening in our live environment. Our monitors keep watch, so we don't have to. However, systems change and so do teams. Information is forgotten, products and services may be handed over to new teams. The biggest concerns of a year ago may be yesterday's news. How should we keep our watchmen relevant? Taking ownership I work in the…

  2. Testing in Production is not for free - a real life tale

    05 September, 2019

    Here's a real life tale of how we used Application Monitoring, Observability and Graceful Degradation to be able to ship fast but also catch and fix mistakes without letting our users down. In it we take a look at safe failure states and complementing metrics with supporting data and how we use them to solve real issues. Let me take you back to 6th June. Everything was fine and normal on Tes.com…

  3. Exceptional Alerts - Instrumenting and Observing Part 3

    20 June, 2019

    Observing what happens when your users interact with your software keep you from disaster, allowing your users to keep working and you to keep shipping. At Tes we capture what happens when our users interact with our services. We set expectations on outcomes. This means we know when our users can't reach their goals. It also means we can act fast to fix problems. In this blog post I'll show how…

  4. Instrumenting and Observing Microservices Part 2: Are your microservices working together?

    12 February, 2018

    If you want to be confident that your users are able to achieve their goals using your service there's more to do than monitoring the health of individual microservices. You need assurance that your set of microservices are working well together, and when they aren't, you need the information necessary to fix any problems as soon as you can. This blog follows one Tes team's mission to better…

  5. Instrumenting and Observing Microservices Part 1: What do you expect from your microservice?

    03 November, 2017

    A friend of mine tells a great story of a team avoiding a great deal of grief. All of their system health checks were green, but the live graph of purchases dropped to zero and stayed there. Despite the many positive system indicators, the team were able to see they had a problem and were able able to react quickly to find and to fix it. It turned out that user purchases was a key indicator of…

© Tes Engineering Team 2020| All rights reserved
Follow @tes_engineering