This is a simple summary of when we sat down in Betfair and questioned why we were still applying a highly manual, so-called ‘industry best practice’ process to all our production application releases regardless of how they were being deployed and went back to the whiteboard to see how we could make it better.
… like OS load, and Java Beans, and packet counts and pings.
The Evolution of TSDB
Here at Betfair, we’ve been enthusiastic users of OpenTSDB for a couple of years now. We use it to gather and store metrics across the entirety of our production estate, and the graphing that it can produce is frequently the first place that our engineers turn when trying to diagnose a fault.
Of course, now that people have seen how useful it can be, they want more from it, and they want it faster. And so we’re developing one particular component of it in new and (hopefully) exciting directions.