From their article: "Additionally we had been running with far too much delay in...

PhantomGremlin · on July 23, 2015

   PostgreSQL ... will start squawking about wraparound
   ...
   or the monitoring they had in place

If only there was a tool or service they could use that could monitor log files for signs of problems. (I couldn't resist being a little snarky, the situation seems so perfect.)

All kidding aside, kudos to Sentry for being so candid publicly about their problems. We've seen so many companies avoid providing any technical information. About the best we'll hear in some of these is that the problem was "not terrorism related".

zeeg · on July 23, 2015

David from Sentry here.

We actually knew about the problem with delay and had been working to improve it. We were a couple days away from failing over to the new hardware (safely) and unfortunately we didn't have any early warnings in the logs. I haven't yet looked at why.

JimNasby · on July 23, 2015

You need to monitor how close you are to a wraparound. https://bucardo.org/check_postgres/check_postgres.pl.html#tx... is one way you could do that.

skehlet · on July 23, 2015

Can you share what you had for your autovacuum_vacuum_cost_delay, and your approximate rate of transactions?

zeeg · on July 24, 2015

We had at it 50ms on the previous setup, though I wish we I knew why that value was used. Likely it was a default with the Chef cookbook we forked off of, or we read something that convinced us at the time it was a good idea.

Kassandry · on July 23, 2015

Happens. Glad you're back up and running, and best of luck with your new setup. =)