Platform Signals

Archives

All the articles I've archived.

2026 ⁷

May ¹

Observability
The Four Golden Signals, Reimagined for AI Systems

What SREs already know about reliability — and what changes when the workload is an LLM.

30 May, 2026
· 12 min read Sowmya Shree

January ⁶

Observability
Building Multi-Region Synthetic Monitoring with Grafana Open Source (For Free)

Synthetic monitoring is expensive because it's outsourced, not because it's hard. Here's how to build multi-region browser monitoring with Grafana open source tools for free.

11 Jan, 2026
· 4 min read Sowmya Shree
Observability
The Four Layers of Truth: Monitoring Journeys, Not Just Servers

How to structure your observability stack across four layers — from synthetic journeys to distributed traces — to answer the only question that matters.

10 Jan, 2026
· 4 min read Sowmya Shree
Glossary
The Primitive Shapes of Reliability (SRE Glossary)

The core concepts every Platform Engineer must know: SLIs, SLOs, Error Budgets, Toil, and Blameless Post-Mortems — distilled.

10 Jan, 2026
· 3 min read Sowmya Shree
Platform
Terraform is Not Enough: Why "Infrastructure as Code" Drifts

The lie we tell ourselves about IaC — and how Configuration Drift silently undermines your Terraform state.

10 Jan, 2026
· 4 min read Sowmya Shree
Practice
Reliability is a Feature, Not a Guardrail

Why "100% uptime" is the wrong goal, and how to build systems that embrace failure instead of fighting it.

10 Jan, 2026
· 4 min read Sowmya Shree
Observability
The Millisecond Watchdog: Monitoring Rules for Low-Latency Trading

In low-latency trading, averages are lies. Here are the monitoring rules — covering market data, execution, and post-trade reconciliation — that actually matter.

1 Jan, 2026
· 4 min read Sowmya Shree