Insights

Thoughts on persistence, grid modernization, and the iterative journey of energy transition.

7-Part Series

SRE for Power Grids

Your utility already practices Site Reliability Engineering. FLISR is failover. Reclosers are retry-with-backoff. The missing piece is making it systematic.

March 3, 2026 8 min read
Reliability Engineering

Part 1: Why Your Grid Is Already Running SRE

Utilities already practice core SRE concepts. FLISR is failover, reclosers are retry-with-backoff, storm drills are chaos engineering. The missing piece is making it systematic.

Read Part 1 →
March 11, 2026 9 min read
Reliability Engineering

Part 2: The Grid Is a Network

The power grid and the internet share the same architecture. BGP maps to FLISR, CDNs map to DERs, load balancers map to dispatch. The internet already solved the reliability problem.

Read Part 2 →
March 18, 2026 10 min read
Reliability Engineering

Part 3: Why N-1/N-2 Can't Keep Up

N-1/N-2 contingency planning gives binary pass/fail results for a probabilistic world. The 2003 blackout proved every component can pass and the system still fails.

Read Part 3 →
March 18, 2026 11 min read
Reliability Engineering

Part 4: Chaos Engineering for the Grid

Controlled failure injection for power systems. Storm replay, protection miscoordination drills, and DER disconnection tests that find weaknesses before they find you.

Read Part 4 →
March 18, 2026 12 min read
Reliability Engineering

Part 5: SRE Doesn't Replace IEEE 1366. It Makes It Better.

SRE and IEEE 1366 operate at different timescales and are complementary. Error budgets derived from SAIDI targets give operators a leading indicator months before the annual report.

Read Part 5 →
March 18, 2026 10 min read
Reliability Engineering

Part 6: The $10 Million SAIDI Improvement

A 30-minute SAIDI improvement avoids $10 million per year in outage costs. A typical SRE program costs $2 million to run. The math is not complicated. The hard part is believing it.

Read Part 6 →
March 18, 2026 9 min read
Reliability Engineering

Part 7: Building SRE Culture at a Utility

The technical case is strong. The economic case is compelling. But neither matters if the organization cannot adopt it. The transformation takes 18 months. Here is how to execute it.

Read Part 7 →

Want to Discuss These Ideas?

Let's talk about how these approaches can accelerate your grid modernization efforts.

Get In Touch