Sisyphean Power & Light

What's Inside the Repository

The Dynamic Network Model is organized into logical layers—from raw network topology to pre-built analysis notebooks. Each layer is documented, versioned, and designed to be used independently or composed together.

Network Topology & Electrical Model

The physical distribution system, ready for power flow simulation.

OpenDSS master file with 12 feeders, 147 line segments, 86 distribution transformers, and 23 switching devices
Bus & node coordinates for GIS visualization and spatial analysis
Conductor specifications per feeder section—impedance, ampacity, vintage, and material
Protection device settings for reclosers, fuses, and relay coordination zones
Capacitor bank and regulator placements with tap settings and control modes

OpenDSS GeoJSON CSV

Time-Series Load & Generation Profiles

Five years of synthetic but statistically faithful demand and DER generation data.

Hourly substation load for all 12 feeders (2020–2025), decomposed into residential, commercial, and industrial components
15-minute AMI meter data for a representative sample of 2,400 service points with realistic noise, gaps, and meter errors
Solar PV generation profiles for 680+ behind-the-meter installations, correlated to local irradiance data
Battery storage dispatch records for community-scale and behind-the-meter systems
EV charging load shapes at residential Level 2 and commercial DCFC stations, growing by adoption year

Parquet CSV 15-min Resolution

Outage & Reliability Records

A complete OMS-style outage history with the detail to train real predictive models.

3,200+ outage events with cause codes (vegetation, equipment failure, animal contact, weather, overload, unknown)
Timestamps: fault detected, crew dispatched, crew arrived, service restored—per IEEE 1366 conventions
Weather tags: temperature, wind speed, precipitation, and lightning strike proximity at time of event
Affected customers & CMI per event, linked to feeder and protective device
Major Event Day (MED) flags per IEEE 1366 2.5-beta method

CSV IEEE 1366 Cause-Coded

Asset Registry & Condition Data

The infrastructure metadata that powers asset management and predictive maintenance models.

Transformer inventory: kVA rating, installation year, manufacturer, oil/dry type, load history summary
Conductor & pole records: material, vintage, span lengths, vegetation clearance zones
Switching device metadata: recloser model, firmware version, SCADA-controlled vs. manual
Condition scores: synthetic health index (1–5) based on age, loading, failure history, and inspection results
Maintenance logs: inspection dates, work orders, and replacement records

CSV JSON Asset Health Index

Weather & Environmental Data

Localized weather history aligned to the outage and load records.

Hourly weather observations: temperature, humidity, wind speed/direction, precipitation, barometric pressure
Lightning strike data: synthetic strike records with proximity to feeders
Vegetation growth model: seasonal growth rates tied to trim cycle schedules
Heat wave and storm event flags for correlation analysis

CSV NOAA-Format Hourly

Scenario Configurations & Analysis Notebooks

Pre-built scenarios and starter notebooks so you can run analysis on day one.

Baseline scenario: SP&L as-is, 2025 system state
High DER scenario: 35% solar penetration + community storage on constrained feeders
EV adoption scenario: 20% residential EV penetration with clustered charging
Extreme weather scenario: 10-year storm event replay with cascading failures
Jupyter notebooks: load forecasting, outage prediction, hosting capacity, and voltage analysis starters

Jupyter Python OpenDSS-py

Repository Structure

sisyphean-power-and-light/ ├── network/ │ ├── master.dss # OpenDSS master file │ ├── lines.dss # Line segment definitions │ ├── transformers.dss # Distribution transformers │ ├── loads.dss # Load allocations by bus │ ├── capacitors.dss # Cap bank placements │ ├── regulators.dss # Voltage regulators │ ├── switches.dss # Reclosers & sectionalizers │ ├── pv_systems.dss # Behind-the-meter solar DER │ ├── storage.dss # Battery storage systems │ └── coordinates.csv # Bus XY for GIS mapping ├── timeseries/ │ ├── substation_load_hourly.parquet # 5-year feeder loads │ ├── ami_15min_sample.parquet # 2,400 meter sample │ ├── pv_generation.parquet # Solar output profiles │ ├── ev_charging.parquet # EV load shapes │ └── battery_dispatch.parquet # Storage charge/discharge ├── outages/ │ ├── outage_events.csv # 3,200+ cause-coded events │ ├── crew_dispatch.csv # Dispatch & restoration logs │ └── reliability_metrics.csv # Annual SAIFI/SAIDI/CAIDI ├── assets/ │ ├── transformers.csv # Inventory & condition scores │ ├── conductors.csv # Line & cable registry │ ├── poles.csv # Pole material, age, class │ ├── switches.csv # Protection device metadata │ └── maintenance_log.csv # Work orders & inspections ├── weather/ │ ├── hourly_observations.csv # Temp, wind, precip, humidity │ ├── lightning_strikes.csv # Proximity-to-feeder records │ └── storm_events.csv # Named events & MED flags ├── scenarios/ │ ├── baseline_2025.json # Current-state configuration │ ├── high_der_2030.json # 35% DER penetration │ ├── ev_adoption_2030.json # 20% residential EV │ └── extreme_weather.json # 10-year storm replay ├── notebooks/ │ ├── 01_load_forecasting.ipynb # LSTM & gradient-boost models │ ├── 02_outage_prediction.ipynb # Random forest classifier │ ├── 03_hosting_capacity.ipynb # Iterative power flow HCA │ ├── 04_voltage_analysis.ipynb # Volt-VAR optimization │ ├── 05_asset_health.ipynb # Predictive maintenance │ └── 06_flisr_simulation.ipynb # Fault isolation modeling └── README.md

The ML/AI Playground

Two things have kept power engineers from building their own ML tools: data locked behind NDAs and CEII restrictions, and a skills gap between understanding a grid problem and implementing an ML solution. SP&L eliminates the first barrier. AI-assisted development tools eliminate the second. Clone it, load it, train on it—today.

Step-by-Step Beginner & Advanced Guides

Hands-on tutorials for every use case below—8 beginner guides plus 8 advanced guides covering deep learning, reinforcement learning, survival analysis, and production techniques. Run the code, understand the approach, then use AI coding assistants to adapt it to your real-world problems.

View All 16 Guides →

Use Case 01

Outage Prediction

Train a classifier on 3,200+ outage events cross-referenced with weather, asset age, vegetation cycles, and time-of-year. The data includes the exact features a utility reliability engineer would use—but without the 18-month procurement cycle to access them.

Start here: Random forest on cause-coded outages. Graduate to temporal convolutional networks for sequence-aware prediction. Benchmark against SP&L's historical SAIFI to validate your model.

XGBoost Random Forest TCN scikit-learn

Beginner Guide → Advanced Guide →

Use Case 02

Load Forecasting

Five years of hourly feeder loads and 15-minute AMI data, decomposed by customer class. Add weather inputs and DER generation to build short-term (day-ahead) and long-term (capacity planning) forecasts that account for behind-the-meter solar cannibalizing net load.

Start here: LSTM or Prophet on substation-level load. Then disaggregate to the feeder level. Then tackle net load forecasting with solar as a confounding variable.

LSTM Prophet Transformer Models PyTorch

Beginner Guide → Advanced Guide →

Use Case 03

Hosting Capacity Analysis

Run iterative power flow studies on the full OpenDSS model to determine how much additional DER each feeder can accommodate before hitting thermal or voltage limits. The network model is pre-configured with existing PV, storage, and EV chargers so you start from a realistic baseline.

Start here: Systematic PV injection at each bus. Map voltage violations and thermal overloads. Compare traditional HCA to ML-accelerated screening methods.

OpenDSS-py Power Flow Voltage Analysis GIS Mapping

Beginner Guide → Advanced Guide →

Use Case 04

Predictive Asset Maintenance

Combine asset condition scores, maintenance logs, loading history, and weather exposure to predict which transformers, reclosers, and conductor segments are most likely to fail in the next 30–90 days. SP&L includes the exact features that utility asset management teams struggle to assemble from their own systems.

Start here: Survival analysis on transformer age + loading. Then layer in gradient-boosted models with weather features. Validate against SP&L's historical failure records.

Survival Analysis XGBoost Feature Engineering Anomaly Detection

Beginner Guide → Advanced Guide →

Use Case 05

FLISR & Restoration Optimization

Replay historical storm events against the network model to simulate how automated Fault Location, Isolation, and Service Restoration would have reduced customer minutes interrupted. Build switching sequence optimization algorithms on a system with realistic topology constraints.

Start here: Counterfactual CMI analysis on the 5 worst storm events. Then build a graph-based optimization model for automated switching. Quantify the avoided CMI to build a business case.

Graph Algorithms NetworkX Optimization Simulation

Beginner Guide → Advanced Guide →

Use Case 06

Volt-VAR Optimization

Develop and test voltage optimization strategies using the network model's capacitor banks, voltage regulators, and smart inverter settings. The time-series load and PV generation data lets you evaluate VVO performance across seasons, time-of-day, and DER penetration levels.

Start here: Baseline voltage profile analysis. Then implement rule-based VVO. Then build a reinforcement learning agent that learns optimal tap and VAR dispatch policies.

Reinforcement Learning OpenDSS-py Control Systems Gym

Beginner Guide → Advanced Guide →

Use Case 07

DER Scenario Planning

Stress-test SP&L's distribution system against aggressive DER adoption futures. The pre-built scenarios include high solar, high EV, and combined penetration configurations. Answer the question every distribution planner asks: "What happens to my system when DER doubles?"

Start here: Run the high-DER scenario and identify reverse power flow feeders. Then build a Monte Carlo simulation for uncertain adoption rates. Map investment triggers to penetration thresholds.

Monte Carlo Scenario Analysis Power Flow Planning

Beginner Guide → Advanced Guide →

Use Case 08

Anomaly Detection & Grid State Estimation

Use the AMI and SCADA-style data to build real-time anomaly detection for voltage excursions, phase imbalances, meter tampering patterns, and non-technical losses. The dataset includes realistic noise, data gaps, and meter errors that make anomaly detection genuinely challenging.

Start here: Autoencoders on AMI voltage time series. Then build an isolation forest for multi-dimensional SCADA anomaly detection. Validate against injected fault scenarios.

Autoencoders Isolation Forest State Estimation PyTorch

Beginner Guide → Advanced Guide →

Built for Engineers Who Build Things

The gap between "I understand this problem" and "I can build an ML model for it" has been too wide for too long—blocked by data access on one side and specialized tooling on the other. SP&L provides the data. AI-assisted development provides a co-pilot for the code. Your domain expertise does the rest.

Power Systems Engineers

You know which problems matter—which feeders are trouble, which assets are aging out, which load patterns signal something wrong. Now you can build the solutions yourself. AI-assisted development tools translate your engineering intuition into working code, and SP&L gives you a realistic system to build against without touching production data.

Data Scientists Entering Energy

You have the ML skills but not the domain context. SP&L comes with documented data dictionaries, industry-standard metrics (SAIFI, SAIDI, CAIDI), and notebooks that bridge the gap between general data science and power systems engineering.

Utility Innovation Teams

You need to prototype analytics use cases before pitching them internally. SP&L plus AI-assisted development lets you build a working demo in days, not months—on realistic data, validated against real engineering constraints, ready to show leadership without waiting for IT to provision access to production systems.

Academic Researchers

You need a common, open reference system that reviewers and collaborators can reproduce. SP&L provides a citable, versioned, realistic distribution network model with enough complexity to publish meaningful results.

Independent Consultants

You need to demonstrate feasibility and build proof-of-concept analyses for clients, but accessing their data takes months. SP&L gives you a realistic utility-scale reference system to develop and validate your methodologies before engagement, shortening sales cycles and establishing credibility.

AI/ML Engineers

You want a domain-specific playground that isn't MNIST or Kaggle tabular data. Power systems offer time-series, graph topology, spatial data, multi-objective optimization, and real-time control problems—all in one domain. SP&L packages them in a single repo.

The Thesis: Just Build Things Now

The power grid is the largest machine ever built, and it's being reinvented in real time. Solar, storage, EVs, and electrification are fundamentally changing how distribution systems operate. The utilities managing this transition need analytical tools built by people who understand the grid—and they need them faster than traditional vendor procurement or data science hiring pipelines can deliver.

For years, experienced power engineers have been sidelined from building these tools—not because they lack the ability to define what needs to be built, but because two barriers stood in the way. First, realistic grid data was locked behind NDAs, CEII restrictions, and vendor contracts. Second, translating engineering knowledge into working code required specialized programming skills that took years to develop. Both barriers are now gone. SP&L provides the open data. Modern AI-assisted development tools handle the coding scaffolding. The equation has changed.

An experienced distribution engineer with the SP&L dataset can now:

Build an outage prediction model trained on 3,200+ cause-coded events, focusing on feature engineering and domain judgment rather than wrestling with syntax
Develop a load forecasting pipeline on five years of realistic hourly data, iterating rapidly on model architectures without getting blocked by implementation details
Prototype a hosting capacity screening tool and test it against multiple DER scenarios in a single afternoon
Design a voltage optimization strategy on a system with real-world topology constraints, defining the control logic and reward functions while modern tools generate the implementation
Publish research on a common, reproducible reference system that others can build on

The best grid analytics won't come from ML specialists guessing at domain context. They'll be built by power engineers who understand the physics, the operations, and the failure modes. The domain expertise is the irreplaceable ingredient. The data is now open. The development tools are now accessible. What's left is what you're already good at—understanding the grid.

"The grid doesn't need more feasibility studies. It needs engineers who understand the problems shipping working solutions against realistic data. The tools to do that exist now. SP&L is the starting line."

A Fictional Utility. Real Engineering Problems.

SP&L at a Glance

What's Inside the Repository

Network Topology & Electrical Model

Time-Series Load & Generation Profiles

Outage & Reliability Records

Asset Registry & Condition Data

Weather & Environmental Data

Scenario Configurations & Analysis Notebooks

Repository Structure

The ML/AI Playground

Step-by-Step Beginner & Advanced Guides

Outage Prediction

Load Forecasting

Hosting Capacity Analysis

Predictive Asset Maintenance

FLISR & Restoration Optimization

Volt-VAR Optimization

DER Scenario Planning

Anomaly Detection & Grid State Estimation

Built for Engineers Who Build Things

Power Systems Engineers

Data Scientists Entering Energy

Utility Innovation Teams

Academic Researchers

Independent Consultants

AI/ML Engineers

The Thesis: Just Build Things Now

Start Building

Clone the Repository