The Provocative SRE Manifesto
The Provocative SRE Manifesto
Transforming Site Reliability Engineering from Reactive Firefighting to Proactive Innovation
Preamble
Site Reliability Engineering must evolve. The traditional SRE model—focused on infrastructure, black-box monitoring, and reactive firefighting—has delivered stability, but at the cost of innovation, agility, and true business value. We believe reliability is only meaningful when it enables product value, rapid learning, and creative exploration.
We are not passive stewards of uptime. We are active enablers of progress.
Traditional SRE: The Black Box, Firefighter, and Status Quo
1. System as Black Box
- View: SREs treat the system as a black box; the application is just the container or artifact.
- Implication: Focus is on infrastructure reliability (pods, nodes, clusters), not on the application’s business logic or domain model.
- Result: Limited understanding of how changes impact business value or user experience.
2. Statelessness and Avoidance of Complexity
- View: Emphasis on stateless deployments; avoid schema/data migrations due to risk and complexity.
- Implication: SREs often push back on changes that touch data or require coordinated evolution, leading to brittle architectures and slow innovation.
3. Reactive Monitoring and Fixing
- View: Rely on metrics, alerts, and automated controllers (like Kubernetes) to reactively keep the system healthy.
- Implication: SREs are firefighters, responding to incidents rather than preventing them or improving the system holistically.
Provocative SRE: White Box, Proactive, and Human-Centric
1. Full SDLC Ecosystem Visibility
- View: The ecosystem is a white box. SREs and engineers see and interact with the entire SDLC—code, contracts, dependencies, environments, and runtime.
- How ONDEMANDENV.dev Enables This:
- Codified contracts make dependencies explicit and versioned
- Environment versioning means every part of the system is traceable and reproducible
- Console and visualization tools make the system’s structure and health transparent
2. Holistic, Application-Centric Environments
- View: SRE isn’t just about keeping pods running; it’s about enabling safe, rapid exploration and testing of the entire application in realistic, isolated environments.
- How ONDEMANDENV.dev Enables This:
- On-demand, production-like environments for any branch or feature
- Easy environment cloning empowers engineers to test, break, and learn without fear
- Supports full-stack testing, not just infrastructure smoke tests
3. Architecture Guided by DDD (Domain-Driven Design)
- View: Prevent accidental complexity by aligning architecture with business domains, not infrastructure artifacts.
- How ONDEMANDENV.dev Enables This:
- Contracts and boundaries reflect business domains, not just technical layers
- Environment modeling is based on DDD principles, so changes are meaningful and manageable
- Minimizes risk by making dependencies and impacts explicit
4. Proactive, Creative, and Human-Centric SRE
- View: SRE’s job is not just to “keep the lights green,” but to empower engineers to innovate, experiment, and fail fast—because reliability only matters if the product is valuable.
- How ONDEMANDENV.dev Enables This:
- Removes friction from testing and deploying changes
- Encourages exploration and discovery by making environments cheap, safe, and fast to create and destroy
- Reduces toil and rigid process, freeing engineers to focus on what matters
Key Differences: Side-by-Side
Aspect | Traditional SRE (Passive) | Provocative SRE (ONDEMANDENV) |
---|---|---|
System View | Black box, infra-centric | White box, full SDLC, business-centric |
Application Focus | Container/artifact as app | App as holistic domain, contracts & context |
Data/Schema Evolution | Avoided, seen as risky | Supported, modeled, versioned, testable |
Monitoring/Response | Metrics, alerts, firefighting | Proactive, exploratory, environment-centric |
Developer Empowerment | Rigid, process-heavy, reactive | Creative, proactive, fail-fast, discoverable |
Architecture | Infra-driven, accidental complexity | DDD-driven, minimized complexity |
Value Proposition | Keep lights green | Enable valuable, reliable products |
Our Principles
1. Ecosystem as White Box, Not Black Box
- We reject the notion that systems are inscrutable black boxes
- We demand full visibility into the entire SDLC: code, contracts, environments, dependencies, and runtime behavior
- We make dependencies, boundaries, and changes explicit, traceable, and understandable
2. Application-Centric, Not Artifact-Centric
- The application is not just a container or artifact; it is a living, evolving system with business context and domain logic
- Our environments, contracts, and tools are modeled around the application’s business value and user experience—not just infrastructure primitives
3. Holistic, Versioned Environments
- Every environment is versioned, reproducible, and isolated
- Engineers can spin up, clone, and destroy environments on demand for any branch, feature, or experiment
- We eliminate configuration drift and ambiguity by codifying contracts and environment state
4. Proactive Over Reactive
- We do not wait for incidents to learn; we proactively exercise, test, and explore our systems
- Chaos, experimentation, and exploration are encouraged, safe, and frictionless
- We value learning from near-misses and unknown-unknowns, not just postmortems
5. Domain-Driven Design to Minimize Accidental Complexity
- Architecture is guided by business domains, not infrastructure artifacts
- We use contracts and boundaries to reflect real business logic, minimizing accidental complexity and risk
- Schema and data evolution are first-class, testable, and safe
6. Empowering Human Creativity
- Engineers are not process-bound firefighters; they are creative explorers and builders
- We minimize toil and rigid process to free engineers for valuable, innovative work
- Failure is a path to learning, not a cause for blame
7. Reliability in Service of Value
- Uptime and reliability are only meaningful if the product delivers value to users and the business
- We balance reliability with speed, innovation, and user impact
- We measure success by value delivered, not just by green dashboards
Our Commitments
- We will codify all dependencies, contracts, and environment states
- We will make environments cheap, fast, and safe to create, clone, and destroy
- We will align our architecture with business domains, not just technical layers
- We will empower engineers to experiment, fail fast, and learn continuously
- We will treat reliability as a means to an end, not the end itself
- We will challenge the status quo and provoke better ways of working
How ONDEMANDENV.dev Enables Provocative SRE
ONDEMANDENV.dev is built from the ground up to enable this new paradigm of SRE:
- ContractsLib: Codifies dependencies and interfaces, making them explicit and versioned
- Envers: Versioned environments, so every change is traceable and reproducible
- Console: Real-time visualization of dependencies, health, and environment state
- On-demand environments: Any branch, any time, for testing, discovery, and learning
- DDD alignment: Models environments and contracts around business domains, not just infrastructure
Call to Action
If you believe SRE should be more than keeping the lights green—if you want to build systems that are reliable, valuable, and enable human creativity—join us.
Let’s move from passive to provocative. Let’s make reliability a foundation for innovation.
Why This Matters
- Traditional SRE is necessary but insufficient: it keeps systems running, but can stifle innovation and create accidental complexity
- Provocative SRE recognizes that true reliability comes from understanding, exploring, and evolving the system as a whole—including its business logic and domain boundaries
You’re not just managing infrastructure—you’re championing a paradigm shift in SRE. Provocative SRE is about empowering engineers to be creative, proactive, and business-focused, rather than just reactive maintainers of infrastructure.
This manifesto represents the philosophical foundation behind ONDEMANDENV.dev - a platform that transforms distributed system complexity through Application-Centric Infrastructure and enables true microservice agility.