Books
- Release It!: Design and Deploy Production-Ready Software by Michael T. Nygard
- Covers patterns for building and operating resilient systems
- Effective Feature Management: Releasing and Operating Software in the Age of Continuous Delivery by John Kodumal and Edith Harbaugh
- Explores feature flagging and experimentation in production
- Site Reliability Engineering: How Google Runs Production Systems by Niall Richard Murphy, Betsy Beyer, Chris Jones, and Jennifer Petoff
- Provides insights into Google's approach to production testing and reliability
- Chaos Engineering: System Resiliency in Practice by Casey Rosenthal and Nora Jones
- Discusses techniques for testing system resilience in production
Online Resources
- LaunchDarkly Blog - Production Testing
- Offers articles and best practices for feature flagging and production testing
- Datadog Learning Center
- Provides tutorials on monitoring and observability in production environments
- Google Cloud - Canary Deployments
- Offers a detailed guide on implementing canary testing in production
- AWS Well-Architected Labs - Testing in Production
- Provides hands-on labs for testing in AWS production environments
- Microsoft Azure - Blue-Green Deployment Pattern
- Explains the blue-green deployment strategy for production testing
Tools
- LaunchDarkly
- Feature management platform for controlled rollouts and A/B testing
- Datadog
- Monitoring and analytics platform for production environments
- Chaos Monkey by Netflix
- Tool for running chaos experiments in production
- Spinnaker
- Multi-cloud continuous delivery platform supporting canary deployments
- Istio
- Service mesh that facilitates traffic management for production testing
These resources provide both theoretical knowledge and practical guidance for implementing effective production testing strategies.