COLLECTED BY
Organization:
Internet Archive
The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the
Wayback Machine .
The Wayback Machine - https://web.archive.org/web/20210820235402/https://github.com/topics/reliability-engineering
#
reliability-engineering
Here are
118 public repositories
matching this topic...
A curated list of Site Reliability and Production Engineering resources.
A curated list of Site Reliability and Production Engineering Tools
Serverless chaos monkey for AWS (runs on AWS Lambda) ☁️ 💥
Updated
Jul 15, 2020
JavaScript
Probabilistic Risk Analysis Tool (fault tree analysis, event tree analysis, etc.)
A curated list of awesome Site Reliability and Production Engineering resources.
The Chaos Toolkit core library
Updated
Aug 19, 2021
Python
GOV.UK PaaS - Cloud Foundry
A collection of SRE tools
An opinionated list of attributes and policies that need to be met in order to establish a stable software system.
A terraform provider for Concourse
Updated
Sep 24, 2020
Ruby
Updated
Jul 13, 2021
Ruby
The k6 documentation website.
Updated
Aug 20, 2021
JavaScript
A collection templates ported from the SRE Workbook
GSP is a container platform and curated suite of components helping government deploy, run, observe and secure their services
Terraform configuration to manage a Prometheus server running on AWS.
A service broker to provide Aiven Elasticsearch and InfluxDB services to Cloud Foundry users
A Go application for generating billing data from cloudfoundry events
Administration tool for GOV.UK PaaS
Updated
Aug 20, 2021
TypeScript
Code for the paper "Deep Cox Mixtures for Survival Regression", Machine Learning for Healthcare Conference 2021
Updated
Aug 17, 2021
Python
A concourse resource for creating and updating Grafana annotations
Bootstrap a VPC with BOSH and Concourse to run PaaS
Updated
Aug 19, 2021
Ruby
🔖 Daily-updated reading list for designing High Scalability 🍒 , High Availability 🔥 , High Stability 🗻 back-end systems - Pull requests are greatly welcome 👬 I hope you will find this project helpful 🍀 Please help me share it to more and more people ❤️ Thank you - 谢谢 - धन्यवाद - ধন্যবাদ - Спасибо - شكرا - Merci - Gracias - Danke - Cảm ơn! 🙇
A cloud foundry compatible route service that imposes an IP safelist
Updated
Mar 9, 2021
Shell
A small, underdocumented Puppet module for hardening Ubuntu systems.
Updated
Jul 9, 2020
Puppet
Technical documentation for GOV.UK PaaS
Updated
Aug 19, 2021
JavaScript
Terraform configuration to manage a Prometheus server running on AWS.
Updated
Aug 10, 2021
HTML
Improve this page
Add a description, image, and links to the
reliability-engineering
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
reliability-engineering
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.
Currently this will work:
But this won't: