2019-11-20 17:41:35 +01:00
# Awesome DevOps
2019-11-19 15:16:52 +01:00
2023-01-04 00:58:57 +01:00
[![Awesome DevOps ](http://awesome-devops.xyz/assets/banner.png )](https://github.com/wmariuss/awesome-devops)
2023-01-04 00:55:33 +01:00
2019-11-20 17:41:35 +01:00
[![Awesome ](https://awesome.re/badge.svg )](https://awesome.re)
2022-01-21 08:52:18 +01:00
[![Deploy ](https://github.com/wmariuss/awesome-devops/actions/workflows/deploy.yml/badge.svg )](https://github.com/wmariuss/awesome-devops/actions/workflows/deploy.yml)
2023-01-03 15:17:01 +01:00
[![Links validator ](https://github.com/wmariuss/awesome-devops/actions/workflows/links-validator.yml/badge.svg )](https://github.com/wmariuss/awesome-devops/actions/workflows/links-validator.yml)
2019-11-20 17:41:35 +01:00
2023-01-24 21:00:18 +01:00
> A curated list of platforms, tools, practices and resources to create, improve DevOps culture and SRE Team in the organization.
2023-01-24 20:47:44 +01:00
DevOps is the combination of cultural philosophies, practices, and tools that increases an organization’ s ability to deliver applications and services at high velocity: evolving and improving products at a faster pace than organizations using traditional software development and infrastructure management processes. This speed enables organizations to better serve their customers and compete more effectively in the market.
2019-11-20 17:41:35 +01:00
2023-01-04 00:46:56 +01:00
## Contents
- [Cloud Platforms ](#cloud-platforms )
- [Open Source Cloud Platforms ](#open-source-cloud-platforms )
- [Operating Systems ](#operating-systems )
- [Distributed Filesystems ](#distributed-filesystems )
- [Applications Platforms ](#applications-platforms )
- [Container Image Registry ](#container-image-registry )
- [Automation & Orchestration ](#automation--orchestration )
- [Continuous Integration & Delivery ](#continuous-integration--delivery )
- [Source Code Management ](#source-code-management )
- [Web Servers ](#web-servers )
- [SSL ](#ssl )
- [Databases ](#databases )
- [Observability and Monitoring ](#observability--monitoring )
- [Service Discovery & Service Mesh ](#service-discovery--service-mesh )
- [Chaos Engineering ](#chaos-engineering )
- [API Gateway ](#api-gateway )
- [Code review ](#code-review )
- [Distributed messaging ](#distributed-messaging )
- [Programming Languages ](#programming-languages )
- [Chat and ChatOps ](#chat-and-chatops )
- [Secret Management ](#secret-management )
- [Sharing ](#sharing )
- [VPN ](#vpn )
- [Resources ](#resources )
- [Books ](#books )
- [Conferences ](#conferences )
- [DevOps Roadmap ](#devops-roadmap )
2019-11-20 17:41:35 +01:00
---
## Cloud Platforms
*Public and Private Cloud Platforms.*
- [Amazon Web Services (AWS) ](https://aws.amazon.com/ ) - Cloud Computing Services.
- [Google Cloud Platform (GCP) ](https://cloud.google.com/ ) - Cloud Computing Services.
- [Azure ](https://azure.microsoft.com/ ) - Cloud Computing Platform & Services.
2023-01-04 00:46:56 +01:00
- [Alibaba Cloud ](https://us.alibabacloud.com/ ) - Integrated suite of cloud products and services.
- [Oracle Cloud ](https://www.oracle.com/cloud/ ) - Comprehensive and fully integrated stack of cloud applications and platform services.
- [DigitalOcean ](https://www.digitalocean.com/ ) - Helping developers easily build, test, manage, and scale applications of any size.
- [Scaleway ](https://www.scaleway.com/ ) - Single way to create, deploy and scale your infrastructure in the cloud.
- [Vultr ](https://www.vultr.com/ ) - Easily deploy cloud servers, bare metal, and storage worldwide.
- [VMware Cloud ](https://cloud.vmware.com/ ) - Run, manage, connect and protect all of your apps on any cloud.
- [IBM Cloud ](https://www.ibm.com/cloud ) - Tools, data & APIs to make AI real now.
- [Stackpath ](https://www.stackpath.com/ ) - Platform of computing infrastructure and services built at the edge of the cloud.
- [Linode ](https://www.linode.com/ ) - Accelerate innovation in the cloud, virtual computing must be more accessible, affordable, and simple.
- [Kinsta ](https://kinsta.com/application-hosting/ ) - Create and deploy web applications and databases in minutes.
2019-11-20 17:41:35 +01:00
## Open Source Cloud Platforms
*Private, Public and Hybrid open source Cloud Platforms.*
2023-01-04 00:46:56 +01:00
- [Openstack ](https://www.openstack.org/ ) - Open source software for creating private and public clouds.
- [Apache CloudStack ](https://cloudstack.apache.org/ ) - Designed to deploy and manage large networks of virtual machines.
- [OpenNebula ](https://opennebula.org/ ) - Build Private Clouds and manage Data Center virtualization based on KVM, LXD and VMware.
- [Eucalyptus ](https://www.eucalyptus.cloud/ ) - Building AWS-compatible private and hybrid clouds.
- [DC/OS ](https://dcos.io/ ) - Distributed operating system based on the Apache Mesos distributed systems kernel.
- [Apache Mesos ](http://mesos.apache.org/ ) - Program against your datacenter like it’ s a single pool of resources.
- [Localstack ](https://github.com/localstack/localstack ) - Fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline.
2019-11-20 17:41:35 +01:00
## Operating Systems
*Operating Systems - Server Platform.*
2023-01-04 00:46:56 +01:00
- [Ubuntu ](https://ubuntu.com/ ) - Enterprise Open Source and Linux.
- [Rocky Linux ](https://rockylinux.org/ ) - Open-source enterprise operating system designed to be 100% bug-for-bug compatible with Red Hat Enterprise Linux.
- [CoreOS ](http://coreos.com/ ) - The pioneering lightweight container host.
- [OSv ](http://osv.io/ ) - Versatile modular unikernel designed to run unmodified Linux applications securely on micro-VMs in the cloud.
- [Atomic ](http://www.projectatomic.io/ ) - Use immutable infrastructure to deploy and scale your containerized applications.
2019-11-21 14:08:17 +01:00
- [Photon ](https://github.com/vmware/photon ) - Linux container host optimized for cloud-native applications, cloud platforms, and VMware infrastructure.
2019-11-20 17:41:35 +01:00
## Distributed Filesystems
*Network distributed filesystems.*
2023-01-04 00:46:56 +01:00
- [Ceph ](https://ceph.io/en/ ) - Highly scalable object, block and file-based storage under one whole system.
- [Gluster ](https://www.gluster.org/ ) - Free and open source software scalable network filesystem.
- [LINBIT ](https://www.linbit.com/en/ ) - Create, remove, and replicate block storage devices for datacenter scale environments.
- [XtreemFS ](http://www.xtreemfs.org/ ) - Fault-tolerant distributed file system for all storage needs.
- [min.io ](https://min.io/ ) - High performance, distributed object storage system.
2019-11-20 17:41:35 +01:00
## Applications Platforms
2019-11-21 19:38:49 +01:00
*Applications management platforms, Containers platform and Containers management.*
2019-11-20 17:41:35 +01:00
2023-01-04 00:46:56 +01:00
- [Openshift ](https://www.openshift.com/ ) - The Kubernetes platform for big ideas.
- [Dokku ](https://dokku.com/ ) - Helps you build and manage the lifecycle of applications.
- [Flynn ](https://flynn.io/ ) - Open source platform (PaaS) for running applications in production.
- [Docker ](https://www.docker.com/ ) - Create, deploy, and run applications by using containers.
- [Docker Compose ](https://github.com/docker/compose ) - Define and run multi-container applications with Docker.
2019-11-20 17:41:35 +01:00
- [Docker Swarm ](https://github.com/docker/swarm ) - Docker-native clustering system.
2023-01-04 00:46:56 +01:00
- [Kubernetes ](https://kubernetes.io/ ) - Automating deployment, scaling, and management of containerized applications.
- [LXC ](https://linuxcontainers.org/ ) - Lets Linux users easily create and manage system or application containers.
- [Rancher ](https://rancher.com/ ) - Lets you deliver Kubernetes-as-a-Service.
- [OpenVz ](https://openvz.org/ ) - Container-based virtualization for Linux.
- [Singularity ](https://sylabs.io/singularity/ ) - Run the application from the local environment to the cloud.
- [AppScale ](https://github.com/AppScale/appscale ) - Easy-to-manage serverless platform for building and running scalable web and mobile applications.
- [Kata Containers ](https://katacontainers.io/ ) - Building lightweight virtual machines that seamlessly plug into the containers ecosystem.
2020-01-28 23:50:29 +01:00
- [K3S ](https://k3s.io/ ) - The certified Kubernetes distribution built for IoT and Edge computing.
2023-01-04 00:46:56 +01:00
- [Podman ](https://github.com/containers/podman ) - A tool for managing OCI containers and pods.
2023-01-19 10:05:00 +01:00
- [Linx ](https://linx.software ) - General-purpose low-code platform for building and hosting backend solutions.
2019-11-20 17:41:35 +01:00
## Container Image Registry
*Container Image registry.*
2023-01-04 00:46:56 +01:00
- [Quay ](https://www.projectquay.io/ ) - Container image registry that enables you to build, organize, distribute, and deploy containers.
2019-11-20 17:41:35 +01:00
- [Dockyard ](https://github.com/Huawei/dockyard ) - Container & Artifact Repository.
2023-01-04 00:46:56 +01:00
- [Harbor ](https://goharbor.io/ ) - An open source trusted cloud native registry project that stores, signs, and scans content.
2023-03-21 11:12:35 +01:00
- [GitHub Container Registry ](https://github.blog/2020-09-01-introducing-github-container-registry/ ) - Container registry free for public images.
2019-11-20 17:41:35 +01:00
## Automation & Orchestration
*Tools for automation, orchestration, deployment, provisioning and configuration management.*
2023-01-04 00:46:56 +01:00
- [Ansible ](https://www.ansible.com/ ) - Simple IT automation platform that makes your applications and systems easier to deploy.
- [Salt ](https://www.saltstack.com/ ) - Automate the management and configuration of any infrastructure or application at scale.
- [Puppet ](https://puppet.com/ ) - Unparalleled infrastructure automation and delivery.
- [Chef ](https://www.chef.io/ ) - Automate infrastructure and applications.
- [Juju ](https://jaas.ai/ ) - Simplifies how you configure, scale and operate today's complex software.
2019-11-20 17:41:35 +01:00
- [Rundeck ](https://www.rundeck.com/ ) - Runbook Automation For Modernizing Your Operations.
2023-01-04 00:46:56 +01:00
- [StackStorm ](https://stackstorm.com/ ) - Connects all your apps, services, and workflows. Automate DevOps your way.
- [Bosh ](https://www.cloudfoundry.org/bosh/ ) - Release engineering, deployment, and lifecycle management of complex distributed systems.
2019-11-20 17:41:35 +01:00
- [Cloudify ](https://cloudify.co/ ) - Connect, Control, & Automate from core to edge: unlimited locations, clouds and devices.
2023-01-04 00:46:56 +01:00
- [Tsuru ](https://tsuru.io/ ) - An extensible and open source Platform as a Service software.
- [Fabric ](http://www.fabfile.org/ ) - High level Python library designed to execute shell commands remotely over SSH.
2019-11-20 17:41:35 +01:00
- [Capistrano ](https://capistranorb.com/ ) - A remote server automation and deployment tool.
2023-01-04 00:46:56 +01:00
- [Mina ](http://nadarei.co/mina/ ) - Really fast deployer and server automation tool.
2019-11-20 17:41:35 +01:00
- [Terraform ](https://www.terraform.io/ ) - use Infrastructure as Code to provision and manage any cloud, infrastructure, or service.
2023-01-04 00:46:56 +01:00
- [Pulumi ](https://www.pulumi.com/ ) - Modern infrastructure as code platform that allows you to use familiar programming languages and tools to build, deploy, and manage cloud infrastructure.
- [Packer ](https://www.packer.io/ ) - Build Automated Machine Images.
2019-11-20 17:41:35 +01:00
- [Vagrant ](https://www.vagrantup.com/ ) - Development Environments Made Easy.
2023-01-04 00:46:56 +01:00
- [Foreman ](https://theforeman.org/ ) - Complete lifecycle management tool for physical and virtual servers.
- [Nomad ](https://learn.hashicorp.com/nomad ) - Deploy and Manage Any Containerized, Legacy, or Batch Application.
- [Marathon ](https://mesosphere.github.io/marathon/ ) - A production-grade container orchestration platform for DC/OS and Apache Mesos.
- [OctoDNS ](https://github.com/github/octodns ) - Managing DNS across multiple providers. DNS as code.
2020-01-15 18:19:54 +01:00
- [ManageIQ ](https://www.manageiq.org/ ) - Manage containers, virtual machines, networks, and storage from a single platform.
2020-09-25 00:07:25 +02:00
- [Ignite ](https://github.com/weaveworks/ignite ) - Open Source Virtual Machine (VM) manager with a container UX and built-in GitOps management.
2023-03-21 11:12:05 +01:00
- [Selefra ](https://github.com/selefra/selefra ) - An open-source policy-as-code software that provides analytics for multi-cloud and SaaS.
2022-01-28 15:42:21 +01:00
- [Spacelift ](https://spacelift.io/ ) - Flexible orchestration solution for IaC development.
2023-01-02 00:03:06 +01:00
- [Atlantis ](https://www.runatlantis.io/ ) - Terraform Pull Request Automation
- [KubeVela ](https://kubevela.io/ ) - Modern application delivery platform that makes deploying and operating applications across today's hybrid, multi-cloud environments easier, faster and more reliable.
2022-03-28 17:10:39 +02:00
- [Stacktape ](https://stacktape.com ) - Developer-friendly Infrastructure as a Code framework built on top of AWS.
2022-11-18 11:01:57 +01:00
- [Score ](https://score.dev ) - Open Source developer-centric and platform-agnostic workload specification.
2019-11-20 17:41:35 +01:00
## Continuous Integration & Delivery
2023-01-04 00:46:56 +01:00
*Continuous Integration, Continuous Delivery and Continuous Delivery. GitOps.*
2019-11-20 17:41:35 +01:00
- On premises
- [Buildbot ](http://buildbot.net/ ) - automate all aspects of the software development cycle.
- [Gitlab CI ](https://about.gitlab.com/product/continuous-integration/ ) - pipelines build, test, deploy, and monitor your code as part of a single, integrated workflow.
- [Jenkins ](http://jenkins-ci.org/ ) - automation server for building, deploying and automating any project.
- [Drone ](https://github.com/drone/drone ) - a Container-Native, Continuous Delivery Platform.
- [Concourse ](https://concourse-ci.org/ ) - pipeline-based continuous thing-doer.
2019-11-20 22:45:14 +01:00
- [Spinnaker ](https://www.spinnaker.io/ ) - fast, safe, repeatable deployments for every Enterprise.
2019-11-20 17:41:35 +01:00
- [goCD ](https://www.gocd.org/ ) - Delivery and Release Automation server.
- [Teamcity ](https://www.jetbrains.com/teamcity/ ) - enterprise-level CI and CD.
2019-11-20 22:45:14 +01:00
- [Bamboo ](https://www.atlassian.com/software/bamboo ) - tie automated builds, tests, and releases together in a single workflow.
2019-11-20 17:41:35 +01:00
- [Integrity ](http://integrity.github.io/ ) - Continuous Integration server.
- [Zuul ](https://zuul-ci.org/ ) - drives continuous integration, delivery, and deployment systems with a focus on project gating.
2020-09-25 00:07:25 +02:00
- [Argo ](https://argoproj.github.io/ ) - Open Source Kubernetes native workflows, events, CI and CD.
2019-11-20 17:41:35 +01:00
- [Strider ](https://strider-cd.github.io/ ) - Continuous Deployment/Continuous Integration platform.
2020-08-01 17:32:10 +02:00
- [Evergreen ](https://github.com/evergreen-ci/evergreen ) - A Distributed Continuous Integration System from MongoDB.
- [werf ](https://werf.io/ ) - Open Source CI/CD tool for building Docker images & deploying them to Kubernetes using a GitOps approach.
2020-09-25 00:07:25 +02:00
- [Flux ](https://github.com/fluxcd/flux ) - automatically ensures that the state of your Kubernetes cluster matches the configuration you’ ve supplied in Git.
- [Flagger ](https://github.com/weaveworks/flagger ) - progressive delivery Kubernetes operator (Canary, A/B Testing and Blue/Green deployments).
2021-02-03 18:41:49 +01:00
- [Tekton ](https://tekton.dev/ ) - powerful and flexible open-source framework for creating CI/CD systems.
- [PipeCD ](https://pipecd.dev/ ) - Continuous Delivery for Declarative Kubernetes, Serverless and Infrastructure Applications.
2021-11-07 11:37:33 +01:00
- [Gitploy ](https://www.gitploy.io/ ) - Build the deployment system around GitHub in minutes.
2020-08-01 17:32:10 +02:00
- Public Services
2019-11-20 17:41:35 +01:00
- [Travis CI ](https://travis-ci.org/ ) - easily sync your projects, you’ ll be testing your code in minutes.
- [Circle CI ](https://circleci.com/ ) - powerful CI/CD pipelines that keep code moving.
- [Bitrise ](https://www.bitrise.io/ ) - CI/CD for mobile applications.
- [Buildkite ](https://buildkite.com/ ) - run fast, secure, and scalable continuous integration pipelines on your own infrastructure.
2021-12-01 03:36:01 +01:00
- [Cirrus CI ](https://cirrus-ci.org/ ) - continuous integration system built for the era of cloud computing.
2021-02-03 18:41:49 +01:00
- [Codefresh ](https://codefresh.io/ ) - GitOps automation platform for Kubernetes apps.
2021-10-18 23:03:58 +02:00
- [Github actions ](https://github.com/features/actions ) - GitHub Actions makes it easy to automate all your software workflows, now with world-class CI/CD.
2023-01-04 00:46:56 +01:00
- [Kraken CI ](https://kraken.ci/ ) - Modern CI/CD, open-source, on-premise system that is highly scalable and focused on testing.
2022-07-13 23:05:35 +02:00
- [Earthly ](https://earthly.dev/ ) - Develop CI/CD pipelines locally and run them anywhere.
2019-11-20 17:41:35 +01:00
## Source Code Management
2023-01-04 00:46:56 +01:00
*Source Code management, Git-repository manager, Version Control. Some of them are included in Code review section.*
2019-11-20 17:41:35 +01:00
2023-01-24 20:47:44 +01:00
- [GitHub ](https://github.com/ ) - Helps developers store and manage their code, as well as track and control changes to their code.
2023-01-04 00:46:56 +01:00
- [Gitlab ](https://gitlab.com/ ) - Entire DevOps lifecycle in one application.
- [Bitbucket ](https://bitbucket.org/product/ ) - Gives teams one place to plan projects, collaborate on code, test, and deploy
- [Phabricator ](https://github.com/phacility/phabricator/ ) - A collection of web applications which help software companies build better software.
- [Gogs ](https://gogs.io/ ) - A painless self-hosted Git service.
- [Gitea ](https://gitea.io/ ) - A painless self-hosted Git service.
- [Gitblit ](https://github.com/gitblit/gitblit ) - Pure Java Git solution for managing, viewing, and serving Git repositories.
2019-11-20 17:41:35 +01:00
## Web Servers
*Web servers and reverse proxy.*
2023-01-04 00:46:56 +01:00
- [Nginx ](http://nginx.org/ ) - High performance load balancer, web server and reverse proxy.
- [Apache ](http://httpd.apache.org/ ) - Web server and reverse proxy.
- [Caddy ](https://caddyserver.com/ ) - Web server with automatic HTTPS.
- [Cherokee ](http://cherokee-project.com/ ) - Highly concurrent secured web applications.
- [Lighttpd ](http://www.lighttpd.net/ ) - Optimized for speed-critical environments while remaining standards-compliant, secure and flexible.
- [Uwsgi ](https://github.com/unbit/uwsgi/ ) - Application server container.
2019-11-20 17:41:35 +01:00
2019-11-21 14:08:17 +01:00
## SSL
2021-12-01 03:36:01 +01:00
*Tools for automating the management of SSL certificates.*
2019-11-21 14:08:17 +01:00
2023-01-04 00:46:56 +01:00
- [Certbot ](https://github.com/certbot/certbot ) - Automate using Let’ s Encrypt certificates on manually-managed websites to enable HTTPS.
- [Let’ s Encrypt ](https://letsencrypt.org/ ) - Free, automated, and open Certificate Authority.
2019-11-21 14:33:28 +01:00
- [Cert Manager ](https://github.com/jetstack/cert-manager ) - K8S add-on to automate the management and issuance of TLS certificates from various issuing sources.
2019-11-21 14:08:17 +01:00
2019-11-20 17:41:35 +01:00
## Databases
2021-12-01 03:36:01 +01:00
*Relational (SQL) and non-relational (NoSQL) databases.*
2019-11-20 17:41:35 +01:00
2021-12-01 03:36:01 +01:00
- Relational (SQL)
2023-01-04 00:46:56 +01:00
- [PostgreSQL ](https://www.postgresql.org/ ) - Powerful, open source object-relational database system.
- [MySQL ](https://www.mysql.com/ ) - Open-source relational database management system.
- [MariaDB ](https://mariadb.org/ ) - Fast, scalable and robust, with a rich ecosystem of storage engines, plugins and many other tools.
- [SQLite ](https://sqlite.org/ ) - Small, fast, self-contained, high-reliability, full-featured, SQL database engine.
2021-12-01 03:36:01 +01:00
- Non-relational (NoSQL)
2023-01-04 00:46:56 +01:00
- [Cassandra ](http://cassandra.apache.org/ ) - Manage massive amounts of data, fast, without losing sleep.
2023-02-01 11:58:14 +01:00
- [ScyllaDB ](https://www.scylladb.com/ ) - NoSQL data store using the seastar framework, compatible with Apache Cassandra
2023-01-04 00:46:56 +01:00
- [Apache HBase ](http://hbase.apache.org/ ) - Distributed, versioned, non-relational database.
- [Couchdb ](https://couchdb.apache.org/ ) - Database that completely embraces the web.
- [Elasticsearch ](https://www.elastic.co/products/elasticsearch ) - Distributed, RESTful search and analytics engine capable of addressing a growing number of use cases.
- [MongoDB ](https://www.mongodb.com/ ) - General purpose, document-based, distributed database built for modern applications.
- [Rethinkdb ](https://github.com/rethinkdb/rethinkdb ) - Open-source database for the realtime web.
2019-11-20 17:41:35 +01:00
- Key-Value
2023-01-04 00:46:56 +01:00
- [Couchbase ](https://www.couchbase.com/ ) - Distributed multi-model NoSQL document-oriented database that is optimized for interactive applications.
- [Leveldb ](https://github.com/google/leveldb ) - Fast key-value storage library.
- [Redis ](https://redis.io/ ) - In-memory data structure store, used as a database, cache and message broker.
- [RocksDB ](https://rocksdb.org/ ) - A library that provides an embeddable, persistent key-value store for fast storage.
- [Etcd ](https://github.com/etcd-io/etcd ) - Distributed reliable key-value store for the most critical data of a distributed system.
2019-11-20 17:41:35 +01:00
## Observability & Monitoring
*Observability, Monitoring, Metrics/Metrics collection and Alerting tools.*
2023-03-15 22:28:26 +01:00
- [Steampipe ](https://steampipe.io/ ) - The universal SQL interface for any cloud API, & cloud intelligence dashboards extensible w/ HCL+SQL.
2019-11-20 22:45:14 +01:00
- [Sensu ](https://sensu.io/ ) - Simple. Scalable. Multi-cloud monitoring.
2023-01-04 00:46:56 +01:00
- [Alerta ](https://github.com/alerta/alerta ) - Scalable, minimal configuration and visualization monitoring system.
- [Cabot ](https://github.com/arachnys/cabot ) - Self-hosted, easily-deployable monitoring and alerts service.
- [Amon ](https://github.com/amonapp/amon ) - Modern server monitoring platform.
- [Flapjack ](https://flapjack.io/ ) - Monitoring notification routing + event processing system.
- [Icinga ](https://icinga.com/ ) - Monitors availability and performance, gives you simple access to relevant data and raises alerts.
- [Monit ](https://mmonit.com/monit/#home ) - Managing and monitoring Unix systems.
- [Naemon ](http://www.naemon.org/ ) - Fast, stable and innovative while giving you a clear view of the state of your network and applications.
- [Nagios ](https://www.nagios.org/ ) - Computer-software application that monitors systems, networks and infrastructure.
- [Sentry ](https://sentry.io/welcome/ ) - Error monitoring that helps all software teams discover, triage, and prioritize errors in real-time.
- [Shinken ](http://www.shinken-monitoring.org/ ) - Monitoring framework.
- [Zabbix ](https://www.zabbix.com/ ) - Mature and effortless monitoring solution for network monitoring and application monitoring.
- [Glances ](https://github.com/nicolargo/glances ) - Monitoring information through a curses or Web based interface.
- [Healthchecks ](https://github.com/healthchecks/healthchecks ) - Cron monitoring tool.
- [Bolo ](http://bolo.niftylogic.com/ ) - Building distributed, scalable monitoring systems.
- [cAdvisor ](https://github.com/google/cadvisor ) - Analyzes resource usage and performance characteristics of running containers.
- [ElastiFlow ](https://github.com/robcowart/elastiflow ) - Network flow monitoring (Netflow, sFlow and IPFIX) with the Elastic Stack.
- [Co-Pilot ](https://pcp.io/ ) - System performance analysis toolkit.
2023-03-20 15:20:04 +01:00
- [Keep ](https://github.com/keephq/keep ) - Open source alerting CLI for developers.
2023-02-09 14:37:38 +01:00
- [Globalping CLI ](https://github.com/jsdelivr/globalping-cli ) - Run network commands like ping, traceroute and mtr from hundreds of global locations.
2019-11-20 17:41:35 +01:00
- Metrics/Metrics collection
2023-01-04 00:46:56 +01:00
- [Thundra Foresight ](https://www.thundra.io/foresight ) - Visibility into CI pipeline by spotting test failures in no time.
- [Prometheus ](https://prometheus.io/ ) - Power your metrics and alerting with a leading open-source monitoring solution.
- [Collectd ](https://github.com/collectd/collectd ) - The system statistics collection daemon.
- [Facette ](https://github.com/facette/facette ) - Time series data visualization software.
- [Grafana ](https://grafana.com/ ) - Analytics & monitoring solution for every database.
- [Graphite ](https://graphite.readthedocs.io/en/latest/ ) - Store numeric time-series data and render graphs of this data on demand.
- [Influxdata ](https://www.influxdata.com/ ) - Time series database.
- [Netdata ](https://www.netdata.cloud/ ) - Instantly diagnose slowdowns and anomalies in your infrastructure.
- [Freeboard ](https://github.com/Freeboard/freeboard ) - Real-time dashboard builder for IOT and other web mashups.
2019-11-20 17:41:35 +01:00
- Logs Management
2023-01-04 00:46:56 +01:00
- [Anthracite ](https://github.com/Dieterbe/anthracite ) - An event/change logging/management app.
- [Graylog ](https://github.com/Graylog2/graylog2-server ) - Free and open source log management.
- [Logstash ](https://www.elastic.co/products/logstash# ) - Collect, parse, transform logs.
- [Fluentd ](https://www.fluentd.org/ ) - Data collector for unified logging layer.
- [Flume ](https://flume.apache.org/ ) - Distributed, reliable, and available service for efficiently collecting, aggregating, and moving logs.
- [Heka ](https://hekad.readthedocs.io/en/latest/# ) - Stream processing software system.
- [Kibana ](https://www.elastic.co/products/kibana ) - Explore, visualize, discover data.
- [Loki ](https://github.com/grafana/loki ) - Horizontally-scalable, highly-available, multi-tenant log aggregation system inspired by Prometheus.
2019-11-21 14:08:17 +01:00
- Status
2023-01-04 00:46:56 +01:00
- [Cachet ](https://github.com/CachetHQ/Cachet ) - Beautiful and powerful open source status page system.
- [StatusPal ](https://statuspal.io/?utm_source=github.com&utm_medium=referral&utm_campaign=awesome-devops ) - Communicate incidents and maintenance effectively with a beautiful hosted status page.
2019-11-20 17:41:35 +01:00
## Service Discovery & Service Mesh
*Service Discovery, Service Mesh and Failure detection tools.*
2023-01-04 00:46:56 +01:00
- [Consul ](https://www.hashicorp.com/products/consul/ ) - Connect and secure any service.
- [Serf ](https://www.serf.io/ ) - Decentralized cluster membership, failure detection, and orchestration.
- [Doozerd ](https://github.com/ha/doozerd ) - A consistent distributed data store.
- [Zookeeper ](http://zookeeper.apache.org/ ) - Centralized service for configuration, naming, providing distributed synchronization, and more.
- [Etcd ](https://etcd.io/ ) - Distributed, reliable key-value store for the most critical data of a distributed system.
- [Istio ](https://istio.io/ ) - Connect, secure, control, and observe services.
- [Kong ](https://konghq.com/ ) - Deliver performance needed for microservices, service mesh, and cloud native deployments.
- [Linkerd ](https://github.com/linkerd/linkerd2 ) - Service mesh for Kubernetes and beyond.
2019-11-20 17:41:35 +01:00
2020-04-09 08:56:27 +02:00
## Chaos Engineering
*The discipline of experimenting on a distributed system in order to build confidence in the system's capability to withstand turbulent conditions in production.*
2023-01-04 00:46:56 +01:00
- [Chaos Toolkit ](https://github.com/chaostoolkit ) - The Open Source Platform for Chaos Engineering.
- [Chaos Monkey ](https://github.com/Netflix/chaosmonkey ) - A resiliency tool that helps applications tolerate random instance failures.
- [Toxiproxy ](https://github.com/Shopify/toxiproxy ) - Simulate network and system conditions for chaos and resiliency testing.
- [Pumba ](https://github.com/alexei-led/pumba ) - Chaos testing, network emulation and stress testing tool for containers.
2022-10-23 11:56:25 +02:00
- [Chaos Mesh ](https://github.com/chaos-mesh/chaos-mesh ) - A Chaos Engineering Platform for Kubernetes.
- [Litmus ](https://github.com/litmuschaos/litmus ) - Litmus enables teams to identify weaknesses in infrastructures.
2020-04-09 08:56:27 +02:00
2019-11-20 17:41:35 +01:00
## API Gateway
*API Gateway, Service Proxy and Service Management tools.*
2023-01-04 00:46:56 +01:00
- [API Umbrella ](https://github.com/NREL/api-umbrella ) - Proxy that sits in front of your APIs, API management platform.
2019-11-20 23:18:47 +01:00
- [Ambassador ](https://www.getambassador.io/ ) - Kubernetes-Native API Gateway built on the Envoy Proxy.
2023-01-04 00:46:56 +01:00
- [Kong ](https://konghq.com/ ) - Connect all your microservices and APIs with the industry’ s most performant, scalable and flexible API platform.
2019-11-20 23:18:47 +01:00
- [Tyk ](https://tyk.io/ ) - API and service management platform.
- [Cilium ](https://github.com/cilium/cilium ) - API aware networking and security using BPF and XDP.
2023-01-04 00:46:56 +01:00
- [Gloo ](https://github.com/solo-io/gloo ) - Feature-rich, Kubernetes-native ingress controller, and next-generation API gateway.
- [Envoy ](https://www.envoyproxy.io/ ) - Cloud-native high-performance edge/middle/service proxy.
- [Traefik ](https://traefik.io/ ) - Reverse proxy and load balancer for HTTP and TCP-based applications.
2019-11-20 17:41:35 +01:00
## Code review
2023-01-04 00:46:56 +01:00
*Code review. A few of the Source Code Management tools have built-in code review features.*
2019-11-20 17:41:35 +01:00
2023-01-04 00:46:56 +01:00
- [Gerrit ](https://www.gerritcodereview.com/ ) - Web-based team code collaboration tool.
- [Review Board ](https://www.reviewboard.org/ ) - Web-based collaborative code review tool.
2019-11-20 17:41:35 +01:00
## Distributed messaging
*Distributed messaging platforms and Queues software.*
2023-01-04 00:46:56 +01:00
- [Rabbitmq ](https://www.rabbitmq.com/ ) - Message broker.
- [Kafka ](http://kafka.apache.org/ ) - Building real-time data pipelines and streaming apps.
2019-11-20 23:18:47 +01:00
- [Activemq ](http://activemq.apache.org/ ) - Multi-Protocol messaging.
2023-01-04 00:46:56 +01:00
- [Beanstalkd ](https://beanstalkd.github.io/ ) - Simple, fast work queue.
- [NSQ ](https://nsq.io/ ) - Realtime distributed messaging platform.
- [Celery ](http://www.celeryproject.org/ ) - Asynchronous task queue/job queue based on distributed message passing.
- [Faktory ](https://github.com/contribsys/faktory ) - Repository for background jobs within your application.
- [Nats ](https://nats.io/ ) - Simple, secure and high performance open source messaging system.
- [RestMQ ](http://restmq.com/ ) - Message queue which uses HTTP as transport.
- [Dkron ](https://github.com/distribworks/dkron ) - Distributed, fault tolerant job scheduling system.
2022-04-08 00:59:28 +02:00
- [KubeMQ ](https://kubemq.io/ ) - Kubernetes-native messaging platform.
2019-11-20 17:41:35 +01:00
## Programming Languages
*Programming languages.*
2023-01-04 00:46:56 +01:00
- [Python ](https://www.python.org/ ) - Programming language that lets you work quickly and integrate systems more effectively.
- [Ruby ](https://www.ruby-lang.org/ ) - A dynamic, open source programming language with a focus on simplicity and productivity.
- [Go ](https://golang.org/ ) - An open source programming language that makes it easy to build simple, reliable, and efficient software.
2019-11-20 17:41:35 +01:00
## Chat and ChatOps
*Chat and ChatOps.*
2023-01-04 00:46:56 +01:00
- [Rocket ](https://rocket.chat/ ) - Open source team communication.
- [Mattermost ](https://mattermost.com/ ) - Messaging platform that enables secure team collaboration.
- [Zulip ](https://zulipchat.com/ ) - Real-time chat with an email threading model.
- [Riot ](https://about.riot.im/ ) - A universal secure chat app entirely under your control.
2019-11-20 17:41:35 +01:00
- ChatOps:
2023-01-04 00:46:56 +01:00
- [CloudBot ](https://github.com/CloudBotIRC/CloudBot ) - Simple, fast, expandable, open-source Python IRC Bot.
- [Hubot ](https://hubot.github.com/ ) - A customizable life embetterment robot.
2019-11-20 17:41:35 +01:00
## Secret Management
*Security as code, sensitive credentials and secrets need to be managed, security, maintained and rotated using automation.*
2023-01-04 00:46:56 +01:00
- [Sops ](https://github.com/mozilla/sops ) - Simple and flexible tool for managing secrets.
- [Vault ](https://www.hashicorp.com/products/vault/ ) - Manage secrets and protect sensitive data.
- [Keybase ](https://keybase.io/ ) - End-to-end encrypted chat and cloud storage system.
- [Vault Secrets Operator ](https://github.com/ricoberger/vault-secrets-operator ) - Create Kubernetes secrets from Vault for a secure GitOps based workflow.
- [Git Secret ](https://github.com/sobolevn/git-secret ) - A bash-tool to store your private data inside a git repository.
2023-03-29 22:34:49 +02:00
- [Infisical ](https://github.com/Infisical/infisical ) - Open source end-to-end encrypted secrets sync for teams and infrastructure.
2019-11-20 17:41:35 +01:00
## Sharing
*A collection of tools to help with sharing knowledge and telling the story.*
2023-01-04 00:46:56 +01:00
- [Gitbook ](https://github.com/GitbookIO/gitbook ) - Modern documentation format and toolchain using Git and Markdown.
- [Docusaurus ](https://github.com/facebook/docusaurus ) - Easy to maintain open source documentation websites.
- [Docsify ](https://github.com/docsifyjs/docsify/ ) - A magical documentation site generator.
- [MkDocs ](https://github.com/mkdocs/mkdocs/ ) - Project documentation with Markdown.
2019-11-20 17:41:35 +01:00
2019-11-21 08:58:37 +01:00
## VPN
*VPN, routing and firewall.*
2023-01-04 00:46:56 +01:00
- [OpenVPN ](https://openvpn.net/ ) - Flexible VPN solutions to secure your data communications, whether it's for Internet privacy.
- [Pritunl ](https://pritunl.com/ ) - Enterprise Distributed OpenVPN and IPsec Server.
- [VyOS ](https://vyos.io/ ) - Open source network OS that runs on a wide range of hardware, virtual machines, and cloud providers.
- [Algo ](https://github.com/trailofbits/algo ) - Set up a personal VPN in the cloud.
- [Streisand ](https://github.com/StreisandEffect/streisand ) - Sets up a new VPN service nearly automatically.
- [Freelan ](https://github.com/freelan-developers/freelan ) - A peer-to-peer, secure, easy-to-setup, multi-platform, open-source, highly-configurable VPN software.
- [Sshuttle ](https://github.com/sshuttle/sshuttle ) - Transparent proxy server that works as a poor man's VPN.
- [SoftEther ](https://www.softether.org/ ) - An Open-Source Free Cross-platform Multi-protocol VPN Program.
2022-05-12 07:15:12 +02:00
as an academic project from University of Tsukuba, under the Apache License 2.0.
2022-07-21 19:18:48 +02:00
- [Firezone ](https://www.firezone.dev/ ) - Self-hosted VPN server using WireGuard. Supports MFA, SSO, and has easy deployment options.
2019-11-21 08:58:37 +01:00
2019-11-20 17:41:35 +01:00
## Resources
### Books
2019-11-21 13:39:06 +01:00
*Books focused on DevOps, DevSecOps and Site Reliability Engineering.*
2019-11-20 17:41:35 +01:00
2019-11-21 11:37:45 +01:00
- [Effective DevOps: Building a Culture of Collaboration, Affinity, and Tooling at Scale ](http://shop.oreilly.com/product/0636920039846.do )
- [Continuous Delivery: Reliable Software Releases through Build, Test, and Deployment Automation ](https://www.oreilly.com/library/view/continuous-delivery-reliable/9780321670250/ )
2022-04-08 00:59:28 +02:00
- [Hands-On Security in DevOps ](https://www.packtpub.com/networking-and-servers/hands-security-devops )
2023-01-24 20:47:44 +01:00
- [Site Reliability Engineering ](https://sre.google/sre-book/table-of-contents/ )
- [The Site Reliability Workbook ](https://sre.google/workbook/table-of-contents/ )
- [Building Secure & Reliable Systems ](https://static.googleusercontent.com/media/sre.google/ro//static/pdf/building_secure_and_reliable_systems.pdf )
2019-11-21 11:37:45 +01:00
- [Infrastructure as Code: Managing Servers in the Cloud ](http://shop.oreilly.com/product/0636920039297.do )
2021-03-14 14:05:01 +01:00
- [The DevOps Handbook ](https://www.oreilly.com/library/view/the-devops-handbook/9781457191381/ )
2019-11-20 17:41:35 +01:00
### Conferences
- [DevOpsCon ](https://devopscon.io/ )
2019-11-20 22:45:14 +01:00
- [AWS re:Invent ](https://reinvent.awsevents.com/ )
2019-11-20 17:41:35 +01:00
- [DevSecOps ](https://www.devseccon.com/ )
- [ADDO ](https://www.alldaydevops.com/ )
- [DevOpsConnect ](https://www.devopsconnect.com/ )
2019-11-21 15:36:04 +01:00
- [@Scale ](https://atscaleconference.com/ )
2020-06-30 21:45:34 +02:00
- [devopsdays ](https://devopsdays.org/ )
2021-03-14 14:05:01 +01:00
- [DevOps Enterprise Summit ](https://events.itrevolution.com/ )
2019-11-20 17:41:35 +01:00
2023-01-04 00:46:56 +01:00
### DevOps Roadmap
2019-11-21 19:38:49 +01:00
Basic understanding and what you should know to become a *DevOps* Engineer, check the roadmap [here ](https://roadmap.sh/devops ).
2019-11-20 17:41:35 +01:00
## Contributing
2023-01-03 23:09:59 +01:00
Your contributions are always welcome! Please take a look at the [Contribution Guidelines ](https://github.com/wmariuss/awesome-devops/blob/main/docs/contribution.md ).