ODH Logo

Operations

In this section, you can find documentation pertaining to operations within the Operate First initiative.

GitOps and Operate First

In Operate First, operations span cluster operations as well as operations for managed services. We follow the GitOps operating model for OpenShift cluster and service management. Because this model hinges on Git as a single source of truth, we perform all operations for live deployments via GitHub pull-requests. As a result, practically anyone can submit changes to the state of the cluster. Various applications managed by the Operate First team can be updated in a similar manner.

Site Reliability Engineering

Operate First’s mission is to adapt industry best practices around Site Reliability Engineering (SRE) in an open and transparent manner. We consume practices as outlined in places such as the Google SRE book and implement them in our operating framework. Here you will find resources and documentation that describe our implementation.

Toolbox

Toolbox is a Linux utility that provides a containerized environment in which software can be installed and used. We have created an Operate First toolbox that includes tools such as Kustomize, SOPS, KSOPS, and more that can be utilized for workflows. You can find details for installing our toolbox here.