Overview
To monitor the health of your Gloo Mesh Enterprise components, you can leverage pre-built Grafana dashboards.
About
Grafana is an open source interactive data-visualization platform that allows you to see data via charts and graphs that are unified into one dashboard.
Operations dashboard
The operations dashboard lets you monitor the health of your Gloo Mesh Enterprise environment, such as the average translation and reconciliation time for the Gloo management server, or translation errors that occured. Built on top of Grafana and integrated with the Gloo Prometheus server, the dashboard is configured to visualize critical Gloo Mesh Enterprise metrics and alerts for you so that you can quickly see errors and the performance of Gloo Mesh Enterprise components.
With the operations dashboard, you have access to the following key features:
Gloo Mesh Enterprise metrics and alerts: Quickly view critical Gloo Mesh Enterprise metrics and alerts to determine the health of your Gloo Mesh Enterprise environment. The dashboard is organized into different sections that provide an overview of the Gloo management server’s and agent’s status, and the overall Gloo Mesh Enterprise resource consumption. Populated metrics and alerts are retrieved from the built-in Prometheus server.
For an overview of available metrics, see Gloo management server metrics. To find a detailed overview of the alerts that are automatically configured for Gloo Mesh Enterprise components, see Alerts.
Resource consumption: Monitor the CPU and memory usage of Gloo components, such as the Gloo agents and management server pods.
The operations dashboard is not automatically set up when you install Gloo Mesh Enterprise. To access the dashboard, you must follow these steps:
Cilium dashboard
When you add Cilium, Hubble, and eBPF-specific metrics to the Gloo telemetry pipeline, you can import the pre-built Cilium dashboard to Grafana. This dashboard provides the following key features that can help you monitor the health of your Cilium components and the workloads in your cluster:
- Cilium agent and operator resource consumption and latency: Gain insights into the state of the Cilium CNI components, such as the Cilium agent and the Cilium operator. For example, you can monitor the average CPU and memory consumption of the Cilium agent, the number of Cilium API calls, their latency and API return codes.
- Layer 3 BPF metrics: Monitor the system-wide memory consumption of eBPF maps, the number of system calls, their latency, and IP allocation operations.
- Network traffic: Track incoming and outgoing network packets that were successfully processed or dropped for Cilium-managed Kubernetes pods.
- Network policies: Monitor the number of requests that were allowed or prohibited by network policies, and the endpoints that are labeled by policy enforcement status.
- Cilium endpoints: Review the number of endpoints that Cilium monitors and their status.
The Cilium dashboard is not automatically set up when you install Gloo Mesh Enterprise. To access the dashboard, you must follow these steps:
The Cilium dashboard offers various graphs and data that provide a full picture of the health of the Cilium CNI and the network traffic in your cluster. To provide and visualize this data, it is recommended to add all Cilium metrics to the Gloo telemetry pipeline. Note that if you customized the Cilium metrics and enabled only certain Cilium metrics, some graphs in the dashboard might remain empty due to missing data.
OPA dashboard
When you import the pre-built OPA dashboard to Grafana, you gain access to the following key features that can help you monitor the health of your Open Policy Agent components:
- Performance metrics for the OPA engine: Monitor the average CPU and memory consumption of the OPA engine.
- Performance metrics for requests and responses: Track the average duration, size, and other key metrics for the requests and responses to and from the OPA engine.
- Status metrics: Monitor the health of your OPA environment, including whether bundles are successfully loaded. This way, you can better troubleshoot issues with policy enforcement that can arise when the bundles or OPA engine is not healthy.
The OPA dashboard is not automatically set up when you install Gloo Mesh Enterprise. To access the dashboard, you must follow these steps: