Aiven for OpenSearch® metrics available via Prometheus
Monitor and optimize your Aiven for OpenSearch service with metrics available via Prometheus. These metrics help track cluster health, replication status, and overall performance.
Prerequisites
- Enable Prometheus integration.
- Note the Prometheus username and password in the Integration endpoints section of the Aiven Console.
Access Prometheus metrics
- View in browser
- Retrieve via cURL
- Open your service's Overview page in the Aiven Console.
- In the Connection information section, click the Prometheus tab.
- Copy the Service URI.
- Paste the Service URI into your browser's address bar.
- When prompted, enter your Prometheus credentials.
- Click Login.
To retrieve metrics, run the following curl command:
curl --user 'USERNAME:PASSWORD' PROMETHEUS_URL/metrics
Replace USERNAME:PASSWORD with your Prometheus credentials and PROMETHEUS_URL
with the Service URI from the Connection information section.
Host metrics
Host metrics provide insights into system-level performance, including CPU, memory, disk, and network usage.
CPU utilization
CPU utilization metrics offer insights into CPU usage. These metrics include time spent on different processes, system load, and overall uptime.
| Metric | Description |
|---|---|
cpu_usage_guest | CPU time spent running a virtual CPU for guest operating systems |
cpu_usage_guest_nice | CPU time running low-priority virtual CPUs for guest operating systems; interrupted by higher-priority tasks and measured in hundredths of a second |
cpu_usage_idle | Time the CPU spends doing nothing |
cpu_usage_iowait | Time waiting for I/O to complete |
cpu_usage_irq | Time servicing interrupts |
cpu_usage_nice | Time running user-niced processes |
cpu_usage_softirq | Time servicing softirqs |
cpu_usage_steal | Time spent in other operating systems when running in a virtualized environment |
cpu_usage_system | Time spent running system processes |
cpu_usage_user | Time spent running user processes |
system_load1 | System load average for the last minute |
system_load15 | System load average for the last 15 minutes |
system_load5 | System load average for the last 5 minutes |
system_n_cpus | Number of CPU cores available |
system_n_users | Number of users logged in |
system_uptime | Time for which the system has been up and running |