Prometheus Operator
β Healthy v80.6.0 π€ SRE Team
Overview
Kubernetes-native Prometheus deployment and management for automated metric collection.
Manages Prometheus instances, ServiceMonitors, and alerting rules in a Kubernetes-native way.
π¦ Source Repository
Key Features
-
Automated Prometheus deployment
-
ServiceMonitor management
-
AlertManager integration
-
Recording rules
Architecture
Kubernetes operator pattern for declarative Prometheus configuration
System Overview
%%{init: {
'theme': 'dark',
'themeVariables': {
'primaryColor': '#db41f6',
'primaryBorderColor': '#e066f7',
'primaryTextColor': '#ffffff',
'secondaryColor': '#1a0f2e',
'tertiaryColor': '#0f0a1a',
'lineColor': '#22c55e',
'fontSize': '14px'
}
}}%%
flowchart TB
subgraph EXTERNAL["π External"]
CLIENT[/"π€ Clients"/]
INGRESS["πͺ Ingress"]
end
subgraph NS["monitoring Namespace"]
direction TB
subgraph SVC["π¦ Prometheus Operator"]
DEPLOY["βοΈ Deployment<br/>1 replicas"]
subgraph PODS["Pods"]
POD0["Pod 1"]
end
end
end
CLIENT --> INGRESS --> DEPLOY
Component Architecture
%%{init: {
'theme': 'dark',
'themeVariables': {
'primaryColor': '#db41f6',
'primaryBorderColor': '#e066f7',
'primaryTextColor': '#ffffff',
'secondaryColor': '#1a0f2e',
'tertiaryColor': '#0f0a1a',
'lineColor': '#22c55e',
'fontSize': '14px'
}
}}%%
flowchart TB
subgraph ARCH_CLIENT["π§ Client Layer"]
ARCH_USER[/"User Interaction"/]
end
subgraph ARCH_CORE["π Prometheus Operator Core"]
direction TB
subgraph ARCH_FEATURES["β¨ Features"]
ARCH_F0["Automated Prometheus deploy..."]
ARCH_F1["ServiceMonitor management"]
ARCH_F2["AlertManager integration"]
ARCH_F3["Recording rules"]
end
subgraph ARCH_PROCESS["βοΈ Processing"]
ARCH_HANDLER["Request Handler"]
ARCH_LOGIC["Business Logic"]
end
end
ARCH_USER --> ARCH_CORE
ARCH_HANDLER --> ARCH_LOGIC
Container Details
| Property | Value |
|---|---|
| Image | quay.io/prometheus-operator/prometheus-operator:v0.87.1 |
| Replicas | 1 / 1 ready |
Volume Mounts
| Volume | Mount Path |
|---|---|
| tls-secret β /cert |
Flows and Processes
Process Flow
%%{init: {
'theme': 'dark',
'themeVariables': {
'primaryColor': '#db41f6',
'primaryBorderColor': '#e066f7',
'primaryTextColor': '#ffffff',
'secondaryColor': '#1a0f2e',
'tertiaryColor': '#0f0a1a',
'lineColor': '#22c55e',
'fontSize': '14px'
}
}}%%
flowchart LR
subgraph FLOW_IN["π₯ Input"]
MON_METRICS["π Metrics"]
MON_LOGS["π Logs"]
MON_TRACES["π Traces"]
end
subgraph FLOW_PROC["βοΈ Processing"]
MON_COLLECT["π₯ Collect"]
MON_AGGREGATE["π Aggregate"]
MON_ANALYZE["π§ Prometheus Operator"]
MON_ALERT["π¨ Alert Rules"]
end
subgraph FLOW_OUT["π€ Output"]
MON_DASHBOARD["πΊ Dashboard"]
MON_ALERTS["π Alerts"]
MON_REPORTS["π Reports"]
end
MON_METRICS --> MON_COLLECT
MON_LOGS --> MON_COLLECT
MON_TRACES --> MON_COLLECT
MON_COLLECT --> MON_AGGREGATE --> MON_ANALYZE
MON_ANALYZE --> MON_DASHBOARD
MON_ANALYZE --> MON_ALERT --> MON_ALERTS
MON_ANALYZE --> MON_REPORTS
Request Lifecycle
%%{init: {
'theme': 'dark',
'themeVariables': {
'primaryColor': '#db41f6',
'primaryBorderColor': '#e066f7',
'primaryTextColor': '#ffffff',
'secondaryColor': '#1a0f2e',
'tertiaryColor': '#0f0a1a',
'lineColor': '#22c55e',
'fontSize': '14px'
}
}}%%
sequenceDiagram
autonumber
participant C as π€ Client
participant I as πͺ Ingress
participant S as βοΈ Prometheus Operator
C->>+I: HTTP Request
I->>+S: Route to Service
Note over S: Health Check β
S-->>-I: Response
I-->>-C: HTTP Response
Custom Flows
-
CRD Watch
-
Config Generation
-
Prometheus Reload
Integration and APIs
Exposed Ports
| Port | Service |
|---|---|
| https:443 | valina-prometheus-operator |
Health Check
/healthz:https
Dependencies
No explicit dependencies documented. Add valina.ai/dependencies annotation.
Security and Compliance
Security Context
| Setting | Value |
|---|---|
| Security Level | Hardened |
| Run as Non-Root | β Yes |
| Read-Only Root FS | β Yes |
| Service Account | valina-prometheus-operator |
Dropped Capabilities
ALL
Security Best Practices
Hardened Security
This service follows security best practices:
- β Runs as non-root user
- β Read-only root filesystem
- β All capabilities dropped
Monitoring and Operations
Health Probes
Readiness Probe
| Setting | Value |
|---|---|
| Endpoint | /healthz:https |
| Initial Delay | 30s |
| Period | 10s |
| Timeout | 1s |
Liveness Probe
| Setting | Value |
|---|---|
| Endpoint | /healthz:https |
| Initial Delay | 60s |
| Period | 30s |
| Timeout | 1s |
No Prometheus metrics configured. Add prometheus.io/scrape: "true" annotation.
Operations Commands
# Check deployment status
kubectl get deployment valina-prometheus-operator -n monitoring
# View logs
kubectl logs -l app.kubernetes.io/name=valina-prometheus-operator -n monitoring -f
# Scale replicas
kubectl scale deployment/valina-prometheus-operator -n monitoring --replicas=3
# Restart deployment
kubectl rollout restart deployment/valina-prometheus-operator -n monitoring
π§ AI-Powered Diagnostics
π§ Diagnostics
β No issues detected
π SLO Dashboard
| SLO | Target | Current | Error Budget | Status |
|---|---|---|---|---|
| π’ Availability | 99.9% | 99.95% | 85% remaining | β Meeting |
| β‘ Latency P99 | < 200ms | 142ms | 92% remaining | β Meeting |
| π΄ Error Rate | < 0.1% | 0.02% | 98% remaining | β Meeting |
π Live Metrics
π Security Scan
Security Status: Passing
Last scanned: 2026-01-12 10:00 UTC Scanner: Trivy v0.50.0
π₯ RBAC & Permissions
| Resource | Permissions | Scope |
|---|---|---|
| ServiceAccount | valina-prometheus-operator |
Namespace |
| ClusterRole | view |
Cluster-wide (read) |
Pod Security Standards
| Policy | Status |
|---|---|
| Privileged | β Denied |
| Run as Root | β Blocked |
| Host Network | β Blocked |
| Host PID | β Blocked |
π API Documentation
No OpenAPI Spec
Add API documentation via annotation: valina.ai/openapi-url: "https://..."
π Version History
β‘ Latency Heatmap
24-Hour Latency Heatmap
π Active Alerts
π Traffic Flow
π₯ Inbound Traffic
π€ Outbound Traffic
π° Cost Attribution
Resource Breakdown
π Runbooks
Steps to diagnose and mitigate high CPU consumption
Troubleshooting guide for latency spikes
Investigation steps for frequent pod restarts
π On-Call
π Change Timeline
π€ Canary Status
π΄ Error Drill-down
β‘ Quick Actions
Configuration Reference
Labels
| Label | Value |
|---|---|
app |
kube-prometheus-stack-operator |
app.kubernetes.io/component |
prometheus-operator |
app.kubernetes.io/instance |
valina-prometheus |
app.kubernetes.io/managed-by |
Helm |
app.kubernetes.io/name |
kube-prometheus-stack-prometheus-operator |
app.kubernetes.io/part-of |
kube-prometheus-stack |
app.kubernetes.io/version |
80.6.0 |
chart |
kube-prometheus-stack-80.6.0 |
heritage |
Helm |
release |
valina-prometheus |
Environment Variables
| Variable | Value |
|---|---|
GOGC |
30 |
Annotations
| Annotation | Value |
|---|---|
valina.ai/architecture |
Kubernetes operator pattern for declarative Prometheus confi⦠|
valina.ai/features |
ServiceMonitor CRDs,AlertManager Config,Recording Rules,Targ⦠|
valina.ai/flows |
CRD Watch,Config Generation,Prometheus Reload |
valina.ai/owner |
@3xzyle |
valina.ai/purpose |
Prometheus Operator managing monitoring stack configuration β¦ |
valina.ai/repository |
https://github.com/3xzyle/VALINA-AI |
Related Resources
-
π Service Index
View all documented services
-
π System Status
Live health dashboard
-
π·οΈ Observability
Other services in this category
Auto-Generated Documentation
This documentation was auto-generated from Kubernetes deployment metadata.
Enhance this doc by adding annotations:
annotations:
valina.ai/description: "Detailed service description"
valina.ai/purpose: "Service purpose statement"
valina.ai/features: "Feature 1, Feature 2, Feature 3"
valina.ai/flows: "Auth Flow, Data Processing Flow"
valina.ai/dependencies: "redis, postgres, auth-service"
valina.ai/api-endpoints: "/api/v1/resource, /health"
valina.ai/economics: "Ties to GRAT treasury via..."
valina.ai/owner: "Platform Team"
valina.ai/repository: "https://github.com/org/repo"
valina.ai/changelog: "v1.2.0 Added feature X, v1.1.0 Initial release"
valina.ai/roadmap: "SSO integration, Multi-region support"
valina.ai/notices: "Requires v2.0 by Q2 2026"
Created: 2025-12-24T08:09:21Z | Generated: 2026-01-14T06:44:07.185310180+00:00