K8s Resource Optimizer

Automated Kubernetes resource recommendation tool that analyzes historical Prometheus metrics to generate right-sized CPU and memory recommendations for your deployments.

The Problem

Managing resource allocation for hundreds of Kubernetes pods across multiple namespaces is challenging. Over-provisioning leads to wasted cloud costs, while under-provisioning risks application performance. When faced with analyzing and optimizing around 400 pods based on actual usage patterns, manual analysis becomes impractical and time-consuming.

The Solution

This tool was created to automate the process of analyzing pod resource usage and generating data-driven recommendations. It integrates with Prometheus to analyze historical metrics, calculates optimal resource requests and limits based on statistical analysis (mean for requests, P95 for limits), and provides actionable recommendations through multiple output formats.

How It Works

The tool runs as a Kubernetes CronJob, deployed via Helm chart, that:

Connects to Prometheus to retrieve historical CPU and memory metrics for all running pods
Analyzes usage patterns over a configurable time period (default: 7 days)
Calculates recommendations using statistical analysis with configurable safety buffers
Generates YAML patches ready for deployment updates
Sends reports to Slack with detailed HTML tables and patch files

Key Features

Historical Analysis: Analyzes configurable time periods (default 7 days) of Prometheus metrics
Statistical Recommendations: Uses mean values for requests and P95 percentiles for limits
Multiple Output Formats: Generates both human-readable tables and machine-readable YAML patches
Slack Integration: Automated reports with HTML-formatted tables and downloadable patch files
Safety Buffers: Configurable buffer percentage to add headroom to recommendations

Installation

Prerequisites

Kubernetes cluster
Prometheus with container_cpu_usage_seconds_total and container_memory_working_set_bytes metrics

Deploy via Helm

Clone the repository:

git clone git@github.com:umairedu/kube-right-sizer.git
cd kube-right-sizer

Configure environment variables in helm/resource-right-sizing/production/values.yaml:

resource_sizing:
  plain:
    PROMETHEUS_URL: "http://prometheus-server.default.svc:9090"
    KUBERNETES_USE_IN_CLUSTER_CONFIG: "true"
    TARGET_NAMESPACE: ""  # Empty for all namespaces, or comma-separated list
    EXCLUDED_NAMESPACES: "kube-system"
    HOURS: "168"  # 7 days
    BUFFER_PERCENT: "20"
    OUTPUT_FORMAT: "both"
  
  secrets:
    SLACK_TOKEN: "your-slack-token"
    SLACK_CHANNEL: "your-channel"
    SLACK_VERIFY_SSL: "true"

Deploy the Helm chart:

helm install resource-right-sizing ./helm/resource-right-sizing \
  -f ./helm/resource-right-sizing/production/values.yaml \
  -n <your-namespace>

Local Development

Install dependencies:

pip install -r requirements.txt

Configure environment variables (copy env.sample to .env and update):

cp env.sample .env
# Edit .env with your configuration

Run locally:

python3 main.py

Configuration

All configuration is done via environment variables:

PROMETHEUS_URL: Prometheus server URL
KUBERNETES_USE_IN_CLUSTER_CONFIG: Use in-cluster config (true/false)
TARGET_NAMESPACE: Comma-separated list of namespaces to scan (empty for all)
EXCLUDED_NAMESPACES: Comma-separated list of namespaces to exclude
HOURS: Number of hours of historical data to analyze
BUFFER_PERCENT: Safety buffer percentage to add to recommendations
OUTPUT_FORMAT: Output format (table, yaml, or both)
SLACK_TOKEN: Slack bot token for notifications
SLACK_CHANNEL: Slack channel for notifications
SLACK_VERIFY_SSL: SSL verification for Slack API (true/false)

Example Output

CLI Output

The tool provides color-coded terminal output showing current vs recommended resources:

Slack Notifications

Slack message

HTML report

Project Structure

kube-right-sizer/
├── main.py                 # Main application entry point
├── config.py              # Configuration management (Pydantic)
├── services/
│   ├── kubernetes.py      # Kubernetes API interactions
│   ├── prometheus.py      # Prometheus metrics queries
│   └── slack.py           # Slack notification integration
├── helm/
│   └── resource-right-sizing/
│       ├── Chart.yaml
│       ├── production/
│       │   └── values.yaml
│       └── templates/
│           ├── cron_job.yaml
│           ├── rbac.yaml
│           └── secrets.yaml
├── Dockerfile
├── requirements.txt
└── env.sample

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
helm/resource-right-sizing		helm/resource-right-sizing
screenshots		screenshots
services		services
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
build_push.sh		build_push.sh
config.py		config.py
env.sample		env.sample
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

K8s Resource Optimizer

The Problem

The Solution

How It Works

Key Features

Installation

Prerequisites

Deploy via Helm

Local Development

Configuration

Example Output

CLI Output

Slack Notifications

Slack message

HTML report

Project Structure

About

Uh oh!

Languages

umairedu/kube-right-sizer

Folders and files

Latest commit

History

Repository files navigation

K8s Resource Optimizer

The Problem

The Solution

How It Works

Key Features

Installation

Prerequisites

Deploy via Helm

Local Development

Configuration

Example Output

CLI Output

Slack Notifications

Slack message

HTML report

Project Structure

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages