GitHub - sbahmani/ktop: A powerful htop-like CLI tool for Kubernetes cluster resource monitoring - displays real-time CPU/memory requests, limits, and usage across all nodes with smart memory corruption handling and advanced sorting capabilities

📊 Overview

ktop is a powerful command-line tool for monitoring Kubernetes node resource allocation and usage. It provides a comprehensive view of CPU and memory requests, limits, actual usage, and capacity across all nodes in your cluster, similar to htop but for Kubernetes nodes.

Version: 1.3.0

✨ Features

Real-time Resource Monitoring: View CPU and memory requests, limits, usage, and capacity Smart Memory Corruption Handling: Automatically detects and fixes Kubernetes memory reporting bugs Flexible Sorting: Sort by any column (CPU/Memory requests, limits, usage, percentage, capacity, request percentage) Parallel Processing: Fast data collection with configurable parallel queries Multiple Output Formats: Table (default), CSV, JSON Watch Mode: Auto-refresh display at specified intervals Color-Coded Alerts: Visual indicators for resource usage levels

🟢 Green: 0-59% (Normal)
🟡 Yellow: 60-79% (Warning)
🔴 Red: 80%+ (Critical)

Node Health Status: Display node conditions (Ready, MemoryPressure, DiskPressure, PIDPressure, NetworkUnavailable)

🟢 Green: Ready (healthy node)
🔴 Red: NotReady (unhealthy node)
🟡 Yellow: Pressure conditions detected

Node Filtering: Include or exclude control-plane nodes Resource Totals: Summary row showing cluster-wide resource allocation Request Percentage Tracking: Monitor CPU and memory request utilization as percentage of node capacity Environment Variable Support: Configure defaults via environment variables Version Information: Built-in version tracking and display Enhanced Error Handling: Better error messages and retry logic

📋 Requirements

Kubernetes cluster (v1.19+)
kubectl configured with cluster access
metrics-server installed in the cluster
jq for JSON parsing
bc for calculations
bash 4.0+

🚀 Installation

    # Download the script
    curl -Lo ~/bin/ktop https://raw.githubusercontent.com/sbahmani/ktop/refs/heads/main/ktop.sh
    
    # Make it executable
    chmod +x ~/bin/ktop
    
    # Add to PATH (if not already)
    echo 'export PATH="$HOME/bin:$PATH"' >> ~/.bashrc
    source ~/.bashrc
    
    # Verify installation
    ktop --help

Install metrics-server (if not installed)

kubectl apply -f https://github.com/kubernetes-metrics/metrics-server/releases/latest/download/components.yaml

📖 Usage

# Display all worker nodes with default settings
ktop

# Use 12 parallel queries for faster execution
ktop -P 12

# Include control-plane nodes
ktop --all

# Sort by CPU usage (highest first)
ktop -S cpu-use

# Sort by memory percentage
ktop -S mem-pct

# Sort by CPU request percentage
ktop -S cpu-req-pct

# Sort by memory request percentage
ktop -S mem-req-pct

# Show detailed node conditions (Ready, MemoryPressure, etc.)
ktop --show-conditions

# Sort by node status
ktop -S status

# Watch mode - refresh every 5 seconds
ktop -w 5

# Show version information
ktop --version

# Use environment variables for configuration
export KTOP_PARALLEL=8
export KTOP_FORMAT=json
ktop

Command-Line Options

Option	Description	Example
`-h, --help`	Show help message	`ktop -h`
`-v, --version`	Show version information	`ktop --version`
`-P <num>`	Number of parallel kubectl queries (default: 4)	`ktop -P 8`
`-a, --all`	Include control-plane nodes	`ktop --all`
`-n, --no-color`	Disable color output	`ktop --no-color`
`-s, --no-sum`	Don't show summary totals	`ktop --no-sum`
`-w, --watch <sec>`	Auto-refresh every N seconds	`ktop -w 10`
`-o, --output <fmt>`	Output format: table, csv, json	`ktop -o csv`
`-S, --sort <field>`	Sort by field	`ktop -S cpu-pct`
`-r, --reverse`	Reverse sort order (ascending)	`ktop -S name -r`
`-c, --show-conditions`	Show detailed node conditions	`ktop --show-conditions`

Sort Fields

Field	Description
`name`	Node name (alphabetical)
`cpu-req`	CPU requests (default)
`cpu-lim`	CPU limits
`cpu-use`	CPU actual usage
`cpu-pct`	CPU usage percentage
`cpu-cap`	CPU capacity
`cpu-req-pct`	CPU request percentage of node capacity
`mem-req`	Memory requests
`mem-lim`	Memory limits
`mem-use`	Memory actual usage
`mem-pct`	Memory usage percentage
`mem-cap`	Memory capacity
`mem-req-pct`	Memory request percentage of node capacity
`status`	Node status/conditions

📊 Output Example

WORKER_NODE        STATUS      CPU_REQ  CPU_LIM  CPU_USE  CPU_%  CPU_CAP  CPU_REQ_% | MEM_REQ  MEM_LIM  MEM_USE  MEM_%  MEM_CAP  MEM_REQ_%
========================================================================================================================================
worker07           Ready       78573m   184732m  44905m   40%    111.5    70%       | 252.8Gi  301.6Gi  137.2Gi  27%    494.5Gi  51%
worker09           Ready       77925m   168720m  35093m   31%    111.5    70%       | 146.0Gi  187.8Gi  126.8Gi  25%    494.5Gi  30%
worker08           Ready       77075m   157220m  43008m   38%    111.5    69%       | 141.0Gi  176.0Gi  131.4Gi  26%    494.5Gi  29%
worker-gpu-02      Ready       72161m   208884m  1206m    0%     127.5    57%       | 489.2Gi  1047.4Gi 446.5Gi  44%    993.6Gi  49%
...
========================================================================================================================================
TOTAL (58)         -           3272.9   7225.0   1552.4   -      5018.8   -        | 8117.2Gi 12621.0Gi 7869.1Gi -     22998.8Gi -

With --show-conditions, the STATUS column shows detailed condition information:

WORKER_NODE        STATUS          CPU_REQ  ...
worker01           Ready           78573m   ...
worker02           Ready(Mem,Disk) 77925m   ...  # Node has MemoryPressure and DiskPressure
worker03           NotReady        77075m   ...

Output Columns

WORKER_NODE: Node name
CPU_REQ: CPU requests allocated to pods
CPU_LIM: CPU limits allocated to pods
CPU_USE: Actual CPU usage
CPU_%: CPU usage percentage of node capacity
CPU_CAP: Total CPU capacity (cores)
CPU_REQ_%: CPU request percentage of node capacity
MEM_REQ: Memory requests allocated to pods (Gi)
MEM_LIM: Memory limits allocated to pods (Gi)
MEM_USE: Actual memory usage (Gi)
MEM_%: Memory usage percentage of node capacity
MEM_CAP: Total memory capacity (Gi)
MEM_REQ_%: Memory request percentage of node capacity
STATUS: Node health status
- Ready: Node is healthy and ready to accept pods
- NotReady: Node is not ready (may be cordoned, draining, or unhealthy)
- Ready(Mem): Node is ready but has MemoryPressure
- Ready(Disk): Node is ready but has DiskPressure
- Ready(PID): Node is ready but has PIDPressure
- Ready(Net): Node is ready but NetworkUnavailable condition is True
- Multiple conditions can appear together, e.g., Ready(Mem,Disk)

Export and Analysis

# Export to CSV for spreadsheet analysis
ktop -o csv > node_resources_$(date +%Y%m%d).csv

# Export to JSON for programmatic processing
ktop -o json | jq '.nodes[] | select(.cpu_pct > 80)'

# Create a resource utilization report
ktop --all -o csv | awk -F',' 'NR>1 {print $1","$4","$9}'

🐛 Troubleshooting

Common Issues

"metrics-server is not installed or not working"

# Install metrics-server
kubectl apply -f https://github.com/kubernetes-metrics/metrics-server/releases/latest/download/components.yaml

# Verify metrics-server is running
kubectl get deployment metrics-server -n kube-system

Memory values showing as 0Gi or incorrect
- The script automatically detects and fixes Kubernetes memory reporting bugs
- For nodes showing corrupted values, the script calculates actual memory from pod specifications
Slow performance
- Increase parallel queries: ktop -P 16
- Reduce scope: exclude control-plane nodes (default behavior)
No color in output
- Check terminal support: echo $TERM
- Force color: TERM=xterm-256color ktop

🔧 Configuration

Environment Variables

# Set default parallel queries (1-50)
export KTOP_PARALLEL=12

# Set default output format (table, csv, json)
export KTOP_FORMAT=table

# Include control-plane nodes by default
export KTOP_ALL=true

# Disable color by default
export KTOP_NO_COLOR=true

# Don't show summary totals by default
export KTOP_NO_SUM=false

# Set default watch interval (0 = no watch)
export KTOP_WATCH=0

# Set default sort field
export KTOP_SORT=cpu-req

# Show node conditions by default
export KTOP_SHOW_CONDITIONS=true

# Enable verbose mode (shows configuration info)
export KTOP_VERBOSE=true

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
README.md		README.md
ktop.sh		ktop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📊 Overview

✨ Features

📋 Requirements

🚀 Installation

Install metrics-server (if not installed)

📖 Usage

Command-Line Options

Sort Fields

📊 Output Example

Output Columns

Export and Analysis

🐛 Troubleshooting

Common Issues

🔧 Configuration

Environment Variables

About

Uh oh!

Releases 1

Packages

Languages

sbahmani/ktop

Folders and files

Latest commit

History

Repository files navigation

📊 Overview

✨ Features

📋 Requirements

🚀 Installation

Install metrics-server (if not installed)

📖 Usage

Command-Line Options

Sort Fields

📊 Output Example

Output Columns

Export and Analysis

🐛 Troubleshooting

Common Issues

🔧 Configuration

Environment Variables

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages