Hitbox

Highly customizable async caching framework for Rust designed for high-performance applications.

Protocol-agnostic async core + first-class HTTP support via hitbox-http. Pluggable backends from in-memory to distributed solutions such as Redis. Built on tower, works with any tokio-based service.

Features
Motivation
Quick Start

Features

Core Features

Stale Cache Serve stale data within grace period while revalidating in background
Dogpile Prevention Configurable concurrency limit with broadcast to prevent redundant upstream calls
Pluggable Backends Choose Moka, Redis, FeOxDB, or implement the Backend trait for your own storage
Composable Backends Combine backends into L1/L2/L3 tiers for optimal performance
Serialization Choose between bincode, rkyv, JSON, or RON formats
Compression Reduce storage size with zstd or gzip compression
Observability Track cache status, latency, backend I/O, and offload tasks
Predicate Trait Protocol-agnostic trait to control what gets cached
Extractor Trait Protocol-agnostic trait to generate cache keys from any request type

HTTP Caching Features

HTTP Predicates Control caching with rules based on any part of request or response, including body
HTTP Extractors Automatically generate cache keys from request components
Framework Integration Works with Axum, Reqwest, and any tower-based framework or reqwest-middleware client
YAML Configuration Define entire caching setup in a configuration file

Stale Cache

Stale cache enables serving outdated data while refreshing in the background. Each cache entry has a TTL (fresh period) and an optional stale window (grace period). During TTL, data is fresh. After TTL but within the stale window, data is stale but servable. Three policies control stale behavior: Return serves stale data immediately without revalidation, Revalidate blocks until fresh data is fetched, and OffloadRevalidate serves stale data immediately while refreshing in the background (Stale-While-Revalidate pattern). The OffloadManager handles background revalidation with task deduplication, configurable timeouts (none, cancel, or warn), and metrics tracking.

OffloadManager Configuration

Option	Description	Default
`max_concurrent_tasks`	Limit parallel background tasks	Unlimited
`timeout_policy`	`None`, `Cancel(duration)`, or `Warn(duration)`	`None`
`deduplicate`	Prevent duplicate revalidation for same key	`true`

Code example

let offload = OffloadConfig::builder()
    .max_concurrent_tasks(10)
    .timeout(Duration::from_secs(30))
    .deduplicate(true)
    .build();

let manager = OffloadManager::new(offload);

Stale policy YAML configuration

policy:
  Enabled:
    ttl: 60      # fresh for 60 seconds
    stale: 300   # serve stale for additional 300 seconds
    policy:
      stale: OffloadRevalidate

Note: OffloadManager settings are configured via Rust code only.

Dogpile Prevention

When a cache entry expires or is missing, multiple simultaneous requests can trigger redundant upstream calls—this is the "dogpile" or "thundering herd" problem.

Hitbox uses a configurable concurrency limit per cache key:

First N requests (where N = concurrency limit) proceed to upstream
Additional requests subscribe to a broadcast channel and wait
When any request completes, it broadcasts the result to all waiters
Waiters receive the response without calling upstream

With concurrency: 1, only one request fetches from upstream while others wait. But if that single request is slow, all waiting requests become slow too. Setting concurrency: 2 or higher allows parallel fetches—the first to complete broadcasts to all waiters, reducing the impact of slow upstream responses.

Code example

// Create concurrency manager for dogpile prevention
let concurrency_manager = Arc::new(BroadcastConcurrencyManager::<Response>::new());

// Configure policy with concurrency limit
let policy = PolicyConfig::Enabled(EnabledCacheConfig {
    ttl: Some(60),
    stale: Some(300),
    concurrency: Some(1),  // Only one request fetches, others wait
    ..Default::default()
});

YAML configuration

policy:
  Enabled:
    ttl: 60
    stale: 300
    concurrency: 1  # Only one request fetches, others wait

Pluggable Backends

Backends store cached data. Each backend implements the Backend trait with read, write, and remove operations. All backends support configurable serialization format (Bincode, JSON, RON, Rkyv), key format (Bitcode, UrlEncoded), compression (Gzip, Zstd), and custom naming for metrics. Implement the Backend trait to add your own storage.

Backend	Type	Configuration
Moka	In-memory	`max_capacity`
Redis	Distributed	`server` (connection string)
FeOxDB	Embedded	`path` or `in_memory()`

Code example

// Moka (in-memory)
let moka = MokaBackend::builder(10_000)
    .value_format(BincodeFormat)
    .compressor(ZstdCompressor::default())
    .build();

// Redis (distributed)
let redis = RedisBackend::builder()
    .server("redis://127.0.0.1/")
    .value_format(BincodeFormat)
    .build()?;

// FeOxDB (embedded persistent)
let feoxdb = FeOxDbBackend::builder()
    .path("/tmp/cache".into())
    .build()?;

YAML configuration

# Moka (in-memory)
backend:
  type: Moka
  max_capacity: 10000
  key:
    format: Bitcode
  value:
    format: Bincode
    compression:
      type: Zstd
      level: 3

# Redis (distributed)
backend:
  type: Redis
  connection_string: "redis://localhost:6379"
  key:
    format: Bitcode
  value:
    format: Bincode

# FeOxDB (embedded persistent)
backend:
  type: FeOxDb
  path: "/tmp/cache.db"
  key:
    format: UrlEncoded
  value:
    format: Bincode
    compression:
      type: Gzip
      level: 6

Composable Backends

Compose multiple backends into tiered cache hierarchies, similar to CPU cache levels. Each composition has configurable read, write, and refill policies. Read policies control how layers are queried: Sequential (L1 first, then L2), Race (first hit wins), or Parallel (both queried, prefer fresher). Write policies control how data propagates: Sequential (L1 then L2), OptimisticParallel (both in parallel, succeed if any succeeds), or Race (first success wins). Refill policies control L1 population on L2 hits: Always or Never. Compositions can be nested for L1/L2/L3 hierarchies.

Policy Type	Options	Default
Read	Sequential, Race, Parallel	Sequential
Write	Sequential, OptimisticParallel, Race	OptimisticParallel
Refill	Always, Never	Never

Code example

// L1 (Moka) + L2 (Redis) composition
let cache = moka.compose(redis, offload);

// With custom policies
let policy = CompositionPolicy::new()
    .read(RaceReadPolicy::new())
    .write(SequentialWritePolicy::new())
    .refill(RefillPolicy::Always);

let cache = moka.compose_with(redis, offload, policy);

YAML configuration

# L1 (Moka) + L2 (Redis) composition
backend:
  type: Composition
  l1:
    type: Moka
    max_capacity: 10000
    key:
      format: Bitcode
    value:
      format: Bincode
  l2:
    type: Redis
    connection_string: "redis://localhost:6379"
    key:
      format: Bitcode
    value:
      format: Bincode
  policy:
    read: Sequential
    write: OptimisticParallel
    refill: Never

# Nested L1/L2/L3 hierarchy
backend:
  type: Composition
  l1:
    type: Moka
    max_capacity: 1000
    key:
      format: Bitcode
    value:
      format: Bincode
  l2:
    type: Composition
    l1:
      type: Moka
      max_capacity: 10000
      key:
        format: Bitcode
      value:
        format: Bincode
    l2:
      type: Redis
      connection_string: "redis://localhost:6379"
      key:
        format: Bitcode
      value:
        format: Bincode
    policy:
      read: Sequential
      write: OptimisticParallel
      refill: Never
  policy:
    read: Race
    write: OptimisticParallel
    refill: Always

Serialization

Hitbox supports multiple serialization formats via the Format trait. Bincode offers the best write throughput and excellent read performance. Rkyv provides zero-copy deserialization for maximum read speed but slower writes. RON balances performance with human-readable output. JSON is slowest but useful for debugging. Choose based on your read/write ratio and debugging needs.

Compression

Reduce cache storage size with optional compression via the Compressor trait. Zstd provides excellent compression ratio with fast decompression (levels -7 to 22, default 3). Gzip offers wide compatibility (levels 0-9, default 6). Both require feature flags (zstd, gzip). Skip compression for small payloads where overhead exceeds savings.

Observability

Track cache performance with built-in metrics (via metrics crate):

Cache status — hit, miss, and stale counters per backend
Latency histograms — request duration and upstream call timing
Backend I/O — reads, writes, bytes transferred, and errors per backend
Offload tasks — spawned, completed, deduplicated, active, and timed out

Integrates with any metrics-compatible exporter (Prometheus, StatsD, etc.).

Predicate Trait

The Predicate trait is a protocol-agnostic abstraction for controlling what gets cached. It defines a single async method that determines whether a given subject (request or response) should be cached.

pub trait Predicate: Debug {
    type Subject;
    async fn check(&self, subject: Self::Subject) -> PredicateResult<Self::Subject>;
}

pub enum PredicateResult<S> {
    Cacheable(S),
    NonCacheable(S),
}

Implement this trait to add caching rules for any protocol—GraphQL, gRPC, PostgreSQL wire protocol, or your own custom protocol. The Subject type represents your request or response type, and check returns whether it should be cached along with the (potentially modified) subject.

Example: Custom Protocol Predicate

use hitbox_core::{Predicate, PredicateResult};

#[derive(Debug)]
struct GraphQLQueryPredicate;

#[async_trait]
impl Predicate for GraphQLQueryPredicate {
    type Subject = GraphQLRequest;

    async fn check(&self, req: Self::Subject) -> PredicateResult<Self::Subject> {
        // Only cache queries, not mutations or subscriptions
        if req.operation_type() == OperationType::Query {
            PredicateResult::Cacheable(req)
        } else {
            PredicateResult::NonCacheable(req)
        }
    }
}

Hitbox provides a complete HTTP implementation in hitbox-http with predicates for method, path, headers, status codes, and body content—use it as a reference when implementing your own protocol support.

Extractor Trait

The Extractor trait is a protocol-agnostic abstraction for generating cache keys from requests. It defines a single async method that extracts key components from a subject.

pub trait Extractor: Debug {
    type Subject;
    async fn get(&self, subject: Self::Subject) -> KeyParts<Self::Subject>;
}

KeyParts accumulates extracted values that combine into a final cache key. Implement this trait to define how cache keys are built for any protocol. Multiple extractors can be chained together, each contributing parts to the final key.

Example: Custom Protocol Extractor

use hitbox_core::{Extractor, KeyParts};

#[derive(Debug)]
struct GraphQLOperationExtractor;

#[async_trait]
impl Extractor for GraphQLOperationExtractor {
    type Subject = GraphQLRequest;

    async fn get(&self, req: Self::Subject) -> KeyParts<Self::Subject> {
        KeyParts::new(req)
            .part("operation", req.operation_name())
            .part("query_hash", hash(req.query()))
            .part("variables_hash", hash(req.variables()))
    }
}

The HTTP implementation in hitbox-http provides extractors for method, path parameters, query strings, headers, and body content—serving as a reference for building protocol-specific extractors.

HTTP Predicates

HTTP Predicates control what gets cached. Request predicates filter incoming requests by method, path, headers, query parameters, and body. Response predicates filter upstream responses by status code, headers, and body. Both support operations like equality, existence, list matching, containment, and regex. Body predicates additionally support size limits and JQ expressions for JSON filtering. Combine predicates with AND (chaining), OR, and NOT logic.

Code example

NeutralRequestPredicate::new()
    .method(Method::GET)
    .path("/api/authors/{id}".into())
    .header(header::Operation::Eq(
        "x-api-key".parse().unwrap(),
        "secret123".parse().unwrap(),
    ))

NeutralResponsePredicate::new()
    .status_code(StatusCode::OK)

YAML configuration

request:
  - Method: GET
  - Path: "/api/authors/{id}"
  - Header:
      x-api-key: "secret123"

response:
  - Status: 200
  - Status: Success  # matches any 2xx

HTTP Extractors

HTTP Extractors build cache keys from request components. They extract values from method, path parameters (using {param} patterns), headers, query parameters, and body content. Headers and query parameters support exact name matching or prefix-based selection, with optional regex extraction for partial values. Body extraction supports full-body hashing, JQ expressions for JSON (with a custom hash function), and regex with named capture groups. Extracted values can be transformed using hash (SHA256), lowercase, or uppercase. Extractors chain together, combining all extracted parts into a single cache key.

Code example

NeutralExtractor::new()
    .path("/v1/authors/{author_id}/books/{book_id}")
    .query("page".into())
    .header("Accept-Language".into())

YAML configuration

extractors:
  - Path: "/v1/authors/{author_id}/books/{book_id}"
  - Query: page
  - Header: Accept-Language

A request to /v1/authors/123/books/456?page=1 with Accept-Language: en produces a cache key with author_id, book_id, page, and Accept-Language components.

Framework Integration

Hitbox integrates with both server-side frameworks and HTTP clients.

Servers

Framework	Support
Axum	Full support via tower layer

Since Hitbox is built on tower, it integrates as a standard layer without framework-specific code. Any tower-compatible framework can use Hitbox.

Clients

Client	Support
Reqwest	Full support via reqwest-middleware

Cache responses from external APIs using hitbox-reqwest. Works with reqwest-middleware for seamless integration.

YAML Configuration

Define your entire caching setup in a configuration file:

backend:
  type: Moka
  max_capacity: 10000
  key:
    format: Bitcode
  value:
    format: Bincode
    compression:
      type: Zstd
      level: 3

request:
  - Method: GET
  - Path: "/api/authors/{author_id}"

response:
  - Status: Success

extractors:
  - Method:
  - Path: "/api/authors/{author_id}"
  - Query: page

policy:
  Enabled:
    ttl: 60
    stale: 300
    policy:
      stale: OffloadRevalidate
    concurrency: 1

Change caching rules at runtime—no recompilation needed.

Motivation

Every real-world system brings a combination of shared challenges and unique constraints. We tried using existing caching frameworks for our services, but each time they failed to fully match our requirements. As a result, we repeatedly ended up building custom caching mechanisms from scratch instead of relying on ready-made solutions.

Hitbox was created to break this cycle.

We think of Hitbox not as a library, but as a platform for caching, designed from day one to be easily extensible without enforcing a single backend, protocol, or caching strategy. New storage backends, new protocols such as GraphQL or even the PostgreSQL protocol, as well as new serialization or compression strategies, are expected use cases - not special exceptions. You simply implement the required traits and go for it. Hitbox is built to be hacked, extended, bent, and reshaped.

A key principle of Hitbox is that every new integration automatically inherits the full set of advanced optimizations we built through real-world experience: dogpile-effect prevention, composable multi-layer caching (L1/L2/L3), offload caching, and more. Instead of re-implementing these mechanisms for every project, they come for free with the platform.

At the same time, Hitbox is not just an abstract foundation. It already provides a production-ready HTTP caching implementation based on tower::Service, covering the most common use case out of the box while also serving as a reference implementation for building additional integrations.

Quick Start

Installation

[dependencies]
hitbox = { version = "0.1", features = ["moka"] }
hitbox-tower = "0.1"
hitbox-http = "0.1"

Basic Usage

use axum::{Router, routing::get, extract::Path};
use hitbox_moka::MokaBackend;
use hitbox_tower::Cache;
use hitbox_http::HttpEndpoint;
use hitbox::policy::{PolicyConfig, EnabledCacheConfig};

// Handlers
async fn get_users() -> &'static str { "users list" }
async fn get_user(Path(id): Path<String>) -> String { format!("user {id}") }

// Create backend
let backend = MokaBackend::builder(10_000).build();

// Users list - long TTL (60s)
let users_config = HttpEndpoint {
    policy: PolicyConfig::Enabled(EnabledCacheConfig {
        ttl: Some(60),
        stale: Some(30),
        ..Default::default()
    }),
};

// Single user - short TTL (10s)
let user_config = HttpEndpoint {
    policy: PolicyConfig::Enabled(EnabledCacheConfig {
        ttl: Some(10),
        stale: Some(5),
        ..Default::default()
    }),
};

// Build cache layers
let users_cache = Cache::builder()
    .backend(backend.clone())
    .config(users_config)
    .build();

let user_cache = Cache::builder()
    .backend(backend)
    .config(user_config)
    .build();

// Router with per-route cache layers
let app = Router::new()
    .route("/api/users", get(get_users).layer(users_cache))
    .route("/api/users/{id}", get(get_user).layer(user_cache));

What's Next

Predicates - Control what gets cached
Stale Cache - Configure TTL and background revalidation
Composable Backends - Add Redis as L2
Examples - More complete examples

Name		Name	Last commit message	Last commit date
Latest commit History 831 Commits
.cargo		.cargo
.config		.config
.github		.github
documentation		documentation
examples		examples
hitbox-backend		hitbox-backend
hitbox-configuration		hitbox-configuration
hitbox-core		hitbox-core
hitbox-feoxdb		hitbox-feoxdb
hitbox-http		hitbox-http
hitbox-moka		hitbox-moka
hitbox-redis		hitbox-redis
hitbox-reqwest		hitbox-reqwest
hitbox-tarantool		hitbox-tarantool
hitbox-test		hitbox-test
hitbox-tower		hitbox-tower
hitbox		hitbox
observability		observability
.gitignore		.gitignore
.rustfmt.toml		.rustfmt.toml
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
clippy.toml		clippy.toml
codecov.yml		codecov.yml
cspell.json		cspell.json
deny.toml		deny.toml
release.toml		release.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hitbox

Features

Core Features

HTTP Caching Features

Stale Cache

OffloadManager Configuration

Dogpile Prevention

Pluggable Backends

Composable Backends

Serialization

Compression

Observability

Predicate Trait

Extractor Trait

HTTP Predicates

HTTP Extractors

Framework Integration

YAML Configuration

Motivation

Quick Start

Installation

Basic Usage

What's Next

About

Uh oh!

Releases 5

Uh oh!

Contributors 7

Uh oh!

Languages

License

hit-box/hitbox

Folders and files

Latest commit

History

Repository files navigation

Hitbox

Features

Core Features

HTTP Caching Features

Stale Cache

OffloadManager Configuration

Dogpile Prevention

Pluggable Backends

Composable Backends

Serialization

Compression

Observability

Predicate Trait

Extractor Trait

HTTP Predicates

HTTP Extractors

Framework Integration

YAML Configuration

Motivation

Quick Start

Installation

Basic Usage

What's Next

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 5

Uh oh!

Contributors 7

Uh oh!

Languages