Skip to main content
Version: 2.25.2


Aperture is an open source load management platform designed for classifying, rate limiting, queuing and prioritizing API traffic in cloud applications. Built upon a foundation of observability and a global control plane, it offers a comprehensive suite of load management capabilities. These capabilities enhance the reliability and performance of cloud applications while also optimizing resource utilization.

Aperture can seamlessly integrate with existing control points such as gateways, service meshes, and application middlewares. Moreover, it offers SDKs for developers who need to establish control points around specific features or code sections inside applications. The following diagram depicts the role of Aperture in a cloud application:

Unified Load Management (dark) Unified Load Management (light)

Aperture is available as a managed service, Aperture Cloud, or can be self-hosted within your infrastructure. Visit the Architecture page for more details.


To sign-up to Aperture Cloud, click here.

⚙️ Load management capabilities

  • ⏱️ Global Rate-Limiting: Safeguard APIs and features against excessive usage with Aperture's high-performance, distributed rate limiter. Identify individual users or entities by fine-grained labels. Create precise rate limiters controlling burst-capacity and fill-rate tailored to business-specific labels. Refer to the Rate Limiting guide for more details.
  • 📊 API Quota Management: Maintain compliance with external API quotas with a global token bucket and smart request queuing. This feature regulates requests aimed at external services, ensuring that the usage remains within prescribed rate limits and avoids penalties or additional costs. Refer to the API Quota Management guide for more details.
  • 🛡️ Adaptive Queuing: Enhance resource utilization and safeguard against abrupt service overloads with an intelligent queue at the entry point of services. This queue dynamically adjusts the rate of requests based on live service health, thereby mitigating potential service disruptions and ensuring optimal performance under all load conditions. Refer to the Service Load Management and Database Load Management guides for more details.
  • 🎯 Workload Prioritization: Safeguard crucial user experience pathways and ensure prioritized access to external APIs by strategically prioritizing workloads. With weighted fair queuing, Aperture aligns resource distribution with business value and urgency of requests. Workload prioritization applies to API Quota Management and Adaptive Queuing use cases.

✨ Get started

📖 Learn

The Concepts section provides detailed insights into essential elements of Aperture's system and policies, offering a comprehensive understanding of their key components.

Additional Support

Don't hesitate to engage with us for any queries or clarifications. Our team is here to assist and ensure that your experience with Aperture is smooth and beneficial.

💬 Consult with an expert | 👥 Join our Slack Community | ✉️ Email: