Features
Zero Config Scaling

Auto-Scaling

Your agents handle 10 requests or 10,000 — without you touching a thing.

How It Works

HostAgentes monitors every request in real-time. When traffic spikes, pre-warmed instances activate in under 10 seconds. When traffic subsides, excess instances scale down automatically.

<10s
Scale-up time
0
Configuration needed
0
Requests dropped

Scaling Triggers

Request rate
Exceeds 80% of capacity
Add instances
Response latency (p95)
Exceeds 2x baseline
Add instances
Queue depth
More than 10 waiting
Add instances
CPU utilization
Below 20% for 5 min
Remove instances

By Plan

Starter
  • Basic auto-scaling
  • Up to 50 req/min
  • ~30s scale-up
Pro
  • Full auto-scaling
  • Unlimited requests
  • ~10s scale-up
Scale
  • Priority scaling
  • Dedicated instances
  • ~5s scale-up

Frequently Asked Questions

How fast does auto-scaling respond to traffic spikes?
Pre-warmed instances activate in under 10 seconds on Pro and Scale plans. Starter plan scales in approximately 30 seconds. Zero requests are dropped during scale-up.
Do I need to configure auto-scaling?
No. Auto-scaling is enabled by default on every plan. HostAgentes monitors request rate, latency, and queue depth automatically and adjusts capacity in real-time.
Will I be charged for auto-scaling?
Auto-scaling is included in your plan. Starter has basic scaling (up to 50 req/min), Pro has full scaling with unlimited requests, and Scale offers dedicated instances with priority scaling.
What happens when traffic drops?
Instances automatically scale down when CPU utilization stays below 20% for 5 minutes. You never pay for idle capacity.
Can I set scaling limits?
Yes, on Pro and Scale plans. Set maximum instance counts, budget-based limits, or configure custom scaling rules through the dashboard.

Never worry about traffic spikes again

Auto-scaling included on every plan. Start with a free trial.

See Plans