Features
Zero Config Scaling
Auto-Scaling
Your agents handle 10 requests or 10,000 — without you touching a thing.
How It Works
HostAgentes monitors every request in real-time. When traffic spikes, pre-warmed instances activate in under 10 seconds. When traffic subsides, excess instances scale down automatically.
<10s
Scale-up time
0
Configuration needed
0
Requests dropped
Scaling Triggers
Request rate
Exceeds 80% of capacity
Add instances
Response latency (p95)
Exceeds 2x baseline
Add instances
Queue depth
More than 10 waiting
Add instances
CPU utilization
Below 20% for 5 min
Remove instances
By Plan
Starter
- Basic auto-scaling
- Up to 50 req/min
- ~30s scale-up
Pro
- Full auto-scaling
- Unlimited requests
- ~10s scale-up
Scale
- Priority scaling
- Dedicated instances
- ~5s scale-up
Frequently Asked Questions
How fast does auto-scaling respond to traffic spikes?
Pre-warmed instances activate in under 10 seconds on Pro and Scale plans. Starter plan scales in approximately 30 seconds. Zero requests are dropped during scale-up.
Do I need to configure auto-scaling?
No. Auto-scaling is enabled by default on every plan. HostAgentes monitors request rate, latency, and queue depth automatically and adjusts capacity in real-time.
Will I be charged for auto-scaling?
Auto-scaling is included in your plan. Starter has basic scaling (up to 50 req/min), Pro has full scaling with unlimited requests, and Scale offers dedicated instances with priority scaling.
What happens when traffic drops?
Instances automatically scale down when CPU utilization stays below 20% for 5 minutes. You never pay for idle capacity.
Can I set scaling limits?
Yes, on Pro and Scale plans. Set maximum instance counts, budget-based limits, or configure custom scaling rules through the dashboard.
Never worry about traffic spikes again
Auto-scaling included on every plan. Start with a free trial.
See Plans