- What is a good cost per API request?
- Highly variable by use case. Simple CRUD operations: $0.0001-0.001. AI inference endpoints: $0.001-0.10. Video processing: $0.01-1.00. The benchmark is whether your revenue-per-request exceeds your cost-per-request with sufficient margin.
- How do I reduce cost per request?
- The four levers: increase utilization (right-size instances), add caching (fewer origin requests), use reserved/spot instances (lower compute rates), and optimize code efficiency (fewer resources per request). Caching typically gives the highest ROI.
- Should I use serverless or dedicated servers?
- Serverless is cost-effective under ~1 million requests/month and for variable or unpredictable traffic. Dedicated servers or containers are more cost-effective at consistent high volume. The crossover point depends on your compute-to-memory ratio and idle time.