Baseten

Baseten

Inference

Production ML model deployment — custom + open models

Operational

All systems responding normally

Last checked 29/04/2026, 9:01:40 pm

346ms response

Uptime History100.00% uptime
2026-04-25Today

Uptime

100.00%

Avg Latency

399ms

P95 Latency

501ms

Fastest

283ms

Checks

150

Response Time

Last 60 checks
283ms min399ms avg570ms max

💰 Pricing

custom

A10G: $0.31/hr. A100: $2.87/hr. Dedicated deployment.

⚡ Rate Limits

free
RPM: 10TPM: 1,000

Free plan: 10 RPM, 1K TPM. Paid: up to 1000 RPM.

🤖 Models (1)

ModelTaskContextVisionToolsJSON
Custom Deployment

Deploy any model. A10G/A100 GPUs.

llm

Recent Checks

Showing last 15
Operational
346ms29 Apr, 09:01 pm
Operational
352ms29 Apr, 08:15 pm
Operational
310ms29 Apr, 07:20 pm
Operational
469ms29 Apr, 06:30 pm
Operational
419ms29 Apr, 05:33 pm
Operational
475ms29 Apr, 04:38 pm
Operational
369ms29 Apr, 03:40 pm
Operational
440ms29 Apr, 02:31 pm
Operational
346ms29 Apr, 01:10 pm
Operational
468ms29 Apr, 11:52 am
Operational
336ms29 Apr, 11:07 am
Operational
407ms29 Apr, 10:00 am
Operational
337ms29 Apr, 09:30 am
Operational
346ms29 Apr, 09:05 am
Operational
454ms29 Apr, 08:39 am