onnxruntime · cpu inference · 2b params
POST /generate — submit job
GET /status/{job_id} — poll status
GET /result/{job_id} — fetch image
GET /queue — view all jobs
GET /health — health check