Self-Hosted GPU
Influgen can route part of its generation traffic to your own GPU infrastructure. This is useful when you want lower marginal cost, tighter infrastructure control, or regional routing that fits your setup better than a fully cloud-hosted path.
What self-hosting supports today
The built-in self-hosted adapter currently supports:
text-to-imageimage-to-image
It does not support every advanced workflow. In particular, motion transfer is not part of the built-in self-hosted adapter path today.
Core behavior
The adapter maintains:
- endpoint health state
- request queueing
- concurrency limits
- circuit breaking
- cloud fallback
Important built-in defaults include:
- timeout:
90seconds - retry count:
2 - health TTL:
30seconds - failure threshold:
3 - circuit reset:
5minutes
[screenshot: Self-hosted admin overview showing routing percentage, endpoint health cards, and recent routing logs]
Admin endpoints
Self-hosted routing is administered through:
GET /api/v1/admin/selfhostedPUT /api/v1/admin/selfhosted/routingPOST /api/v1/admin/selfhosted/refreshPOST /api/v1/admin/selfhosted/endpointsPATCH /api/v1/admin/selfhosted/endpoints/{id}DELETE /api/v1/admin/selfhosted/endpoints/{id}
Routing controls
The routing config currently lets admins set:
enabledrouting_percentage
routing_percentage must stay between 0 and 100.
Use a low percentage first, validate quality and uptime, then scale routing up gradually.
Endpoint management
Each endpoint can store:
- name
- URL
- API key
- status
- max concurrency
- capabilities
Influgen also tracks runtime telemetry such as:
- health
- last error
- current load
- average latency
- consecutive failures
- circuit-open timing
Practical rollout advice
- Add one healthy endpoint.
- Turn routing on at a low percentage.
- Watch latency, failures, and quality.
- Keep cloud fallback enabled while confidence builds.
- Expand capacity only after the endpoint stays healthy under load.
When to build a custom adapter instead
If you want self-hosted support for advanced workflows such as motion transfer or more provider-specific tuning, the better path is a custom or community adapter. See Adapter Spec.