OpenAI-compatible inference on owned Macs

For smaller workloads, gpu-io.net can expose a managed API backed by local Apple Silicon. Bare-metal remains the primary product; API is added when the workload benefits from a managed endpoint.

API principles

  • OpenAI-compatible request format where practical.
  • Models listed only after license and commercial-use review.
  • No medical diagnosis, legal advice, or other high-risk production use without separate agreement.
  • Prompts are not logged by default for normal API operation.
curl https://api.gpu-io.net/v1/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"model":"qwen3.5-27b","messages":[{"role":"user","content":"Hello"}]}'