Serverless GPU API
for Python developers
Submit a .py script. We run it on a GPU. You pay per second of actual compute — nothing else. No instance to spin up, no Docker image to build, no idle time charged.
How it works
Write your Python script
Any standard Python script. Install packages via pip in the script itself. No Docker, no container, no environment setup.
Submit via API or SDK
POST your script to our REST API or use the Python SDK. The job queues instantly and starts on the next available GPU.
Stream logs in real time
WebSocket log streaming lets you watch stdout/stderr as the job runs. Or poll the status endpoint.
vs Traditional GPU Cloud
| Feature | GhostNexus (serverless) | Traditional GPU cloud |
|---|---|---|
| Instance management | None — fully managed | You manage pods/instances |
| Billing | Per second — zero idle cost | Per hour (billed even idle) |
| Setup time | < 1 minute | 5–20 min (Docker, config) |
| Python only | Yes — submit a .py file | Requires container image |
| Log streaming | Real-time WebSocket | Varies by provider |
| CI/CD ready | GitHub Action built-in | Manual scripting |
Use cases
Model fine-tuning
Fine-tune LLaMA, Mistral, or any HuggingFace model. Submit the training script, get the saved checkpoint back.
Stable Diffusion batch
Generate hundreds of images in a single batch job. No per-image API cost — pay only for GPU time.
Batch inference
Run predictions on a dataset. Submit once, get all results. Ideal for nightly pipelines or data enrichment.
Research experiments
Run ablation studies or hyperparameter sweeps from CI/CD. Each experiment is a separate GPU job.
GDPR-sensitive workloads
EU-hosted, Docker-isolated per job. Signed DPA available. Safe for medical, financial, or biometric data.
Real-time inference API
Trigger GPU jobs from your backend via REST. Results available via webhook or polling within seconds of completion.
Built for CI/CD pipelines
The official GitHub Action lets you trigger GPU jobs directly from your workflow. Run a training job on every push to main, or trigger batch inference on a schedule.
Simple per-second pricing
No subscription. No minimum. No idle billing.
Run your first GPU job in 2 minutes
Sign up, get $5 free credits, and submit your Python script. No Docker, no infra setup.