AI for Researchers Without the Cluster Queue: On-Demand GPUs

If you're a grad student, a researcher, or part of a lab in Brazil, you know the scene well: you finish setting up your experiment, submit the job to the university cluster, and... you wait. Hours. Sometimes days. The SLURM queue is full, someone reserved 8 GPUs for a week, and your few-hour training run is stuck behind all of it. When the GPU finally frees up, you discover the node's CUDA version doesn't match your library.

⚡ TL;DR

Shared HPC clusters are great for big, scheduled jobs but terrible for fast iteration. With dedicated on-demand GPUs on GPUBrazil, there's no queue, you spin up in seconds, you pick your own environment (JupyterLab/PyTorch in one click), you pay per hour, and you keep full control of your data under the LGPD. Turn it on, use it, turn it off.

Why the university cluster stalls your research

The institutional HPC cluster is a valuable resource, but it was designed for a world of batch jobs: you describe what you need, join the queue, and the scheduler fits you in when there's room. For a week-long simulation, that makes sense. For the reality of doing AI today — test an idea, tweak a hyperparameter, run again — it's enormous friction. The main problems:

Unpredictable queue: your 2-hour experiment might wait 2 days to start, breaking your iteration rhythm.
Rigid stack: driver, CUDA, Python, and library versions are set by the admin. Updating or using something different is a formal request — when it's possible at all.
Shared tenancy: in many cases your data sits on file systems shared with other groups, which is a concern when the dataset is clinical, legal, or personal.
No control over when: you don't decide when it runs; the scheduler does.

The practical result is that good research gets slow not because of the science, but because of the infrastructure.

The alternative: dedicated on-demand GPUs

GPUBrazil's pitch is simple: instead of competing for a shared resource, you rent a dedicated NVIDIA GPU by the hour, spin it up in seconds, and have the whole machine to yourself. No queue, no scheduler, no asking permission. When you're done, you shut it down and stop paying.

	HPC cluster (SLURM)	On-demand GPU
Time to start	Hours to days (queue)	Seconds
Environment	Fixed by admin	You choose
Data access	Shared	Dedicated instance
Cost	Institutional quota / contention	Per hour in reais (R$)
Teardown	Subject to time limits	Shut down whenever

Your environment, your way — in one click

Instead of filing a ticket with IT to request a specific PyTorch version, you launch a JupyterLab with GPU and CUDA template in one click and land in a working notebook. From there, install whatever you like:

# In the terminal of your GPUBrazil instance
nvidia-smi                       # confirm the GPU and available CUDA
pip install torch transformers datasets accelerate

python -c "import torch; print(torch.cuda.is_available())"
# True — ready to train

Because the environment is yours, you can pin exact versions in a requirements.txt or container image and get reproducible experiments: the same environment, the same way, every time — something hard to guarantee when you share a node with dozens of other researchers.

Cost control within a research budget

Research funding is finite and needs accounting. The per-hour model in reais helps with exactly that: you know, before you start, what you'll spend. An entry-level GPU like the RTX A4000 from R$1.80/h handles plenty of prototyping and light fine-tuning; for bigger experiments, you scale up to a more powerful GPU only during the run and drop back afterward. (Check current pricing in the console.)

A few practices that fit an academic budget well:

Prototype small: develop and debug on a cheap GPU; only move to the big one once the code is correct.
Shut down when done: no running job, no cost. No "left it on" surprises like a standing reservation.
Pay with Pix: no international-card hassle or FX swings — handy for institutional reimbursement.
Scale per experiment: run hyperparameter sweeps in parallel by spinning up several instances and tearing them all down at the end.

You keep control of sensitive data (LGPD)

For anyone working with medical records, clinical notes, legal texts, or participants' personal data, how the data is processed is not a detail — it's an ethical and legal requirement. Each GPUBrazil instance is single-tenant and dedicated to you, so your datasets stay on your instance and are never sent to a third-party API. That eases LGPD governance, ethics-board approval, and adherence to consent terms, and it keeps you in full control of what is processed and logged. To dig deeper, see data governance and the LGPD in practice.

💡 Use both worlds

This doesn't replace the university cluster — it complements it. Reserve the institutional HPC for long, scheduled jobs, and use the on-demand GPU for fast iteration, prototyping, and time-sensitive sensitive-data runs. The best of each.

A typical end-to-end workflow

In the Console, pick a suitable GPU and launch the JupyterLab template.
Upload your dataset and git clone your repository.
Run the experiment, watch the metrics, tweak, and repeat — no queue between runs.
Download checkpoints and results, then shut the instance down.

Want to go beyond training and adapt an open model to your domain on a small budget? See our guide to low-budget LLM fine-tuning for research, with LoRA/QLoRA on a single GPU.

Your next GPU is one click away, no queue

Spin up a dedicated GPU with JupyterLab in seconds.

Get Started Free →

Frequently asked questions

How can I skip the SLURM queue to get a GPU for research?

Instead of waiting for a slot on a shared university cluster, you spin up a dedicated on-demand GPU on GPUBrazil in seconds. There's no queue and no scheduling: the GPU is exclusively yours while the instance is on, and you tear it down when the experiment finishes, paying only for the hours you used.

Can I use my own Python environment and CUDA versions?

Yes. Unlike a cluster with a fixed stack, you choose the environment. There are one-click templates such as JupyterLab with CUDA and PyTorch, and you can install whatever libraries and versions your project needs, building reproducible environments for each experiment.

Does my sensitive research data stay under my control?

Yes. Each instance is single-tenant and dedicated to you, so prompts and clinical, legal, or personal datasets stay on your instance and are never sent to a third-party API. You keep full control of what is processed and logged, which eases LGPD governance and ethics-board approval.

Conclusion

Science shouldn't wait in line. With dedicated on-demand GPUs, you trade the cluster's unpredictable wait for immediate iteration, control your environment, keep full control of your data, and pay only for what you use — all within a research budget. It's infrastructure working for your research, not the other way around.