If you're a grad student, a researcher, or part of a lab in Brazil, you know the scene well: you finish setting up your experiment, submit the job to the university cluster, and... you wait. Hours. Sometimes days. The SLURM queue is full, someone reserved 8 GPUs for a week, and your few-hour training run is stuck behind all of it. When the GPU finally frees up, you discover the node's CUDA version doesn't match your library.

โšก TL;DR

Shared HPC clusters are great for big, scheduled jobs but terrible for fast iteration. With dedicated on-demand GPUs on GPUBrazil, there's no queue, you spin up in seconds, you pick your own environment (JupyterLab/PyTorch in one click), you pay per hour, and your data stays in Brazil under the LGPD. Turn it on, use it, turn it off.

Why the university cluster stalls your research

The institutional HPC cluster is a valuable resource, but it was designed for a world of batch jobs: you describe what you need, join the queue, and the scheduler fits you in when there's room. For a week-long simulation, that makes sense. For the reality of doing AI today โ€” test an idea, tweak a hyperparameter, run again โ€” it's enormous friction. The main problems:

  • Unpredictable queue: your 2-hour experiment might wait 2 days to start, breaking your iteration rhythm.
  • Rigid stack: driver, CUDA, Python, and library versions are set by the admin. Updating or using something different is a formal request โ€” when it's possible at all.
  • Shared tenancy: in many cases your data sits on file systems shared with other groups, which is a concern when the dataset is clinical, legal, or personal.
  • No control over when: you don't decide when it runs; the scheduler does.

The practical result is that good research gets slow not because of the science, but because of the infrastructure.

The alternative: dedicated on-demand GPUs

GPUBrazil's pitch is simple: instead of competing for a shared resource, you rent a dedicated NVIDIA GPU by the hour, spin it up in seconds, and have the whole machine to yourself. No queue, no scheduler, no asking permission. When you're done, you shut it down and stop paying.

HPC cluster (SLURM)On-demand GPU
Time to startHours to days (queue)Seconds
EnvironmentFixed by adminYou choose
Data accessSharedDedicated instance
CostInstitutional quota / contentionPer hour in reais (R$)
TeardownSubject to time limitsShut down whenever

Your environment, your way โ€” in one click

Instead of filing a ticket with IT to request a specific PyTorch version, you launch a JupyterLab with GPU and CUDA template in one click and land in a working notebook. From there, install whatever you like:

# In the terminal of your GPUBrazil instance
nvidia-smi                       # confirm the GPU and available CUDA
pip install torch transformers datasets accelerate

python -c "import torch; print(torch.cuda.is_available())"
# True โ€” ready to train

Because the environment is yours, you can pin exact versions in a requirements.txt or container image and get reproducible experiments: the same environment, the same way, every time โ€” something hard to guarantee when you share a node with dozens of other researchers.

Cost control within a research budget

Research funding is finite and needs accounting. The per-hour model in reais helps with exactly that: you know, before you start, what you'll spend. An entry-level GPU like the RTX A4000 from R$1.80/h handles plenty of prototyping and light fine-tuning; for bigger experiments, you scale up to a more powerful GPU only during the run and drop back afterward. (Check current pricing in the console.)

A few practices that fit an academic budget well:

  1. Prototype small: develop and debug on a cheap GPU; only move to the big one once the code is correct.
  2. Shut down when done: no running job, no cost. No "left it on" surprises like a standing reservation.
  3. Pay with Pix: no international-card hassle or FX swings โ€” handy for institutional reimbursement.
  4. Scale per experiment: run hyperparameter sweeps in parallel by spinning up several instances and tearing them all down at the end.

Sensitive data stays in Brazil (LGPD)

For anyone working with medical records, clinical notes, legal texts, or participants' personal data, where the data is processed is not a detail โ€” it's an ethical and legal requirement. GPUBrazil's GPUs are in Brazil, so your datasets don't cross a border. That eases LGPD compliance, ethics-board approval, and adherence to consent terms, and it reduces latency for you and your collaborators in-country. To dig deeper, see data sovereignty and the LGPD in practice.

๐Ÿ’ก Use both worlds

This doesn't replace the university cluster โ€” it complements it. Reserve the institutional HPC for long, scheduled jobs, and use the on-demand GPU for fast iteration, prototyping, and time-sensitive sensitive-data runs. The best of each.

A typical end-to-end workflow

  1. In the Console, pick a suitable GPU and launch the JupyterLab template.
  2. Upload your dataset and git clone your repository.
  3. Run the experiment, watch the metrics, tweak, and repeat โ€” no queue between runs.
  4. Download checkpoints and results, then shut the instance down.

Want to go beyond training and adapt an open model to your domain on a small budget? See our guide to low-budget LLM fine-tuning for research, with LoRA/QLoRA on a single GPU.

Your next GPU is one click away, no queue

Spin up a dedicated GPU with JupyterLab in seconds.

Get Started Free โ†’

Frequently asked questions

How can I skip the SLURM queue to get a GPU for research?

Instead of waiting for a slot on a shared university cluster, you spin up a dedicated on-demand GPU on GPUBrazil in seconds. There's no queue and no scheduling: the GPU is exclusively yours while the instance is on, and you tear it down when the experiment finishes, paying only for the hours you used.

Can I use my own Python environment and CUDA versions?

Yes. Unlike a cluster with a fixed stack, you choose the environment. There are one-click templates such as JupyterLab with CUDA and PyTorch, and you can install whatever libraries and versions your project needs, building reproducible environments for each experiment.

Does my sensitive research data stay in Brazil?

Yes. GPUBrazil's GPUs are located in Brazil, so prompts and clinical, legal, or personal datasets never leave the country. That eases LGPD compliance and ethics-board approval, and it also reduces latency for users in Brazil.

Conclusion

Science shouldn't wait in line. With dedicated on-demand GPUs, you trade the cluster's unpredictable wait for immediate iteration, control your environment, keep data in Brazil, and pay only for what you use โ€” all within a research budget. It's infrastructure working for your research, not the other way around.

Read next: Low-budget LLM fine-tuning for research ยท Data sovereignty and the LGPD ยท Open-source LLM comparison 2026