ES

NaN

Shared GPUs. Open-source models. Closed community.

Builders sharing GPUs to run the latest open AI models without token limits.

Limited spots
scroll

// what is nan

Builders sharing dedicated GPUs to run open models and burn tokens without limits.

Closed-model APIs are convenient but expensive and limited. Open models are free but running them seriously requires hardware almost no one has at home.

NaN solves this by sharing powerful GPUs among builders who actually use them, splitting the real cost of infrastructure.

A closed community by design, limited to the real capacity of shared GPUs. When it fills up, we open a waitlist.

The community lives on Discord. The website only manages access. Every member pays to be here. Only those who truly want to build make it in.

// inference cluster

Inference cluster for open AI models

A shared cluster with NVIDIA RTX PRO 6000 Blackwell, 96 GB VRAM, 256 GB DDR5 RAM and 48 dedicated threads per node. Four open models running at once, chosen by the community each quarter.

hardware

GPU
NVIDIA RTX PRO 6000 Blackwell
VRAM
96 GB GDDR7 ECC
RAM
256 GB DDR5 ECC
CPU
48 threads · Intel Xeon Gold 5412U
Storage
2× 960 GB NVMe RAID 1

stack

Inference
vLLM / SGLang
Proxy
LiteLLM
API
OpenAI-compatible

// current models

qwen3.6

llm

LLM · 35B-A3B MoE · FP8 · 128K context · Tool calling · Reasoning

qwen3-embedding

embedding

Embeddings · 8B · 4096 dims · 100+ languages · MMTEB 70.58

kokoro

tts

TTS · 82M params · 67 voices · <1s latency · ef_dora · em_alex

whisper

stt

STT · large-v3 · CTranslate2 · ~3.2% WER Spanish · auto language detection

  • Your code doesn't train any models
  • Data in EU, no prompt logs
  • No token caps
  • Personal API key for each member

// privacy

Pure inference. Zero logs.

NaN is inference infrastructure. There is no logging of prompts, model responses, nothing. Everything lives in your local session, on whatever client, agent, or custom app you use.

The API is pure inference. Nothing you do with it is logged except server-side metrics (tokens/s, RPM, etc.) for cluster maintenance and monitoring only.

Your code doesn't train any models. Your prompts aren't stored. Your data doesn't leave your session.

  • Zero logs: no prompt or response logging
  • Everything lives in your local client or agent session
  • Processing in the European Union
  • Your code doesn't train any models
  • Server metrics only for cluster maintenance (tokens/s, RPM)

// how models are chosen

Models chosen by the community. Refreshed every quarter.

  • Every 3 months, members vote on the new models that will run on the cluster.
  • Candidates are pre-selected technically: they must fit in 96 GB of VRAM.
  • Voting happens on Discord, open to all members.
  • The open models ecosystem moves fast. So does NaN.

// current models

qwen3.6

LLM · 35B-A3B MoE

qwen3-embedding

Embeddings · 8B

kokoro

TTS · 67 voices

whisper

STT · large-v3

// who's behind

The community is run by one person. Not a company.

Cristian Córdova

Cristian Córdova

@barckcode

Tenerife, Islas Canarias

Founder of Helmcode, working with startups on cloud infrastructure, AI tooling and open source.

Thanks to Helmcode we offer open model inference with unlimited token plans for enterprises, I share my experiments on social media and AI research.

I started NaN because I wanted to run serious open models without usage limits, and share and learn with people in the same boat: builders who actually burn tokens.

// pricing

One tier per region. No upsells. No trial.

NaN Member EU

🇪🇺
70€ /mo

VAT included

Limited spots

  • Access to shared cluster (96 GB VRAM)
  • Open models chosen by the community
  • Personal API key compatible with OpenAI
  • No token caps
  • Private Discord channels for members only
  • Voice in quarterly model voting
  • Month to month, no commitment
Join

Limited spots. When it fills up, waitlist.

NaN Member USA/Latam

🇺🇸
$75 /mo

Taxes included

Limited spots

  • Access to shared cluster (96 GB VRAM)
  • Open models chosen by the community
  • Personal API key compatible with OpenAI
  • No token caps
  • Private Discord channels for members only
  • Voice in quarterly model voting
  • Month to month, no commitment
Join

Limited spots. When it fills up, waitlist.

// join

A community of builders. Limited spots. Shared GPUs with unlimited usage.

We share dedicated GPUs to run open models and burn tokens without limits. Leave your email and we'll reach out when we open payments.

No spam. No newsletters. We only write when we open payments.

// faq

What you'll probably ask.