NaN. GPU compartida. Modelos abiertos. Comunidad de builders.

// what is nan

Builders sharing dedicated GPUs to run open models and burn tokens without limits.

Closed-model APIs are convenient but expensive and limited. Open models are free but running them seriously requires hardware almost no one has at home.

NaN solves this by sharing powerful GPUs among builders who actually use them, splitting the real cost of infrastructure.

A closed community by design, limited to the real capacity of shared GPUs. When it fills up, we open a waitlist.

The community lives on Discord. The website only manages access. Every member pays to be here. Only those who truly want to build make it in.

// inference cluster

Inference cluster for open AI models

A shared cluster with NVIDIA RTX PRO 6000 Blackwell, 96 GB VRAM, 256 GB DDR5 RAM and 48 dedicated threads per node. Four open models running at once, chosen by the community each quarter.

hardware

GPU: NVIDIA RTX PRO 6000 Blackwell
VRAM: 96 GB GDDR7 ECC
RAM: 256 GB DDR5 ECC
CPU: 48 threads · Intel Xeon Gold 5412U
Storage: 2× 960 GB NVMe RAID 1

stack

Inference: vLLM / SGLang
Proxy: LiteLLM
API: OpenAI-compatible

// current models

qwen3.6

llm

LLM · 35B-A3B MoE · FP8 · 128K context · Tool calling · Reasoning

qwen3-embedding

embedding

Embeddings · 8B · 4096 dims · 100+ languages · MMTEB 70.58

kokoro

tts

TTS · 82M params · 67 voices · <1s latency · ef_dora · em_alex

whisper

stt

STT · large-v3 · CTranslate2 · ~3.2% WER Spanish · auto language detection

Your code doesn't train any models
Data in EU, no prompt logs
No token caps
Personal API key for each member

// privacy

Pure inference. Zero logs.

NaN is inference infrastructure. There is no logging of prompts, model responses, nothing. Everything lives in your local session, on whatever client, agent, or custom app you use.

The API is pure inference. Nothing you do with it is logged except server-side metrics (tokens/s, RPM, etc.) for cluster maintenance and monitoring only.

Your code doesn't train any models. Your prompts aren't stored. Your data doesn't leave your session.

Zero logs: no prompt or response logging
Everything lives in your local client or agent session
Processing in the European Union
Your code doesn't train any models
Server metrics only for cluster maintenance (tokens/s, RPM)

// how models are chosen

Models chosen by the community. Refreshed every quarter.

Every 3 months, members vote on the new models that will run on the cluster.
Candidates are pre-selected technically: they must fit in 96 GB of VRAM.
Voting happens on Discord, open to all members.
The open models ecosystem moves fast. So does NaN.

// current models

qwen3.6

LLM · 35B-A3B MoE

qwen3-embedding

Embeddings · 8B

kokoro

TTS · 67 voices

whisper

STT · large-v3

// who's behind

The community is run by one person. Not a company.

Cristian Córdova

@barckcode

Tenerife, Islas Canarias

Founder of Helmcode, working with startups on cloud infrastructure, AI tooling and open source.

Thanks to Helmcode we offer open model inference with unlimited token plans for enterprises, I share my experiments on social media and AI research.

I started NaN because I wanted to run serious open models without usage limits, and share and learn with people in the same boat: builders who actually burn tokens.

// pricing

One tier per region. No upsells. No trial.

NaN Member EU

🇪🇺

70€ /mo

VAT included

Limited spots

Access to shared cluster (96 GB VRAM)
Open models chosen by the community
Personal API key compatible with OpenAI
No token caps
Private Discord channels for members only
Voice in quarterly model voting
Month to month, no commitment

Join

Limited spots. When it fills up, waitlist.

NaN Member USA/Latam

🇺🇸

$75 /mo

Taxes included

Limited spots

Access to shared cluster (96 GB VRAM)
Open models chosen by the community
Personal API key compatible with OpenAI
No token caps
Private Discord channels for members only
Voice in quarterly model voting
Month to month, no commitment

Join

Limited spots. When it fills up, waitlist.

// join

A community of builders. Limited spots. Shared GPUs with unlimited usage.

We share dedicated GPUs to run open models and burn tokens without limits. Leave your email and we'll reach out when we open payments.

// faq

NaN

Builders sharing dedicated GPUs to run open models and burn tokens without limits.

Inference cluster for open AI models

qwen3.6

qwen3-embedding

kokoro

whisper

Pure inference. Zero logs.

Models chosen by the community. Refreshed every quarter.

The community is run by one person. Not a company.

Cristian Córdova

One tier per region. No upsells. No trial.

NaN Member EU

NaN Member USA/Latam

A community of builders. Limited spots. Shared GPUs with unlimited usage.

What you'll probably ask.