Ternative

The proof

A 2B model in a
7B+ fight.

Internal Benchmark v2 — 100 questions across 8 categories, semantic similarity scoring. Orchid lands third of twelve models, ahead of every open-weight system, including Qwen2.5-7B and Kimi k1.5.

Science 100% · Math 93.3% · Coding 93.3%

See full benchmarks →

Internal Benchmark v2 Orchid 1.0

Claude 3.5 Sonnet

89.5

GPT-4o

89.2

Orchid 1.0 · 2B

87.9

BitNet b1.58 · 2B

84.2

Kimi k1.5

82.2

Qwen2.5 · 7B

78.4

Semantic-similarity scoring is a relative comparison tool, not a substitute for standard NLP benchmarks.

Two open-source projects

A model, and the engine
that makes it run.

Model

Orchid 1.0

The first competitive LLM trained and aligned in Colombia. Aligned with ORPO for unbiased, multilingual responses on consumer hardware — no cloud dependency.

Explore the model →

−10+1 Engine

The inference engine for ternary-weight LLMs with runtime LoRA — “the llama.cpp of BitNet models.” It serves combinations no other stack can run correctly.

How it works →

The story

88 hours. One laptop.
No datacenter.

Every training stage ran on a single RTX 3050 laptop — 4 GB of VRAM, 16 GB of RAM, Windows 11. SFT, then two rounds of ORPO alignment, with memory tricks that made it possible to fine-tune a 2B model on hardware most people already own.

Read the full story →

training_run.log

# single RTX 3050 · 4 GB VRAM · no cloud
SFT-A   LoRA r=16   reasoning        ~1 h
SFT-B   LoRA r=16   5,500 samples    ~88 h
ORPO-2  LoRA r=8    2,038 pairs      ~26 h
ORPO-3  LoRA r=8    2,104 pairs      ~54 h
# total cloud GPUs used: 0

Out in the open

Weights, code, paper, DOI —
all public.

Hugging Face
Model card, GGUF weights & LoRA adapter — 4,127 downloads last month huggingface.co ↗ ternative engine
C++17 / CUDA inference engine — source, releases & build instructions github.com ↗ orchid-1.0 recipe
Training recipe, eval harness & reproduction scripts github.com ↗ Zenodo · Technical paper
DOI 10.5281/zenodo.20452163 — archived, citable research record zenodo.org ↗ FLOSS/fund
Support continued open development of Ternative & Orchid floss.fund ↗

Ranks #3.Behind only Claudeand GPT-4o.

A 2B model in a7B+ fight.

A model, and the enginethat makes it run.

Orchid 1.0

ternative

88 hours. One laptop.No datacenter.

Weights, code, paper, DOI —all public.

Ranks #3.
Behind only Claude
and GPT-4o.

A 2B model in a
7B+ fight.

A model, and the engine
that makes it run.

88 hours. One laptop.
No datacenter.

Weights, code, paper, DOI —
all public.