Orchid 1.0 is a 2-billion-parameter ternary-weight language model fine-tuned on a single 4 GB laptop GPU. On our internal benchmark it outscores every open-weight model we tested — including 7B–9B systems.
Internal Benchmark v2 — 100 questions across 8 categories, semantic similarity scoring. Orchid lands third of twelve models, ahead of every open-weight system, including Qwen2.5-7B and Kimi k1.5.
Science 100% · Math 93.3% · Coding 93.3%
See full benchmarks →Semantic-similarity scoring is a relative comparison tool, not a substitute for standard NLP benchmarks.
Model
The first competitive LLM trained and aligned in Colombia. Aligned with ORPO for unbiased, multilingual responses on consumer hardware — no cloud dependency.
Explore the model →The inference engine for ternary-weight LLMs with runtime LoRA — “the llama.cpp of BitNet models.” It serves combinations no other stack can run correctly.
How it works →Every training stage ran on a single RTX 3050 laptop — 4 GB of VRAM, 16 GB of RAM, Windows 11. SFT, then two rounds of ORPO alignment, with memory tricks that made it possible to fine-tune a 2B model on hardware most people already own.
Read the full story →