Building personal and general intelligence.
An AI research lab. Personal intelligence is the through-line — but we work across language models, architectures, reasoning, agents and the systems on top.
We don't pre-announce. We ship.
AVALON-2B — the first sub-3B language model that knows what it doesn't know.
1.88B parameters built on Qwen 3.5 2B. A five-token reflection vocabulary. A 22M-parameter MiniLM router at 90.5% accuracy. 82.5% Self-RAG token accuracy under LoRA fine-tune. Apache 2.0. Live on Hugging Face and Ollama.
In production, in flight, and in development.
Research → infrastructure → applied.
Research
Apache 2.0 papers and weights. AVALON-2B is live; PLMR preprint imminent; Hydra in active development.
Infrastructure
The cognitive memory layer for AI agents. SOC 2 Type II. Sub-200ms p95. Beats every adjacent system on LoCoMo.
Applied
Vertical and consumer products built on the same stack. Compliance-aware, citation-backed, on-device first.
The path to general intelligence runs through personal intelligence.
We're a research lab — first and foremost. That means we work on the open questions across language models, architectures, reasoning, agents and the systems on top of them, and we publish what we learn.
Personal intelligence is the through-line that ties our work together — persistent memory, self-reflection, per-agent cognition. It's the part of intelligence the current frontier mostly skips, and we think it's a credible bottom-up route to the general intelligence everyone is racing toward top-down.
But personal intelligence isn't the only thing we work on. Some of our research is on language-model reflection (AVALON-2B). Some is on byte-level architectures and routing (PLMR). Some is on systems we haven't named yet (Hydra). Some ships as open weights; some ships as commercial infrastructure (Hypersave) or applied products (Khyaa, the Nuro stack). — one lab, many bets.
from hypersave import Memory
mem = Memory(agent_id="researcher")
# every turn, the agent both remembers and reflects
mem.write("Sarah's youngest is heading to Stanford in August.",
sector="episodic")
ctx = mem.recall("What's top of mind for Sarah right now?")
# → cited, ranked, decay-weighted answer — not chunksHypersave SDK · TypeScript and Python · npm i @hypersave/sdk
Memory that thinks.
Hypersave is the cognitive memory layer for AI agents. Brain-inspired architecture: five cognitive sectors, each with its own Ebbinghaus decay curve. Knowledge graph + vector + keyword + RRF hybrid retrieval. Ten-stage query pipeline. Answer synthesis — not chunks.
Reported in good faith from internal benchmarks. Reproducible script will ship with the v1.1 release notes.
Sarah Chen joins at 15:00. Last note flagged inflation; today's CPI print came in 30 bps below consensus[1]. Recent touchpoints: Roth conversion (Oct 12), Stripe RSU vesting (Oct 4), 529 contribution limits (Sep 21)[2]. Open commitments: Q3 statement due Friday; estate-attorney intro by Nov 1[3].
Never walk into a client meeting cold.
Personal intelligence for US financial advisors. Pre-meeting briefs, on-device voice capture, natural-language Ask across your entire book — every answer cites its source.
- 200-word brief 15 minutes before each external meeting
- 1-button voice recorder — recap leaves your phone as text, not audio
- Ask the book in English; every claim has a source
- SEC marketing rule, Reg BI, Rule 204-2 retention aware
Built on the same research stack.
Seven things we hold ourselves to.
Stated up front so the next time you read a Nuro release, you can hold us to them.
and so we don't lose the plot01Open research, commercial products.
Open research, commercial products.
02Personal first.
Personal first.
03Compounding, not theatrical.
Compounding, not theatrical.
04Accountable.
Accountable.
05Useful before grand.
Useful before grand.
06For every mind.
For every mind.
07Patient about the destination.
Patient about the destination.
The labs that ship across the whole stack — open research, infrastructure, applied products — define the next decade of AI. The ones that pick one and stop don't.
From an idea about memory to a shipping research lab.
Hypersave research
Hypersave v1.0 GA
AVALON-2B released
Frontier-grade work, incrementally.
- Nov 2024
Nuro AI Labs incorporated
Companies House #16079959. London-registered private limited. - Q1 2025
Hypersave research kicks off
Brain-inspired cognitive memory architecture for AI agents — five sectors, Ebbinghaus decay curves, hybrid retrieval. - Q3 2025
Khyaa first design partners
Personal intelligence for US financial advisors. Free beta with named RIA design partners. - Q1 2026
Hypersave v1.0 GA
SOC 2 Type II. TypeScript + Python SDKs. 86% LoCoMo — beats every adjacent agent-memory system head-to-head. - Apr 2026
AVALON-2B released
First sub-3B Self-RAG language model. Apache 2.0. Live on Hugging Face and Ollama. - Next
PLMR preprint · AVALON-3 · Hydra
Continued open research. Continued commercial product. Same compounding.
A small lab. High signal.
Akhil Ponnada
Author on AVALON-2B and PLMR. Sets research direction and overall company strategy. MSc International Business Management, Heriot-Watt University; BBA, Amity University.
Naga Sri Arvapalli
Co-author on AVALON-2B; led the AVALON training pipeline. Owns the technical architecture across research, Hypersave infrastructure and the applied product stack.
Naveen Yelloji
Senior CXO-level operator with three decades of leadership across AI, media & entertainment, telecom, infrastructure and technology. IIM Ahmedabad alumnus; University of Hull.
Research
Pretraining, fine-tuning, evals. AVALON, PLMR, Hydra. Open by default.
Open roles →Infrastructure
Hypersave platform engineering. Distributed memory, latency at p95, SOC 2.
Open roles →Applied
Khyaa, Nuro Studio, Nuro Chat, Nuro One. Vertical and consumer surfaces.
Open roles →Operations
GTM, partnerships, finance. Help us turn frontier research into a durable lab.
Open roles →London. Quietly, deliberately.
Registered office at 128 City Road. Companies House #16079959. We're hiring across research, infrastructure and applied — remote-friendly, London-anchored.
London
What does "personal intelligence" actually mean?
How is this different from OpenAI, Anthropic or DeepMind?
Are your products open source?
Where are you based?
Who is the team?
Are you raising?
One email per release. That's it.
Papers, weights, and product updates as they ship. No marketing, no tracking, unsubscribe anytime.
Slow updates from the lab.
One email per release. No marketing, no tracking, unsubscribe anytime.
Personal intelligence. General intelligence.
Build with us. Read the research. Try the products. Or just say hello.
nuroailabs.com·@nurolabs·huggingface.co/nuroai