OpenJarvis: Stanford's Open-Source Framework for Local-First Personal AI Agents

Stanford's Scaling Intelligence Lab has released OpenJarvis, an open-source framework under the Apache 2.0 license designed to run personal AI agents entirely on local devices. The project, led by PhD researchers Jon Saad-Falcon and Avanika Narayan alongside faculty co-authors Christopher Ré, Azalia Mirhoseini, and former Stanford president John Hennessy, directly challenges the cloud-by-default architecture that underpins nearly every consumer AI product today. The framework is structured around five composable primitives — Intelligence (model selection and routing), Engine (local inference runtime), Agents (orchestration logic), Tools and Memory (context and capability access), and Learning (closed-loop fine-tuning from on-device trace data) — and supports interoperability standards including MCP and Google's A2A protocol.

The project is grounded in the lab's own "Intelligence Per Watt" study, which found that local language models can accurately handle 88.7% of single-turn chat and reasoning queries at interactive latencies, with efficiency improving 5.3x between 2023 and 2025. OpenJarvis's defining architectural choice is treating energy consumption, latency, FLOPs, and dollar cost as first-class evaluation constraints on equal footing with task accuracy — a direct response to the reality that battery-powered edge devices cannot tolerate the resource profiles that cloud deployments routinely assume. The framework also includes a closed-loop learning harness that optimizes across four surfaces: model weights, LM prompts, agentic logic, and the inference engine itself, filling a gap that cloud-hosted models structurally cannot address since their weights and runtimes are not exposed for local adaptation.

Hennessy's co-authorship is the project's most striking credibility signal. The 2017 Turing Award winner — recognized alongside UC Berkeley's David Patterson for pioneering the RISC architecture that directly descends into ARM, Apple Silicon, and Qualcomm Snapdragon chips — is arguably the intellectual architect of the hardware substrate that makes local AI viable at all. His current role as Chairman of Alphabet, whose cloud AI infrastructure OpenJarvis is explicitly designed to reduce dependence on, gives his endorsement of the local-first thesis unusual weight. The team frames the current moment as analogous to the 1970s-to-PCs transition: the argument is not that local models match cloud models on raw capability, but that they are now efficient enough for the overwhelming majority of real-world use cases.

The team is seeding adoption through an ENERGY leaderboard challenge, offering a Mac Mini prize to push real efficiency benchmarks across diverse hardware configurations. The goal is to fill what the researchers call the "missing stack" for local agent deployment — shared abstractions that today's bespoke, non-interoperable builds lack. The ambition is a reference architecture in the tradition of PyTorch: a foundation that personal AI agent projects can build on rather than reinvent from scratch.