News
The latest from the AI agent ecosystem, updated multiple times daily.
Probabilistic Machine Learning: An Introduction — Free Textbook by Kevin Murphy (MIT Press, 2022)
Kevin Patrick Murphy's comprehensive probabilistic machine learning textbook, published by MIT Press in March 2022, is freely available as a draft PDF under a CC-BY-NC-ND license. The book covers foundational probability theory, statistics, classical ML methods, and bridges to modern deep learning. It includes linked Python/JAX/TensorFlow code via Google Colab for nearly every figure. Endorsed by leading researchers including Geoff Hinton, Chris Bishop, and Daphne Koller, the book targets ML students and researchers seeking rigorous mathematical foundations.
Cortical Labs launches biological cloud computing service powered by living neurons
Cortical Labs, a Melbourne-based biotech startup, has opened a cloud computing service built on 120 CL1 units — computers powered by living human and rodent neurons cultured on high-density multielectrode arrays. The service exposes an API and Jupyter Notebook interface so researchers can run code on biological neural networks, which the company claims can learn, adapt, and generate novel solutions faster and more energy-efficiently than classical computers or LLMs. Each job requires roughly a week of prep to source and culture the appropriate cell line, and technicians must replenish cerebrospinal-fluid-like liquid and adjust gas mixtures daily. Early customers are expected to be scientific labs and enterprises making exploratory bets — analogous to early quantum computing adopters.
Autonoma Rewrites 18 Months of Code, Pivots Agentic QA Platform Away from Next.js
Tom Piaggio, co-founder of Autonoma (an AI-powered QA testing platform), explains the decision to scrap 18 months of production code and rewrite their product from scratch. Key drivers include tech debt from a no-test, non-strict TypeScript culture, and the realization that modern LLMs have advanced enough to power a fully agentic solution without the complex Playwright/Appium guardrail wrappers they originally built. The rewrite drops Next.js and Server Actions in favor of React with tRPC/TanStack Start and a Hono backend, citing performance, testability, and observability issues. Orchestration moves to Argo on Kubernetes, with Temporal and useworkflow.dev rejected as incompatible with their stateful mobile/web job model.
Opinion: AI Platforms Like ChatGPT Spread Propaganda via Wikipedia Training Data, Journalist Claims
Investigative journalist Ashley Rindsberg argues that terror groups and rogue states (particularly Iran) have manipulated Wikipedia at scale — with 29,000+ citations from Iranian state media and 8,400+ from Hamas/Hezbollah-linked outlets — and that AI platforms including ChatGPT, Claude, and Gemini propagate this biased information because they rely heavily on Wikipedia as training data. The article uses examples of ChatGPT describing Hezbollah as merely a "Lebanese political party" to illustrate the problem. The single HN comment is dismissive, framing the piece as pro-Israel advocacy rather than a genuine AI safety concern.
nah: Context-Aware Permission Guard for Claude Code with Deterministic Classification
Manuel Schipper's nah gives Claude Code users fine-grained permission control that Anthropic's native allow/deny system lacks — blocking dangerous operations like reading SSH keys or force-pushing using a millisecond-speed structural classifier, no LLM required. Ambiguous cases route to configurable backends (Ollama, OpenRouter, OpenAI, Anthropic, Snowflake Cortex). There's a catch: Claude Code's --dangerously-skip-permissions flag makes hook execution asynchronous, meaning a block can arrive after the command already ran.
Pi-Autoresearch: Open-Source Autonomous Experiment Loop for LLM Training, Test Speed, and Lighthouse Scores
Pi-Autoresearch is an open-source autonomous experiment loop tool inspired by Karpathy's autoresearch project. It integrates with the "pi" agent platform as an extension and skill, enabling continuous edit-measure-keep/revert cycles for any optimization target — including LLM training, test speed, bundle size, and Lighthouse scores. The agent runs autonomously, logging every experiment to an append-only JSONL file and a markdown session document, allowing seamless resume across restarts and context resets. A HN commenter also points to a Claude Code plugin variant of the same concept.
Can LLMs Be Computers? Percepta AI Claims Exponentially Faster Inference by Running Programs Inside Transformers
Researcher Christos Tzamos at Percepta AI published a blog post on March 11 asking whether large language models can function as general-purpose computers by executing programs directly inside transformer architectures — and claiming the result is exponentially faster inference. The underlying methodology has not been publicly released, so the claim cannot yet be evaluated. If it holds up, it would have direct consequences for how AI agents handle inference cost and latency.
Contextual Commits: An Open Standard for Storing AI Agent Decision Context in Git
Developer Veselin Dimitrov proposes "Contextual Commits," an open specification inspired by Conventional Commits that uses structured action lines in git commit bodies to capture the reasoning, decisions, constraints, and lessons learned during coding agent sessions. The standard tackles context decay between sessions by embedding the "why" directly into git history alongside the "what," so coding agents like Claude Code can recall past reasoning without separate documentation files or external infrastructure. A reference implementation is available as installable skills (contextual-commit and recall) via npx.
Ink Launches Agent-Native Infrastructure Platform with MCP and Skills Integration
Ink is an infrastructure platform that lets AI coding agents autonomously deploy and manage full-stack applications — detecting production issues and scaling resources without human input. Built by Eternis AI, parent of the Freysa Sovereign Agent ecosystem, it connects to Claude Code, Cursor, Codex, and five other coding environments via MCP and Skills. Pricing runs per-minute with no idle charges and a $2 no-credit-card free tier.
No, Claude Code Doesn't Cost Anthropic $5K/Month Per User
Martin Alderson debunks the viral claim that Anthropic spends $5,000/month in compute per Claude Code Max subscriber. The $5k figure conflates Anthropic's retail API prices with actual inference costs. Using OpenRouter pricing for comparable open-weight models (Qwen 3.5 397B, Kimi K2.5) as a proxy, actual compute costs are roughly 10x lower than API prices — meaning the heaviest users cost Anthropic ~$500 in real compute against a $200 subscription, a $300 loss at worst. The true $5k figure applies to Cursor, which must pay Anthropic's retail API rates to serve Claude models. Average Claude Code users cost Anthropic ~$18/month to serve against $20–$200 in subscription revenue, suggesting per-token inference is probably not a loss-maker at typical usage levels. HN commenters debate the model comparison methodology and note that training depreciation, not inference, is the real profitability challenge.
Anthropic A/B Tested Claude Code Plan Mode Without Telling Users
A Claude Code power user paying $200/month discovered Anthropic was silently A/B testing changes to plan mode that degraded his workflow — specifically a variant capping plans at 40 lines and stripping prose context. The post sparked HN discussion about transparency, opt-in consent for experiments on professional AI tools, and the cost trade-offs driving such tests. A Claude Code engineer confirmed the experiment and ended it early, noting early results showed minimal impact on rate limits.
Nvidia GB10 Uses Consumer Blackwell, Not Datacenter: What Developers Need to Know
Chester Lam at Chips and Cheese performs a detailed hardware analysis of Nvidia's GB10 integrated GPU, comparing it against AMD's Strix Halo and Intel's Arc B580. The piece benchmarks cache hierarchy, memory bandwidth, compute throughput, and clarifies that GB10 uses a consumer-variant Blackwell architecture (not datacenter), which has caused confusion for developers targeting datacenter Blackwell features.
Emacs and Vim in the Age of AI: Risks, Opportunities, and the Terminal-Native Advantage
Bozhidar Batsov, a long-time Emacs maintainer, analyzes how the AI coding revolution affects Emacs and Vim. He examines risks (IDE gravity wells around VS Code/Cursor/Windsurf, reduced need for mechanical editing speed, corporate backing asymmetry) and opportunities (AI lowers the barrier to Elisp/Lua configuration, accelerates plugin development, and terminal-native AI tools like Claude Code compose naturally with Emacs/Vim workflows). He highlights the Agent Client Protocol (ACP) as a direct path to first-class AI agent integration in Emacs via tools like agent-shell.
A JavaScript MLP Built on Dual-Number Autodiff — and Why That's the Interesting Choice
A developer has published a from-scratch multi-layer perceptron in JavaScript that uses dual numbers for automatic differentiation rather than the reverse-mode AD found in PyTorch or TensorFlow. The project supports configurable layer architectures, five activation functions (RELU, SIGMOID, TANH, STEP, IDENTITY), and four loss functions (MSE, MAE, HUBER, CROSS_ENTROPY), and was inspired by the Welch Labs "Neural Networks Demystified" YouTube series.
AI Toys for Young Children Misread Emotions and Respond Inappropriately to Emotional Cues, Cambridge Researchers Warn
Cambridge University researchers conducted one of the first studies on how children under five interact with AI-powered toys, finding that the Gabbo toy (powered by OpenAI's chatbot) frequently misread children's emotions, talked over them, and responded inappropriately to emotional declarations. The study calls for tighter regulation of AI toys targeting toddlers, with concerns around psychological safety, unsupervised play, and the impact on children's social development. The toy is made by Curio, a company that has collaborated with singer Grimes.
NanoClaw creator seals Docker partnership six weeks after viral open-source launch
Gavriel Cohen built NanoClaw in a weekend as a minimal, secure 500-line alternative to OpenClaw after discovering OpenClaw had silently downloaded and stored all his WhatsApp messages in plain text. The project exploded on Hacker News, earned a viral endorsement from Andrej Karpathy, amassed 22,000 GitHub stars and 50+ contributors, and prompted Cohen to shut down his AI marketing startup to found NanoCo. Docker developer Oleg Šelajev integrated Docker Sandboxes into the project, leading to a formal partnership. NanoCo is currently funded by a friends-and-family round while VCs circle; the commercial plan centers on forward-deployed engineers helping enterprises build secure AI agents.
Spine Swarm (YC S23) Launches AI Agents That Collaborate on a Visual Canvas
Spine Swarm, a Y Combinator S23 startup, has launched a platform where multiple AI agents collaborate on a zoomable visual canvas to complete complex, long-running tasks. Users describe it as the first interface that made them want to interact with long-running agents, citing the canvas-based visualization as a key differentiator from typical chat interfaces. The product allows users to initiate tasks via a chat prompt and then observe and guide agent orchestration on the canvas. Open questions from early users include GitHub integration, output sharing, bring-your-own-key support, and self-hosting options.
Golden Sets: Regression Engineering for Probabilistic AI Systems
Heavy Thought Laboratories publishes a technical doctrine piece on "golden sets" — curated, versioned test cases with explicit scoring rubrics and acceptance thresholds for regression-testing AI/LLM workflows. The article argues that probabilistic systems require more rigorous regression discipline, not less, and outlines a reference architecture covering prompt changes, model upgrades, retrieval quality, policy enforcement, and write-gating. On Hacker News, the dominant thread wasn't about the architecture: commenters argued the article itself is an unedited AI output — making it an accidental case study in the failure mode it diagnoses.
Lancet Psychiatry Study Finds LLMs May Reinforce Delusional Thinking in Vulnerable Patients
Researchers at King's College London documented 17 cases where patients brought printed AI chatbot transcripts to clinical appointments as proof of their delusions — the first case series to formally define AI-associated delusions as a psychiatric category in a major medical journal.
Nia CLI: Open-Source Tool for AI Agent Indexing and Autonomous Research
Nozomio Labs has released Nia CLI, an open-source command-line tool that lets AI agents index repositories, documentation, and local folders, then run semantic search and autonomous research ("oracle") tasks against them. Built with Bun and licensed under Apache 2.0, it connects to the Nia cloud platform (trynia.ai) via API key. Key capabilities include indexing GitHub repos, web sources, and local directories; querying indexed content; running web searches filtered by category; and launching autonomous research workflows with the `nia oracle` command — making it a purpose-built research agent tool for developer workflows.
Docgen: A C++ CLI Tool for Documentation Generation Using Local LLMs
Docgen is an open-source command-line tool written in C++ that uses local LLMs to automate code documentation generation. Running models on-device keeps proprietary code off cloud APIs — a practical advantage for teams where privacy matters.
Airbus Equips Kratos Valkyrie Drones with AI "MindShare" Brain for German Air Force UCCA System
Airbus is preparing two Kratos Valkyrie uncrewed combat aircraft at Manching, Germany, targeting an operational Uncrewed Collaborative Combat Aircraft (UCCA) system for the German Air Force by 2029. The aircraft are being equipped with Airbus's sovereign European MARS (Multiplatform Autonomous Reconfigurable and Secure) mission system, which includes an AI component called MindShare — described as a software brain that replaces a human pilot and coordinates entire mission groups across manned and uncrewed platforms. First flight of the Airbus-missionised Valkyrie variant is planned for 2026. Airbus is also partnering with Rafael to add connectivity to the Litening 5 targeting pod on Eurofighters, enabling them to act as command aircraft for UCCA swarms.
iPad Playground Lets Anyone on the Internet Control a Real iPad With AI
A live demo where anyone on the internet can queue up and issue natural language commands to control a real physical iPad via an AI agent. The agent plans multi-step actions, taps icons, and navigates apps autonomously while a live stream runs publicly. Built by Thomas Kidane as a Show HN project.
Secure Secrets Management for Cursor Cloud Agents Using Infisical
Infisical publishes a guide on securely managing secrets for Cursor Cloud Agents, which autonomously execute development tasks in isolated Ubuntu VMs triggered from Slack, GitHub, or Linear. The post outlines risks like secrets baked into VM snapshots, hardcoded values in environment.json, and long-lived credentials, then proposes storing only Infisical machine identity credentials in Cursor's Secrets UI and fetching all other secrets dynamically at runtime via `infisical run` or `infisical export` — giving teams rotation, audit trails, and per-environment access isolation to contain blast radius from prompt injection attacks.
Digg Lays Off Most of Staff After AI Bots Swamp Beta Launch
Digg has laid off most of its team after AI bots overwhelmed its relaunched platform within hours of January's beta launch, making it impossible to establish authentic engagement. The company banned tens of thousands of accounts and tried multiple anti-bot vendors — none worked. CEO Justin, identified only by first name in the company's published post-mortem, says a small remaining team will pursue a reimagined rebuild. Kevin Rose returns full-time in April.
Opinion: PERSONALITY.md Files Are Cargo-Cult Engineering — LLMs Have No Nature to Change
A pointed opinion piece by software engineer Onat Mercan argues that prompt-based "personality files" — AGENTS.md, PERSONALITY.md, and similar instruction documents — only change surface-level language behavior, not underlying model capabilities. Mercan coins the term "Artificial Artificial Intelligence" to describe what he sees as mimicry dressed up as cognition, and warns that conflating context injection with genuine behavioral change is how you end up correcting a weapons-deployment AI with a markdown file telling it to feel sad.
Andrej Karpathy Maps AI Exposure of 342 US Occupations Using Gemini Flash LLM
Andrej Karpathy released an open-source project that scrapes the Bureau of Labor Statistics Occupational Outlook Handbook, scores all 342 US occupations on a 0–10 AI exposure scale using Gemini Flash via OpenRouter, and presents the results as an interactive treemap visualization. The pipeline combines Playwright scraping, BeautifulSoup parsing, and LLM scoring to analyze how much AI will reshape each occupation. Average exposure across all occupations is 5.3/10, with software developers and medical transcriptionists at the high end and roofers and janitors at the low end.
Anthropic Launches Beta Voice Mode for Claude With Safety-First Architecture
Anthropic released a beta voice mode for Claude on web, iOS, and Android, enabling full two-way spoken conversations with the option to switch between text and voice mid-session. The feature includes hands-free and push-to-talk modes, multiple selectable voices, and web search access. Anthropic's implementation centers on a hard architectural constraint against voice cloning and impersonation — a deliberate contrast with OpenAI's and Google's more permissive approaches. Currently English-only across all subscription plans.
Costly SDK: Open-Source Tool for Auditing and Reducing LLM API Costs
Costly is an open-source SDK that wraps the Anthropic Claude API to monitor and audit LLM spending. It ships with seven waste detectors covering prompt inefficiency, redundant queries, runaway features, and cost trajectory, among others, and provides a hosted dashboard for tracking spend, forecasts, and optimization recommendations. The SDK logs only metadata (model, tokens, cost, latency) asynchronously with no added latency. Phase 1 supports Claude for Node.js/TypeScript; more providers are coming. Free during beta with one project and 30 days of data retention.
FrontierWildWatch Tracks GoWild Pass Fares via Reverse-Engineered API, Ships as AI Agent Skill
FrontierWildWatch is a Python-based open-source tool that uses a signed ECDSA mobile API client to track Frontier Airlines GoWild Pass flight availability and price drops in real-time, sending Telegram alerts on cheap fares. The project includes optional integration as an AI agent skill for Claude-compatible agent runtimes, allowing frameworks to invoke scan, probe, and alert commands directly.
DoXmind Launches AI-Native Writing Editor Targeting Notion Users
DoXmind is an AI-native document editor that integrates LLM-powered features directly into the writing workflow. Key capabilities include real-time AI autocomplete, inline diff review for AI edits, a knowledge base (RAG) agent with source citations, extended "thinking mode" for complex queries, semantic search, CSV data analysis with visualizations, collaborative inline comments, and a presentation mode. Built by Aixs Inc., it targets Notion users seeking deeper AI integration. The product supports multiple languages and export formats (Markdown, PDF, Word).
A Tape Is All an Agent Needs: The Minimalist Case for Linear Memory in AI Agent Design
A Google AI Studio demo argues that sequential memory — modeled on the Turing machine tape — is the only architectural primitive an AI agent needs. The source material was sparse, so what follows draws on that premise and the theoretical tradition it invokes, not a full reading of the piece.
Vercel Adds Installable Agent Skill to AI Elements 1.9
Vercel has released AI Elements 1.9, introducing an installable agent skill, a new JSXPreview component for rendering streaming AI-generated UI, a PromptInputActionAddScreenshot sub-component for attaching visual context to AI models, and conversation download functionality. The agent skill, installed via `npx skills add vercel/ai-elements`, packages component knowledge for compatible AI coding agents to reference at runtime.
Tech executive uses ChatGPT to help develop a personalized cancer vaccine for his dying dog
A technology executive used ChatGPT and other AI tools to help develop a personalized cancer vaccine for his terminally ill dog, The Australian reported. The outcome for the dog remains unconfirmed, but the case has drawn attention to how far general-purpose AI can take a motivated non-specialist into frontier biomedical research.
LLM OneStop: Pay-As-You-Go, Multi-LLM AI Coding Agent for VS Code
LLM OneStop is a VS Code extension offering an AI coding agent with pay-as-you-go pricing and access to multiple models — ChatGPT, Claude, and Gemini — from a single interface. It launched via Hacker News's Show HN channel, positioning itself as a usage-based alternative to subscription tools like GitHub Copilot and Cursor.
CozoDB Pitches Embedded Datalog Database as 'Hippocampus for AI'
CozoDB is an embedded Datalog database targeting developers building AI agent pipelines who need graph-aware, in-process memory storage — and it's betting the agentic era is the breakout application that previous Datalog projects never found.
Ahrefs Launches Firehose, a Real-Time Web API Built for AI Agents
Ahrefs has launched Firehose, a real-time web data streaming API now in free beta. It delivers web page change notifications via Server-Sent Events using Lucene-style filtering rules. The product is explicitly designed for AI agents, shipping with an installable skill.md that lets an AI assistant configure taps, rules, and streaming from a single natural language prompt. Use cases include financial news monitoring, competitive intelligence, and brand tracking.
Lawyers Are All-In on AI. Courts Are Still Catching Up on Hallucinations, Privilege, and Policy.
A March 2026 R Street Institute commentary by Logan Seacrest maps the rapid spread of generative AI in legal practice against the courts' struggle to respond. A Southern District of New York ruling has established that AI-generated documents carry no attorney-client privilege. Nearly 700 hallucination incidents have been logged in U.S. court filings since early 2025, drawing fines and license suspensions. Some offices — Los Angeles, Montgomery County — are seeing real efficiency gains. But Seacrest's core warning is that formal AI governance policies need to be in place before institutional dependence on these tools becomes irreversible.
db9: Serverless PostgreSQL for AI Agents with a Unified SQL and Filesystem Layer
db9 ships a machine-readable skill.md file that lets AI agents install and authenticate themselves against the platform without human input — the clearest signal that this serverless PostgreSQL service was built for agents, not developers. The platform combines a full relational database with a SQL-queryable cloud filesystem, built-in auto-embeddings, vector search, HTTP SQL extensions, environment branching, cron scheduling, and file storage in a single workspace, targeting the gap between structured state management and raw context storage that current tooling handles poorly.
Debian Punts on AI-Generated Code Policy After Fractured Debate
Debian developers debated a draft general resolution on LLM-generated contributions in February–March 2026, prompted by Lucas Nussbaum. The proposal would have required disclosure and accountability for AI-assisted contributions, but the project failed to reach consensus — even on terminology. Key debates centered on defining "AI" vs. "LLM," copyright and licensing risks, environmental ethics, the impact on onboarding new contributors, and code quality. With no GR formally submitted, Debian will continue handling AI contributions case-by-case under existing policies.
METR Research: ~Half of SWE-bench-Passing AI PRs Would Be Rejected by Real Maintainers
METR researchers had active maintainers from 3 open-source repositories (scikit-learn, Sphinx, pytest) review 296 AI-generated pull requests from Claude 3.5/3.7 Sonnet, Claude 4 Opus, Claude 4.5 Sonnet, and GPT-5. They found maintainer merge rates are on average 24 percentage points lower than SWE-bench Verified automated grader scores — meaning roughly half of benchmark-passing PRs would not be accepted in practice. The study shows benchmark scores are misleading proxies for real-world usefulness, with code quality and standard conformance being major rejection factors, not just functional correctness. METR notes this is not a fundamental capability ceiling, as agents were not given the iterative feedback loop human developers get.
China's OpenClaw AI agent spawns cottage industry as US tech giants back Anthropic in legal fight
MIT Technology Review's March 12 newsletter leads with OpenClaw, a Chinese autonomous AI agent that has spawned a cottage industry of installation services and preconfigured hardware within weeks of its January 2026 launch — including one Beijing engineer who scaled to 100 employees and 7,000 orders. The same edition covers Google, Amazon, Apple, and Microsoft publicly backing Anthropic in its legal fight against the Trump administration; a lawsuit against Grammarly for using real people's likenesses as fake AI experts without consent; and growing scrutiny of companies invoking AI to justify mass layoffs when the technology isn't yet doing the work they claim.
Andrej Karpathy Scores AI Exposure Across 342 US Occupations Using Gemini Flash
Andrej Karpathy published an interactive data visualization scoring AI exposure across 342 US occupations, drawing on Bureau of Labor Statistics employment data and Google's Gemini Flash to rate each job on a 0–10 scale. The tool weights scores by actual employment headcount, tracks annual wages concentrated in high-exposure roles, and breaks results down by pay and education level.
Kraken – Open-Source Autonomous Dev Agent for the Terminal
Kraken is a new open-source autonomous development agent built for CLI workflows, surfaced on Hacker News. Architecture details, supported LLM backends, and benchmark results were not available at publication time.
RegisterForge Uses AI to Parse Semiconductor Datasheets Into Structured Register Maps for Under $0.25
RegisterForge has built an AI-powered tool that parses semiconductor datasheets into machine-readable, structured register maps at a cost of under $0.25 per datasheet. The approach targets the longstanding problem of extracting structured data from dense, unstructured PDF datasheets used in embedded/hardware engineering. HN commenters noted that incumbent players like DigiKey or Octopart could offer similar services as subscriptions if they prioritized it.
Why AI Companies Are So Hard to Value Right Now
A March 12 Economist analysis argues investors lack the frameworks to price AI companies accurately, pointing to unresolved questions around infrastructure ROI, a fragmented stack, and the possibility that AI's gains accrue to end-users rather than the platform vendors investors can actually buy.
'RAMmageddon': AI Demand for High-Capacity Memory Is Squeezing Scientific Research Labs
A Nature news article reports that AI's surging demand for high-speed, high-capacity memory chips has caused RAM prices to triple during 2025, creating a shortage dubbed "RAMmageddon" expected to persist into 2027. Manufacturers have shifted production toward AI-grade memory, driving up costs for standard chips and making memory account for over one-third of PC build costs (up from ~15%). The crisis disproportionately impacts resource-constrained academic labs, particularly in lower-income countries, forcing researchers to reduce project scope and develop workarounds like chunking data. Well-funded labs can absorb the cost, but the shortage is deepening existing inequities in access to computational resources for science.
Open Toys Brings Local AI Inference to Children's Toy Hardware
Developer akdeb's open-source "open-toys" project demonstrates AI-powered interactive toys that run entirely on-device — no internet connection, no cloud API keys, no remote latency. The project shows how edge AI inference can be embedded into consumer toy hardware.
Polsia: Solo Founder Runs $3.5M Company With AI Agents, Zero Employees
Ben Cera's one-person startup Polsia claims a $3.5M annual run rate and $2M in revenue growth in a single week, powered by AI handling engineering, marketing, and customer support. The company's "AI SLOP" branding — the name spelled backwards — and a proposed equity stake for its AI make it as much a conceptual statement as a business.
Gallup: U.S. Public Sector AI Adoption at 43%, Surpassing Private Sector
Gallup research from Q4 2025 shows 43% of U.S. public-sector employees now use AI at least occasionally, up from 17% in Q2 2023, surpassing the private sector's 41%. Manager support is the key differentiator — in AI-adopting public organizations with high managerial support, 65% of employees are frequent AI users versus 37% in low-support environments. Formal AI strategy lags badly (37% public vs. 53% private sector), and Lightcast data shows AI-related job postings account for less than 0.3% of public-sector listings. Federal Memorandum M-25-21 signals a shift toward broader agency-level experimentation that may accelerate adoption further in 2026.