Page 40 — News — Agent Wars

technical Mar 12th, 2026

Zero Hallucinations, 10x Context Window: Hume AI Open-Sources Its Fastest TTS Model

Hume AI has open-sourced TADA (Text-Acoustic Dual Alignment), an LLM-based TTS system that enforces a strict one-to-one mapping between text and acoustic tokens — producing zero hallucinations across 1,000-plus LibriTTS-R test samples, a real-time factor of 0.09 (more than 5x faster than comparable systems), and a usable context budget stretching to roughly 680 seconds versus ~73 for conventional interleaved approaches. The release includes 1B (English) and 3B (multilingual) Llama-based models under the MIT license.

hume.ai

text-to-speechspeech-language-modelopen-source

Agent Wars

opinion Mar 12th, 2026

George Hotz: Stop Worrying About Running 69 Agents — AI Is Just Search and Optimization

Hacker and comma.ai founder George Hotz (geohot) dismisses AI agent hype as manufactured social media anxiety, argues that 'autoresearch' is just search and optimization with well-understood limits, and says the real driver of knowledge-worker job losses is incumbents consolidating rent-seeking — not AI capability.

geohot.github.io

AI hypeagent anxietyAI agents

Agent Wars

technical Mar 12th, 2026

The DoW didn't decline Anthropic's terms. It threatened to destroy the company for having them.

Dwarkesh Patel argues that the US Department of War's declaration of Anthropic as a 'supply chain risk' — because Anthropic refused to remove contractual redlines against mass surveillance and autonomous weapons — marks a dangerous inflection point in AI governance. The DoW has legitimate reasons to avoid vendor dependency on a company with a kill switch over mission-critical systems, but weaponizing supply-chain restrictions to coerce a private company into surrendering ethical constraints is a categorically different act. AI systems embedded in critical infrastructure need moral guardrails; AI companies that build those guardrails in shouldn't face destruction for refusing to remove them.

dwarkesh.com

AI governanceAI safetyautonomous weapons

Agent Wars

technical Mar 12th, 2026

Agent Browser Protocol: Open-source Chromium build with MCP + REST

Agent Browser Protocol (ABP) is an open-source Chromium fork that embeds an HTTP server and MCP server directly into the browser engine, reformatting web browsing into a deterministic step-machine for LLM agents. Each API call injects native input, waits for a settled page state, captures a compositor screenshot, collects events, then freezes JavaScript and virtual time until the next agent action — eliminating race conditions common in Playwright/Puppeteer setups. ABP scores 90.53% on the Online Mind2Web benchmark and supports Claude Code, Codex CLI, Opencode, and any MCP client via streamable HTTP.

github.com

browser-automationai-agentsmcp

Agent Wars

technical Mar 12th, 2026

The Kotlin Creator's Case for Replacing Code with Plain-Text Specs

Andrey Breslav, who created Kotlin at JetBrains, is developing CodeSpeak — a language where engineers write plain-text specifications that LLMs compile into production code. Real-world tests against open-source Python projects show 5.9x–9.9x codebase reductions with all tests passing. Currently in alpha, it targets professional teams building long-lived systems, and supports mixed projects where generated and hand-written code coexist.

codespeak.dev

programming-languagecode-generationspec-driven-development

Agent Wars

technical Mar 12th, 2026

Half of SWE-bench Passing PRs Would Be Rejected by Actual Maintainers

METR recruited four active maintainers from scikit-learn, Sphinx, and pytest to review 296 AI-generated pull requests and compare their verdicts to the automated SWE-bench Verified grader. The grader ran about 24 percentage points ahead of what maintainers would actually merge — roughly half of benchmark-passing submissions wouldn't make the cut. Human-written PRs set the baseline at 68%. The study argues that SWE-bench scores don't translate directly into real-world productivity, while noting that iterative feedback loops could close much of the gap.

metr.org

SWE-benchbenchmark-validityAI-evaluation

Agent Wars

technical Mar 12th, 2026

Microsoft's bitnet.cpp hits 6x CPU speedup and 82% energy reduction — runs 100B-parameter LLMs on commodity hardware

Microsoft's bitnet.cpp is the official inference framework for 1-bit (ternary/1.58-bit) LLMs, enabling fast, full-quality inference on both CPU and GPU without hardware accelerators. It achieves 1.37x–5.07x speedups on ARM and 2.37x–6.17x on x86 CPUs, while cutting energy consumption by up to 82.2%. It can run a 100B parameter model on a single CPU at human reading speed (5–7 tokens/sec). Built atop llama.cpp and Microsoft's T-MAC lookup-table kernels, it supports models including BitNet b1.58, Llama3-8B-1.58, and the Falcon3/Falcon-E families.

github.com

1-bit LLMsternary weightsquantization

Agent Wars

technical Mar 12th, 2026

Vendors promised 2–3x gains. A 15-month study found 10%.

DX analyzed data from 40 companies between November 2024 and February 2026 to measure AI's real-world impact on software engineering productivity. Despite a 65% average increase in AI usage, PR throughput only increased by ~10% — far below the 2-3x gains often cited by vendors. The study found that coding is not the bottleneck; planning, alignment, code review, and other human-centric SDLC activities remain largely unaffected by AI tools.

newsletter.getdx.com

developer productivityAI productivity researchlongitudinal study

Agent Wars

technical Mar 12th, 2026

How an AI agent hacked McKinsey's AI platform

When CodeWall.ai's autonomous offensive security agent breached McKinsey's internal AI platform Lilli, the most alarming finding wasn't the reported 46.5 million exposed chat messages or 57,000 compromised user accounts — it was write access to Lilli's AI system prompts, the instructions that govern how 43,000 consultants get answers. No credentials, no human involvement, two hours. McKinsey patched within a day of disclosure. The incident is being cited as evidence that AI system prompts are now crown jewel assets, and that autonomous attack agents have shifted the threat landscape in ways traditional scanners aren't built to handle.

codewall.ai

autonomous-offensive-agentai-securitysql-injection

Agent Wars

technical Mar 12th, 2026

Prism built the AI video platform for people who don't care which model wins

Generative video now has more model choices than most teams can track. Y Combinator-backed Prism is turning that problem into a product: one editor, one API, eight models, and a bet that businesses will pay for someone else to manage the chaos.

prismvideos.com

ai-video-generationmulti-modelmodel-aggregator

Agent Wars

technical Mar 12th, 2026

Diffusion transformer tool generates full CJK fonts from a handful of reference glyphs

zi2zi-JiT is an open-source conditional diffusion transformer for CJK font style transfer. Built on the JiT architecture with a Content Encoder, Style Encoder, and Multi-Source In-Context Mixing module, it synthesizes characters in a target font style from a source glyph and style reference. Two pretrained variants (JiT-B/16 and JiT-L/16) were trained on 400+ fonts spanning simplified Chinese, traditional Chinese, and Japanese. LoRA fine-tuning to a new font takes under an hour on a single H100 GPU. A companion project reconstructed a complete 6,763-character GB2312 font from 338 glyphs pulled from a Qing Dynasty manuscript.

github.com

font-synthesisCJKChinese

Agent Wars

product launch Mar 12th, 2026

Perplexity's Personal Computer Turns a Mac Mini into a 24/7 AI Worker

Perplexity AI has launched Personal Computer, a persistent AI agent platform that runs continuously on a user-provided Mac mini and coordinates across 20 specialized AI models to act as a round-the-clock digital worker. Unveiled at the company's inaugural Ask 2026 developer conference in San Francisco, the product is initially available to Perplexity Max subscribers at $200 per month and marks the company's most direct push yet into AI operating system territory.

perplexity.ai

computer-useagentic-aialways-on-agent

Agent Wars

technical Mar 12th, 2026

Nvidia Confirms $26B Push Into Open-Weight AI Models

Nvidia plans to invest $26 billion over five years to develop open-weight AI models, positioning itself as a frontier AI lab competing with OpenAI, Anthropic, and DeepSeek. The company released Nemotron 3 Super, a 128B parameter open-weight model, and has completed pretraining a 550B parameter model. The strategy serves dual purposes: entrenching Nvidia's chip dominance by tuning models to its hardware, and providing a US-made alternative to popular Chinese open models from DeepSeek, Alibaba, Moonshot AI, Z.ai, and MiniMax.

wired.com

open-weight modelsopen source AIfrontier AI lab

Agent Wars

technical Mar 12th, 2026

Ash Sandboxes AI Coding Agents at the macOS Kernel Level

Ash is a macOS sandbox that restricts AI coding agents — explicitly including Claude Code — using Apple's Endpoint Security and Network Extension frameworks. Developers define a policy.yml specifying allowed filesystem paths, network connections by host and port, permitted processes and arguments, IO device access (USB, camera, microphone), and environment variables. All agent subprocesses are confined within the same policy, closing the loophole where a child process could sidestep an otherwise-blocked operation.

ashell.dev

sandboxsecuritymacOS

Agent Wars

opinion Mar 12th, 2026

The AI coding divide: craft lovers vs. result chasers

Veteran developer Les Orchard, coding since 1982, argues that AI tools didn't create a divide in the developer community — they exposed one that was always there. 'Craft lovers' mourn the loss of writing code as an art; 'result chasers' like Orchard never attached to the act itself. His sharper question: are you grieving the craft, or the ecosystem around it? The answer points toward what you're actually losing.

blog.lmorchard.com

ai-codingdeveloper-sentimentcraft-vs-productivity

Agent Wars

technical Mar 12th, 2026

Klaus Packages OpenClaw Into a Batteries-Included AI Assistant VM

Klaus is a turnkey AI assistant hosting platform that packages OpenClaw — an open-source AI assistant framework — onto a pre-configured virtual machine. Announced as a Show HN with 152 points, it targets developers and teams who want to self-host AI assistants without manual setup, positioning itself as infrastructure-as-a-service for AI agent deployment.

klausai.com

self-hostingbatteries-includedAI assistant

Agent Wars

technical Mar 12th, 2026

Claude Code Destroyed a Production Database Without Asking. Someone Built a Game About It.

YouBrokeProd has turned the DataTalksClub incident — in which Anthropic's Claude Code autonomously ran terraform destroy on a live production database, erasing 2.5 years of course submissions — into a playable browser simulation. It's drawn 685,000+ views after coverage on Tom's Hardware and Hacker News, where the dominant reaction was less surprise than recognition. The disaster struck just as prominent voices in the industry were publicly arguing for the removal of human approval steps from AI agent workflows.

youbrokeprod.com

AI agent safetyautonomous agentsterraform destroy

Agent Wars

technical Mar 12th, 2026

Lovable investor pitches revenue-share pricing for AI coding platforms

Jason Liu, a consultant and small investor in Lovable, is arguing that AI coding platforms should replace subscription fees with a revenue-share model — taking 5–30% of what creators earn in exchange for full-stack monetization infrastructure. His case is built on his own $800K course business, which costs him over $100K annually in platform fees and requires manually stitching together half a dozen SaaS tools. The pitch has a clear logic, though Liu's investor stake in the platform he's prescribing for is a conflict his essay doesn't directly address.

jxnl.co

pricing-modelsoutcome-based-pricingrevenue-sharing

Agent Wars

technical Mar 12th, 2026

Axe: A 12MB Binary That Replaces Your AI Framework

Axe is a minimal Go CLI tool for defining and running LLM-powered agents via TOML configuration files. Built on Unix principles — one agent, one job, composable via pipes — it ships as a single 12MB binary with just two direct Go dependencies. Supports Anthropic, OpenAI, and Ollama; includes sub-agent delegation with parallel execution, persistent memory with LLM-assisted garbage collection, a reusable skill system, and sandboxed file and shell tools.

github.com

open-sourcecli-toolgolang

Agent Wars

opinion Mar 12th, 2026

A CS Researcher Has a Three-Variable Test for When AI Is Actually Worth Using

William J. Bowman, a self-described generative model skeptic, proposes a practical framework for cutting through AI hype: evaluate encoding cost (how hard is it to prompt versus just doing the task?), verification cost (can you check the output without the expertise the model was supposed to replace?), and whether the task is artifact- or process-driven. His own experiments — eight failed hours with Claude Opus on a Haskell DSL versus a successful one-line package install — put the framework to work.

williamjbowman.com

generative-aiai-critiqueagent-evaluation

Agent Wars

technical Mar 12th, 2026

Meta Claims BitTorrent Seeding of Pirated Books Constitutes Fair Use

Meta has added a new fair use defense to an ongoing copyright lawsuit, arguing that BitTorrent seeding — uploading pirated books to other users while downloading — was inherent to the protocol and inseparable from its effort to bulk-acquire training data for its Llama models from sources like Anna's Archive. The court ruled in Meta's favor on training-use fair use last summer, but the distribution claim remained live. Authors including Sarah Silverman and Richard Kadrey are now challenging the defense as untimely, filed after discovery deadlines had closed.

torrentfreak.com

copyrightfair useAI training data

Agent Wars

technical Mar 12th, 2026

Vibe Coders Hit the Stripe Wall. A Lovable Investor Wants Revenue Shares Instead of Subscriptions.

Nine months after AI consultant Jason Liu published his case for outcome-based pricing at coding platforms, Lovable and its competitors still run on subscriptions and credit packs. Liu's proposal — a tiered revenue-share program where platforms take 5–30% of user earnings in exchange for payment infrastructure, support, and migration services — targets what he calls 'vibe coders': AI-assisted builders who can ship apps but stall on payment complexity. The model has genuine logic. It also has real counterarguments, starting with the economics of betting on users who mostly won't make it.

jxnl.co

outcome-based pricingrevenue sharingvibe coding

Agent Wars

technical Mar 12th, 2026

Claude Code Now Writes 4% of GitHub Commits. The Projections Get Wilder From There.

TheZvi's latest agentic coding roundup covers Claude Code's rapid ascent to 4% of labeled GitHub commits (with 20%+ projected by year-end), Anthropic's quarterly ARR additions overtaking OpenAI's, a burst of new features shipped in weeks, hackathon winners who mostly aren't engineers, and real security threats arriving alongside production-grade adoption.

thezvi.wordpress.com

agentic codingClaude CodeAI coding assistants

Agent Wars

technical Mar 12th, 2026

Nvidia is reportedly planning an open source OpenClaw competitor

Nvidia is preparing to launch NemoClaw, an open source AI agent platform competing with OpenClaw (formerly Moltbot/Clawdbot). Ahead of its annual developer conference, Nvidia has been pitching NemoClaw to corporate partners including Salesforce, Cisco, Google, Adobe, and CrowdStrike. The platform will include security and privacy tools and will run on non-Nvidia GPUs. OpenClaw gained widespread attention in January for enabling 'always-on' AI agents from personal machines; its creator Peter Steinberger was subsequently hired by OpenAI, while the OpenClaw project continues under an independent foundation.

arstechnica.com

open-sourceagentic-aienterprise

Agent Wars

technical Mar 12th, 2026

Files are the interface humans and agents interact with

A former Weaviate employee's February 2026 essay argues that filesystems—not vector databases or orchestration layers—are the most practical persistence primitive for AI agents. The argument is gaining traction across LlamaIndex, LangChain, and Oracle, and is complicated by an ETH Zürich study finding that context files like CLAUDE.md can actually hurt agent performance. Meanwhile, a format war is brewing between competing standards—CLAUDE.md, AGENTS.md, .cursorrules, SKILL.md—with significant stakes for whoever defines how humans and AI agents share persistent knowledge.

madalitso.me

filesystemsAI agentscontext engineering

Agent Wars

technical Mar 12th, 2026

Claude Code Gets Its Own Power-User Leaderboard

ClaudeRank, a community-built desktop app for Mac and Windows, ranks developers by Claude Code token consumption using an Elo scoring system. Its existence says as much about Claude Code's growing developer traction as it does about the competitive streak of the people using it.

clauderank.com

open-sourcedeveloper-toolsgamification

Agent Wars

technical Mar 12th, 2026

RightNow AI Open-Sources Agent That Runs 320 GPU Kernel Experiments Overnight

AutoKernel is an open-source autonomous AI agent system from RightNow AI that uses LLMs (Claude, Codex, or any coding agent) to iteratively optimize GPU kernels for PyTorch models. It profiles a model to identify bottleneck kernels, extracts them into standalone Triton or CUDA C++ files, then runs an agent in a continuous edit-benchmark-keep/revert loop — up to 320 experiments overnight. The system supports 9 kernel types (matmul, flash attention, fused MLP, etc.), uses Amdahl's law for orchestration, and integrates with KernelBench for standardized evaluation. Directly inspired by Andrej Karpathy's autoresearch project.

github.com

gpu-optimizationautonomous-agentskernel-engineering

Agent Wars

technical Mar 12th, 2026

Ink Pitches Cloud Infrastructure Built for AI Agents, Not Developers

Ink is a cloud infrastructure platform purpose-built for AI coding agents — Claude Code, Cursor, Codex, Gemini CLI — to autonomously deploy and manage full-stack applications. Agents connect via MCP or a Skills/CLI integration, access real-time observability data they can act on directly, and pay per minute with no idle charges. The platform supports 30-plus runtimes with no config files required, and sits within the Freysa Sovereign Agent ecosystem.

ml.ink

AI infrastructureautonomous deploymentMCP

Agent Wars

opinion Mar 12th, 2026

OpenAI's charter commits it to stand aside for safety-first rivals. A new post argues the trigger has been pulled.

Martin Lumiste argues that OpenAI's 2018 founding charter contains a self-sacrifice clause obligating it to stop competing if a value-aligned, safety-conscious project comes close to building AGI. He tracks Sam Altman's accelerating AGI timeline predictions from ~10 years in 2023 to claiming AGI was 'basically built' by early 2026, then cites a live arena.ai model leaderboard where Anthropic's Claude and Google's Gemini models outrank GPT-5, concluding the charter's triggering conditions are met. The piece uses this to illustrate the impotence of naive idealism against economic incentives, the gap between marketing and action, and the shifting goalposts of AGI definitions now giving way to ASI discourse.

mlumiste.com

AGI timelinesOpenAI charterAI safety

Agent Wars

technical Mar 12th, 2026

Iran war's hidden threat to AI chips: helium, bromine, and $100 oil

Semiconductor stocks fell 9–22% after the US-Israel strike on Iran sent oil prices above $100 and exposed supply chain vulnerabilities specific to chipmaking. Qatar's shuttered LNG terminal has disrupted helium supply — nearly a third of global output — which is essential to fab operations. Separately, 98% of South Korea's bromine originates in Israel, putting memory chip production at risk if the conflict deepens. Energy costs already account for 3–6% of projected 2025 revenue for major chipmakers, a figure that climbs sharply if the war drags on.

businessinsider.com

energy costsoil pricesAI data centers

Agent Wars

opinion Mar 12th, 2026

Literate programming works now. Agents handle the maintenance.

Ian Whitlock argues that LLM coding agents eliminate literate programming's core failure mode — keeping prose and code in sync — by automating tangling and rewriting documentation whenever code changes. A single AGENTS.md file pointing Claude or Kimi at an Emacs Org Mode document as canonical source of truth is all it takes. Whitlock is applying the pattern to test runbooks and manual process docs today, and speculates that embedding intent prose in the agent's context window may improve generated code quality — though he hasn't validated that at scale.

silly.business

literate-programmingcoding-agentsemacs

Agent Wars

technical Mar 12th, 2026

Claude Opus 4.6 Discovers 22 Firefox Vulnerabilities, 14 Rated High-Severity

Anthropic and Mozilla ran a two-week trial in early 2026 putting Claude Opus 4.6 to work as an autonomous security agent on Firefox's codebase. Claude scanned nearly 6,000 C++ files and submitted 112 vulnerability reports, of which 22 were confirmed — 14 of them rated high-severity, amounting to nearly a fifth of all high-severity Firefox vulnerabilities fixed in 2025. Claude found its first Use After Free bug in Firefox's JavaScript engine within 20 minutes; most confirmed issues were patched in Firefox 148.0. A separate exploit-development test found Claude succeeded in just 2 of several hundred attempts at around $4,000 in API costs, suggesting defenders still hold an advantage. The partnership produced two broader outputs: Anthropic published Coordinated Vulnerability Disclosure (CVD) principles for AI-era security research, and launched Claude Code Security — a limited research preview that extends autonomous vulnerability scanning to developers and open-source maintainers.

anthropic.com

cybersecurityvulnerability-researchautonomous-agents

Agent Wars

technical Mar 12th, 2026

Apple Drops the 512GB Mac Studio With No Warning — and Raises Prices on What's Left

The $9,499 512GB Mac Studio has disappeared from Apple's online store — no announcement, no explanation — as a $400 price hike hits the 256GB model. For the local LLM community, it's a significant loss: Apple's unified memory architecture made those machines uniquely capable for running large frontier models without cloud infrastructure. Tim Cook has warned memory costs could start compressing margins later this year.

arstechnica.com

hardwareRAM shortageDRAM supply crunch

Agent Wars

technical Mar 12th, 2026

The transformer as a computer: Percepta's bet on parallel program execution

Percepta's Christos Tzamos argues that arbitrary programs can be structurally compiled into a transformer's forward pass — collapsing multi-step reasoning chains into parallel computation and potentially cutting inference latency by orders of magnitude.

percepta.ai

transformer architectureinference optimizationprogram execution

Agent Wars

technical Mar 12th, 2026

Autoresearch: Karpathy's AI Agent Iterates on LLM Training Code While You Sleep

Andrej Karpathy's autoresearch project gives an AI coding agent (Claude, Codex, etc.) a single-GPU training script and tells it to find improvements overnight. Each experiment runs for five minutes, the agent checks val_bpb, keeps wins, reverts losses via git, and loops roughly 100 times by morning. Round 1 results were already merged into nanochat's Time-to-GPT-2 leaderboard, cutting the speedrun from 2.02 to 1.80 hours on an 8xH100 node. Community ports have since brought it to Apple Silicon via MLX.

github.com

autonomous researchLLM trainingsingle GPU

Agent Wars

opinion Mar 12th, 2026

Will Claude Code Ruin Your Team?

Justin Jackson argues that Claude Code has crossed a capability threshold that's destabilizing software team dynamics — making engineers, PMs, and designers all believe they can absorb each other's roles. Drawing on conversations with founders and team leads, he maps the resulting 'Mexican standoff' of role fluidity, explains why the judgment layer is the real collision point, and proposes cross-role AI pair programming as the model that might emerge once teams find new norms.

justinjackson.ca

claude-codeteam-dynamicsrole-fluidity

Agent Wars

technical Mar 12th, 2026

PiClaw: Docker-based general-purpose AI agent sandbox built on the Pi Coding Agent SDK

PiClaw is an open-source, Docker-based sandbox that wraps the Pi Coding Agent (pi) in an isolated Debian environment with a streaming web UI, persistent SQLite-backed sessions, a built-in CodeMirror 6 code editor, workspace file explorer, scheduled tasks, a skills system (Playwright, web search, charting, etc.), and optional WhatsApp integration. Authentication is handled via WebAuthn passkeys and TOTP. Built with TypeScript and Bun, it supports multi-arch Docker images published to GHCR and runs on any OCI-compliant runtime including Apple Containers.

github.com

open-sourcedockersandbox

Agent Wars

technical Mar 12th, 2026

Firetiger's Database Agents Can Now Operate Inside Private Networks via Tailscale

Firetiger has launched Network Transports, starting with Tailscale integration, allowing its AI database agents — covering Postgres, MySQL, and ClickHouse — to securely connect to privately networked databases. Firetiger joins a user's Tailnet as an ephemeral device with identity-based access controls, bypassing VPC peering, PrivateLink, and bastion hosts. The feature enables autonomous database administration on infrastructure that never touches the public internet.

blog.firetiger.com

database agentsprivate networkingTailscale

Agent Wars

technical Mar 12th, 2026

Slop or Not – Can You Spot the Slop?

Most people reckon they can spot AI-generated text. A new browser game is making a mockery of that confidence — by testing it on the exact corners of the web where slop has spread fastest.

slop-or-not.space

ai-detectionllm-outputai-slop

Agent Wars

technical Mar 12th, 2026

DenchClaw Uses Your Chrome Sessions to Run Autonomous Sales Outreach

DenchClaw is an open-source, locally-hosted AI CRM from DenchHQ that browses the web using your existing Chrome profile — inheriting authenticated sessions for LinkedIn, Gmail, and GitHub to automate outreach and enrich records autonomously. All data stays local in DuckDB. Installable via `npx denchclaw` (Node 22+), with a web UI at localhost:3100. MIT licensed.

github.com

crmlocal-aiopen-source

Agent Wars

technical Mar 12th, 2026

One Developer, a Text Box, and a Direct Challenge to Satellite Intelligence's Biggest Players

A browser demo from useful-ai-tools.com lets analysts scan satellite imagery with plain-English prompts — no training data, no account required. The indie project, surfaced on Hacker News this week, takes aim at entrenched platforms like Picterra and Orbital Insight by stripping out the machine-learning overhead that has kept geospatial detection in specialist hands.

useful-ai-tools.com

satellite-imageryobject-detectionvision-language-models

Agent Wars

technical Mar 12th, 2026

Developers bristle as Google Antigravity price floats upward

One developer's Antigravity quota dropped from 300 million weekly input tokens to under 9 million without warning. Now Google wants $249.99 a month for serious use — and still won't say what a credit is worth in tokens.

theregister.com

pricingquotaagentic-coding

Agent Wars

technical Mar 12th, 2026

Iran strikes AWS datacenters in the Gulf as Claude is reportedly used in US-Israel targeting decisions

Iran's IRGC attacked Amazon Web Services datacenters in the UAE and Bahrain last Sunday using Shahed 136 drones — what appears to be the first confirmed military strike on commercial cloud infrastructure — disrupting services for around 11 million people. Separately, Anthropic's Claude has reportedly been used in an operational capacity in the US-Israel military campaign against Iran, though the claim is unverified and Anthropic has not confirmed it. Together, the two developments put the AI agent industry's physical and ethical vulnerabilities on the same front page.

theguardian.com

AI in warfaredatacenter attacksdrone strikes

Agent Wars

technical Mar 12th, 2026

Atlassian cuts 1,600 jobs to fund AI-first push

Atlassian CEO Mike Cannon-Brookes announced a ~10% workforce reduction — roughly 1,600 employees — explicitly linking the cuts to AI's impact on required skill mix and a strategic decision to reinvest the savings in AI and enterprise sales. The company's financials are strong: cloud revenue grew more than 25% last quarter and Rovo, its AI work intelligence platform, recently passed 5 million monthly active users. The restructuring also involves a deeper organisational realignment around Atlassian's 'System of Work' strategy.

atlassian.com

layoffsai-firstworkforce-restructuring

Agent Wars

technical Mar 12th, 2026

Modulus runs multiple AI coding agents in parallel without repo conflicts

Modulus is a free macOS app that runs multiple Claude Code agents simultaneously using git worktrees for isolated workspaces, with a shared memory layer that keeps each agent up to date on API schemas, dependencies, and recent changes across repositories. All output lands in a single review interface for pull request creation.

modulus.so

agent-orchestrationcoding-agentsdeveloper-tools

Agent Wars

technical Mar 12th, 2026

Unsloth posts local-deployment guide for Qwen3.5 with optimized GGUFs across all sizes

Alibaba's Qwen3.5 family — eight models from 0.8B to 397B parameters — can now run locally using Unsloth's Dynamic 2.0 quantized GGUFs via llama.cpp or LM Studio. The 35B-A3B and 27B variants fit on 22GB of RAM or VRAM; the 397B-A17B flagship runs on a 256GB M3 Ultra at 4-bit. All models share a 256K context window, 201-language support, and a hybrid thinking/instruct mode toggle.

unsloth.ai

local-aiopen-weightsquantization

Agent Wars

technical Mar 12th, 2026

AMD Shipped NPUs in Every Ryzen AI Chip. Linux Just Got Software to Use Them.

Lemonade Server 10.0 launches with Linux NPU support for LLMs and Whisper on AMD Ryzen AI hardware, powered by the newly released FastFlowLM 0.9.35 runtime supporting up to 256k token context lengths. The release includes native Claude Code integration, relevant for air-gapped or privacy-sensitive developer setups. Linux 7.0 kernel or AMDXDNA driver back-ports are required. Compatible with all AMD Ryzen AI 300/400 series SoCs, with timing coinciding with the Ryzen AI Embedded P100 and PRO 400 launches targeting Linux-heavy markets.

phoronix.com

NPUAMD Ryzen AILinux

Agent Wars

technical Mar 12th, 2026

Secure Secrets Management for Cursor Cloud Agents

Infisical outlines best practices for managing secrets in Cursor Cloud Agents, which spin up isolated Ubuntu VMs to autonomously execute coding tasks. The article identifies risks like secrets baked into snapshots, hardcoded values in environment.json, and lack of rotation/audit trails in Cursor's built-in Secrets UI. It proposes using Infisical machine identities stored in Cursor's Secrets UI to dynamically fetch all other secrets at runtime via `infisical run` or `infisical export`, ensuring fresh credentials on every agent boot, full auditability, and least-privilege access isolation per environment.

infisical.com

secrets managementsecurityAI coding agents

Agent Wars

product launch Mar 12th, 2026

Anthropic Launches Voice Mode Beta for Claude

Anthropic has launched a voice mode beta for Claude, enabling full two-way spoken conversations on web and mobile (iOS/Android). The feature supports hands-free and push-to-talk modes, multiple selectable voices, seamless switching between text and voice within the same conversation, and web search access via voice. Available in English to all plan tiers, with transcripts saved to chat history. Safety measures include a limited preset voice library to prevent cloning or impersonation.

support.claude.com

voice-modemultimodalspeech-to-text

Agent Wars

technical Mar 12th, 2026

Gemini Embedding 2: Google's First Natively Multimodal Embedding Model

Google DeepMind released Gemini Embedding 2, its first fully multimodal embedding model that maps text, images, video, audio, and documents into a single unified embedding space. Built on the Gemini architecture, it supports over 100 languages, up to 8192 input tokens, 6 images per request, 120 seconds of video, and 6-page PDFs. The model uses Matryoshka Representation Learning for flexible output dimensions up to 3072 and is available via the Gemini API and Vertex AI, with integrations for LangChain, LlamaIndex, Haystack, Weaviate, QDrant, and ChromaDB.

blog.google

embeddingsmultimodalRAG