🛠️ Project Spotlight: Forge
The Pitch: Running multi-step agent workflows on small, self-hosted LLMs (like 7B–8B parameter models) is notoriously flaky. Forge acts as an open-source "guardrails and context-management stack." Sitting as a proxy between your client and local server, it rescues malformed tool calls, enforces required steps, and nudges models to retry when they fail. On internal benchmarks, it boosted a Ministral 8B model to an 86.5% overall success rate.
🗣️ Inside the Hacker News Discussion
The comment section largely focused on the trade-offs of using automated "harnesses" or wrappers around smaller local models, debating latency, accuracy, and engineering philosophies.
1. The "Latency vs. Accuracy" Trade-off
A major point of skepticism came from users who primarily rely on cutting-edge cloud models (like OpenAI or Anthropic). One user questioned whether Forge's layers of guardrails, wrappers, and retry-loops introduce crippling latency to local setups.
- The Creator's Response: The author behind Forge (zmbll) clarified that the actual code overhead is practically zero (around 5 milliseconds per Python function). The real "latency" comes in when a workflow actually has to retry a prompt. However, as the creator pointed out, spending extra time on automated LLM retries is simply the difference between a workflow failing instantly versus eventually succeeding.
2. The "Thousand Monkeys on Typewriters" Debate
Can a small, somewhat prone-to-error model achieve SOTA (State of the Art) results if you just put it in a retry-loop forever?
- Some users argued that if token costs aren't an issue, forcing a small model to re-evaluate itself is a highly viable strategy.
- Others countered that "giving a junior developer unlimited time doesn't mean they reach SOTA quality," noting that even massive models struggle with complex problems, regardless of retries.
- This led to a humorous framing of local LLMs guided by Forge as "a thousand unusually smart monkeys who speak major human languages... but sometimes make bizarre mistakes and have to backtrack." The creator joked that a core metric to measure this is ETTWS—Estimated Time To Working Solution (which another user quickly dubbed Estimated Time to William Shakespeare).
3. Context Hygiene and Alternative Harnesses
Several developers chimed in to share their own homegrown approaches to keeping small local models on track, like running local Gemma models on older hardware (like an RTX 2060).
- A user detailed their personal harness design, which focuses on strict programmatic validation of tool arguments before execution, and physically rewinding the conversation history to inject failure reasons if the model hallucinates.
- The Forge creator noted they share a similar philosophy. A key feature of Forge is "context hygiene"—collapsing the tool-call history directly into the context window to prevent the local model from getting confused by its own past bloated mistakes.
Housekeeping Note: Early on, users pointed out that the paper/readme link on the original post was broken. The author quickly provided the correct repo link: https://github.com/antoinezambelli/forge. (And in true HN fashion, the thread eventually drifted into an unrelated tangent about 1980s Texas Instruments Lisp machines).
Remove-AI-Watermarks – CLI and library for removing AI watermarks from images
Remove-AI-Watermarks: open-source tool to strip both visible and invisible AI watermarks and provenance data from images
A new GitHub project (wiltodelta/remove-ai-watermarks; ~1k stars) claims to remove Google Gemini’s “sparkle” logo overlay, defeat invisible watermarks like SynthID v1/v2, StableSignature, and TreeRing, and strip metadata that drives “Made with AI” labels on social platforms. It targets outputs from Gemini/Nano Banana, DALL·E/ChatGPT, Stable Diffusion, Firefly, Midjourney, and more, and also offers a free web front end (raiw.cc).
Highlights
- Visible watermarks: Reverses Gemini’s alpha-blended sparkle logo via known alpha maps and NCC-based detection to locate scale/position; cleans artifacts with inpainting. Claims ~0.05s/image, CPU-only.
- Invisible watermarks: Uses a diffusion “regeneration” pipeline (now SDXL at ~1024px) to break frequency/latent marks like SynthID v2; earlier SD-1.5 path removed after proving ineffective on v2.
- Metadata/provenance: Strips C2PA Content Credentials, EXIF/XMP (including the XMP DigitalSourceType that triggers “Made with AI” labels), and PNG text chunks, while preserving standard fields.
- Extras: “Smart Face Protection” blends original faces back post-diffusion to avoid distortion; “Analog Humanizer” adds grain and chromatic aberration to evade AI-image classifiers.
- Scope: Notes a pixel-level watermark in ChatGPT Images 2.0 with no public detector yet; says SDXL pipeline defeats SynthID on Gemini 3 Pro outputs.
Why it matters
- Directly undermines provenance efforts (C2PA) and platform labeling, escalating the arms race between watermarking and removal.
- Raises ethical/legal questions around misuse, research disclosure, and the viability of current watermark schemes.
- Expect debate on robustness of watermark tech, platform countermeasures (stronger signing, hardware roots of trust), and the implications of open-sourcing such tools.
Here is a daily digest summary of the Hacker News discussion regarding the Remove-AI-Watermarks submission:
The Hacker News Digest: Removing AI Watermarks
Today’s most actively debated submission centers on a new open-source tool designed to strip both visible (Gemini’s logo) and invisible (SynthID, StableSignature) AI watermarks, as well as C2PA provenance metadata from images.
While the tool itself represents a significant blow to current AI-labeling efforts, the Hacker News discussion quickly moved past the code and into deep debates regarding digital rights management (DRM), the "hacker ethos," and the underlying philosophical implications for truth in media.
Here are the primary themes from the discussion:
1. The DRM and Piracy Parallel
A massive portion of the thread compared the AI watermarking "arms race" to the historical battle between digital piracy and DRM (Digital Rights Management).
- Over several nested threads, commenters debated who ultimately "won" the piracy wars. Some argued that giant corporations (Hollywood, academic publishers) always win through sheer financial attrition.
- Others contended that DRM historically fails to stop dedicated pirates, instead only punishing legitimate consumers.
- A common consensus emerged that piracy only wanes when legal alternatives (like the early days of Netflix and Spotify) provide overwhelming convenience—a convenience users noted is now dying due to streaming fragmentation and platform "enshittification."
2. Fighting the System vs. Implicit Acceptance
An interesting philosophical debate sparked over whether building watermark-removal tools is a valid reflection of the "hacker ethos."
- One user argued that engaging in this arms race implicitly accepts the dystopian "barcode/tracking" system that tech giants are trying to implement. They suggested hackers should simply abandon corporate APIs altogether and focus on running open-source, open-weight models locally.
- Others strongly disagreed, comparing watermark removal to ad-blocking. They argued that using an ad-blocker doesn't mean a user "accepts" corporate tracking; rather, it is a direct, necessary tool to fight back against it.
3. The Death of Photographic Truth (and the "Machine Gun" Analogy)
The thread took a deep dive into the epistemological impact of AI imagery.
- The "Moral Panic" Camp: Some users argued that "pixels were never the truth anyway," noting that photos could always be manipulated. They view the current anxiety over AI fakes as a media-driven moral panic, suggesting society will simply have to revert to "pre-photography" concepts of establishing trust and truth.
- The "Scale Matters" Camp: Others pushed back vehemently, arguing that scale, speed, and access fundamentally change the game. Using an analogy of "knives versus machine guns," one commenter pointed out that while photorealistic manipulation used to require immense skill and time, anyone can now generate endless fakes instantly.
- Furthermore, users pointed out that previous verification methods (like reverse-image searching to find an original, un-doctored photo) are rendered useless when AI generates an image entirely from scratch. This dynamic, they warned, allows bad actors to effortlessly manufacture propaganda while simultaneously dismissing entirely legitimate journalism and video evidence as AI-generated "fake news."
4. The Classic Hacker News Tangent
In true Hacker News fashion, an offhand analogy about the limits of what "hobbyist hackers" can achieve against massive corporate budgets devolved into a deeply pedantic, multi-paragraph debate about whether a determined individual could theoretically acquire an ultracentrifuge to build a backyard nuclear weapon.
Gemini CLI will stop working from June 18, 2026
Google folds Gemini CLI into Antigravity CLI, consumer deprecation hits June 18
-
What’s new: Google is retiring Gemini CLI for most users and consolidating terminal tooling under Antigravity CLI, part of its new agent‑first Antigravity 2.0 platform. The CLI is rebuilt in Go for speed, adds built‑in async orchestration for multi‑agent tasks, and shares a unified server‑side agent harness with the desktop app so core agent upgrades land everywhere at once.
-
Feature carryover (not full parity at launch): Agent Skills, Hooks, Subagents, and Extensions (now “Antigravity plugins”). Google says common workflows—quick Q&A, project scaffolding, infra provisioning—still work, but some Gemini CLI features may lag during the transition.
-
Why it matters: Signals Google’s bet on multi‑agent workflows and a single backend across terminal and desktop. Expect faster iteration on agent capabilities, but also a tighter coupling to Google’s server‑side harness.
-
Key dates:
- Available now: Antigravity CLI.
- June 18, 2026: Gemini CLI and Gemini Code Assist IDE extensions stop serving requests for Google AI Pro/Ultra and the individual/free tier. For Gemini Code Assist for GitHub, no new org installs from that date; existing requests stop in the following weeks.
-
Enterprise carve‑out: Organizations on Gemini Code Assist Standard/Enterprise (or via Google Cloud) keep access to Gemini CLI and IDE extensions, with ongoing model updates. Gemini CLI will remain usable with paid Gemini and Gemini Enterprise Agent Platform API keys. Enterprises can adopt Antigravity CLI today with existing Google Cloud projects.
-
Migration notes: Docs are live; video walkthroughs coming. Extensions need to move to Antigravity plugins; expect some breakage until feature parity lands. Google is taking feedback in the Antigravity CLI forum.
Bottom line: If you’re on consumer/pro tiers, plan a migration before June 18; enterprises can transition at their own pace while maintaining current setups.
Hacker News Daily Digest: Google Axing Gemini CLI for ‘Antigravity’
The News in Brief
Google is officially retiring the Gemini CLI for consumer and individual tiers by June 18, 2026, folding its terminal tooling into a new Go-based "Antigravity CLI." The move consolidates Google’s agent-first platform, bringing built-in async orchestration and a unified backend for both terminal and desktop. While enterprise customers are shielded from the deprecation and can migrate at their own pace, consumer and Pro tier users must transition to Antigravity plugins. Not all features will have 1-to-1 parity at launch.
The Hacker News Conversation
The reaction on Hacker News was largely cynical, combining classic "Killed by Google" grievances with deep confusion over the company's branding strategy.
Here are the main takeaways from the discussion:
- The "Killed by Google" Fatigue: The loudest sentiment in the thread was exhaustion with Google’s product lifecycle. Commenters heavily criticized the company for abandoning tools, comparing this move to the infamous Google Messaging graveyard (Wave, Hangouts, Duo, Allo) and past developer tools like Polymer. As one user pointed out, developers are increasingly hesitant to invest time in adopting and learning Google workflows when they are likely to be killed or drastically retooled a year later.
- Branding Confusion & Mockery: The shift from the globally recognizable "Gemini" name to "Antigravity"—which now serves as the platform/harness, while Gemini remains the underlying model—drew widespread criticism. Users found the naming scheme chaotic, comparing it to Microsoft's scattershot branding circa 2010. Some joked that "Antigravity" feels less like a coding superpower and more like a "vomit comet" in freefall.
- Open Source "Slop" and Repo Drama: While the original Gemini CLI was open source (Apache 2), several users noted that its GitHub repo had devolved into a dumpster fire of AI-generated spam issues and pull requests, completely hamstringing actual development. While a Googler in the thread hinted that Antigravity CLI might be open-sourced, the community remains highly skeptical that Google will follow through.
- Coding Performance & The Anthropic Threat: Several developers noted that Gemini CLI's coding capabilities already felt subpar compared to Claude Code, Codex, or Kimi. This sparked a debate on Google's AI strategy: some users speculate that Google's massive recent investment in Anthropic ($40B) signals they are conceding the "coding agent" space to Claude. However, Google defenders pointed out that Gemini is a generalist model forced to optimize for massive horizontal integration (Docs, Gmail, GCP), making it tough to compete with purpose-built coding models.
- Corporate Bloat & Margin Debates: The sudden deprecation also spurred a tangent on tech industry profit margins. Users debated whether Google's decisions are driven by internal political jockeying for promotions and bloated headcounts, rather than actual customer needs, citing Google's Q1 margins as a driver for ruthless product consolidation.
The Bottom Line
For Hacker News readers, this announcement is less about the technical merits of the new Go-based Antigravity CLI and more about Google's chronic inability to maintain a stable, predictable product strategy for developers. If you are on the consumer tier, the clock is ticking to migrate, but the community sentiment suggests many might just jump ship to Claude Code or Cursor instead.
Mistral AI acquires Emmi AI
Mistral AI acquires Emmi AI to build a full-stack platform for industrial engineering
- Deal: Mistral AI is buying Linz-based Emmi AI, a 30+ person team focused on “Physics AI” for engineering. The Emmi team joins Mistral’s Science and Applied AI groups in May 2026.
- What Emmi does: AI models that accelerate physical simulation and engineering workflows across energy, automotive, semiconductors, and aerospace—aiming at real-time simulations and sophisticated digital twins.
- Tech receipts: Emmi’s AB-UPT scaled neural surrogates for CFD to 100M+ mesh cells with mesh-free inference and physics-consistent predictions; NeuralDEM (for particulate flows) is open source. Past work spans power grid stabilization, injection molding, and automotive safety testing.
- Strategy: Combines Mistral’s platform with Emmi’s domain models to create a vertically integrated “AI for engineering” stack—positioning Mistral as a transformation partner for manufacturers in high-stakes sectors.
- Europe footprint: Accelerated investment and hiring in Austria, Germany, and Lithuania; Linz becomes an official Mistral office alongside Paris, London, Amsterdam, Munich, San Francisco, and Singapore.
- Funding context: Emmi raised a €15M seed in 2025, reportedly Austria’s largest seed round at the time.
- Why it matters: Signals European consolidation around AI-for-physics, moving beyond general-purpose LLMs toward domain-specific stacks that could cut simulation costs and speed up R&D.
- What to watch: Head-to-head benchmarks vs. traditional solvers, integration with existing CAE/HPC toolchains, validation for safety-critical use, and on-prem options for IP-sensitive customers.
Hacker News Daily Digest: Mistral’s Industrial Pivot & The "Sovereign EU AI" Play
Today’s top story highlights European AI champion Mistral acquiring Linz-based startup Emmi AI to build a full-stack platform for industrial engineering and "Physics AI." The move aims to bring real-time physical simulations and digital twins to sectors like aerospace, energy, and semiconductors.
Over in the Hacker News comments, the discussion quickly moved past the acquisition itself and into a broader debate about Mistral’s overarching business strategy, its deep ties to European industrial giants, and its fading presence in the consumer AI hype cycle.
Here is a summary of what the HN community is saying:
1. The "Sovereign EU AI" & B2B Strategy
A dominant theme in the thread is that Mistral is no longer trying to compete head-to-head with the "Big 3" (OpenAI, Anthropic, Google) in the consumer/B2C space. Instead, commenters point out that Mistral is playing a highly lucrative, behind-the-scenes game:
- Government & Defense: Users note that Mistral is leaning hard into European data sovereignty. Rather than chasing public benchmark leaderboards, they are optimizing for EU procurement rules, structured on-premize deployments, and defense contracts where hosting your own keys is mandatory.
- Enterprise Consulting: Developers observed that Mistral’s business model is looking increasingly like high-end ML consulting designed for massive European legacy companies, governments (like their Luxembourg partnership), and institutions that require strict data privacy.
2. The ASML Connection
Much of the thread focused on ASML, the Dutch semiconductor manufacturing giant, which is a major investor in Mistral.
- Some commenters initially questioned why ASML would invest in an LLM company.
- Others, including users claiming secondhand knowledge from ASML employees, clarified that this is a deeply strategic play. ASML is ostensibly using Mistral's infrastructure to train models on highly proprietary data to power complex R&D and operations. The Emmi AI acquisition directly supports this hardware/physics-oriented direction.
3. Demystifying Emmi AI’s "Physics AI"
While a few users were skeptical of the buzzwords surrounding Emmi AI, one commenter clearly explained the practical value of the tech. They noted that Emmi has built transformer-based mold flow simulators. In traditional manufacturing (like plastic injection molding), physics simulators are notoriously slow. By using AI to instantly predict how materials will fill a cavity or react to different geometries, engineers can drastically speed up the R&D and physical testing phases.
4. Falling Developer Mindshare vs. Enterprise Success
There was a spirited debate about Mistral's current relevance to everyday coders:
- The Critics: Several users admitted they had "completely forgotten" about Mistral, arguing that for daily coding tasks, Anthropic, OpenAI, and even Chinese open-source models (like Qwen) have largely outpaced them.
- The Fans: Despite this, some developers praised Mistral's specific tools, giving a shout-out to their "Vibe" CLI tool for being a highly ergonomic and effective terminal UI for coding.
- The Conclusion: The consensus seems to be that while Mistral might be losing the public mindshare battle among indie developers, they are quietly becoming the undisputed #1 player for corporate AI rollouts inside Germany, France, and the broader EU enterprise market.