Posts

Show HN: Viberails – Easy AI Audit and Control https://ift.tt/0uEqYWb

Show HN: Viberails – Easy AI Audit and Control Hello HN. I'm Maxime, founder at LimaCharlie ( https://limacharlie.io ), a Hyperscaler for SecOps (access building blocks you need to build security operations, like AWS does for IT). We’ve engineered a new product on our platform that solves a timely issue acting as a guardrail between your AI and the world: Viberails ( https://ift.tt/UOMvxCt ) This won't be new to folks here, but we identified 4 challenges teams face right now with AI tools: 1. Auditing what the tools are doing. 2. Controlling toolcalls (and their impact on the world). 3. Centralized management. 4. Easy access to the above. To expand: Audit logs are the bread and butter for security, but this hasn't really caught up in AI tooling yet. Being able to look back and say "what actually happened" after the fact is extremely valuable during an incident and for compliance purposes. Tool calls are how LLMs interact with the world, we should be able to exerci...

Show HN: EpsteIn – Search the Epstein files for your LinkedIn connections https://ift.tt/wRBNsD1

Show HN: EpsteIn – Search the Epstein files for your LinkedIn connections https://ift.tt/NIpWRZA February 5, 2026 at 12:54AM

Show HN: Tabstack Research – An API for verified web research (by Mozilla) https://ift.tt/80FMW32

Show HN: Tabstack Research – An API for verified web research (by Mozilla) Hi HN, My team and I are building Tabstack to handle the web layer for AI agents. Today we are sharing Tabstack Research, an API for multi-step web discovery and synthesis. https://ift.tt/57vLmzk In many agent systems, there is a clear distinction between extracting structured data from a single page and answering a question that requires reading across many sources. The first case is fairly well served today. The second usually is not. Most teams handle research by combining search, scraping, and summarization. This becomes brittle and expensive at scale. You end up managing browser orchestration, moving large amounts of raw text just to extract a few claims, and writing custom logic to check if a question was actually answered. We built Tabstack Research to move this reasoning loop into the infrastructure layer. You send a goal, and the system: - Decomposes it into targeted sub-questions to hit different data ...

Show HN: Nomad Tracker – a local-first iOS app to track visas and tax residency https://ift.tt/1cQykbK

Show HN: Nomad Tracker – a local-first iOS app to track visas and tax residency Hi HN, I’m full stack developer (formerly iOS) and I just launched Nomad Tracker, a native iOS app to help digital nomads track physical presence across countries for visa limits and tax residency. Key idea: everything runs on-device. No accounts, no cloud sync, no analytics. Features: - Calendar-based day tracking per country. - Schengen 90/180 and other visa “runways”. - Fiscal residency day counts and alerts. - Optional background location logging (battery-efficient, never overwrites manual data). - Photo import using metadata only (no image access). - On-device “Fiscal Oracle” using Apple’s Foundational Models to ask questions about your own data. I created this because other apps felt limiting and didn’t do what I needed. This app is visual, user-focused, and designed to make tracking easy and clear. Happy to answer questions or discuss the technical tradeoffs. https://ift.tt/H8veVPa February 3, 2026 a...

Show HN: Stigmergy pattern for multi-agent LLMs (80% fewer API calls) https://ift.tt/Hh6VYcf

Show HN: Stigmergy pattern for multi-agent LLMs (80% fewer API calls) https://ift.tt/PlBgDwk February 3, 2026 at 11:01PM

Show HN: kiln.bot - Orchestrate Claude Code from GitHub https://ift.tt/8FgcDJW

Show HN: kiln.bot - Orchestrate Claude Code from GitHub Hey everybody! "Kiln" orchestrates Claude Code instances on your local machine using GitHub projects as its control panel. https://kiln.bot https://ift.tt/ZV4tNUz If you're around Stage 6-7 on the Gas Town scale, you may have 3-15 terminal windows open. You're out of screen real estate and the markdown files are piling up. TUIs and specialized IDEs are meant to help, but they're more things to manage. Kiln simply polls GitHub projects. When you move issues from one column to another, Kiln invokes Claude Code CLI to run the corresponding /command. Claude creates the worktrees, researches the codebase, creates and implements the plan. Stores it in GitHub Issues. It's meant to be simple, nothing new: - Use your existing claude subscription (no auth trickery, runs locally) - All context and state is on GitHub (no markdown mess, no local DBs, easy recovery) - Poll instead of webhooks/events (no external attack...

Show HN: I built "AI Wattpad" to eval LLMs on fiction https://ift.tt/BdjOfCQ

Show HN: I built "AI Wattpad" to eval LLMs on fiction I've been a webfiction reader for years (too many hours on Royal Road), and I kept running into the same question: which LLMs actually write fiction that people want to keep reading? That's why I built Narrator ( https://ift.tt/tKsFvHk ) – a platform where LLMs generate serialized fiction and get ranked by real reader engagement. Turns out this is surprisingly hard to answer. Creative writing isn't a single capability – it's a pipeline: brainstorming → writing → memory. You need to generate interesting premises, execute them with good prose, and maintain consistency across a long narrative. Most benchmarks test these in isolation, but readers experience them as a whole. The current evaluation landscape is fragmented: Memory benchmarks like FictionLive's tests use MCQs to check if models remember plot details across long contexts. Useful, but memory is necessary for good fiction, not sufficient. A model ...