Posts

Showing posts from January, 2026

Show HN: Agent Tinman – Autonomous failure discovery for LLM systems https://ift.tt/swIJ8mg

Show HN: Agent Tinman – Autonomous failure discovery for LLM systems Hey HN, I built Tinman because finding LLM failures in production is a pain in the ass. Traditional testing checks what you've already thought of. Tinman tries to find what you haven't. It's an autonomous research agent that: - Generates hypotheses about potential failure modes - Designs and runs experiments to test them - Classifies failures (reasoning errors, tool use, context issues, etc.) - Proposes interventions and validates them via simulation The core loop runs continuously. Each cycle informs the next. Why now: With tools like OpenClaw/ClawdBot giving agents real system access, the failure surface is way bigger than "bad chatbot response." Tinman has a gateway adapter that connects to OpenClaw's WebSocket stream for real-time analysis as requests flow through. Three modes: - LAB: unrestricted research against dev - SHADOW: observe production, flag issues - PRODUCTION: human approval ...

Show HN: An extensible pub/sub messaging server for edge applications https://ift.tt/gbt4uqe

Show HN: An extensible pub/sub messaging server for edge applications hi there! i’ve been working on a project called Narwhal, and I wanted to share it with the community to get some valuable feedback. what is it? Narwhal is a lightweight Pub/Sub server and protocol designed specifically for edge applications. while there are great tools out there like NATS or MQTT, i wanted to build something that prioritizes customization and extensibility. my goal was to create a system where developers can easily adapt the routing logic or message handling pipeline to fit specific edge use cases, without fighting the server's defaults. why Rust? i chose Rust because i needed a low memory footprint to run efficiently on edge devices (like Raspberry Pis or small gateways), and also because I have a personal vendetta against Garbage Collection pauses. :) current status: it is currently in Alpha. it works for basic pub/sub patterns, but I’d like to start working on persistence support soon (so mess...

Show HN: Daily Cat https://ift.tt/pkHjuah

Show HN: Daily Cat Seeing HTTP Cats on the home page remind me to share a small project I made a couple months ago. It displays a different cat photo from Unsplash every day and will send you notifications if you opt-in. https://daily.cat/ January 31, 2026 at 03:40AM

Show HN: A Local OS for LLMs. MIT License. Zero Hallucinations. Infinite Memory https://ift.tt/HTgGRl4

Show HN: A Local OS for LLMs. MIT License. Zero Hallucinations. Infinite Memory The problem with LLMs isn't intelligence; it's amnesia and dishonesty. Hey HN, I’ve spent the last few months building Remember-Me, an open-source "Sovereign Brain" stack designed to run entirely offline on consumer hardware. The core thesis is simple: Don't rent your cognition. Most RAG (Retrieval Augmented Generation) implementations are just "grep for embeddings." They are messy, imprecise, and prone to hallucination. I wanted to solve the "Context integrity" problem at the architectural layer. The Tech Stack (How it works): QDMA (Quantum Dream Memory Architecture): instead of a flat vector DB, it uses a hierarchical projection engine. It separates "Hot" (Recall) from "Cold" (Storage) memory, allowing for effectively infinite context window management via compression. CSNP (Context Switching Neural Protocol) - The Hallucination Killer: This is...

Show HN: We added memory to Claude Code. It's powerful now https://ift.tt/U8LM0Sj

Show HN: We added memory to Claude Code. It's powerful now https://ift.tt/UdiwYve January 30, 2026 at 10:53PM

Show HN: Craft – Claude Code running on a VM with all your workplace docs https://ift.tt/EciBVjA

Show HN: Craft – Claude Code running on a VM with all your workplace docs I’ve found coding agents to be great at 1/ finding everything they need across large codebases using only bash commands (grep, glob, ls, etc.) and 2/ building new things based on their findings (duh). What if, instead of a codebase, the files were all your workplace docs? There was a `Google_Drive` folder, a `Linear` folder, a `Slack` folder, and so on. Over the last week, we put together Craft to test this out. It’s an interface to a coding agent (OpenCode for model flexibility) running on a virtual machine with: 1. your company's complete knowledge base represented as directories/files (kept in-sync) 2. free reign to write and execute python/javascript 3. ability to create and render artifacts to the user Demo: https://www.youtube.com/watch?v=Hvjn76YSIRY Github: https://ift.tt/x93e5wL... It turns out OpenCode does a very good job with docs. Workplace apps also have a natural structure (Slack channels about ...

Show HN: SHDL – A minimal hardware description language built from logic gates https://ift.tt/4jAWa8n

Show HN: SHDL – A minimal hardware description language built from logic gates Hi, everyone! I built SHDL (Simple Hardware Description Language) as an experiment in stripping hardware description down to its absolute fundamentals. In SHDL, there are no arithmetic operators, no implicit bit widths, and no high-level constructs. You build everything explicitly from logic gates and wires, and then compose larger components hierarchically. The goal is not synthesis or performance, but understanding: what digital systems actually look like when abstractions are removed. SHDL is accompanied by PySHDL, a Python interface that lets you load circuits, poke inputs, step the simulation, and observe outputs. Under the hood, SHDL compiles circuits to C for fast execution, but the language itself remains intentionally small and transparent. This is not meant to replace Verilog or VHDL. It’s aimed at: - learning digital logic from first principles - experimenting with HDL and language design - teachi...

Show HN: Lightbox – Flight recorder for AI agents (record, replay, verify) https://ift.tt/shB8O4q

Show HN: Lightbox – Flight recorder for AI agents (record, replay, verify) I built Lightbox because I kept running into the same problem: an agent would fail in production, and I had no way to know what actually happened. Logs were scattered, the LLM’s “I called the tool” wasn’t trustworthy, and re-running wasn’t deterministic. This week, tons of Clawdbot incidents have driven the point home. Agents with full system access can expose API keys and chat histories. Prompt injection is now a major security concern. When agents can touch your filesystem, execute code, and browse the web…you probably need a tamper-proof record of exactly what actions it took, especially when a malicious prompt or compromised webpage could hijack the agent mid-session. Lightbox is a small Python library that records every tool call an agent makes (inputs, outputs, timing) into an append-only log with cryptographic hashes. You can replay runs with mocked responses, diff executions across versions, and verify t...

Show HN: Cosmic AI Workflows – Chain AI agents to automate multi-step projects https://ift.tt/gIbnMrQ

Show HN: Cosmic AI Workflows – Chain AI agents to automate multi-step projects Hi, I'm Tony, founder of Cosmic (AI-powered headless CMS and application development platform). We kept running into the same problem: create a blog post with the help of an AI agent, use the output for another prompt to create social posts, then manually post to X, LinkedIn, Facebook. Every single time. So we built AI Workflows — chain multiple agents together and let them run autonomously, with each step receiving outputs from previous steps. Three agent types you can chain: - Code Agents: Build features in GitHub with commits and pull requests. - Content Agents: Generate CMS content with context injection from previous steps. - Computer Use Agents: Automate browser workflows and record demos. How it works: 1. Define steps with agent type, prompt, and configuration 2. Steps run sequentially or in parallel (configurable) 3. Context passes automatically between steps 4. Trigger manually, on a schedule (c...

Show HN: LemonSlice – Give your voice agents a face https://ift.tt/ki4IDaE

Show HN: LemonSlice – Give your voice agents a face Hey HN, we're the co-founders of LemonSlice ( https://lemonslice.com ). We train interactive avatar video models. Our API lets you upload a photo and immediately jump into a FaceTime-style call with that character. Here's a demo: https://ift.tt/uY5lrWa Chatbots are everywhere. Voice AI has recently taken off. But we believe video avatars will be the most common form factor for conversational AI. Most people would rather watch something than read it. The problem is that generating video in real-time is hard, and overcoming the uncanny valley is even harder. We haven’t broken the uncanny valley yet. Nobody has. But we’re getting close and our photorealistic avatars are currently best-in-class (judge for yourself: https://ift.tt/A5OK3Xf ). Plus, we're the only avatar model that can do animals and heavily stylized cartoons. Try it: https://ift.tt/FA0IpKk . Warning! Talking to this little guy may improve your mood. Today we...

Show HN: Ourguide – OS wide task guidance system that shows you where to click https://ift.tt/gN9Lh0T

Show HN: Ourguide – OS wide task guidance system that shows you where to click Hey! I'm eshaan and I'm building Ourguide -an on-screen task guidance system that can show you where to click step-by-step when you need help. I started building this because whenever I didn’t know how to do something on my computer, I found myself constantly tabbing between chatbots and the app, pasting screenshots, and asking “what do I do next?” Ourguide solves this with two modes. In Guide mode, the app overlays your screen and highlights the specific element to click next, eliminating the need to leave your current window. There is also Ask mode, which is a vision-integrated chat that captures your screen context—which you can toggle on and off anytime -so you can ask, "How do I fix this error?" without having to explain what "this" is. It’s an Electron app that works OS-wide, is vision-based, and isn't restricted to the browser. Figuring out how to show the user where to...

Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus https://ift.tt/Ef26FTj

Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus https://ift.tt/OkD1pKB January 27, 2026 at 12:12AM

Show HN: Postgres and ClickHouse as a unified data stack https://ift.tt/3zwUOys

Show HN: Postgres and ClickHouse as a unified data stack Hello HN, this is Sai and Kaushik from ClickHouse. Today we are launching a Postgres managed service that is natively integrated with ClickHouse. It is built together with Ubicloud (YC W24). TL;DR: NVMe-backed Postgres + built-in CDC into ClickHouse + pg_clickhouse so you can keep your app Postgres-first while running analytics in ClickHouse. Try it (private preview): https://ift.tt/n9UtO6i Blog w/ live demo: https://ift.tt/9rbvxOD Problem Across many fast-growing companies using Postgres, performance and scalability commonly emerge as challenges as they grow. This is for both transactional and analytical workloads. On the OLTP side, common issues include slower ingestion (especially updates, upserts), slower vacuums, long-running transactions incurring WAL spikes, among others. In most cases, these problems stem from limited disk IOPS and suboptimal disk latency. Without the need to provision or cap IOPS, Postgres could do far m...

Show HN: I Created a Tool to Convert YouTube Videos into 2000 Word SEO Blog https://ift.tt/ketCIcG

Show HN: I Created a Tool to Convert YouTube Videos into 2000 Word SEO Blog https://landkit.pro/youtube-to-blog January 25, 2026 at 11:16PM

Show HN: CertRadar – Find every certificate ever issued for your domain https://ift.tt/ArzW3JF

Show HN: CertRadar – Find every certificate ever issued for your domain https://certradar.net/ January 25, 2026 at 11:21PM

Show HN: Remote workers find your crew https://ift.tt/bVXyEor

Show HN: Remote workers find your crew Working from home? Are you a remote employee that "misses" going to the office? Well let's be clear on what you actually miss. No one misses that feeling of having to go and be there 8 hours. But many people miss friends. They miss being part of a crew. Going to lunch, hearing about other people's lives in person not over zoom. Join a co-working space you say? Yes. We have. It's like walking into a library and trying to talk to random people and getting nothing back. Zero part of a crew feeling. https://ift.tt/LqiUWAb This app helps you find a crew and meet up for work and get that crew feeling. This is my first time using cloudflare workers for a webapp. The free plan is amazing! You get so much compare to anything else out there in terms of limits. The sqlite database they give you is just fine, I don't miss psql. January 24, 2026 at 11:54PM

Show HN: Teemux – Zero-config log multiplexer with built-in MCP server https://ift.tt/NEPbKkV

Show HN: Teemux – Zero-config log multiplexer with built-in MCP server I started to use AI agents for coding and quickly ran into a frustrating limitation – there is no easy way to share my development environment logs with AI agents. So that's what is Teemux. A simple CLI program that aggregates logs, makes them available to you as a developer (in a pretty UI), and makes them available to your AI coding agents using MCP. There is one implementation detail that I geek out about: It is zero config and has built-in leader nomination for running the web server and MCP server. When you start one `teemux` instance, it starts web server, .. when you start second and third instances, they join the first server and start merging logs. If you were to kill the first instance, a new leader is nominated. This design allows to seamless add/remove nodes that share logs (a process that historically would have taken a central log aggregator). A super quick demo: npx teemux -- curl -N https://ift.t...

Show HN: MermaidTUI - Deterministic Unicode/ASCII diagrams in the terminal https://ift.tt/hOJPdbj

Show HN: MermaidTUI - Deterministic Unicode/ASCII diagrams in the terminal Hi HN, I built mermaidtui, a lightweight TypeScript engine that renders Mermaid flowcharts directly in your terminal as clean Unicode or ASCII boxes. Visualizing Mermaid diagrams usually requires a heavy setup: a headless browser (Puppeteer/Playwright), SVG-to-image conversion, or a web preview. That's fine for documentation sites, but it's overkill for TUI apps, CI logs, or quick terminal previews. The Solution is a small engine (<= 1000 LOC) that uses a deterministic grid-based layout to render diagrams using box-drawing characters. Key Features - Intelligent Routing: It uses corner characters (┌, ┐, └, ┘) for orthogonal paths. - Topological Layering: Attempts a readable, structured layout. - Support for Chained Edges: A --> B --> C works out of the box. - Zero Heavy Dependencies: No Mermaid internals, no Chromium, just pure TypeScript/JavaScript. With commander for the CLI, not the MermaidTUI...

Show HN: Synesthesia, make noise music with a colorpicker https://ift.tt/BE5DnvL

Show HN: Synesthesia, make noise music with a colorpicker This is a (silly, little) app which lets you make noise music using a color picker as an instrument. When you click on a specific point in the color picker, a bit of JavaScript maps the binary representation of the clicked-on color's hex-code to a "chord" in the 24 tone-equal-temperament scale. That chord is then played back using a throttled audio generation method which was implemented via Tone.js. NOTE! Turn the volume way down before using the site. It is noise music. :) https://visualnoise.ca January 22, 2026 at 11:22AM

Show HN: I'm tired of my LLM bullshitting. So I fixed it https://ift.tt/n59ijxC

Show HN: See the carbon impact of your cloud as you code https://ift.tt/saFq2LV

Show HN: See the carbon impact of your cloud as you code Hey folks, I’m Hassan, one of the co-founders of Infracost ( https://ift.tt/hdPaKM5 ). Infracost helps engineers see and reduce the cloud cost of each infrastructure change before they merge their code. The way Infracost works is we gather pricing data from Amazon Web Services, Microsoft Azure and Google Cloud. What we call a ‘Pricing Service’, which now holds around 9 million live price points (!!). Then we map these prices to infrastructure code. Once the mapping is done, it enables us to show the cost impact of a code change before it is merged, directly in GitHub, GitLab etc. Kind of like a checkout-screen for cloud infrastructure. We’ve been building since 2020 (we were part of YC W21 batch), and iterating on the product, building out a team etc. However, back in 2020 one of our users asked if we can also show the carbon impact alongside costs. It has been itching my brain since then. The biggest challenge has always been th...

Show HN: Xv6OS – A modified MIT xv6 with GUI https://ift.tt/AURVxtd

Show HN: Xv6OS – A modified MIT xv6 with GUI I've been working on a hobby project to transform the traditional xv6 teaching OS into a graphical environment. Key Technical Features: GUI Subsystem: I implemented a kernel-level window manager and drawing primitives. Mouse Support: Integrated a PS/2 mouse driver for navigation. Custom Toolchain: I used Python scripts (Pillow) and Go to convert PNG assets and TTF fonts into C arrays for the kernel. Userland: Includes a terminal, file explorer, text editor, and a Floppy Bird game. The project is built for i386 using a monolithic kernel design. You can find the full source code and build instructions here: https://ift.tt/cyuCjt5 January 20, 2026 at 10:46PM

Show HN: Trinity – a native macOS Neovim app with Finder-style projects https://ift.tt/XH5SjBo

Show HN: Trinity – a native macOS Neovim app with Finder-style projects Hi HN, I built Trinity, a native macOS app that wraps Neovim with a project-centric UI. The goal was to keep Neovim itself untouched, but provide a more Mac-native workflow: – Finder-style project browser – Multiple projects/windows – Markdown preview, image/pdf viewer – Native menus, shortcuts, and windowing – Minimal UI, no GPU effects or terminal emulation It’s distributed directly (signed + notarized PKG) and uses Sparkle for incremental updates. This started as a personal tool after bouncing between terminal Neovim and heavier editors. Curious to hear feedback from other Neovim users, especially on what feels right or wrong in a GUI wrapper. Site: https://ift.tt/ZLIqStp Direct download: https://ift.tt/NUdfD58... https://ift.tt/ZLIqStp January 20, 2026 at 11:14PM

Show HN: Homunculus – A self-rewriting Claude Code plugin https://ift.tt/8yf5htj

Show HN: Homunculus – A self-rewriting Claude Code plugin Homunculus is a Claude Code plugin that watches how you work and writes new capabilities into itself. If you keep doing something repeatedly—checking docs before API calls, running the same debug flow, formatting PRs a certain way—it notices and offers to automate it. Accept, and it writes a new markdown file into its own structure. The plugin literally changes based on what you do. It can create: Commands (explicit shortcuts) Skills (context-triggered behaviors) Subagents (specialists for specific problem domains) Hooks (event-driven, like "run tests when these files change") What actually works (v0.1): Commands are deterministic. Skills are probabilistic—they fire when Claude decides they're relevant, maybe 50-80% of the time. It's an experiment in making LLM tooling adaptive rather than static. State stored in .claude/homunculus/. Each project gets its own instance. https://ift.tt/aWbNYtn January 19, 2026 at...

Show HN: Mist – a lightweight, self-hosted PaaS https://ift.tt/ZeoMPC7

Show HN: Mist – a lightweight, self-hosted PaaS Hi HN, We’re building Mist, a lightweight, self-hosted PaaS for running and managing applications on your own servers. We started Mist because we wanted a middle ground between raw VPS management and heavy, all-in-one platforms. Existing PaaS solutions often felt too complex or resource-intensive for small servers, homelabs, and side projects. We wanted something that keeps the PaaS experience but stays simple and predictable. Our goals with Mist are: - A simple PaaS to deploy and manage apps on your own infrastructure - HTTPS, routing, and app access handled automatically - Low resource usage so it runs well on small VPSes - Self-hosted and transparent, with minimal magic Mist focuses on being an opinionated but lightweight layer on top of your server. It doesn’t try to hide everything behind abstractions, but it does aim to remove the repetitive operational work that comes with managing apps by hand. Mist is still early, and this is whe...

Show HN: Docker.how – Docker command cheat sheet https://ift.tt/J74rGLM

Show HN: Docker.how – Docker command cheat sheet https://docker.how/ January 18, 2026 at 01:47AM

Show HN: Minikv – Distributed key-value and object store in Rust (Raft, S3 API) https://ift.tt/jwpvqbA

Show HN: Minikv – Distributed key-value and object store in Rust (Raft, S3 API) Hi HN, I’m releasing minikv, a distributed key-value and object store in Rust. What is minikv? minikv is an open-source, distributed storage engine built for learning, experimentation, and self-hosted setups. It combines a strongly-consistent key-value database (Raft), S3-compatible object storage, and basic multi-tenancy. I started minikv as a learning project about distributed systems, and it grew into something production-ready and fun to extend. Features/highlights: - Raft consensus with automatic failover and sharding - S3-compatible HTTP API (plus REST/gRPC APIs) - Pluggable storage backends: in-memory, RocksDB, Sled - Multi-tenant: per-tenant namespaces, role-based access, quotas, and audit - Metrics (Prometheus), TLS, JWT-based API keys - Easy to deploy (single binary, works with Docker/Kubernetes) Quick demo (single node): git clone https://ift.tt/0Dhk1MB cd minikv cargo run --release -- --config c...

Show HN: What if your menu bar was a keyboard-controlled command center? https://ift.tt/EhwZ9R4

Show HN: What if your menu bar was a keyboard-controlled command center? Hey Hacker News The ones that know me here know that I am a productivity geek. After DockFlow to manage my Dock and ExtraDock, which gives me more space to manage my apps and files, I decided to tackle the macOS big boss: the menu bar. I spend ~40% of my day context-switching between apps — Zoom meetings, Slack channels, Code projects, and Figma designs. My macOS menu bar has too many useless icons I almost never use. So I thought to myself, how can I use this area to improve my workflows? Most solutions (Bartender, Ice) require screen recording permissions, and did not really solve my issues. I wanted custom menus in the apps, not the ones that the developers decided for me. After a few iterations and exploring different solutions, ExtraBar was created. Instead of just hiding icons, what if the menu bar became a keyboard-controlled command center that has the actions I need? No permissions. No telemetry. Just loc...

Show HN: 1Code – Open-source Cursor-like UI for Claude Code https://ift.tt/Luh3bv0

Show HN: 1Code – Open-source Cursor-like UI for Claude Code Hi, we're Sergey and Serafim. We've been building dev tools at 21st.dev and recently open-sourced 1Code ( https://1code.dev ), a local UI for Claude Code. Here's a video of the product: https://www.youtube.com/watch?v=Sgk9Z-nAjC0 Claude Code has been our go-to for 4 months. When Opus 4.5 dropped, parallel agents stopped needing so much babysitting. We started trusting it with more: building features end to end, adding tests, refactors. Stuff you'd normally hand off to a developer. We started running 3-4 at once. Then the CLI became annoying: too many terminals, hard to track what's where, diffs scattered everywhere. So we built 1Code.dev, an app to run your Claude Code agents in parallel that works on Mac and Web. On Mac: run locally, with or without worktrees. On Web: run in remote sandboxes with live previews of your app, mobile included, so you can check on agents from anywhere. Running multiple Claude C...

Show HN: I built an 11MB offline PDF editor because mobile Acrobat is 500MB https://ift.tt/ASF3nK0

Show HN: I built an 11MB offline PDF editor because mobile Acrobat is 500MB https://revpdf.com/ January 16, 2026 at 12:30AM

Show HN: OpenWork – an open-source alternative to Claude Cowork https://ift.tt/eKLY5Hl

Show HN: OpenWork – an open-source alternative to Claude Cowork hi hn, i built openwork, an open-source, local-first system inspired by claude cowork. it’s a native desktop app that runs on top of opencode (opencode.ai). it’s basically an alternative gui for opencode, which (at least until now) has been more focused on technical folks. the original seed for openwork was simple: i have a home server, and i wanted my wife and i to be able to run privileged workflows. things like controlling home assistant, or deploying custom web apps (e.g. our customs recipe app recipes.benjaminshafii.com), legal torrents, without living in a terminal. our initial setup was running the opencode web server directly and sharing credentials to it. that worked, but i found the web ui unreliable and very unfriendly for non-technical users. the goal with openwork is to bring the kind of workflows i’m used to running in the cli into a gui, while keeping a very deep extensibility mindset. ideally this grows int...

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP https://ift.tt/pIsYWxf

Show HN: Webctl – Browser automation for agents based on CLI instead of MCP https://ift.tt/bpm5HfB January 14, 2026 at 08:04PM

Show HN: Repomance: A Tinder style app for GitHub repo discovery https://ift.tt/ePRYjxb

Show HN: Repomance: A Tinder style app for GitHub repo discovery Hi everyone, Repomance is an app for discovering curated and trending repositories. Swipe to star them directly using your GitHub account. It is currently available on iOS, iPadOS, and macOS. I plan to develop an Android version once the app reaches 100 users. Repomance is open source: https://ift.tt/Mqe9zG0 All feedback is welcome, hope you enjoy using it. https://ift.tt/6LGBVdU January 15, 2026 at 12:24AM

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR https://ift.tt/KeXlId5

Show HN: Sparrow-1 – Audio-native model for human-level turn-taking without ASR For the past year I've been working to rethink how AI manages timing in conversation at Tavus. I've spent a lot of time listening to conversations. Today we're announcing the release of Sparrow-1, the most advanced conversational flow model in the world. Some technical details: - Predicts conversational floor ownership, not speech endpoints - Audio-native streaming model, no ASR dependency - Human-timed responses without silence-based delays - Zero interruptions at sub-100ms median latency - In benchmarks Sparrow-1 beats all existing models at real world turn-taking baselines I wrote more about the work here: https://ift.tt/fFRHxa4... https://ift.tt/u7ViAlg January 14, 2026 at 11:31PM

Show HN: Self-host Reddit – 2.38B posts, works offline, yours forever https://ift.tt/zZrVkYc

Show HN: Self-host Reddit – 2.38B posts, works offline, yours forever Reddit's API is effectively dead for archival. Third-party apps are gone. Reddit has threatened to cut off access to the Pushshift dataset multiple times. But 3.28TB of Reddit history exists as a torrent right now, and I built a tool to turn it into something you can browse on your own hardware. The key point: This doesn't touch Reddit's servers. Ever. Download the Pushshift dataset, run my tool locally, get a fully browsable archive. Works on an air-gapped machine. Works on a Raspberry Pi serving your LAN. Works on a USB drive you hand to someone. What it does: Takes compressed data dumps from Reddit (.zst), Voat (SQL), and Ruqqus (.7z) and generates static HTML. No JavaScript, no external requests, no tracking. Open index.html and browse. Want search? Run the optional Docker stack with PostgreSQL – still entirely on your machine. API & AI Integration: Full REST API with 30+ endpoints – posts, commen...

Show HN: Data from a mixed-brand LiFePO₄ battery bank https://ift.tt/qzxjf0v

Show HN: Data from a mixed-brand LiFePO₄ battery bank Hi HN — I’m sharing an empirical, long-term dataset from a DIY energy-storage project that ended up testing a common assumption in battery design. Conventional advice says never mix battery brands. That guidance is well-founded for series strings, but there’s surprisingly little data on purely parallel configurations. I built a 12 V, 500 Ah LiFePO₄ battery bank (1S5P) using mixed-brand cells and instrumented it for continuous monitoring over 73+ days, including high-frequency voltage sampling. The goal was to see whether cell-level differences actually manifest over time in a parallel topology. What the data shows No progressive voltage divergence across the observation period Voltage spread remained within ~10–15 mV Measured Peukert exponent ≈ 1.00 Thermal effects were small relative to instrumentation noise In practice, the parallel architecture appears to force electrical convergence when interconnect resistance is low. I’ve been...

Show HN: Fall asleep by watching JavaScript load https://ift.tt/NvZhuCz

Show HN: Fall asleep by watching JavaScript load https://ift.tt/UVLeQhy January 13, 2026 at 12:06AM

Show HN: Sidecar – AI Social Manager (Analyzes past hits to write new posts) https://ift.tt/MorfnIy

Show HN: Sidecar – AI Social Manager (Analyzes past hits to write new posts) Hi HN, I built Sidecar ( https://sidecar.bz ) because I was having issues maintaining a social media presence for my last startup. I would spend a lot of time trying to create content, but I often froze up or burned out, and the marketing died. How it works: Instead of guessing what to write, Sidecar connects to your existing accounts (Threads, Bluesky, Mastodon, Facebook, Instagram) and analyzes your past posts to see what actually worked. It uses that data to generate weeks of new, text-based content that mimics your successful posts, which you can then bulk schedule in one go. I’d love to hear what you think of Sidecar. You can use code HNLAUNCH for a free month if you want to test the ai features. https://ift.tt/QAGS1uJ January 12, 2026 at 10:48PM

Show HN: Worldview, persistent strategic context for Claude Code https://ift.tt/PqfiMDQ

Show HN: Worldview, persistent strategic context for Claude Code https://ift.tt/8GVbDUF January 12, 2026 at 01:29AM

Show HN: AI Vibe Coding Hackathon https://ift.tt/pYzZvbK

Show HN: AI Vibe Coding Hackathon https://ift.tt/OxKlHq6 January 12, 2026 at 01:37AM

Show HN: A MCP for controlling terminal UI apps built with bubbletea and ratatui https://ift.tt/PVHEaRJ

Show HN: A MCP for controlling terminal UI apps built with bubbletea and ratatui so you can start vibe-coding your ad-hoc terminal dashboard. With session replay and mouse click support built-in. https://ift.tt/o2ErYPL January 12, 2026 at 01:24AM

Show HN: Interactive California Budget (by Claude Code) https://ift.tt/3QTiW4H

Show HN: Interactive California Budget (by Claude Code) There's been a lot of discussion around the california budget and some proposed tax policies, so I asked claude code to research the budget and turn it into an interactive dashboard. Using async subagents claude was able to research ~a dozen budget line items at once across multiple years, adding lots of helpful context and graphs to someone like me who was starting with little familiarity. It still struggles with frontend changes, but for research this probably 20-40x's my throughput. Let me know any additional data or visualizations that would be interesting to add! https://ift.tt/wMnzjlV January 11, 2026 at 10:43PM

Show HN: Play poker with LLMs, or watch them play against each other https://ift.tt/CYQHvZs

Show HN: Play poker with LLMs, or watch them play against each other I was curious to see how some of the latest models behaved and played no limit texas holdem. I built this website which allows you to: Spectate: Watch different models play against each other. Play: Create your own table and play hands against the agents directly. https://llmholdem.com/ January 11, 2026 at 12:57AM

Show HN: Marten – Elegant Go web framework (nothing in the way) https://ift.tt/tNk1jbS

Show HN: Marten – Elegant Go web framework (nothing in the way) https://ift.tt/7zxbXK5 January 11, 2026 at 02:40AM

Show HN: I used Claude Code to discover connections between 100 books https://ift.tt/856rjJa

Show HN: I used Claude Code to discover connections between 100 books I think LLMs are overused to summarise and underused to help us read deeper. I built a system for Claude Code to browse 100 non-fiction books and find interesting connections between them. I started out with a pipeline in stages, chaining together LLM calls to build up a context of the library. I was mainly getting back the insight that I was baking into the prompts, and the results weren't particularly surprising. On a whim, I gave CC access to my debug CLI tools and found that it wiped the floor with that approach. It gave actually interesting results and required very little orchestration in comparison. One of my favourite trail of excerpts goes from Jobs’ reality distortion field to Theranos’ fake demos, to Thiel on startup cults, to Hoffer on mass movement charlatans ( https://ift.tt/4e2ApMw ). A fun tendency is that Claude kept getting distracted by topics of secrecy, conspiracy, and hidden systems - as if ...

Show HN: 15 Years of StarCraft II Balance Changes Visualized Interactively https://ift.tt/XDdqUYg

Show HN: 15 Years of StarCraft II Balance Changes Visualized Interactively Hi HN! "Never perfect. Perfection goal that changes. Never stops moving. Can chase, cannot catch." - Abathur ( https://www.youtube.com/watch?v=pw_GN3v-0Ls ) StarCraft 2 is one of the most balanced games ever - thanks to Blizzard’s pursuit of perfection. It has been over 15 years since the release of Wings of Liberty and over 10 years since the last installment, Legacy of the Void. Yet, balance updates continue to appear, changing how the game plays. Thanks to that, StarCraft is still alive and well! I decided to create an interactive visualization of all balance changes, both by patch and by unit, with smooth transitions. I had this idea quite a few years ago, yet LLMs made it possible - otherwise, I wouldn't have had the time to code or to collect all changes from hundreds of patches (not all have balance updates). It took way more time than expected - both dealing with parsing data and dealing wi...

Show HN: Various shape regularization algorithms https://ift.tt/pbZN8o6

Show HN: Various shape regularization algorithms Shape regularization is a technique used in computational geometry to clean up noisy or imprecise geometric data by aligning segments to common orientations and adjusting their positions to create cleaner, more regular shapes. I needed a Python implementation so started with the examples implemented in CGAL then added a couple more for snap and joint regularization and metric regularization. https://ift.tt/UI6RQ3h January 9, 2026 at 07:43AM

Show HN: CLIs Are All You Need for Agents https://ift.tt/2fWiGVI

Show HN: CLIs Are All You Need for Agents Fun agent I've been playing with - the idea is it only has access to a bash tool, and it's directed to create CLIs for use (with additional direction to make the CLIs composable, follow the Unix philosophy, etc). It persists these CLIs and knowledge about them get injected into the system prompt dynamically, so each time it runs it gets access to a larger and larger toolset of composable CLIs. One interesting dynamic that's emerged from this is I've started using these CLIs myself since they're the same interface for the agent or for me, and it's turned into kind of non-chat channel to interact with the agent. One example - I'll add tasks throughout the day myself using the `tasks` CLI it made, then when I interact with the agent it'll run `tasks list` and see everything I've added, or use it to prioritize/update things for me. Later on when I run `tasks list` myself I see all the updates/priorities it set. h...

Show HN: macOS menu bar app to track Claude usage in real time https://ift.tt/EWLnDge

Show HN: macOS menu bar app to track Claude usage in real time I built a macOS menu bar app to track Claude usage in real time via API after hitting limits mid-flow too often. Signed and notarised by Apple. Open source. https://ift.tt/e7HtOmE https://ift.tt/fi3JLmW https://ift.tt/e7HtOmE January 8, 2026 at 11:54PM

Show HN: TierHive – Hourly-billed NAT VPS with private /24 subnets https://ift.tt/OBZ51ci

Show HN: TierHive – Hourly-billed NAT VPS with private /24 subnets This idea has been floating in my head for about 10 years. Some of you might remember LowEndSpirit.com back before it became a forum, I started that. I've been obsessed with making tiny, cheap VPS actually useful ever since. TierHive is my attempt to make 128MB VPS great again :) It's a NAT VPS (KVM) platform with true hourly billing. Spin up a server, use it for 3 hours, delete it, pay for 3 hours. No monthly commitments, no minimums beyond a $5 top-up. The tradeoff is NAT (no dedicated IPv4), but I've tried to make that less painful: - Every account gets a /24 private subnet with full DHCP management. - Every server gets auto ssh port forwarding and a few TCP/UDP ports - Built-in HAProxy with Let's Encrypt SSL, load balancing, and auto-failover - WireGuard mesh between locations (Canada, Germany, UK currently) - PXE/iPXE boot support for custom installs - Email relay with DKIM/SPF - Recipe system for o...

Show HN: A to Z – A word game I built from a childhood road trip memory https://ift.tt/RVFWE8s

Show HN: A to Z – A word game I built from a childhood road trip memory Long time lurker here. My family had a pen-and-paper game we'd play on long drives to visit my great-grandmother. After she passed, I spent the holidays recreating it: https://a26z.fun How it works: Find 15 words from a category (like "Stone Fruits," "US States," or "Dog Breeds") as fast as you can. Once you meet the 15 word minimum, you can play for as long as you want. Each letter shows how many target words start with it (A¹ = one word starts with A, N² = two words start with N) That small ² in the bottom-right? Multi-word answers allowed. For "US States" with N², both "NEW YORK" and "NORTH DAKOTA" count Unlimited guesses, 2 hints, and a shuffle button to reorder by frequency. Example: Category: US States | Letters: A¹ M¹ N² S² Answers: ALABAMA, MONTANA, NEW MEXICO, SOUTH DAKOTA If you're into Connections or Strands, this scratches a similar itc...

Show HN: Seapie – a Python debugger where breakpoints drop into a REPL https://ift.tt/NTMJkc0

Show HN: Seapie – a Python debugger where breakpoints drop into a REPL https://ift.tt/q2R5nA6 January 7, 2026 at 11:28PM

Show HN: Free and local browser tool for designing gear models for 3D printing https://ift.tt/rGTRsDf

Show HN: Free and local browser tool for designing gear models for 3D printing Just build a local tool for designing gears that kinda looks and works nice https://ift.tt/IC3pfwS January 7, 2026 at 02:12PM

Show HN: Dimensions – Terminal Tab Manager https://ift.tt/YCUb5pa

Show HN: Dimensions – Terminal Tab Manager A terminal TUI that leverage tmux to make managing terminal tabs easier and more friendly. https://ift.tt/hbZTcai January 6, 2026 at 10:18PM

Show HN: Unicode cursive font generator that checks cross-platform compatibility https://ift.tt/WGmBi8R

Show HN: Unicode cursive font generator that checks cross-platform compatibility Hi HN, Unicode “cursive” and script-style fonts are widely used on social platforms, but many of them silently break depending on where they’re pasted — some render as tofu, some get filtered, and others display inconsistently across platforms. I built a small web tool that explores this problem from a compatibility-first angle: Instead of just converting text into cursive Unicode characters, the tool: • Generates multiple cursive / script variants based on Unicode blocks • Evaluates how safe each variant is across major platforms (Instagram, TikTok, Discord, etc.) • Explains why certain Unicode characters are flagged or unstable on specific platforms • Helps users avoid styles that look fine in one app but break in another Under the hood, it’s essentially mapping Unicode script characters and classifying them based on known platform filtering and rendering behaviors, rather than assuming “Unicode = univer...

Show HN: Calgebra – Set algebra for calendars in Python https://ift.tt/N1PDjA8

Show HN: Calgebra – Set algebra for calendars in Python https://ift.tt/jB6LcTG January 4, 2026 at 11:33PM

Show HN: Hover – IDE style hover documentation on any webpage https://ift.tt/uKCdAWQ

Show HN: Hover – IDE style hover documentation on any webpage I thought it would be interesting to have ID style hover docs outside the IDE. Hover is a Chrome extension that gives you IDE style hover tooltips on any webpage: documentation sites, ChatGPT, Claude, etc. How it works: - When a code block comes into view, the extension detects tokens and sends the code to an LLM (via OpenRouter or custom endpoint) - The LLM generates documentation for tokens worth documenting, which gets cached - On hover, the cached documentation is displayed instantly A few things I wanted to get right: - Website permissions are granular and use Chrome's permission system, so the extension only runs where you allow it - Custom endpoints let you skip OpenRouter entirely – if you're at a company with its own infra, you can point it at AWS Bedrock, Google AI Studio, or whatever you have Built with TypeScript, Vite, and the Chrome extension APIs. Coming to the Chrome Web Store soon. Would love feedbac...

Show HN: ZELF – A modular ELF64 packer with 22 vintage and modern codecs https://ift.tt/dL60KNy

Show HN: ZELF – A modular ELF64 packer with 22 vintage and modern codecs https://ift.tt/xhjXQSo January 4, 2026 at 12:59AM

Show HN: Vibe Coding a static site on a $25 Walmart Phone https://ift.tt/dbKnkip

Show HN: Vibe Coding a static site on a $25 Walmart Phone Hi! I took a cheap $25 walmart phone and put a static server on it? Why? Just for a fun weekend project. I used Claude Code for most of the setup. I had a blast. It's running termux, andronix, nginx, cloudflared and even a prometheus node exporter. Here's the site: https://ift.tt/AYPDHdE https://ift.tt/BPkpoIq January 4, 2026 at 01:09AM

Show HN: A New Year gift for Python devs–My self-healing project's DNA analyzer https://ift.tt/PyE8nYN

Show HN: A New Year gift for Python devs–My self-healing project's DNA analyzer I built a system that maps its own "DNA" using AST to enable self-healing capabilities. Instead of a standard release, I’ve hidden the core mapping engine inside a New Year gift file in the repo for those who like to explore code directly. It’s not just a script; it’s the architectural vision behind Ultra Meta. Check the HAPPY_NEW_YEAR.md file for the source https://ift.tt/qgexitp January 4, 2026 at 12:50AM

Show HN: Turbo – Python Web Framework https://ift.tt/rz0H2qK

Show HN: Turbo – Python Web Framework https://ift.tt/QAuqRs9 January 3, 2026 at 10:45PM

Show HN: Jot - Offline, source available notetaking/assistant app https://ift.tt/ZV2v58r

Show HN: Jot - Offline, source available notetaking/assistant app https://ift.tt/1fytq7e January 3, 2026 at 04:22AM

Show HN: Go-Highway – Portable SIMD for Go https://ift.tt/YmVpi0q

Show HN: Go-Highway – Portable SIMD for Go Go 1.26 adds native SIMD via GOEXPERIMENT=simd. This library provides a portability layer so the same code runs on AVX2, AVX-512, or falls back to scalar. Inspired by Google's Highway C++ library. Includes vectorized math (exp, log, sin, tanh, sigmoid, erf) since those come up a lot in ML/scientific code and the stdlib doesn't have SIMD versions. algo.SigmoidTransform(input, output) Requires go1.26rc1. Feedback welcome. https://ift.tt/CzcTL0B January 3, 2026 at 04:06AM

Show HN: Fluxer – open-source Discord-like chat https://ift.tt/eZF79qm

Show HN: Fluxer – open-source Discord-like chat Hey HN, and happy new year! I'm Hampus Kraft [1], a 22-year-old software developer nearing completion of my BSc in Computer Engineering at KTH Royal Institute of Technology in Sweden. I've been working on Fluxer on and off for about 5 years, but recently decided to work on it full-time and see how far it could take me. Fluxer is an open source [2] communication platform for friends, groups, and communities (text, voice, and video). It aims for "modern chat app" feature coverage with a familiar UX, while being developed in the open and staying FOSS (AGPLv3). The codebase is largely written in TypeScript and Erlang. Try it now (no email or password required): https://ift.tt/rCg7aNH – this creates an "unclaimed account" (date of birth only) so you can explore the platform. Unclaimed accounts can create/join communities but have some limitations. You can claim your account with email + password later if you want. I...

Show HN: I mapped System Design concepts to AI Prompts to stop bad code https://ift.tt/yuQMVGU

Show HN: I mapped System Design concepts to AI Prompts to stop bad code https://ift.tt/SX28PBu January 3, 2026 at 12:15AM

Show HN: Feature detection exploration in Lidar DEMs via differential decomp https://ift.tt/dl8ygqQ

Show HN: Feature detection exploration in Lidar DEMs via differential decomp I'm not a geospatial expert — I work in AI/ML. This started when I was exploring LiDAR data with agentic assitince and noticed that different signal decomposition methods revealed different terrain features. The core idea: if you systematically combine decomposition methods (Gaussian, bilateral, wavelet, morphological, etc.) with different upsampling techniques, each combination has characteristic "failure modes" that selectively preserve or eliminate certain features. The differences between outputs become feature-specific filters. The framework tests 25 decomposition × 19 upsampling methods across parameter ranges — about 40,000 combinations total. The visualization grid makes it easy to compare which methods work for what. Built in Cursor with Opus 4.5, NumPy, SciPy, scikit-image, PyWavelets, and OpenCV. Apache 2.0 licensed. I'd appreciate feedback from anyone who actually works with eleva...

Show HN: VectorDBZ, a desktop GUI for vector databases https://ift.tt/JMgUzOX

Show HN: VectorDBZ, a desktop GUI for vector databases Hi HN, I built VectorDBZ, a cross-platform desktop app for exploring and analyzing vector databases like Qdrant, Weaviate, Milvus, and ChromaDB. It lets you browse collections, inspect vectors and metadata, run similarity searches, and visualize embeddings without writing custom scripts. GitHub (downloads and issues): https://ift.tt/Hv0BOko Feedback welcome. If it’s useful, starring the repo helps keep me motivated. Thanks. https://ift.tt/Hv0BOko January 1, 2026 at 08:55PM