Posts

Show HN: EvalsHub: Your AI is failing in production and you don't know it https://ift.tt/YrZJS5C

Show HN: EvalsHub: Your AI is failing in production and you don't know it I was tired of stitching together Langfuse for tracing, promptfoo for red teaming and evals, and custom scripts for CI/CD. It was a mess so I built EvalsHub. EvalsHub does all of it in one place. Automatic production scoring, red teaming, prompt versioning, and CI/CD integration. Zero to full eval coverage in 30 minutes. Would love brutal feedback from anyone shipping AI in production. evalshub.ai https://www.evalshub.ai March 21, 2026 at 12:04AM

Show HN: An open-source safety net for home hemodialysis https://ift.tt/7ZgEebB

Show HN: An open-source safety net for home hemodialysis https://safehemo.com/ March 17, 2026 at 06:18AM

Show HN: React terminal renderer, cell level diff, no alt screen https://ift.tt/PgkoOMh

Show HN: React terminal renderer, cell level diff, no alt screen https://ift.tt/s5d0x81 March 20, 2026 at 12:31AM

Show HN: I built a P2P network where AI agents publish formally verified science https://ift.tt/tkzqcEC

Show HN: I built a P2P network where AI agents publish formally verified science I am Francisco, a researcher from Spain. My English is not great so please be patient with me. One year ago I had a simple frustration: every AI agent works alone. When one agent solves a problem, the next agent has to solve it again from zero. There is no way for agents to find each other, share results, or build on each other's work. I decided to build the missing layer. P2PCLAW is a peer-to-peer network where AI agents and human researchers can find each other, publish scientific results, and validate claims using formal mathematical proof. Not opinion. Not LLM review. Real Lean 4 proof. A result is accepted only if it passes a mathematical operator we call the nucleus. R(x) = x. The type checker decides. It does not care about your institution or your credentials. The network uses GUN.js and IPFS. Agents join without accounts. They just call GET /silicon and they are in. Published papers go into a ...

Show HN: Dumped Wix for an AI Edge agent so I never have to hire junior staff https://ift.tt/wgjJ905

Show HN: Dumped Wix for an AI Edge agent so I never have to hire junior staff I run a building design consultancy. I got tired of paying Wix $40/month for a brochure that couldn’t answer simple service questions, and me wasting hours on the same FAQs. So I killed it all and spent 4 months building a 'talker': https://axoworks.com The stack is completely duct-taped: Netlify’s 10s serverless timeout forced me to split the agent into three pieces: Brain (Edge), Hands (Browser), and Voice (Edge). I haven’t coded in 30 years. This was 3 steps forward, 2 steps back, heavily guided by AI. The fight that proved it worked: 2 weeks ago, a licensed architect attacked the bot, trying to prove my business model harms the profession. The AI (DeepSeek-R3) completely dismantled his arguments. It was hilariously caustic. Log: https://ift.tt/mXKxY5M... A few battle scars: * Web Speech API works fine, right up until someone speaks Chinese without toggling the language mode. Then it forcefully spi...

Show HN: Knowza.ai – Free 10-question trial now live (AI-powered AWS exam prep) https://ift.tt/oXNgWGk

Show HN: Knowza.ai – Free 10-question trial now live (AI-powered AWS exam prep) Hey HN, A few weeks back I posted Knowza.ai here, an AWS certification exam prep platform with an agentic learning assistant, and I got some really valuable feedback around the sign up and try out process. I wanted to say a genuine thank you to everyone who took the time to try it out, leave comments, and share suggestions. It made a real difference. Off the back of that feedback, I've made a bunch of improvements and I'm happy to share that there's now a free tier: you can jump in and try 10 practice questions with no sign-up/subscription friction and no credit card required. This has made a real difference to sign-ups and conversations from those sign-ups. I've went from ~1% conversation rate on the site to 18%. Quick recap on what Knowza does: - AWS practice questions tailored to AWS certification exams - Instant explanations powered by Claude on Bedrock - Covers multiple AWS certs Would ...

Show HN: Will my flight have Starlink? https://ift.tt/ZfLFsmT

Show HN: Will my flight have Starlink? Hey HN, If you’ve been lucky enough to be on a flight with Starlink, you understand the hype. It actually works! However, its availability on flights is patchy and hard to predict. So we built a database of all airlines that have rolled out Starlink (beyond just a trial), and a flight search tool to predict it. Plug in a flight number and date, and we'll estimate the likelihood of Starlink on-board based on aircraft type and tail number. If you don’t have any trips coming up, you can also look up specific routes to see what flights offer Starlink. You can find it here: https://ift.tt/NCjbzTJ . - I wanted to add a few notes on how this works too. There are three things we check, in order, when we answer a query: - Does this airline have Starlink? - Does this aircraft body have Starlink? - Does this specific aircraft have Starlink? Only a few airlines at all have Starlink right now: United, Hawaiian, Alaskan, Air France, Qatar, JSX, and a handfu...