Show HN: Gemma 4 Multimodal Fine-Tuner for Apple Silicon https://ift.tt/l6BkI4n

Show HN: Gemma 4 Multimodal Fine-Tuner for Apple Silicon About six months ago, I started working on a project to fine-tune Whisper locally on my M2 Ultra Mac Studio with a limited compute budget. I got into it. The problem I had at the time was I had 15,000 hours of audio data in Google Cloud Storage, and there was no way I could fit all the audio onto my local machine, so I built a system to stream data from my GCS to my machine during training. Gemma 3n came out, so I added that. Kinda went nuts, tbh. Then I put it on the shelf. When Gemma 4 came out a few days ago, I dusted it off, cleaned it up, broke out the Gemma part from the Whisper fine-tuning and added support for Gemma 4. I'm presenting it for you here today to play with, fork and improve upon. One thing I have learned so far: It's very easy to OOM when you fine-tune on longer sequences! My local Mac Studio has 64GB RAM, so I run out of memory constantly. Anywho, given how much interest there is in Gemma 4, and frankly, the fact that you can't really do audio fine-tuning with MLX, that's really the reason this exists (in addition to my personal interest). I would have preferred to use MLX and not have had to make this, but here we are. Welcome to my little side quest. And so I made this. I hope you have as much fun using it as I had fun making it. -Matt https://ift.tt/IMuKU9H April 8, 2026 at 01:07AM

Search This Blog

News updates

Show HN: Gemma 4 Multimodal Fine-Tuner for Apple Silicon https://ift.tt/l6BkI4n

Comments

Post a Comment

Popular posts from this blog

Show HN: Smol machines – subsecond coldstart, portable virtual machines https://ift.tt/gnYXFml

Show HN: web-pinentry: a pinentry program that leverages matrix and http https://ift.tt/qBfREVw

Show HN: I benchmarked Gemma 4 E2B – the 2B model beat the 12B on multi-turn https://ift.tt/izHYx7V