Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Agreed (not sure what you mean by UI-based hosting).

oMLX does the caching I need to fit models that are near gross memory, and it handles most of the work in finding usable models. After cobbling together various solutions over months, I now just use oMLX, often from Xcode. I can tell the difference between Gemma-4 (local/free) and Claude (paid) only on the largest tasks.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: