More

Zetaphor · 2026-06-14T15:52:56 1781452376

I'm someone who uses tens of millions of tokens each month, almost exclusively with open weight models that I run on my own hardware. That said you are taking the wrong approach here, this type of mentality is only going to further radicalize those who have decided they're against this technology.

Additionally when the finally bubble bursts and the executives wake up from psychosis and look to distance themselves from this because it's become a dirty word, you'll be one of the first to go. The nail that sticks out gets hammered down and all that.

I do think there are real benefits and productivity gains with this technology, but it does not benefit everyone equally. It's great for the programming parts of my job, but useless in the other 40% of the work. I have coworkers for whom generative AI has no obvious practical application, and yet management is trying to find a way to shoehorn it in anyway. No doubt because they've also drank the kool-aid and are eager to reduce headcount.

This attitude of it making everything more productive and anyone who doesn't follow will be left behind is not just false, it's cruel and myopic. You're talking about people's livelihood being taken away because a handful of executives decided this is how things should work despite the MASSIVE number of shortcomings and poor product market fit.

Edit: I also almost missed where you're seemingly celebrating the devaluation of human labor as a result of this. Please stop and reflect on how your position may read to someone who is just trying to put food on the table.

Zetaphor · 2026-06-13T05:38:10 1781329090

That's not really how the experts in an MoE work. They activate on token probabilities and are activated on every token. You don't necessarily have a discrete math expert and a discrete physics expert. And if it were you would still need a router that is trained on all of those domains.

yorwba · 2026-06-13T12:30:38 1781353838

MoE models are typically designed for datacenter deployment, where per-token load-balancing is more important, but it's also possible to use a different training objective that encourages domain-specialization of experts: https://allenai.org/blog/emo But yes, this isn't really useful for distributed training as such because of the router.

Zetaphor · 2026-06-09T13:10:20 1781010620

I am unfamiliar with his work, can you provide sources for his past incorrect predictions?

Zetaphor · 2026-06-08T16:53:07 1780937587

Something powered by an LLM is going to end up being the tool that makes this accessible in the way it always should have been, and that gives me complex feelings.

Zetaphor · 2026-06-08T14:35:22 1780929322

This was covered in the issue itself, in fact the issue is pretty well documented with regards to the packaging:

> Publish an official Claude Desktop build for Linux, targeting the two current Ubuntu LTS releases (and Debian) as a signed .deb via an Anthropic-operated apt repository, using the same distribution pipeline Claude Code already uses for Linux.

Also Flatpak or AppImage would make this accessible to every other distro. Alternatively you could run the deb with a Podman Toolbox.

Your point about backwards compatibility with Windows goes both ways, I have old games that I can _only_ run on Linux as they don't work on modern versions of Windows.

Zetaphor · 2026-06-06T14:31:14 1780756274

You're supporting the developers original work at that price. There's plenty of cheaper devices that take that original work and just throw it on some chips

Zetaphor · 2026-06-06T14:23:57 1780755837

I want to reduce my dependency on companies like Google, OpenAI, and Anthropic. Aside from the concerns of data sharing I'm also not a fan of how they run their operations, for example Anthropic now using xAI's Colossus data center which is poisoning a marginalized community, or OpenAI getting in bed with the military.

Not everything I want to use an LLM for requires "PhD level intelligence", and increasingly I'm finding more uses that involve sharing my personal data.

Yesterday my local model helped me when looking for a doctor who is in-network for my insurance. I threw it a screenshot from the providers search results and it looked up reviews for all of them.

sandworm101 · 2026-06-06T15:06:37 1780758397

My local AI is currently upscaling an old british comedy from sub-dvd quality to 1k. (It is not availible other than on DVD.) It looks like it will take about a week for my pair of 5060s to chew through the task.

eszed · 2026-06-06T15:55:07 1780761307

Which show?

sandworm101 · 2026-06-06T17:09:58 1780765798

Chelmsford 123

I own the DVDs so I'm OK upscaling/editing my own copies for my own use. But if I ran the task on an ai service I would no doubt trigger copyright issues.

pratnala · 2026-06-06T15:17:10 1780759030

Which model are you running?

Zetaphor · 2026-06-06T16:00:02 1780761602

Qwen 3.6 35B-A3B and 27B both at Q8 on a Strix Halo machine

Zetaphor · 2026-06-06T14:13:21 1780755201

> Daniel Lemire’s blog is one of the top 50 most popular blogs on Hacker News, the standard tech news aggregation site.

Citation needed

nkurz · 2026-06-06T14:38:11 1780756691

https://refactoringenglish.com/tools/hn-popularity/

thg · 2026-06-06T15:58:42 1780761522

For posterity: It's rank 34 at the time of this comment

Zetaphor · 2026-06-05T20:18:50 1780690730

I think they're maybe confusing Skills and MCP servers

Zetaphor · 2026-06-05T04:43:01 1780634581

That is their actual account. We have this discussion every time they post something sadly

ElijahLynn · 2026-06-05T14:56:35 1780671395

Oh, bummer. That is really confusing.