More

alexwwang · 2026-06-13T23:50:15 1781394615

I think maybe it’s a tool and it’s up to you to make use of tools to try to let more Chinese people know and convince them to believe your idea. Don’t blame a tool but make proper use of it to make a better world.

paulddraper · 2026-06-14T00:01:21 1781395281

alexwwang · 2026-06-14T03:21:38 1781407298

If you know what Chinese are suffering mentally, you may understand why I say so. Criticize a model is not the smart way to against a system.

alexwwang · 2026-06-13T13:23:50 1781357030

It has been available now in coding plan.

alexwwang · 2026-06-13T11:50:13 1781351413

I wonder if these local model could really solve problems especially for users that aren’t experts on a given coding language. I am not sure that, more than inline auto completion and unit implementation, are these model capable of designing and composing tech specs that really work.

alexwwang · 2026-06-13T05:11:06 1781327466

Yes. It’s really not a good idea to make this ban. When the US is gradually isolated in this way by its gov’s policy, the world becomes more and more dangerous. What worse, the traditional value of open to competition that Americans have hold for centuries seems to be substituted step by step. It’s absolutely a tragedy.

alexwwang · 2026-06-13T04:57:50 1781326670

People always exaggerate the thing they don’t understand.

gobdovan · 2026-06-13T08:21:25 1781338885

Sometimes they exaggerate the things they understand.

alexwwang · 2026-06-13T09:13:36 1781342016

Yes. But they are minority.

gobdovan · 2026-06-13T09:35:46 1781343346

was referencing Anthropic

alexwwang · 2026-06-13T11:35:03 1781350503

I knew it at once after you said. But politicians don’t fully understand what they face.

alexwwang · 2026-06-13T03:50:03 1781322603

I hope so. But how? Who gonna fund these projects and how to coordinate with every sides. This is complex. I only believe that the open source AI won’t lack users.

alexwwang · 2026-05-29T23:49:06 1780098546

I agree. Mcp might be useless in a personal scenario but it absolutely plays a role of service infrastructure in organizations. It is another form of api for those abilities that are not wrapped with rest api yet. But when they are wrapped in mcp, it seems not necessary to wrap them into rest api or cli again in near future. So these mcp services survive. The only thing matters is how to import these mcp services into agent context on demand or say by the gradual disclosure principle.

jimbokun · 2026-05-29T23:57:19 1780099039

Unless you also want humans to also interact with your tools.

That’s covered in the article: a human can modify the commands generated by the agent, or vice versa, to debug problems or transfer knowledge.

alexwwang · 2026-05-30T00:35:17 1780101317

This, IMO, is another scenario. MCP is designed and played as a part of the automatic tool chains. These are two different types of needs. But in the case you mentioned, when some parts of the work should be automated, it’s also possible to utilize mcp there.

alexwwang · 2026-05-28T08:03:51 1779955431

Maybe the same type. Each time I call the LLM api the fan starts to work and make big noise. The temperature in the room is going up noticeably for 1-2 degrees.

embedding-shape · 2026-05-28T11:33:51 1779968031

> Each time I call the LLM api the fan starts to work and make big noise

So every time you do HTTP calls? Nothing there should spin up your fans, unless you use an agent with an horribly broken TUI, I've heard there is a few of those out there. But remotely calling LLM APIs really shouldn't be taxing on your local device, something somewhere is wrong/bad if that's what you're seeing.

alexwwang · 2026-05-28T11:39:35 1779968375

If the horribly broken TUI you mentioned is OpenCode, I’d say yes. That’s exactly what I am experiencing.

embedding-shape · 2026-05-28T12:15:03 1779970503

Sure, if that's what you're using, then that's definitively buggy, unless it's doing compilation or something actually using your resources, just making HTTP calls shouldn't be heavy for your computer. Claude Code was mainly what I was thinking about, as it similarly broken, but I'm sure there are more out there as most of them seem vibe-coded at best.

alexwwang · 2026-05-28T13:37:46 1779975466

Yes. Whatever *code, the same when they are working. The node.js backend is awful.

Filligree · 2026-05-28T12:13:36 1779970416

Is it a local LLM? Sibling seems to be assuming remote, but I have trouble imagining a TUI that inefficient.

alexwwang · 2026-05-28T12:16:16 1779970576

No. Simply the rest api call in opencode tui. I don’t know why maybe the mbp is too old, at least it served 6 years +.

Filligree · 2026-05-29T12:58:24 1780059504

Not counting compilation passes, the rest of OpenCode is trivial enough that it should work on a 1980s PC.

alexwwang · 2026-05-29T20:02:24 1780084944

It’s just so weird.

alexwwang · 2026-05-27T12:00:06 1779883206

Understandable. You don’t want to lose control to your codebase and don’t trust LLM is competent in handling that fully.

lukan · 2026-05-27T12:17:37 1779884257

No. Because they still hallucinate at times. Confuse things. Forget things. Or none of the above, as it is anthropomorphizing, but the result is the same. They can make incredible working one shots, you start to trust them, then you trust too much and .. feel the result.

alexwwang · 2026-05-27T12:26:48 1779884808

Yes. I am fighting with the disobeyance of LLM on working through my pipeline commands. I believe these violations are caused by its hallucinations. So I am still developing a mechanical system to monitor agents’ behaviors automatically. I believe these routines and monitors will play as a set of scaffold to keep leading the LLM on the right way all the time.

xenadu02 · 2026-05-27T18:56:49 1779908209

The percentage of times I prompt claude "what about checking if there are any child processes running?" or "Would using a lock here greatly simplify the design?" only to have myself be correct is approaching 100%. That is it isn't just claude sycophantically agreeing with me. The code itself becomes smaller, simpler, and more reliable with fewer bugs.

The agents tend to produce working code but the larger the scope the bigger the mess they tend to make. They will happily evolve toward a local maxima but leave world-destroying bugs lurking in the implementation.

The other issue is that claude regularly ignores explicit instructions in CLAUDE.md or in prompts. It will "helpfully" decide to just start doing whatever it wants or reinterpret instructions completely differently than it did the last 100 times.

It has nothing to do with losing control or trust. LLMs are not conscious. They have no executive function. They aren't even thinking. They're just models predicting the next word in the script. They are very useful tools but that's all they are: tools.

notgenerated · 2026-05-28T09:32:51 1779960771

I also feel like we still need to steer Claude. It doesn't always help to have stuff in the CLAUDE.md (even when it's lean). I have a lot of cases where I still need to remind the agent to do something even if it's routine.

To me I think that connects with working longer on the planning and specs. It requires reading and re-reading, but when that's done, implementation is usually much cleaner and adheres to your standards

alexwwang · 2026-05-28T05:49:02 1779947342

Yes. They are tools. So my approach, at least try to approach is to keep on polishing the skills and check the output of LLM in loops with mcp to alert the abnormality asap so the LLM won’t go to next step to make things worse.

alexwwang · 2026-05-27T11:57:41 1779883061

Seems have to make a face to face appointment, without any online devices in hand.