Hacker Newsnew | past | comments | ask | show | jobs | submit | freakynit's commentslogin

I have said this before as well: these top-of-the-line models write clever, convoluted code. The code looks intelligent from above, but is a maintenance headache. Makes entire thing fragile for future developments on top of it.

The smaller models, especially the aforementioned ones, they fail much more, but, do not write that insanity of the code. They do simple, non-clever coding like humans do. Much easier to maintain and build upon.

Qwen-3.6-27b is a wonderful model. Exceptionally good for it's size, and excellent in general as well. And with mtp available now, it can run at 60+ tps on a single 3090... this is roughly 30% faster tgs than most of the hosted ones being served from giant data-centers.


It keeps "branding" intact. This is similar to red-bull spending $3 BILLION annually on marketing... it's to keep the brand alive in the minds of folks and "associate" them as kinda the de-facto in given industry.


"damaged", even after paying 99$ : https://github.com/electron/notarize/issues/205#issuecomment...

See last 20'ish comments.


You can run the two agents in GAN like loop.. each trying to better the other. Give them a common task like design a better alternative to transformer model that uses max O(nlogn) memory, and the result comes closest to existing n^n implementations.

Good idea actually.. why haven't I tried this before.


As much as I absolutely love Mimo V2.5 Pro (it's a genuinely good model), I absolutely hate the way they calculate usage in their token plan.

For example: For a super small task in a small project that should not be consuming more than 500K total tokens after all tool calls included, their shown usage shot up to 152 million tokens.

But, when I scroll down on the same page, a table shows usage as 3 million tokens, out of which 2.5 million were cached.

This is such a huge conflict on the very same page. The bad thing is that the usage progress bar is shown against that 150 million token usage, not against that 3 million one.

This has been in discussions for at least past 3 months on reddit as well, and was precisely the reason I subscribed to their lowest tier, and for a single month only.

Update: their own harness, mimocode, shows total token usage as just 63.1K. We now have 3 entirely different values, differing in 3 orders of magnitude.

Update 2: So, I did the exact same task this time using DS4Pro, and total token usage was just 101K (as shown by opencode).


It's very confusing. They have tokens for their API and credits for their "token plan".

Even worse... they use both terms on the same page in dashboard.

"""

Credits 4,100,000,000 Credits

Total Token Consumption

"""


Ooooo... I now realize the trick. It's a mental play... give 1000x of "credits" but charge in same old tokens.

I prefer this to Anthropics "in the next 5 hours you have X tokens to spend, and we are not going to tell you what X is"

Here token price is model price + busy hours surcharge, and BTW nowhere near 1000x


No doubt this is better than Anthropics... and yes, I computed the ratio again: it's 38.92 (at least based on what's shown in my dashboard).

Watch Dogs: Legion

Nice... I built something same last year (not active anymore): https://www.producthunt.com/products/zenquery

It was able to query csv, json, parquet and xlsx files, all locally. You could also mix files of different types in single session, and, manage multiple sets of files as individual sessions that you can switch easily whenever needed.


I built https://pagey.site initially for my own self since I was creating an insane lot of small one-page sites using LLM for various utilities/calculators/explanations etc. and needed a single place to host them.

Initial version was super minimal, but then, a lot of people started using it. So, revamped it into full product and launched it. Currently hosting over 300 sites, with 10 new sites getting added daily. All free for now. I personally has 53 sites as of now.

Sample stuff of mine:

1. [latency-numbers-2026] - https://latency-numbers-2026.pagey.site/

2. [NVIDIA (NVDA) — The Short / Underperformance Thesis] - https://nvidia-stock-analysis.pagey.site/

3. [Shamir's Secret Sharing] - https://shamirs-secret-sharing.pagey.site/

4. [Build Software Like It Needs To Last] - https://building-software-the-right-way.pagey.site/

5. [NPM Supply Chain Attack Techniques] - https://npm-supply-chain-attack-techniques.pagey.site/

6. [NPM Ecosystem Threat Report (May 19, 2025 - June 1, 2026)] - https://npm-supply-chain-attacks-25-26.pagey.site/

https://pagey.site


I loved the 'Build Software Like It Needs To Last' one. I feel like it's something that could be made into a poster and hanged where I could see it first thing in the morning.

Did this research last week on the same thing:

NVIDIA (NVDA) — The Short / Underperformance Thesis - https://nvidia-stock-analysis.pagey.site/

This is pump and dump on the largest scale we have ever seen. It's effectively privatize the profits and socialize the losses.


This doesn't make sense. It completely misses that there are tons of other companies that will happily take the Nvidia capacity at the same price if their biggest customers reduce spend.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: