Hacker Newsnew | past | comments | ask | show | jobs | submit | bushbaba's commentslogin

I'm perfectly happy at claude opus 4.6. All improvements since then have not meaningfully improved my day to day. If i can get 4.6 on my laptop for 5-10k, i'd gladly start shifting my ~1k/month Anthropic spend over.

Some of the harness even let you run a local model for most things, and only pay for the latest frontier models when needed, which cuts down cost drastically.


Not necessarily, inference speed also has huge time aspect. For example anthropic takes nearly twice as long as OpenAI models for my tasks with both having similar success rates.

Easy to test against by going multi model audit

The top U.S. officials do the same when traveling to any country. Everyone does this to everyone else.

Not true. Israel has a reputation for being exceedingly intrusive and aggresssive with US personnel. Just go watch intel community interviews online.

As is the US. Go watch same interviews.

So? The claim was "Everyone does this to everyone else"

Agreed. The whole intensity discussion is pointless.

Saab uses the Bombardier Global 6500 here. Which is entirely manufactured in Canada. This is less politics and more about economics


Not 'entirely', the 6500's wings are made in Japan, the rear fuselage and tail in Northern Ireland and the engines in England.


Not a valid excuse without knowing what their historical growth rate has been. And how much of the instability is load related.


GitHub has been publishing their growth numbers since at least 2016: https://octoverse.github.com/2016/

However, they have reported numbers along rather inconsistent dimensions. Like, historically they've focused on number of repos and users and later PR's and issues, and often catch-all terms like "contributions" which includes all of those + comments etc... but the number of commits alone (which apparently is the main culprit now?) has been mentioned very sporadically. This has made it hard to get a consistent sense of historical growth.

Without any other information, however, it is reasonable to assume that a 14x in commits is the prime candidate for instability. Especially since commits are write traffic, which is much harder to scale than read traffic. Plus every 3 - 5x increase in scale can reveal bottlenecks in your distributed systems that you never knew existed, so they probably have like 2 - 3 "generations" of bottlenecks to figure out!


Makes prior assumptions that getting tens of gigs of ram is cheap thrown out the window. Would likely lead to super fast SSDs such as optain being way more valuable


The price of SSDs is similarly depressing.


Not really I’d not call Google the safest/most popular choice for AI or Cloud.


If cabins are water tight you risk carbon monoxide poisoning


Carbon monoxide... in an EV?


If you've ever built software, you'll know that regressions are all too common. Especially when AI/ML is involved.

It's likely they patch this and cause 2 other bugs in the process.


> If you've ever built software, you'll know that regressions are all too common. Especially when AI/ML is involved.

"AI/ML" has delivered far more complete testing criteria than any "QA expert" has. It's absolutely crazy to me the number of people who defend the status quo in software testing when software quality has been on the decline for over a decade. But sure. "AI/ML" is the problem, not shit developers who never considered that angle in the first place.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: