I think pooling/donating tokens will be a thing. Not sure if like this, but in some format. The Django project, for example, came out and said they don't want your tokens, but I think a lot of people/projects will (do?) want your tokens.
I guess if you have a subscription with a token allowance that you are not going to use up this week, it’s better to let someone else use those tokens rather than throwing them away. So using the food analogy it’s more like a store giving away unsold sandwiches to the homeless at the end of the day instead of throwing them away.
Donating tokens to a software project is a bit like donating food to a hungry person.
I think it might be beneficial to use blockchain, so that the donor can audit which prompts the token-pool they donated too performed. Perhaps donating tokens can also give you votes on which prompts are entered.
It’s a lot more like giving a hungry Hindu a gift card to a specific non-veg restaurant. Maybe they’ll use and enjoy it, maybe they’re vegetarian and will be insulted; either way that restaurant benefits. Especially if the hungry person exceeds the value of the gift card.
The good ones all seem to be pointing in the direction of Django. Which, on its own, says a lot about how likely people will care about vibe-coded anything, whether pooled or not.
This is fitting/perfect. Pokemon go is THE archetypical surveillance capitalism app. Be a drone in surveillance capitalism, know that your behavior will be used like this. Drones generally don't know or care though. Drones just have fun with tech yay fun awesome. Pokemon, gotta catch them all!!
Upd: I meant big picture, not with respect to this model release. Where do subscriptions figure into their strategic vision. Will consumers end up paying enterprise prices in the future?
In the blog post they say when sufficient capacity allows them to do so they aim to restore Fable 5 as a standart part of subscription plans and intend to do so as quickly as they can.
I don't think they'll phase out subscriptions ever, their whole play has been to drive demand from the bottom up. Get engineers hooked on building with claude at home, then get them to demand the ability to use it at work, and bend over their employer with no lube.
They'll probably tighten the quotas to reign in whales though.
They almost certainly already make a fuckload more money off API pricing than they do subscriptions, even if there might be more total subscription users. So offering subscriptions even at some loss is probably going to continue. Honestly, I'd be surprised if they even lost money on most subs; there are definitely Token Whales out there who mess up all the accounting up, though.
Realistically I think Anthropic just has insane demand but finite capacity to run models, and Fable will just make them more money if they dedicate it to API pricing. I suspect the goal here is something like: get individual engineers/PMs on their personal plans to taste Fable and then go to their meetings and say "Yes doubling the price of every single input/output token is a good idea, boss".
But how is this sustainable? It's not like paying $5000 per feature means you'll be refunded if prompting "make no mistakes" didn't work.
The only reason why I pay $200 is because LLM's errors costs me that much, at worst. If "make no error" starts working - sure. But surely, unless you have millions of dollars of cash to burn, a coin flip that costs $5000 is an insane idea?
I certainly hope not. PAYG is not predictable enough for smaller companies or individuals. Where I work (non-tech company), PAYG would never fly. We aren't big enough for that. Of course, you can set usage budgets, but there's a pretty big difference between $200/user/month vs. the equivalent PAYG usage being closer to $1,000/user/month, if you currently use the subscription plan to its limits each week.
Going PAYG only will effectively take these tools away from a huge amount of people and accelerate the push for local LLMs.
OTOH, accelerating the push for local LLMs would also be fine with me.
I doubt it, given the importance of those subscriptions for building and maintaining market awareness.
The AI landscape is changing rapidly, and with Apple announcing the option to change the AI backend, and potential requirements enable AI choices as well, similar to EU browser choice requirements (this is more reading tea leaves than any actual requirements I am aware of). The new OS changes coming to support Googlebook, and deep Copilot/AI integration into Windows will make maintaining user facing subscriptions essential for independent model developers like OpenAI, Anthropic, and Mistal to remain relevant longer term.
If the don't maintain that relevance there is increasing likelihood that they will get consumed by other companies whether it's Apple, Microsoft or Google to form a foundation for their OS, or other cloud providers.
That make sense, but what about the specific bifurcation we're seeing here of super primo models versus still good models being available to subscriptions?
It's kind of annoying not getting access to the primo model and paying 200 bucks a month. I understand 200 bucks a month is basically nothing though.
Like I don't totally understand why they'd let me have it for a couple weeks and then take it away and say I can have it but I have to pay retail and retail is like $1,000 a day.
It's better to have loved and lost than to have never loved at all??
It's a trade-off. Every hyperscaler is buying and building compute capacity as fast as they can dodge red tape. There is limited compute capacity, and scarcity is a real thing.
As a consumer I can choose to buy subscriptions to a range of things, including $5 droplets or VMs on a broad range of cloud hosting providers. I can even buy cheap bare metal at a bunch of providers at an affordable retail rate.
I can also buy "unlimited" AI packages that will be optimized to fit the cost model from a variety of services, with different impacts, such as rolling outages when I consume a daily or hourly allotment.
Right now VC and the investor class are subsidizing the rapid evolution of the services and availability, but that VC is running out. In more traditional economies, AI would have developed and rolled out more slowly, and through metered subscriptions, with the eventual rolling out of "unlimited" packages like telephone, internet, or cell services once the market became commoditized.
We have seen a big inversion of that with the race to "win" AI marketshare. Now the true cost is being exposed, and the most competitive and capable models are hideously expensive to operate, so it makes sense that we are moving to metered billing for a utility service. If you want gas, you can buy regular or premium. If you have a premium car you definitely want the premium, but for most people regular is good.
Give it a couple of years, and the survivors will settle around fairly industry standard models of consumer grade services, pro-sumer accounts, and business/enterprise models.
Things are still shaking out, but I get the sadness. Luckily I work at a big tech company who is banging the drum on doing experimentation so I use my prosumer claude pro and other accounts at home for hobby stuff, and save my heavy lifting and potentially experimentation for work :P
It's interesting living through all this civilizational flailing. Remarkable. I read that when people invented writing it was very controversial because the facility of memory was the prized intellectual capability of the time. I never thought I'd see this kind of thing in person
This is going to be very different though. These companies will just start investing in in house cheaper inference that's been on the front page here the last few weeks.
Oh absolutely. When asked by executives how I think they should adopt AI I've always been upfront about its strengths and weaknesses, and unlike a lot of technologies you can try out a lot of different things on much cheaper technology. As long as they throw out the notion that "more expensive" AI is "better" AI, it gives them a good way to compare workflows with and without machine learning augmentation.
Certainly you realize that these comments exist for more than a single person right? You expect potentially hundreds of viewers to each burn through AI tokens instead of just getting a direct and relevant answer here? This has the same vibe as the old forum posts where the only response was a "google it".
reply