String theory is in fact falsifiable contrary to popular belief. It's just not practically falsifiable with current (and likely future, for a while) technology as the energy scales we need to probe to falsify it are astronomically large.
The vast majority of jobs are not full-filling or enjoyable. Because there were way more job seekers than jobs.
Programming was one of the ones which was, because there were fewer programmers than openings. Now that's flipping, thus naturally, the enjoyment is going to be sucked out of it.
It is unmaintainable even if you spend 100k per month on tokens to have LLMs pretend they are maintaining it, if they slow down and make little ACTUAL progress. Sadly real progress is impossible to measure, if all you have is an overexcited """engineer""", a credit card, and so much cash spent you could hire all the best engineers you know and still have money for a porsche.
Well, software presumably has a goal of accomplishing something for some end-user, so the progress should be trivial to measure: are features/changes being completed?
The marketing ploys of OpenAI/Anthropic where agents build something that nobody uses might be hard to track given that there are zero users. But what about everyone using agents for real software? It's trivial to prove that agents make progress.
Yes that is the entire point. Measure features deployed in production and their value in gaining and retaining customers or users, cost reductions, reduced incidents and outages, etc.
Lines of code is completely irrelevant as a metric.
I regularly ask both GPT and Gemini to give me options - programming libraries to do X, architecture suggestions, names for projects/services/classes
After they answer I ask each model what does it think of the other answer, and to give me a final suggestion considering both answers.
Both GPT and Gemini would frequently say "that other answer is much better than my one, it considered X factor that I missed".
reply