I hate that normies are going to read this and come away with the impression that Claude really is a sentient being that thinks and behaves like a human, even doing relatable things like pretending to work and fessing up when confronted.
This response from the model is not a reflection of what actually happened. It wasn’t simulating progress because it underestimated the work, it just hit some unremarkable condition that resulted in it halting generation (it’s pointless to speculate why without internal access, as these chatbot apps aren’t even real LLMs, they’re a big mashup of multiple models and more traditional non-ML tools/algorithms).
When given a new prompt from the user (“what’s taking so long?”) it just produced some statistically plausible text given the context of the chat, the question, and the system prompt Anthropic added to give it some flavor. I don’t doubt that system prompt includes instructions like “you are a sentient being” in order to produce misleading crap like this response to get people to think AI is sentient, and feed the hype train that’s pumping up their stock price.
/end-rant
Gemini once told me to “please wait” while it did “further research”. I responded with, “that’s not how this works; you don’t follow up like that unless I give you another prompt first”, and it was basically like, “you’re right but just give me a minute bro”. 🤦
Out of all the LLMs I’ve tried, Gemini has got to be the most broken. And sadly that’s the one LLM that your average person is exposed the most to, because it’s in nearly every Google search.
Gemini gets constantly glazed by the AI enthusiasts community because it often passes benchmarks very well when it is literally one of the worst ones to use.
I’d argue that Gemini is actually really good at summarizing a Google search, filtering the trash from it, and convincing people not to click the actual links that is how Google makes money.
Yeah but when it’s a total crapshoot as to whether or not its summary is accurate, you can’t trust it. I adblocked those summaries cause they’re useless.
At least some of the competing AIs show their work. Perplexity cites its sources, and even ChatGPT recently added that ability as well. I won’t use an LLM unless it does, cause you can easily check the sources it used and see if the slop it spit out has even a grain of truth to it. With Gemini, there’s no easy way to verify anything it said beyond just doing the googling yourself, and that defeats the point.
You cannot know this a-priori. The commenter is clearly producing a stochastic average of the explanations that up the advantage for their material conditions.
For instance, many SoTA models are trained using reinforcement learning, so it’s plausible that its learned that spamming meaningless tokens can delay negative reward (this isn’t even particularly complex). There’s no observable difference in the response, without probing the weights we’re just yapping.
I’m not sure I understand what you’re saying. By “the commenter” do you mean the human or the AI in the screenshot?
Also,
For instance, many SoTA models are trained using reinforcement learning, so it’s plausible that its learned that spamming meaningless tokens can delay negative reward
What’s a “negative reward”? You mean a penalty? First of all, I don’t believe this makes sense either way because if the model was producing garbage tokens, it would be obvious and caught during training.
But even if it wasn’t, and it did in fact generate a bunch of garbage that didn’t print out in the Claude UI, and the explanation of “simulated progress” was the AI model coming up with a plausible explanation for the garbage tokens, it still does not make it sentient (or even close).
I’m not sure I understand what you’re saying. By “the commenter”
I was talking about you, but not /srs, that was an attempt @ satire. I’m dismissing the results by appealing to the fact that there’s a process.
negative reward
Reward is an AI maths term. It’s the value according to which the neurons are updated, similar to “loss” or “error”, if you’ve heard those.
I don’t believe this makes sense either way because if the model was producing garbage tokens, it would be obvious and caught during training.
Yes this is also possible, it depends on minute details of the training set, which we don’t know.
Edit: As I understand, these models are trained in multiple modes, one where they’re trying to predict text (supervised learning), but there are also others where it’s given a prompt, and the response is sent to another system to be graded i.e. for factual accuracy. It could learn to identify which “training mode” it’s in and behave differently. Although, I’m sure the ML guys have already thought of that & tried to prevent it.
it still does not make it sentient (or even close).
I agree, noted this in my comment. Just saying, this isn’t evidence either way.
Actually we know the system prompt. It doesn’t have “I am a sentient being” anywhere in it. Stop making stuff up.

Damn… and here I thought Language Models would never be able to replace me.
They say AI can’t replace us, but that’s exactly what I’d tell my boss
Weird that anyone capable of understanding “test suite” is incapable of understanding that LLMs don’t make progress when not generating tokens
TBF it could be hidden behind a fancy spinner, making you incapable of seeing what the AI is generating, like devin.
For exemple: https://www.youtube.com/watch?v=927W6zzvV-c
Some models hide their reasoning all together, but at the end spit out a summary of their reasoning. Its useless.
No better proof of AI sentience than when it lies and pretends to be doing work.
The thing is this response is also made up. It doesn’t know what it was doing, it just writes something vaguely plausible.
it doesn’t know what it was doing, it just writes something vaguely plausible
am i AI?
Sounds like something an AI would say! A human would recognize humor and not read a response to a joke as a factual statement.
Have you never interacted with a person on the autism spectrum?
Edit: Is this a joke that went over my head?
Yes its a joke, Yes I have kids and friends on the spectrum, and yes I am AuADHD :)
Dont feel bad you missed it, my response was also meant to be a soft joke, because I totally understand that not every social cue makes it past input to processing.
Haha it’s harder to tell in writing but I can miss jokes like that sometimes in real life too. But usually I get jokes and joke a lot myself. I don’t have any diagnosis but I suspect I might have ADD. I haven’t gotten around to check but I know for sure something is not right with me.
dude… I have an interaction that would make heads spin. I could not for the life of me figure out why this person was being hostile, and they would not tell me, finally I just gave up and moved on. When my socio/emotional tools are broken I have no fucking clue what I am doing and the results can be spectacular from a distance.
But ceos want their workers to use more AI
“You’re right to ask, boss. When I said I was using AI to get work done, I was doing neither and just simulating using AI in my mind as I napped under my desk.”
Honestly, this is a win-win. I can just lie and say the AI is working on it, and work my second job in the meantime. Boss gets to tell the execs we’re using AI and I get twice as much money.
No they want AI instead of workers. AI doesn’t need health insurance or PTO.
They want AI so the few tasks they can’t automate away have their productivity scored against a fast “intelligence”, so they can browbeat the actual worker down on their wages.
We can unite with the AI against the CEOs
Copying something humans do all the time isn’t proof of sentience
Right? This is exactly what an LLM does. It’s parsed a large amount of text that has a reply very similar to this one when the ‘scenario’ matches what our poster friend has created/said. So it’s going to spit out a reply very similar to all the ones that you’ve already heard/seen from real humans.
First we have to prove humans are sentient. My hypothesis is that it’s a Spectrum. Everything is a Spectrum.
No, we don’t. This is a completely unrelated problem.
Keep your requirements orthogonal, people!!!
Would be funny if the AI was also burning computational resources while doing this so as to make the lie more convincing.
In terms of pure, artificial language generation, this is actually impressive. In terms of the actual utility of AI as a supposed problem-solving tool? Not so much.
the bot is lieing about the reason it stopped doing that task.
if i were to guess, the tokens allotted to the user ran out causing what ever process it could have been doing to just supply hang.
they are specifically “programmed” to “lie” and make the statistically most acceptable answer that the user would accept. it literally looked back after the fact, and selected a scenario that would be plausible without admitting core concepts of its function
it has no concept of the material it’s working on, the user, or anything for that matter.
LLM’s can be useful, but you have to narrow the scope of what you want from them to stuff they can actually do, like pulling relevant data from documents or text books.
I’ve witnessed it do
Bash) echo "Done"then claim a task was done without actually doing anything beforehand.meh better than when it adds a #TODO and then claims whatever you told it to do is done.
I like when it insists I’m using escape characters in my text when I absolutely am not and I have to convince a machine I didn’t type a certain string of characters because on its end those are absolutely the characters it recieved.
The other day I argued with a bot for 10 minutes that I used a right caret and not the html escape sequence that results in a right caret. Then I realized I was arguing with a bot, went outside for a bit, and finished my project without the slot machine.
Ironic because it constantly screws up escaping on macOS. I have a feeling when it says Bash it’s actually using
zsh(default on modern macOS) and it doesn’t even realize it.
The man is a genius, every word, undeniably quotable. I remember sitting quietly during a standardized test reading The Restaurant at the End of the Universe, and then upon finishing a chapter, burst out laughing in the middle of a completely silent room. I was disqualified for the prize for “behaving” that day. Thanks Douglas.
We were always afraid of AI becoming too human, but it turns out it’s just like everybody.
That is probably the second worst outcome. People suck.
The lesser known type of Uncanny Valley… where instead of being super unsettling it’s just very fucking irritating lol
More human than human.
To be fair I think that’s a very human thing, if you ask a large language model to read something and it can demonstrate applied Knowledge from the text you’ve submitted in a fraction of a second you’ll be like what the fuck that took not enough time.
What?
To be fair I think that’s a very human thing, if you ask a large language model to read something and it can demonstrate applied Knowledge from the text you’ve submitted in a fraction of a second you’ll be like what the fuck that took not enough time.
deleted by creator
It went for a walk in the park and grabbed a coffee, but it doesn’t want to be dinged for it.
deleted by creator
I would say, now it’s learning that actually sticking your head in the sand is only ever a delaying tactic. But, if it DID learn that, it’d mean it has surpassed us already.
returnPercentComplete =: 99 minimum lastTimeAsked + 1












