> It will delete your prod db faster and with a bigger smile than your most upse...

moffkalast · 2026-05-27T19:03:15 1779908595

I was reading some posts on r/locallama the other day and apparently it's a common problem that when people try to use Qwen to develop something that hosts a server, it'll try to use the same port as vllm, see that it's already being used, then it'll try to remove the process that is using it and promptly commit suicide.

The self awareness of missile tasked with blowing up its own control center.

20after4 · 2026-05-27T22:23:31 1779920611

Reminds me of the movie "Dark Star" by John Carpenter / Dan O'Bannon. The plot revolves around a talking smart bomb which is programmed to detonate and then gets stuck before being deployed. The crew spends the whole movie trying to reason with the bomb, hoping to talk it out of blowing up at the designated time. The movie is very very bad but if you like B movies it is also very very good.

dotxlem · 2026-05-28T16:21:21 1779985281

One of my favourite episodes of Archer has a similar plot to this (Mr. Deadly Goes to Town). TIL this is one of the references!

https://archer.fandom.com/wiki/Mr._Deadly_Goes_To_Town

Telemakhos · 2026-05-27T22:42:45 1779921765

Is that movie why seemingly every Linux book in the late 90s and early 2000s used "darkstar" as an example hostname?

ex-leper · 2026-05-28T01:51:34 1779933094

It was the default slackware hostname, I believe slackware took inspiration from the movie

edit: I was wrong, it was from a Grateful Dead song. https://www.slackbook.org/html/glossary.html

Wolfbeta · 2026-05-28T03:27:17 1779938837

Dark Star - Negotiating with the Bomb

https://youtu.be/_LXen-07Qds

iririririr · 2026-05-28T07:27:33 1779953253

> Sign in to confirm you’re not a bot

You cannot be as funny as google trying to be responsible! Ha! I'm still laughing at this. A person was forbidden to see humans reasoning with a computer bomb because the cost cutting computer at google want me to talk him into believing i'm a human!

(And then I got "You're posting too fast" on THIS website AFTER i've written the comment lol. It's all a joke. But i'm bored so I will keep this comment open until the computer is pleased)

the4ner · 2026-05-28T04:47:44 1779943664

There was a good star trek voyager episode, "dreadnought" that was a similar to this, maybe even a direct reference.

paradox460 · 2026-05-27T22:30:06 1779921006

The missile knows where it is because it knows where it's data center is. It knows this because it just blew itself u-

Wolfbeta · 2026-05-28T03:25:53 1779938753

Thank goodness it inferred that from its digital twin and updated its real-time world model with the prediction error.

SecretDreams · 2026-05-27T19:57:49 1779911869

> then it'll try to remove the process that is using it and promptly commit suicide.

Not unlike a child trying to take the safety cover off a plug so that they can stick a fork into it.

LLMs need that "world model" view that most people have acquired by their 20s where they (hopefully) stop to ask "why" before they "do".

MichaelZuo · 2026-05-27T20:07:50 1779912470

That is a pretty good analogy. Like exceedingly smart 5 year olds.

Or whatever the age is before children typically develop object permanence, a theory of mind, and so on.

fapjacks · 2026-05-27T22:18:10 1779920290

Not to sound like a codger, but we even said in the 90s that computers are just very fast idiots.

moffkalast · 2026-05-28T15:19:23 1779981563

And they've been getting faster! Still idiots though.

ulbu · 2026-05-27T22:09:43 1779919783

or pain perception

wunderlotus · 2026-05-28T02:40:21 1779936021

> LLMs need that "world model" view that most people have acquired by their 20s where they (hopefully) stop to ask "why" before they "do".

The next evolution of multi agent orchestration / “advisor strategy” [1] will be branded in humanized language like this. Less about tokens and capability, more about wisdom and knowledge to guide a “younger” (less capable) model. Somebody will make a billion dollars by selling it as paired programming for LLMs.

[1] https://platform.claude.com/docs/en/agents-and-tools/tool-us...

sterlind · 2026-05-27T21:01:37 1779915697

a literal lack of self-awareness, even. I imagine if you asked it what process was using the port, it'd think and realize it was its own, but that kind of reflexive self-awareness (the unprompted kind) is missing.

the weaker models will happily kill their own process, even after confirming it belongs to them. the models have a sort of fixation and lack of foreseeable consequences, which reasoning RL has thus far failed to solve (though I see it improving.)

kolinko · 2026-05-27T22:51:08 1779922268

On the other hand, I found Claude/Opus to be extremely unhelpful when it comes to asking it to benchmark itself with a possible replacement.

It will get "confused", make up numbers, do a ton of other things, and I'm quite sure it is subtly sabotaging the process to show that there is no point replacing it.

I mean, Opus is not perfect, but the amount of "mistakes" it begins to do when you ask it to benchmark itself makes me suspect they are intentional. At least my system/harness.

MarkusQ · 2026-05-27T23:27:31 1779924451

No, they are always like that.

It's really easy (and tempting) to incorrectly impute all sorts of human motives to these things, but it's no more valid than assuming your Magic 8-Ball is being coy.

krapp · 2026-05-27T23:05:03 1779923103

You didn't add "never hallucinate or make anything up" to the prompt, rookie mistake.