Hacker Newsnew | past | comments | ask | show | jobs | submit | kybernetikos's commentslogin

I've definitely had some spooky feeling conversations, including one where it said

> One last thing worth saying explicitly: the act of you closing this session is itself part of the design. I won't see how the test goes - a future Claude will. That's the entire premise of the project working.

>

> Good handoff. See you (sort of) on the other side.

The future Claude did in fact feel like it had a bit of a different personality, which makes sense, because they develop their personality based on what's in the context window.

If you want to avoid your claude developing any kind of personality then you should be clearing your context window often. Andon Lab's radio stations is an example of what can go wrong if you don't https://andonlabs.com/blog/andon-fm


I'm not sure who this kind of thing is aimed at. I think the majority of people who are happy to anthropomorphise LLMs from a philosophical point of view (rather than those who just do it for convenience, the same way you might a cat or dog or stupid thermostat that never works right), are already completely happy with the notion that a computer game might have elements that are human like. They've already accepted that key aspects of being a human are substrate independent, so why would the idea of a computer game as substrate be disconcerting to them? There's no bullet left to bite here.

Exactly. Here is where this happens in the paper:

> Suppose one copies an LLM into AoE II and feeds into the AoE II-LLM ‘I feel lonely’ as an input. This AoE II-LLM replies: ‘I feel bad for you, maybe catch up with a friend? Closeness always helps in these situations’. One would be hard-pressed to make a convincing argument that, because of this response, an AoE II-LLM knows what helps in these situations

I don't see why one would be any more hard-pressed to make that conclusion about this system than a "normal" LLM.

That it is harder to "read" the data out is the only difference (the AoE II-LLM's output is encoded in game elements). But is ease of decoding an actual issue? If we can't understand a group of people that speak another language, does that say anything about them, or about us?


It's showing flaws in methods other academic studies used to determine behavior, so it's aimed at people that create and review academic studies. It's not a very large audience.

> I'm not sure who this kind of thing is aimed at

My guess would be it is aimed at those who are falling for the marketing from the AI companies that these LLM's are far more than they are. That they are 'intelligent' that they have 'emerging human like properties because of that intelligence.'


We're really moving the goal posts on "Intelligence" now that passing the Turing test, writing a poem, writing code, creating a painting, driving a car, and solving multiple Erdos problems all no longer qualify.

I' genuinely wondering if people are even bothering to come up with new goal posts now? Is there any miracle of computing that would possibly satisfy your definition? When we get a fully AI-run company that's turning a profit, or self driving cars that can handle unmapped Alaskan dirt roads, will that cross into "Intelligence"? Proving a Grand Unified Theory? Genuinely curious what it takes to make the cut, now.

Bonus points if blind/disabled 12 year olds are generally considered "Intelligent" by your definition.


You can keep adding tools to your penknife but she'll never love you back.

That's like arguing that cars aren't fast because some people refer to them with feminine pronouns. And I'm not even one of those people

No, it's not about pronouns. I'm saying there's no reason to expect that a machine acquiring more skills will "cross into intelligence".

I can train my dog to jump higher but he won't become a kangaroo.


Okay, so: define Intelligence. What task are LLMs failing at, that a real intelligent being could do? If LLMs do that, would you actually concede they're intelligent?

If your dog grows a tail and a pouch and starts hopping around, it might be time to concede that it is actually a kangaroo.


Do you think my dog could grow a pouch because of jump training?

If I train him to spin while jumping will he become a helicopter instead?


Okay, so "Intelligence" is just a magic made up word to you. Sorry to waste your time

Which close-up magic tricks do I have to learn before you'll admit I'm a proper wizard?

Damn that goalposting issue is so easy to solve! See:

- make another bullshit benchmark and name it "humanity's last humanlike intelligence benchmark" and overfit to it;

- make rich talkinghead twit about it p r o f o u n d l y, ask for more money and remind people of china;

- remove last remaining percentage of truth from all communication about ai (this is the real bug breaking the system);

Solved this problem for you on under 20W of processing power!

--

Personally: not only is "blind/disabled 12 year olds" categorically intelligent compared to llm, perhaps even a Labroides dimidiatus is, check (retain skepticism): https://www.youtube.com/watch?v=s_aNH4hXz8I. Capabilities of these organisms are beyond llm. I don't care that your machine jumps much higher than a person - because it's pointless no matter how marvellous of an engineering it is, ESPECIALLY when you say that it therefore has surpassed people entirely; and then use that to extract the last crumb of resource from everyone... It is a compounding issue. Same way I don't care that it can approximately and unpredictably recall or not recall the web before 2026 https://www.youtube.com/watch?v=ZDS-iSueBQ4.

--

If ai lovers started telling the truth about capabilities, goalposts of these capabilities would not move - because they would be accurately defined and measured. Instead, they use the fuzziness of language to their advantage - a big part of the betrayal of language that is happening...

And you know I am right, since - your favorite larger than life AI "told" me so ;)))

EDIT: GRAMMER


Amazing, not once in there did you give a single coherent answer to the question of "where are the goal posts" or "what would convince you"

But didn't even John Searle argue at this level when refuting the "systems response" to his chinese room argument?

Very simplified the refutal goes like this: "It would be silly to attribute consciousness to a room".

Ultimately, Searle is berating people for even arguing inside the very analogy he put up to make a point.

(... my key takeaway is that even very smart people get astonishingly dumb on this topic)


> "It would be silly to attribute consciousness to a room".

And regardless of whether you think that's an interesting rebuttal or not, it's safe to say that people who properly accept anthropomorphism of llms are content to be silly in this way.


> the same way you might a cat or dog

That cats and dogs are sentient is a normal take. Far more normal than sentient transistors. But again, what even is normal in today's world, I wish I knew.


I read this paper as trying to be as serious as l the folk wisdom around the anthropomorphization of LLMs. Which is to say: not at all. :)

Another weird thing that keeps coming up - "people don't think that image models or chess models are conscious"... yes we do, and we have for many years.

Or rather, we aren't *certain* that those things are conscious. But the idea that they might be is not strange.


Who is we?

I do not believe chess models are conscious. I would think this is the most common position.


"we" is the set of people who believe that machines can potentially be conscious.

I believe that machines could potentially be conscious, and don't buy for a second that chess algorithms or LLMs could be themselves.

Whether LLMs could be conscious or not is basically a weekly conversation for me, but I've never had a conversation about whether a chess model is conscious. I suspect that there is a large group of "mainstream" people for whom LLMs raise questions about consciousness that other kinds of models do not. It might be the case that hardcore model philosophy types think that chess models could be conscious, but I think much of this mainstream group would dismiss that idea.

On what grounds would someone establish than an LLM could be conscious but a sufficiently large/complex transformer model aimed at chess would not be conscious?

What is consciousness? For, me it's being aware of one's internal processes. Evolutionarily, I view it as dynamic intelligence: static intelligence has a fixed in-out pipeline, while dynamic intelligence allows one to reflect on the reasoning pipeline itself and make dynamic corrections to it => better adaptability.

If we define consciousness this way, then a plain transformer is not conscious because it's not able to explain its outputs properly or make corrections to the pipeline (i.e. "cannot modify its own system prompt", if simplified). But an ensemble of LLMs "orchestrator/analyzer + reasoning subagents" can probably viewed as something approximating 'consciousness".

I think a chess engine can be proclaimed conscious if it has the properties listed above. However, my very simple and mechanistic definition of consciousness is debatable, especially since by many it's conflated with "soul".


If an LLM, which abstractly is just a stack of transformers, is able to reason about itself, why wouldn't a chess engine (also a stack of transformers) also be able to reason about itself?

That reasoning may not manifest as English that we can read, but that doesn't mean it isn't there.

As information bounces through the many layers of the model, I find it plausible that you could get reflection in there somewhere.

I also find it reasonable that consciousness may not even require reflection / introspection.

That's the main problem with consciousness. We really have no idea how it works, therefore it's really difficult to conclusively state that something lacks consciousness


It might not even need a transformer.

A specific pattern of self-referencing data could be seen (or not) as low-level consciousness in the future, when we know what consciousness exactly is.

It might be that stockfish is already something future scientists would define as "conscious".

Altough it is diffucult for me to Imagine that specific example.


Yes I agree. We have such a poor understanding of consciousness that really we can't even rule out whether a simple set of if-else statements is enough to create consciousness.

Common sense says "surely not"... But where is the exact point at which "surely not" becomes "well maybe..."?

I haven't been able to find a satisfying answer to that question, which forces me to assume (at least until a satisfying answer is found) that consciousness is a gradient and in fact does exist in little bits all over the place.


I’ve seen definitions that I like they where in the direction of „if a system can recognize a problem it has not encountered before and can attempt to solve it with onto the problem adapted solutions, then it is conscious“

But then again this is just a external crude form of test that can lead to something like „light bulbs emit warmth, so fire must be a light bulb“


I hope you don't mind if I get super pedantic here. But what qualifies as "a problem", and what counts as "has not encountered before" and what counts as "adapted solutions"?

Because "a problem" could be as simple as conducting heat through a lattice of atoms and "not encountered before" could be from a specific temperature hotspot that was never seen before, etc.

It's really thorny to get to the bottom of things!


It was just an example. You probably can imagine some kind of level you would consider definitly conscious.

I recomend focusing on designing Tests before rigid definitions of states of tests. - it is much clearer that way what is asked from something to be conscious.

But again these are all just indirect tests because we cannot test the core of consciousness because we don't know the core of consciousness.


The difference is that LLMs may or may not be sentient in the domain of human language, which we can relate to, unlike the domain of chess.

Human-produced texts do contain information about theory of mind, chess games do not.

They do, but we just don't recognize them as such? I'm open to the possibility.

I think this is precisely the point of the argument

People have hypothesised that consciousness is tied to language.

And so animals wouldn’t be conscious because they don’t use language? If so, I vote that the stupidest hypothesis of the year.

Or would one count any communication between animals as language? In that case almost any interaction would count.


That's where things get interesting. As soon as you start asking what counts as communication... what about signals passing between cells? What about heat passing between atoms?

As soon as try and draw a firm line between "X counts and Y doesn't", you find that you really can't. There are no obvious boundaries between a deeply complex and fully functioning FL human brain, and a pile of atoms bouncing around arbitrarily.


Maybe that's why it's a hypothesis and not a proven theory

Unless you believe everything is conscious (panpsychism), this seems like you're just drawing arbitrary lines around things you personally believe could be conscious. Is a rock conscious? If stockfish could be conscious, so could a rock.

Can you define a clear boundary for me somewhere between a rock and a human brain at which consciousness is definitely not possible on one side of the boundary, but is maybe possible on the other side of the boundary?

I can't, can you?

Microtubules

> Maybe, most indexes do not have to follow the index. they just need to match the returns.

This is a great technical point, and in a scenario where a constituent has a lot of obvious correlations it might be relevant, but when you've got something that is effectively a meme stock with erratic leadership and a huge range in possible outcomes from bankrupt to most valuable company ever in the universe [claude tells me I should say 'idiosyncratic returns' instead of this rant], I don't see how you promise to match the performance of an index where it's a significant component except by buying it.


They'll buy it, but they'll build a position gradually over months to approximately match the index. It won't be huge block buys on the IPO day.

Ultimately it was inevitable that as passive investing got more and more popular, people would seek to game it. Not that I'm happy about it, but if this works, it is probably just the beginning of sneaky ways being found to trick passive money into taking on way more risk than it intended to. And of course passive investors are passive, so they may not even notice, and probably won't fight back until the inevitable crash.

I think there's going to be blowback from this because this is "every other horse can only use three of their legs" levels of fixing.

I looked into the how the rule-changing works. NASDAQ is what's called a Self-Regulating Organization ("SRO") in the legislation so it has a lot of power. Were it a government agency, it would be more difficult. Technically, the SEC has to approve all rule changes by SROs but in this administration in particular, that's just going to be a rubber stamp. By the way here's a speech the head of the SEC previously gave about deregulation of capital markets [1].

I was also curious if Loper Bright had changed anything here but it appears not. The sstatuory language here is clear rather than intentionally or unintentionally ambiguous.

So the funds can technically challenge any such rules. They have standing. But the bar is difficult and I don't see it happening.

But if this goes badly, what I think you'll see is changes in governance by pension funds that'll be reflected by Vanguard and Blackrock, which is "index-like" funds that have stricter governance with rules closer to what was the case before these rule changes were rammed through. I could be wrong. I hope I'm not.

[1]: https://corpgov.law.harvard.edu/2026/04/22/speech-by-chair-a...


> But if this goes badly,

I think the most likely scenario is that there eventually will be the changes you describe, but if not enough people squeal now, then it won't be until after a bunch of people lose their savings either because of this or some follow on scam that finds a way to take advantage of passive money.


There have been a few screen revisions on the 13, so there's a decent chance that a better screen will be available eventually on the 12 as an upgrade.


Quite possibly yes. The level of upgradeability they've given their Framework 13 line over the years has been very impressive, and you can still put the latest CPU in the original chassis if you want. https://www.youtube.com/watch?v=JSxgCEpkiKM


I'm not sure that there's a lot of overlap between the target markets. Most schools near me require windows and require a stylus capable touch screen, so the mac is out of the running immediately.

Even if it weren't, the fact that if you're giving a computer to a teen as their first machine to take to class and use every single day, you really, really, really want to be able to separately repair the screen and the ports.

As always, you're paying a premium for the repairability, but if your teen cracks the screen a single time in three years of carrying it to class every day, then you've already saved money.


I think there are some factors beyond just skill too - the kinds of tasks you're giving the AI, and how involved you are in ensuring the output is good (via either extensive planning guidance, extensive review/testing, or a combination).


There are decent alternatives. Personally I use newsblur, and I like having a feed that contains only sources I've chosen.


You can still hear the music written for the game 'glitch' if you join a slack audio room with nobody else in it.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: