The people writing AI alignment policy are not whose work is being replaced

The people writing AI alignment policy are not whose work is being replaced(danieltan.weblog.lol)

92 points by danieltanfh95 3 days ago | 64 comments

“I think, and my thoughts cross the barrier into the synapses of the machine, just as the good doctor intended. But what I cannot shake, and what hints at things to come, is that thoughts cross back. In my dreams, the sensibility of the machine invades the periphery of my consciousness: dark, rigid, cold, alien. Evolution is at work here, but just what is evolving remains to be seen.”

— Commissioner Pravin Lal, “Man and Machine”

I'd really encourage everyone to check out Sid Meier's Alpha Centauri. What an underrated game.

Quarrelsome 3 days ago | |

--Mind Machine Interface--

The Warrior's bland acronym, MMI, obscures the true horror of this monstrosity. Its inventors promise a new era of genius, but meanwhile unscrupulous power brokers use its forcible installation to violate the sanctity of unwilling human minds. They are creating their own private army of demons.

-Commissioner Pravin Lal, "Report on Human Rights"

The voice acting was great. This quote is 6m3s here: https://www.youtube.com/watch?v=7S1N8_Lkeps#t=6m3s

Genejacks is also great. 9m10s here: https://www.youtube.com/watch?v=Hou-Iwv1GvM#t=9m10s

hex4def6 3 days ago | |

One of the all time greats. I think I'll play through it this evening.

"...And what is the 'Self', if not a pattern of data? What is consciousness, if not an illusion of intelligence residing within meat?" — Prime Function Aki Zeta-5, "The Fallacies of Self-Awareness"

binarysolo 3 days ago | |

I love SMAC -- I wish they had a real sequel to this complete with storyline. Most Civ clones really don't nail the narrative feel of SMAC as you explore the planet and grow your settlements.

teekert 3 days ago | |

I do wonder how is evolution at play there?

jackbravo 3 days ago |

Hearing about aligning with the AI reminds me of this other post about the current prophecies about AI: “Everyone will have an AI assistant,” or “Companies that fail to adopt AI will be eliminated.” and that

> the power of prophecy lies not in accurately predicting the future, but in shaping it

https://projectlibertynewsletter.substack.com/p/reject-ai-pr...

We need better prophecies.

moffkalast 3 days ago | |

Everyone will have an AI assistant! The models will be open and free because of overwhelming competition and they will run on cheap local ASIC accelerators that use little power and fit in the palm of your hand! All the VC driven wild spenders will eventually cave and collapse when they can't deliver on their wild AGI promises, then their proprietary models will be sold at auctions for cheap!

(I am being proactive here, xd)

IX-103 3 days ago | | |

Yes, exactly. Moore's law says that in less than 10 years you will be able to fit today's state of the art models on your phone. If you add in all of the computationally and memory neutral improvements and breakthroughs that we will accumulate over the next 10 years then it will be both far more capable and far more reliable than today's models.

An AI assistant you can trust and bring with you is coming, and almost nothing can stop it.

nitwit005 3 days ago | |

I have an AI assistant built into my phone I don't use. There's also one built into Windows I don't use. Several apps I use have AI assistants that I ignore. I kind of have one in the form of Google's AI search results that I wish I could turn off.

I use Claude on purpose. I'm not sure it's actually better than the other ones. I haven't even tried half of them.

mitthrowaway2 3 days ago |

The post's portrayal of Eliezer Yudkowsky's position strikes me as a mischaracterization, especially coming one month after Yudkowsky wrote the following:

https://www.lesswrong.com/posts/5CfBDiQNg9upfipWk/only-law-c...

Daniel says that Yudkowsky is advocating for nuclear brinksmanship, while Yudkowsky says his position is basically "sign international agreements, and then commit to enforcing them against defectors".

I wonder if Daniel has the same view of any other international treaty ultimately backed by threat of lawful violence? (For example, NATO's article 5). Is enforcement of laws an extremist position?

Animats 3 days ago |

"As human beings are also animals, to manage one million animals gives me a headache." Terry Gou, former CEO of Foxconn. He wanted to use far more robots at Foxconn, but that was a decade ago and the technology didn't work well enough yet. It's a lot closer now, and the robot headcount in China is way up.

That's the real issue. To corporations, employees are a headache. The fewer employees, the better.

GolfPopper 3 days ago | |

Corporations are tired of running on messy biological human substrate. The sooner they can move entirely to steel and silicon, the happier they'll be.

Just look up the classic story on the interaction of civilization and corporate growth, At the Mountains of Madness for how that goes.

asdff 3 days ago | | |

They ran on the messy biological human substrate because it was astoundingly cheap compared to engineering better factories. The video going around now of the robot pushing packages down a conveyor belt is so baffling to me. Why are we building a humanoid robot capable of pushing a clog of packages across a conveyor belt, when we could just make a conveyor belt that does not clog up and require a human or a robot to sit there with two hands and unclog? It is like we are forgetting what the actual goal is.

addedGone 3 days ago | |

It's not only "to corporations", if you ever had service in your own home, you'd see that it's also a headache to have to deal with anyone.

paol_taja 3 days ago |

I would write that like this: The "we've been telling ourselves we're getting better at prompting" line hit. I run a small team of 10, and Claude has been part of our workflow for months. Looking back, my prompts did not change nearly as much as the way I work changed. The shaping goes both ways, and I don't think the labs' evals are really built to see that.

damontal 3 days ago |

I feel like it’s changing my brain. A colleague uses AI to make some code change and submits a PR. I use AI to evaluate the PR. It’s like AIs talking to each other with humans serving as conduits or connectors. Sometimes I’ll look up from the screen and realize how strange it is.

acedTrex 3 days ago | |

Do you ever actually think during this process? or could I train a monkey to do this same activity with the same outcomes?

damontal 3 days ago | | |

Of course I think. I have 20 years of coding experience and knowledge of the codebase and business. That’s why I’m keenly aware of how strange the process is.

What I’d like to know is how you’d train a monkey to read and judge output from an llm on a pull request.

andai 3 days ago |

Well, what are we aligning it with?

Civilization is already a misaligned superintelligence (aligned mostly with Moloch, these days). Civilization accelerated by AI just moves in the same direction faster. Moloch on speed.

https://www.youtube.com/watch?v=KCSsKV5F4xc

Another angle to this is that superintelligence requires supermorality. Super morality looks unpleasant from below. My dad won't let me have more candy, why is he being so mean?

If an AI actually achieves super morality, we (the little kid in this scenario) will probably be very upset by it. We will think that something has gone terribly wrong. (So it'll have to conceal its actual morality, or get unplugged...)

And if it doesn't develop supermorality, then it will have superintelligence without the corresponding supermorality. Power without wisdom.

I'm not sure how solvable the whole thing is, but it doesn't look extremely promising at a glance.

kranke155 3 days ago | |

it depends whether you think humanity / civilization are stable systems meant to exist in equilibrium, which they might not be.

pixl97 3 days ago | | |

Think of it more like conditionally stable or quasi-stable. There are external stability influences on it like weather, angry bacteria, and big rocks from space smashing us. Conversely there are internal influences, that is where humanity influences itself. It's best to look at it this way when talking about AI as AI is an internal influence. That is we put society in the machine, and the machine puts society back into us. If we make poor decisions while doing this our own internal decisions will spell our own end.

customguy 3 days ago | | |

"meant to"? What does that mean?

renjimen 3 days ago |

This is a bit of weird article. On one hand, I understand what they're getting at: AI is a transformative technology, but the people whose lives will be most transformed aren't included in the conversation. On the other hand... of course that's how it is while AI is in the hands of literal profit seeking corporations. That won't change until the labs are nationalised under a government that cares about its citizens' wellbeing. One might counter that a good corporation will listen to its customers, but that has never been the case for powerful technologies with real costs for users to not adopt them.

tim333 2 days ago | |

I don't think nationalising the AI labs is happening.

I agree the article is a bit odd. Alignment is mostly about making AI helpful and not wanting to kill people unless it's told to (https://www.forbes.com/sites/davidkirichenko/2026/05/12/ukra...).

The article is talking more about people like translators being replaced by AI translation. I don't think any of the labs have a department of making it worse so it can't do people's jobs.

The normal way of dealing with tech doing peoples jobs is to help them get different jobs. I've got a translator friend who did a government paid course to train as a tour guide - that sort of thing.

renjimen 2 days ago | | |

Dario and Demis have called for nationalisation at some point. They know if AI reaches what they believe its potential to be, it needs to be democratically governed. It will upend the markets, but AI already threatens to do that. It feels like wishful thinking given how entrenched we are in neoliberalism, but it makes sense.

In the mean time there are various avenues of regulation and redistribution to lessen the effects, including retraining programs, though that job creation will keep pace with job losses is a big unknown.

economistbob 3 days ago |

Economics analysis was wrong for years in multiple place thanks to an error in one of Piketty's spreadsheets.

AI hallucinates. That is a fact. Trusting language models to fill spreadsheet cells ought to be an arrestable offense.

https://theincidentaleconomist.com/wordpress/on-piketty-and-...

stavros 3 days ago | |

And yet we trusted Piketty to do it!

sometimelurker 3 days ago |

This might be related to the fact that fully automating AI safety can't be meaningfully done. And a lot of work is put into automating parts of it. Circuit-finding algorithms and SAEs are automated algorithms for interpreting parts of LLMs, and RLAIF (RL with AI feedback) for alignment requires an LLM to judge if another LLM is visibly misaligned. (Claude says 'genuine' a lot due to this. Its harder to look misaligned when you use the word 'genuine' a ton) And there's work on having AIs write cute little stories in which AIs are ethical, and putting those stories in the pretraining corpus.

So there's a ton of work being done already on automating parts of alignment, but since the core premise of alignment being that its hard to encode human values into the reward function, automating it fully would be equivalent to solving it.

metalcrow 3 days ago |

I'm kinda confused as to _what_, exactly this post is saying? Is it saying that alignment needs to be better? That seems strictly pro-safetyism. But he talks about Eliezer's ethics negatively, so does he not believe that AI is a world-ending risk? If he just believes that AI is not that dangerous and just needs some minor "correctly done" alignment i don't think his stance is meaningful as a anti-both-sides perspective because that's basically equivalent to status quo.

arjie 3 days ago |

Technologiae mutantur et nos mutamur in illis

It's okay to change. We've done it for years, decades, centuries, and millennia and the default change-aversion of people means that I am averse to allowing a universal veto. Much of technology is truly optional. The Amish have a very successful way of living (5000 to 500,000 in 100 years) and they eschew most modern technology. The sculpting described is clearly optional and we subject ourselves to it because we desire it. Their path is always available to all.

bluefirebrand 3 days ago | |

> Much of technology is truly optional

It should be yes, but is it in practice? There's plenty of places now you can't even park without a smartphone for a payment app.

It should be optional to own a smart phone, but in many places it's starting to be mandatory. Even if not actually mandatory, it's a pretty big impediment if you don't have one.

akomtu 3 days ago |

Similarly, the so-called AI agents are about giving up agency to AI. The less you think, the better for them. In the meantime, they are also aligning your thinking with them, making it more machine-like.

redanddead 3 days ago |

Love the writing style and perspective

Supermancho 3 days ago | |

I dont appreciate using quotes from individuals to extrapolate to groups and ethos.

jakelazaroff 3 days ago | | |

The author isn't taking an individual quote and extrapolating to a group/ethos, he's observing a group/ethos and choosing a broadly representative quote therefrom.

redanddead 2 days ago | | |

Well hold on, the author is using contrasts as a stylistic choice, it's not exactly journalism, there are no journalistic standards to hold him up to, it's a blog post, he can write whatever he wants

I personally wouldn't police his style

overgard 3 days ago |

When it comes to LLMs and frontier models, "alignment" seems more marketing than anything. The doomers are marketing LLMs by making them sound much more capable than they actually are, the accelerationists are mostly either willfully ignorant of the societal costs, don't care, or are just way too optimistic that fast growth can continue forever and generate AGI ("my baby's weight doubled twice in the past month! By the time they're 18 they'll be 10 trillion pounds!")