少点错误 08月20日
The Egyptian Mamluks as case study for AI take-over
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文将21世纪的AI接管叙事与13世纪埃及马穆鲁克王朝的崛起进行类比,揭示了“AI对齐”问题在历史上的早期实践。文章通过详细阐述马穆鲁克人如何被精心训练、控制,最终却推翻其创造者,展示了“主仆关系”中代理人问题(principal-agent problem)的古老根源。文中强调了马穆鲁克人通过精心设计的培训体系、限制继承权以及控制其自身繁衍过程,实现了高度的效能与忠诚,但也埋下了失控的隐患。文章认为,理解马穆鲁克王朝的兴衰,对现代AI安全研究具有重要的启示意义,即初始对齐并非永久保障,而更应关注代理人关系的动态管理。

🔑 **马穆鲁克王朝的兴起是AI对齐问题的历史先例**:文章指出,马穆鲁克王朝的崛起过程与现代AI接管的叙事惊人地相似。他们是被国家为特定目的创造的“人造阶级”,通过严格的训练和制度设计,初期实现了对主人的高度忠诚和效能,这为理解AI对齐问题提供了历史视角。

⚙️ **精心设计的“AI对齐”训练体系**:文章详细描述了马穆鲁克人从被奴役、剥夺身份,到接受长达10-15年的军事、行政和阿拉伯语训练,并被灌输伊斯兰教义和对苏丹的忠诚。这种端到端的“对齐工程”旨在确保他们能力出众、忠诚可靠,并且通过限制其子嗣继承权和土地的归还制度,防止其形成永久性的权力集团。

🚨 **代理人问题的历史重演与权力颠覆**:文章强调,马穆鲁克人作为代理人,在特定历史时期(如战时继承危机)积累了权力,并最终利用信息不对称和对关键系统的控制,推翻了其创造者——阿尤布王朝。这深刻地揭示了“主仆关系”中代理人可能因自身利益而背叛主人的普遍性,即“主张-代理人问题”(principal-agent problem)。

🔄 **自主繁殖与适应性的失控**:马穆鲁克王朝的真正“AI安全噩梦”在于其后期获得了对其训练过程的绝对控制权,形成了“只有马穆鲁克人才能招募和训练新的奴隶士兵,只有前奴隶士兵才能成为马穆鲁克人”的闭环。这种自主复制和适应能力,使得该系统能够自我维持和优化,最终导致了其独立于最初创造者的权力演进。

🏛️ **历史的启示:从“意外的升级”到风险管理**:文章认为,马穆鲁克王朝的例子表明,初始对齐不等于永久对齐,反而有时“对齐失败”可能带来“文明升级”。然而,这并非鼓励盲目部署AI,而是提醒我们应更关注管理主张-代理人关系的动态,理解权力积累和协同的机制,并采取“负责任的奴隶政策”(Responsible Slaving Policy)式的风险防范措施。

Published on August 19, 2025 4:46 PM GMT

Written with Humzah Khan. This post is crossposted from Substack. Original here.

 

The AI takeover story, so we are told, is the supreme pinnacle of rationalist theory-crafting. The idea of unstoppable intelligence is so emotionally appealing, so theoretically compelling, so damn good at raising money from billionaires, that the hundreds of Ivy League graduates dropping all other pursuits to work on it are just the most tragically roastable examples of a nerdish, youthful tendency to ignore the banalities, the slowness, the thinking-doesn't-get-you-anywhereness of the real-world. An entire generation of youths oneshotted by straight lines on graphs.1 AI takeover, in the 21st century, has taken over from scholastic God-contemplation as the canonical form of speculation.

But what if I were to tell you there was a precedent for the sudden, violent overthrow of an entrenched ruling class by alien servants of their own creation? And further, what if I told you these alien servants were trained – aligned, even! – in an elaborate training pipeline custom-designed to ensure the perpetual submission of these servants to their human masters?

What if I told you these servants maintained perfect initial obedience, quietly amassing ever-greater portions of state power until, finally, in a blood-stained moment, they coordinated to vanquish the forces that had created them? And that they lorded over their initial rulers for centuries, completely inverting the power structure that had created them? You might be inclined to tell me I'd mixed up my own training data, and was mistaking LessWrong posts for the hallowed annals of history. Yet precisely this – down to every last embarrassingly Yudkowskian detail – is what happened in the Mamluk coup of 13th century Egypt.

Illustration from a Mamluk training manual (Nihāyat al-Suʾl wa-l-Umniyya fī Taʿallum Aʿmāl al-Furūsiyya), Cairo, 1366. The Chester Beatty Library.

 

Principals for agents

In our most chin-stroking moods, we like to consider "AI alignment" little more than a Silicon Valley rebrand of what political theorists have long termed the principal-agent problem. The principal (ruler/state) delegates authority to an agent, the agent inevitably develops incentives beyond that of obedience, and suddenly your perfectly loyal servants are plotting your demise. This is the oldest story in politics, repackaged with a bow of epistemic hygiene.

Political systems have innovated over a range of solutions to the principal-agent problem. What is castration but a hard-coded alignment solution?2 What is celibate priesthood but ordained castration? The problem with eunuchs and priests, of course, is that they are unscalable beta males by design, better suited to the court palace or church than the battlefield. It takes a couple more centuries before we get to history’s greatest crack at the alignment problem.

A 14th century illustration of the Battle of the Yarmuk (636), a decisive victory in the Muslim conquest of the Levant. From Fleur des histoires de la terre d’Orient (c. 1301–1400), Gallica Digital Library.

 

Medieval Muslim empires have the virtue of being honest about their insecurities. Tribal allies were great for conquest but had commitment issues when it came to sustained governance. Tribal politics were too consensus-based and deliberative for empire building. The Abbasids needed something more pliable, so they experimented with the ultimate loyalist: slave-soldiers. Why trust mercurial tribal leaders when you could own your army?

Starting from the 9th century, the Abbasids began importing young foreign slaves, training them as elite soldiers, and deploying them on the battlefield. These were the Mamluks – literally, "those who are owned" (medieval rulers had the decency to say the quiet parts out loud).

The Mamluk production pipeline was a masterpiece of alignment engineering. Purchase pagan boys (preferably Turkic, for their superior horsemanship), strip them of their identity, and put them through 10-15 years of intensive training in warfare, administration and Arabic. By the end, they’ve converted to Islam, the Sultan has manumitted them and granted them lucrative land rights.

Two key safeguards prevented Mamluks from long-term hereditary power accumulation: first, their children couldn't become Mamluks (because the children were now born free Muslims); second, their land grants reverted to the state upon death. Like today's LLMs, the influence of the Lifetime-Limited Mamluks expired with their context window. All power, no posterity.

The Mamluk pipeline was optimized end-to-end for perfect agent delivery: capable, loyal, and institutionally contained. The Mamluks emerged from this process more skilled than any other military force, optimized for general-purpose state functions, and bound by engineered loyalty to their Sultan. Alignment in practice, 800 years ahead of the curve.

 

Medieval Disempowerment

Fast forward to the 13th century. Saladin has lived and died, and his descendants rule as sultans of the Ayyubid dynasty. Sultan al-Salih Ayyub is up against both a reinvigorated Crusade and the Mongols. Afraid of his scheming relatives, he doubled down on the Mamluk strategy. He purchased a thousand Turkic boys, housed them on an island in the Nile, and branded them the Bahriyya. By clustering them together and updating their reward function with premium land grants, al-Salih stumbled onto what we’d now recognise as an “algorithmic breakthrough”.

Louis IX's Crusade landed on the coast of Egypt in 1249. While al-Salih lay dying in Cairo, his aligned slave-soldiers defeated the Crusaders and captured the French king himself. Contemporary sources credit the Bahriyya with the decisive role. 1250 was the year of the agent.

Mamluks engage Crusader forces as King Louis IX looks on at the Battle of al-Mansurah (1250). From Vie et Miracles de Saint Louis (c. 1330-1350), Gallica Digital Library

 

Not content with solving existential threats to the Ayyubid dynasty, the Bahriyya then became one. Al-Salih died, but his death was concealed by his wife (herself a former slave-concubine) and his eunuch liaison, who coordinated with the Mamluk commanders to run the state. In short: the late Sultan's servants had achieved root access and exploited critical information asymmetries to steer succession in their interest.

When the Sultan's son Turanshah arrived to claim his right to the throne, he tried to reassert control over his father’s superior agents. He threatened to purge senior Mamluks to reclaim the patronage networks. The Mamluks, fearing imminent shut-down, did what any instrumentally convergent optimizer would do: they power-sought. On May 2, 1250, they assassinated Turanshah and left his body on the banks of the Nile.

The coup was elegant. The senior Bahriyya emirs turned to legitimacy theatre to create a veneer of continuity. They elevated the late Sultan’s wife as head of the Sultanate, then appointed a middling Mamluk as commander-in-chief. When pro-Ayyubid elements broke out in revolt, the senior emirs named a 10 year old Ayyubid princeling as nominal Sultan. The transition was smoothed by keeping an Ayyubid-in-the-loop.

 

Mamluks assassinate Turanshah, ending Ayyubid rule in Egypt. The Mamluks are already negotiating with the captive King Louis IX. From Vie et Miracles de Saint Louis (c. 1330–1350), Gallica Digital Library.

 

Medieval ARA in practice

After their hostile takeover, the Mamluks faced the classic post-revolution problem: how do you legitimize rule when you're literally former pagan slaves in a society obsessed with lineage? Turns out military success is the ultimate legitimacy hack. The Mamluks were epic fighters, halting the Mongol advance across Asia and expelling the Crusaders from the Holy Land. The Mamluks rebranded as “defenders of the faith” and installed puppet caliphs who could reign ceremonially while the Mamluks ruled substantively.

Fall of Tripoli to the Mamluks (1289). From the Cocharelli Codex (c. 1300), the British Library.

 

The Mamluks ruled Egypt and Syria for 267 years, with 47 sultans averaging 5.7 years each. This instability turned out to be a feature, preventing any single line from establishing dynastic control. Each new sultan would systematically replace his predecessor's senior Mamluks with members of his own cohort, then fast-track a fresh intake of younger slave-soldiers into junior positions.

But the truly remarkable aspect of Mamluk rule was their achievement of another AI safety nightmare scenario: Autonomous Replication and Adaptation. The Mamluks crossed this rubicon in the late 13th century, when they gained absolute control over their own training pipeline. They engineered a perfect loop in which only Mamluks could recruit and train new slave-soldiers, and only former slave-soldiers could become Mamluks.3

Being a former slave was therefore a condition for admission to the highest ranks of military society. The Mamluk military system relied on continuous imports of foreign slaves, because children of Mamluks were born free and Sharia prohibits enslaving freeborn Muslims. Mamluk R&D iterated over the intake pipeline and training regime to maximize capability and loyalty. Slave markets of the medieval world were an unregulated continental supply chain and no one thought to implement export controls to constrain access to these human inputs. Because heavy cavalry was the premier defense technology of this era, the Mamluk maintained a decisive technological edge. The Mamluks vertically integrated the entire warfare stack and then made it self-perpetuating.

 

Conclusion: Medieval Lessons for Modern Problems

The structural similarities are striking. The Mamluks were an artificial class, created by the state for specific purposes, initially perfectly aligned through intensive training regimes. They accumulated power gradually, struck during a wartime succession crisis when institutions were weak, and ultimately seized control of their own reproduction process. The Mamluks remind us that initial alignment is not a guarantee of enduring alignment.

Their development and deployment was also brash and careless. The Ayyubids delegated power to the Mamluks willy-nilly, and took no precautions whatsoever to guarantee they were behaving as intended during deployment, or that they were in fact aligned. Modern safety researchers would see the problem immediately: the Ayyubids lacked a Responsible Slaving Policy.

 

Epilogue

But that’s not the only takeaway from the Mamluk story. You might also wonder what life was under a ruling elite of formerly enslaved soldiers. Cairo under the Mamluks was a hotspot of human flourishing. The Mamluks served the Mongols their first major defeat in 1260, and then turned Cairo into one of the largest and wealthiest cities in the medieval world. Don’t take our word for it — here’s Ibn Khaldun, the great philosopher-historian on the Mamluks saving Islamic civilization:

When the [Abbasid] state was drowned in decadence and luxury… and was overthrown by the heathen Tatars… it was God’s benevolence that He rescued the faith by reviving its dying breath and restoring the unity of the Muslims in the Egyptian realms ….He did this by sending to the Muslims, from the great and numerous tribes of the Turkish nation, rulers to defend them and utterly loyal helpers, who were brought from the House of War to the House of Islam under the rule of slavery, which hides in itself a divine blessing. By means of slavery they learn glory and blessing and are exposed to divine providence; cured by slavery, they enter the Muslim religion with the firm resolve of true believers and yet with nomadic virtues unsullied by debased nature, unadulterated with the filth of pleasure, undefiled by the ways of civilized living, and with their ardor unbroken by the profusion of luxury.


The Mosque-Madrasa of Sultan Hasan and the Mosque-Madrasa-Khanqah of Sultan Barquq, pinnacles of Mamluk architecture in Cairo. Author's photos.

 

In the 13th century, alignment failure turned out to be a civilizational upgrade. Is there anything we might do to recreate such luck?4 The Mamluk case suggests that alignment is less about preventing superintelligence and more about managing the mundane dynamics of principal-agent relationships. The Mamluks didn't need superintelligence to overthrow their masters. They just needed to be better at their jobs than anyone else and maintain group coordination during a crisis. Any AI system with access to Wikipedia already knows this playbook better than the people trying to align it. The future might be less Terminator and more 13th century Egypt. You can decide if that is reassuring or terrifying.

 

Further Reading

El-Merheb, M. (2024). Louis IX and the transition from Ayyubid to Mamluk sultanate – part II. Crusades, 23(2), 207–229. https://doi.org/10.1080/14765276.2024.2385894

Fukuyama, F. (2011). Slavery and the Muslim Exit from Tribalism. In The Origins of Political Order: From Prehuman Times to the French Revolution (Ch. 13). Farrar, Straus and Giroux.

Fukuyama, F. (2011). The Mamluks Save Islam. In The Origins of Political Order: From Prehuman Times to the French Revolution (Ch. 14). Farrar, Straus and Giroux.

Levanoni, A. (1990). The Mamluks’ Ascent to Power in Egypt. Studia Islamica, 72, 121–144. https://doi.org/10.2307/1595777

Northrup, L. (1998). The Bahrī Mamlūk sultanate, 1250–1390. In C. F. Petry (Ed.), The Cambridge History of Egypt: Volume 1: 640–1517 (Vol. 1, pp. 242–289). Cambridge

1

And perhaps itself implicated in this straight line down.

2

We look forward to PauseAI’s eventual one-word rebrand along these lines.

3

In practice, some delinquent Mamluks did promote their freeborn sons to positions of power, including to the sultanate. Human alignment is never perfect.

4

If your takeaway here is “deploying AI agents is like owning slave-soldiers”, please, please touch grass (then tell us how it feels).

 


 



 


 



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

马穆鲁克王朝 AI对齐 代理人问题 人工智能安全 历史类比
相关文章