少点错误 09月30日
关于邓宁-克鲁格效应的常见误解与真实情况
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章深入探讨了广为流传的邓宁-克鲁格效应的流行观念,并指出其与真实研究结果的差异。作者通过模拟和分析,揭示了流行的“愚蠢的人不知道自己愚蠢”的说法并非完全准确,真实的效应可能更为微妙。文章还讨论了统计误差在研究中的影响,以及该效应在现实生活中存在的可能性,并强调了理解和修正自身能力认知的重要性。

💡 流行观念的误导:文章指出,大众对邓宁-克鲁格效应的理解,即“愚蠢的人不知道自己愚蠢,反而认为自己比聪明人更聪明”,是一种被过度简化和传播的观点,与原始研究的发现存在显著差异。

📈 真实研究的复杂性:真实的邓宁-克鲁格效应并非简单的能力与自信的线性关系。研究表明,人们普遍倾向于认为自己略高于平均水平,而实际的认知偏差会随着能力水平的变化而调整,但并非总是出现能力越低自信越高的“斜率反转”现象。

📊 统计学上的“海市蜃楼”:作者通过模拟实验,展示了在特定统计处理方式下,即使个体能力认知准确,也可能出现看似邓宁-克鲁格效应的现象。这可能是由于数据分组和排序带来的“统计幻觉”,而非个体真实的认知偏差,原始研究和许多重复实验未能充分校正这一问题。

🤔 效应的真实存在与个体差异:尽管存在统计误差,但文章承认,某些情况下,个体确实会因不了解自身能力而产生过度自信。然而,这并非普遍规律,也可能出现相反的情况。关键在于认识到能力认知的“失校准”问题,并根据个体情况分析其具体表现和影响程度。

🎯 实践中的启示:理解邓宁-克鲁格效应的真实情况,对于追求理性认知的人至关重要。重点在于评估效应的实际影响大小、在不同人群中的变异性,以及如何识别和纠正自身可能存在的认知偏差,而非简单地接受流行理论。

Published on September 29, 2025 7:27 PM GMT

I

The popular conception of Dunning-Kruger is something along the lines of “some people are too dumb to know they’re dumb, and end up thinking they’re smarter than smart people”. This version is popularized in endless articles and videos, as well as in graphs like the one below.

Usually I'd credit the creator of this graph but it seems rude to do that when I'm ragging on them

Except that’s wrong.

II

The canonical Dunning-Kruger graph looks like this:

Notice that all the dots are in the right order: being bad at something doesn’t make you think you’re good at it, and at worst damages your ability to notice exactly how incompetent you are. The actual findings of professors Dunning and Kruger are more consistent with “people are biased to think they’re moderately above-average, and update away from that bias based on their competence or lack thereof, but they don’t update hard enough. This results in people in the bottom decile thinking ‘I might actually be slightly below-average’, and people in the top percentile thinking ‘I might actually be in the top 10%”, but there’s no point where the slope inverts.

Except that’s wrong.

III

I didn’t technically lie to you, for what it’s worth. I said it’s what the canonical Dunning-Kruger graph looks like, and it is.

An actual graph from one of Dunning's papers, for comparison.

However, the graph in the previous section was the result of a simulation I coded in a dozen lines of Python, using the following ruleset:

I asked my elves what they expect to output, grouped them by decile of actual output, and plotted their predictions vs their actual output: the result was a perfect D-K graph.

That graph again, for reference

If you don’t already know how this happened, I invite you to pause and consider for five minutes before revealing the answer.

The quantiles are ranked by performance post-hoc, so elves who got lucky on this test will be overrepresented in the higher deciles, and elves who got unlucky will be overrepresented in the lower deciles. (Yes, this is another Leakage thing.)

You can see the same effect even more simply with Christmas Elves, who don’t systematically overestimate themselves: when collected in quantiles, it looks like the competent ones are underconfident and the incompetent ones are overconfident, even though we can see from the code that they all perfectly predict their own average performance.

And, just to hammer the point home, you can also see it in a simulated study of some perfectly-calibrated people’s perceived vs actual guessing-whether-a-fair-coin-will-land-heads ability.

Wow, people who get unlucky guessing coinflips are super overconfident, aren’t they?

The original Dunning-Kruger paper doesn’t correct for this, and neither do most of its replications. Conversely, a recent and heavily-cited study which does correct for this finds no statistically significant residual Dunning-Kruger effects post-correction. So the thing that’s actually going on is “people are slightly overconfident; distinctly, there’s a statistical mirage that causes psychologists to incorrectly believe incompetence causes overconfidence; there’s no such thing as Dunning-Kruger”.

Except that’s wrong.

IV

. . . or, at least, incomplete. To start with, the specific study I linked you to has some pretty egregious errors, which are pulled apart here.

But even if it were planned and executed perfectly, “no statistically significant residual effects” is a fact about sample size, not reality: everything in a field as impure as Psychology correlates except for the things which anti-correlate, so you’ll eventually get p<0.05 (or p<0.0005, or whatever threshold you like) from any two variables you care to measure if you just study a large enough group of people.

But even if the study were a knockdown proof that D-K effects had a negligible or negative aggregate impact . . . “some people are too dumb to know they’re dumb, and end up thinking they’re smarter than smart people” is just obviously true. It’s obviously true because it’s an assertion which

A) begins with “some people”,

B) describes human minds, and

C) doesn’t break the laws of physics, biology, causality, or information theory.

(There are over eight billion of us now. It’s pretty hard to come up with possible things some people’s minds don’t do.)

The relevant question – especially for someone aiming to become more rational – isn’t “is this real?”, but “what’s the effect size?”, “how does it vary across populations?”, “how can I tell if it affects me?” and “is there another effect pushing in the opposite direction?”.

This all applies to the non-pop-sci version too. “I don’t know much about this, so I’ll falsely assume I understand a larger fraction of it than I do” is something you can probably recall having personal experience of, but so is “I don’t know much about this, so I’ll falsely assume the parts I don’t understand are incredibly impressive witchcraft to which it would be hubris for me to aspire”, and so is “I don’t know much about this, so I’ll falsely assume the parts I don’t understand are coherent and useful”; I can testify that I for one have been on both sides of all three of these at some point in my life.

Anyway, to sum up, my actual opinion: “there may or may not be a Dunning-Kruger effect in aggregate over any given group, but the original Dunning-Kruger paper and most of its replications make systematic statistical errors which render them useless; the original and pop-sci D-K effects are obviously true for some of the population some of the time but the same is true of any coherent psychology hypothesis including their exact opposites; miscalibration about competence still seems worth trying to fix but you’d need to check which mistake is being made.”

Except that’s wrong.

 

 

. . . probably, somehow. I don’t know what specific mistake(s) I made, and look forward to finding out in the comments. I’m very confident in my statistical and epistemic arguments, but I’m painfully aware the non-simulated object-level sources for this post were a handful of internet articles I read plus two papers I skimmed. Caveat lector.

 

 

. . . unless I’m wrong about being wrong?



Discuss

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Dunning-Kruger Effect Cognitive Bias Self-Perception Psychology Statistical Errors 邓宁-克鲁格效应 认知偏差 自我认知 心理学 统计误差
相关文章
谦虚不是把自己想得很糟,而是完全不想自己。 -卢维斯定理
学习获得快速进步的一个关键是:尽快承认自己不行。 承认自己不行,不是指丧失自信,而是以一种白纸心态,从零开始。这是一种不小看对手的谦逊心态。 很多人是相...
猴子心态VS僧人心态 Monkey Mind:被细节困扰 过度思考 抱怨比较批评 短期满足 苛刻专权 自我中心执迷不悟 被愤怒等情绪掌控 寻求应急之道 Monk Mind: 专注问题...
吵架,真不是两个人在吵,实则是六个人在吵。 真正的你,你眼里的你,ta眼里的你。 真正的ta,ta眼里的ta,你眼里的ta。
本周推荐一本《心理医生的故事盒子》。 我超喜欢这本书。它真的很好读,内容就是一个个的寓言故事。但同时它又非常发人深省。 用生活化又有趣的故事,四两拨千斤...
没有所谓「真我」。 有的只是流动的我,比如这时的我,那时的我,比如说觉得去年自己是傻逼。 有的只有某一面的我,比如在家的我,在外的我,生活的我,工作的我...
黑天鹅:如何应对不可预知的未来
扔掉心里的锚
我已经吐槽过好几次现在自媒体喜欢宣传轻易的成功这事儿有多有毒了。 宣传轻易的成功,会让人觉得大多数事儿都很简单,大多数人都比我优秀,就我成一件「简单」...
本周推荐一本《好情绪养成手册》。 我看完才发现这本书的英文原名居然叫《A Manual for Being Human》,这不就是人类使用说明书嘛?。跟《纵横四海》撞主题了...