为什么有些国家富有，有些国家贫穷？

资本？知识？制度？地理？文化？经济学中最大的问题至今没有共识。

阶段 1/5

数据究竟显示了什么

Hans Rosling: 200 Countries, 200 Years, 4 Minutes

5000万次观看。200年的全球收入浓缩在4分钟里。但究竟是什么造就了这一格局？

汉斯·罗斯林的动画图表令人着迷：在人类历史的大部分时间里，各国都聚集在贫穷且多病的角落，随后一些国家飞速迈向富裕与健康，另一些却几乎原地不动。要理解这种分化，你需要先掌握一个指标，也要了解它的局限。

按购买力平价计算的人均GDP。经济学家用人均GDP衡量生活水平——把商品和服务的市场总值除以人口。但直接用原始美元金额做跨国比较会产生误导：在纽约理一次发要\$30，在达卡只要\$2。购买力平价（PPP）调整了这些价格差异，从而能更诚实地比较实际的物质生活水平。

按PPP计算，数据触目惊心。美国约为每人\$80,000。印度约为\$9,000。刚果民主共和国约为\$600。最富裕国家的人均产出是最贫穷国家的100多倍。这不是四舍五入的误差——而是全球经济的决定性事实。

GDP遗漏了什么。人均GDP是起点，而非终点。它遗漏了非正规经济——未登记的企业、街头小贩、自给自足的农户和家庭生产，这些在许多发展中国家占到经济活动的30–60%。它遗漏了不平等——一个国家的GDP在上升，大多数公民却可能变得更穷。它遗漏了环境退化：一个国家砍光森林、摧毁自身生产基础的同时，GDP上却记下了增长。它还遗漏了支撑整个经济、却从不进入任何国民账户的无偿照护劳动。

观点

“Gross National Product counts air pollution and cigarette advertising, and ambulances to clear our highways of carnage… it measures everything, in short, except that which makes life worthwhile.”
— Robert F. Kennedy, University of Kansas, March 18, 1968

GDP衡量的是重要的东西吗？

伊斯特林悖论表明，超过一定门槛后，更高的GDP并不能让人更幸福。不丹以追踪“国民幸福总值”闻名。但人均GDP较高的国家同样拥有更低的婴儿死亡率、更长的预期寿命和更多的政治自由。GDP是一个糟糕的指标，还是一个不完整的指标？

流行版本

Two dinner-party versions cancel each other out. One: “GDP is all that matters — growth is the only number a government should chase.” The other: “GDP is meaningless — it counts oil spills as progress and ignores everything human.” The first treats a thermometer as the patient’s health; the second throws out the thermometer because it doesn’t measure happiness. Both miss what GDP is for: a measure of market output, which turns out to be tightly correlated with the things the critics say it ignores — clean water, schooling, child survival, the leisure to ask whether money buys happiness.

最强支持论点

The Easterlin paradox — rich countries aren’t reported as happier than middle-income ones — was the empirical spine of the “GDP misleads” case for decades. Then Stevenson and Wolfers (2013) reopened the data with larger samples and found a robust, roughly log-linear relationship between income and life satisfaction that holds at every level — within countries, between countries, and over time. There is no satiation point where more income stops buying reported well-being. If you care about how people say their lives are going, income is one of the strongest predictors you have. The factor-of-100 gap is also a factor-of-many gap in stated life satisfaction.

最强反对论点

Simon Kuznets, who built the national-income accounts that became GDP, warned in 1934 that “the welfare of a nation can scarcely be inferred from a measurement of national income.” The architect told us not to use it as a welfare measure. Marilyn Waring pressed the sharpest version: the accounts systematically erase subsistence agriculture and unpaid care work — the labor that feeds and raises most of humanity — because it never crosses a market. A country can grow its GDP by commercializing what families used to do for free, and the number rises while nothing improves. The measure is not neutral; it sees what is sold.

判断

GDP is an imperfect but indispensable starting point. Track it alongside health, schooling, distribution, and environmental damage, not instead of them — the HDI and its cousins exist precisely to keep the other columns open. But the measurement debate does the opposite of what its loudest users intend: it does not dissolve the puzzle, it confirms it. Every honest revision — informal economy in, environmental costs out, capabilities counted — leaves Norway vastly richer than Niger on every axis a reasonable person would name. The gap is real. Now we have to explain it.

Full resources: Ch 7 § Macroeconomic Measurement

我们能衡量这个差距吗？

“我们可以看到世界如何从分裂走向趋同。今天，每个人——包括最贫穷的人——都活得更长了。看似不可能的事情已经发生了。”
— Hans Rosling, Factfulness, 2018

罗斯林的乐观有其事实依据。自1990年以来，已有超过十亿人摆脱了极端贫困。撒哈拉以南非洲的预期寿命在2000年至2020年间从50岁上升到62岁。但罗斯林的动画掩盖了一个关键细节：大部分趋同由中国和印度两国驱动，它们加起来占了世界人口的三分之一。把它们排除在外，情况远不那么乐观。许多撒哈拉以南非洲国家今天的实际人均收入比1980年还要低。

“GDP告诉我们的是市场交易，而不是幸福感。我们应该问的不是‘一个国家生产了多少？’而是‘它的公民能做什么和成为什么？’”
— Amartya Sen, Development as Freedom, 1999

阿马蒂亚·森的可行能力方法坚持认为，收入是手段而非目的。一个GDP高但没有医疗保障、没有政治自由、女性没有安全保障的国家，在任何有意义的层面上都不算“发达”。古巴的人均GDP低于拉丁美洲大部分国家，预期寿命却高于美国。喀拉拉邦是印度较贫穷的邦之一，识字率和健康状况却可与富裕国家媲美。但森的批评并不消解这个谜题——即使用多维指标衡量，富国与穷国之间的差距也依然巨大且持续。物质繁荣，无论衡量得多么不完美，都与几乎所有其他好的结果高度相关。

目前的结论

富国与穷国之间的差距真实、巨大且持续——即便调整了PPP、非正规活动和替代福利指标之后依然如此。罗斯林的动画显示差距在某些维度上正在缩小，但核心谜题依然存在：为什么有些国家飞速发展而另一些停滞不前？衡量只告诉我们差距存在，并没有告诉我们原因。

第一个正式解释来自索洛增长模型——半个世纪以来增长经济学的主力模型。它的答案是：穷国之所以穷，是因为它们没有积累足够的资本。简单、优雅，但最终不够充分。

阶段 2/5

资本深化

“这类问题对人类福祉的影响简直令人咋舌：一旦你开始思考它们，就很难再想别的事情。”
— Robert Lucas, “On the Mechanics of Economic Development,” Journal of Monetary Economics, 1988

这句话开启了现代增长理论。卢卡斯描述的是当你意识到国家间收入差距达50倍——而现有理论无法解释它时的感受。

经济学中最有影响力的增长模型是罗伯特·索洛在1956年构建的。其逻辑很直观：储蓄越多的国家积累的资本（机器、工厂、基础设施）越多，而资本越多意味着每个工人产出越多。穷国之所以穷，是因为储蓄不够。

$$y = A \cdot k^{\alpha}$$

生产函数。每个工人的产出取决于每个工人的资本和技术：

其中 $s$ 是储蓄率，$n$ 是人口增长率，$\delta$ 是折旧率。储蓄率越高，稳态收入越高；人口增长率越低，稳态收入越高。

$$y^* = A \left(\frac{s}{n + \delta}\right)^{\frac{\alpha}{1-\alpha}}$$

Income in the long run rises with the saving rate $s$ and falls with population growth $n$ and depreciation $\delta$. Save more, breed slower, and you get richer — but only up to a ceiling set by technology $A$. Two countries with the same $s$, $n$, and $\delta$ should converge to the same income.

直觉模式

把每个国家想象成在往浴缸里放水。储蓄是水龙头（新资本流入），折旧和人口增长是排水口（旧资本磨损，更多工人共用同样的机器）。水位——人均资本——在流入等于流出时稳定下来。水龙头更大（储蓄率更高）或排水口更小（人口增长率更低）的国家，最终拥有更多的人均资本，收入因而更高。

其中 $y$ 是每个工人的产出，$k$ 是每个工人的资本，$A$ 是全要素生产率（TFP），$\alpha$ 是资本的收入份额（通常约为1/3）。关键特征：资本存在边际收益递减。一个国家的第一家工厂带来巨大的产出；第一千家则增加得少得多。

稳态收入。从长期来看，经济收敛到一个稳态，此时投资恰好替代折旧的资本：

The longer story — how growth theory traveled from the classical stationary state through Solow to the ideas revolution of the next stage — is the spine of a forthcoming thread-tracing walkthrough on the growth tradition.

观点

“Aid is not benign — it’s malignant. No longer part of the potential solution, it’s part of the problem — in fact, aid is the problem.”
— Dambisa Moyo, Wall Street Journal, March 2009

外国援助能弥合差距吗？

如果索洛模型是对的，穷国缺乏资本，解决方案似乎很明显：从富国转移资本。这正是外国援助的逻辑。数万亿美元已经花在了这个前提上。丹比萨·莫约认为，这让事情变得更糟了。

流行版本

Two bumper stickers fight to a draw. “Africa needs more aid — the rich world is stingy.” And: “Aid doesn’t work — it’s money down a hole.” Both treat aid as one undifferentiated thing. It isn’t. A vaccine campaign and a balance-of-payments loan to a kleptocrat are both “aid,” and they do opposite things. The interesting argument is not for or against aid in general; it is about which kind of aid does what, and why the Solow logic that motivates the whole enterprise misfires at the national scale.

最强支持论点

Jeffrey Sachs makes the “big push” case, and it is pure Solow: poor countries are stuck in a low-capital trap; inject enough capital at once — clinics, roads, fertilizer, schools — and you exploit diminishing returns in reverse, kicking the economy onto a self-sustaining growth path. The historical exhibits are real. The Marshall Plan rebuilt a devastated Europe in a few years. South Korea absorbed enormous US aid in the 1950s and 60s before its takeoff. The Millennium Villages Project tried to run the experiment village by village. The logic is airtight inside the model: where capital is scarce, its return is high, so capital should work hardest exactly where it is poured in.

最强反对论点

William Easterly answers that aid mostly funds planners, not searchers — grand top-down schemes that no one is accountable for, captured by elites, generating Dutch disease that strangles exports, and blunting the very pressure to reform that produces growth. Moyo adds that many of the largest per-capita aid recipients saw incomes fall over the decades of heaviest assistance. The Marshall Plan and Korea are the wrong analogy: those were already-industrial societies with skilled workforces and functioning institutions, rebuilding what they had lost — not poor agrarian economies building it for the first time.

判断

Aid works for specific, targeted things — vaccines, deworming, bed nets, oral rehydration salts. These save lives at extraordinary cost-effectiveness, and the case for them is overwhelming. What aid has never done is drive sustained national growth. No country has developed primarily on foreign aid. Korea, China, and Botswana grew through institutional reform, export discipline, and domestic policy — aid was at most a lubricant. The Solow model’s mistake was never algebraic. It was the assumption that capital is the binding constraint, when the real constraint is whatever decides whether capital gets used productively. That “whatever” is the subject of the next three stages.

Full resources: Ch 20 § Development Economics (poverty traps and the big push)

资本是答案吗？

“贫困陷阱意味着穷人太穷了，储蓄不足以实现增长。外部援助可以提供摆脱困境所需的‘大推进’。所需的金额相对于富裕世界的收入来说很小——大约占GNP的0.7%。”
— Jeffrey Sachs, The End of Poverty, 2005

萨克斯的论点遵循清晰的索洛逻辑：如果边际收益递减让资本在稀缺的地方更具生产力，那么从富国向穷国转移资本应当产生爆发性回报。问题出在实证层面：几十年的援助带来的增长成效令人失望。萨克斯回应说援助规模太小、太零散、目标不够精准。千年村庄项目在某些维度上显示出有希望的成果，但未能证明大推进理论所预测的自我维持式增长。

“在过去五十年中，西方在外国援助上花费了\$2.3万亿，却仍然无法将12美分的药物送到儿童手中以预防半数疟疾死亡。大推进以前行不通，现在也行不通。”
— William Easterly, The White Man’s Burden, 2006

伊斯特利的批评超越了“援助被浪费”，直指知识和激励的更深层问题。自上而下的规划者——无论是世界银行还是政府——都不具备有效分配资源所需的本地知识。只有“搜索者”——企业家、本地组织、社区——才能发现每个情境中什么真正有效。援助绕过了这个发现过程。这一批评自有其局限：一些自上而下的干预（消灭天花、PEPFAR艾滋病防治项目）取得了惊人的成功。但伊斯特利说得对，持续增长从来不是靠外部转移得来的。

目前的结论

索洛模型是必不可少的脚手架。它让你明白，仅靠资本积累无法解释差距——这本身就是关键洞见。各国之间的大部分差异藏在 $A$——全要素生产率——之中，而索洛无法解释这个变量。外国援助辩论就是它的实际后果：如果核心瓶颈仅仅是资本，援助就会起效。事实并非如此——至少在宏观层面上并非如此。还有别的东西在决定资本能否被有效利用。

如果资本无法解释差距，那什么可以？下一代增长经济学家给出了一个激进的答案：知识。知识不会折旧，可以在不被消耗的情况下共享，而且会复合增长。但如果知识是增长的引擎，穷国为什么不直接照搬富国已有的知识呢？

阶段 3/5

知识的答案

Economics Explained: the 2024 Nobel Prize and why nations fail

2024年诺贝尔经济学奖授予了“制度”这个答案。但在制度之前，经济学家尝试过“知识”——阿西莫格鲁自己的研究正是建立在那个理论的不足之上。

1990年，保罗·罗默发表了开创内生增长理论的论文。他的洞见看似简单：知识与实物商品根本不同。一台机器同一时刻只能被一家工厂使用。而一项知识——半导体的设计、疫苗的配方、搜索引擎背后的算法——可以被所有人同时使用却不会被耗尽。这种非竞争性彻底改变了增长的数学。

非竞争性与收益递增。物质资本是竞争性的：一家工厂在用一台车床，另一家就不能用。知识是非竞争性的：一旦有人发明了晶体管，地球上每一家芯片制造商都可以同时使用这个设计。使用同一项知识的工人数量翻倍，产出大约也翻倍——却无需让知识翻倍。

其中 $g$ 是知识（因而也是收入）的增长率，$\delta_A$ 是研发部门的生产率，$L_A$ 是研究人员的数量。研究人员越多，新知识就越多，增长就越快——而且效果是永久性的，并非暂时的。

$$g = \delta_A \cdot L_A$$

where $L_A$ is the number of researchers and $\delta_A$ their productivity at generating ideas. More minds working on discovery means faster permanent growth — not a one-time bump but a steeper trajectory. The contrast with Solow is total: there, more saving raised the level of income and then stopped; here, more research raises the growth rate and keeps it raised.

直觉模式

索洛模型说你靠多储蓄变得更富。罗默则说你靠多思考变得更富。投资于研发、奖励创新的国家增长更快——而且是永久性的。正因如此，硅谷对美国的繁荣比匹兹堡的钢铁厂更重要。

罗默模型。劳动力中的一部分在研发部门工作，产出新知识。新知识提升生产率，从而产生利润，进而资助进一步的研发：

增长核算。当经济学家把跨国收入差异分解为资本、劳动力和TFP的贡献时，结果发人深省。TFP——与技术和知识密切相关——贡献了50–70%的收入差异。资本和劳动力加起来只解释了30–50%。知识这条线索在数量上占主导地位。

观点

“It doesn’t matter whether the cat is black or white, as long as it catches mice.”
— Deng Xiaoping, attributed, 1961

“中国模式可以复制吗？”

中国在40年里使8亿人脱贫——人类历史上最伟大的经济转型。它是在一党执政下做到的。这是否证明威权治理能带来增长？

流行版本

Two popular readings collide. “China proves authoritarianism works — strong leaders deliver growth that messy democracies can’t.” And: “China succeeded despite authoritarianism, not because of it.” Both leap straight to the political form and skip the thing that actually changed. The question worth asking is narrower and harder: what did China do, and is the doing separable from the doer?

最强支持论点

The strong version of the authoritarian-advantage case points to East Asia’s record — Singapore, South Korea under Park, Taiwan, China — and to the weak cross-country correlation between democracy and growth once you control for institutions and education. Daniel Bell pushes it furthest with “meritocratic governance”: a competent, insulated technocracy can plan for decades, override NIMBY veto points, and execute infrastructure at a scale electoral cycles punish. On this reading, the binding variable is state capacity — the ability to set a goal and actually deliver it — and authoritarian states sometimes have more of it.

最强反对论点

The same Chinese system that produced Deng’s reforms produced Mao’s Great Leap Forward — 30 to 45 million dead in a famine engineered by the absence of any mechanism to tell the leader he was wrong. Authoritarian systems have no error-correction: when they get it right the upside is fast, and when they get it wrong there is nothing to stop them. And the growth itself traces to specific institutional reforms, not to the one-party form — household decollectivization that gave farmers the residual, Special Economic Zones that created secure-enough property rights for foreign capital, WTO accession that locked in openness, and a quiet extension of property rights to private entrepreneurs. Those are institutional moves that happened to occur under authoritarian rule.

判断

China’s growth came from institutional content that happened to be delivered under authoritarian rule — not from authoritarianism as such. The replicable part is the content: secure property rights, market incentives, export discipline, openness to foreign technology. The political form is not the lesson; it is the packaging. This is why the ideas stage bridges into the institutions stage. Ideas are non-rival and available, but a society adopts them only when its rules make adoption pay. China rewrote those rules. That is what catching up actually looks like.

Full resources: Ch 18 § Extractive vs. Inclusive Institutions

知识够了吗？

“一种非竞争性、部分可排他的商品，绝不只是一个细微差别。把这个特征放进增长模型，你会得到一个完全不同的理论：由知识驱动、由激励维系、没有自然停止趋势的增长。”
— Paul Romer, “Endogenous Technological Change,” Journal of Political Economy, 1990 (Nobel, 2018)

正是这篇论文为罗默赢得了2018年的诺贝尔奖。他的洞见是：知识的生产需要前期成本，使用时的边际成本却为零。这就产生了收益递增，打破了索洛框架。在罗默的世界里，增长是由资源有意识地投向研究所驱动的——因此内生于政策选择。投资于知识的国家增长更快，而且是永久性的。由此可见：收入差距反映的不只是资本存量的差异，更是知识生产体系的差异。

“如果知识是增长的引擎，而知识又是非竞争性的，那发展中国家为什么不直接采用已经存在的技术？障碍不是知识本身，而是决定知识能否被使用的制度。”
— the line of argument in Daron Acemoglu, “Directed Technical Change,” Review of Economic Studies, 2002

阿西莫格鲁对纯粹知识驱动增长的批评，把这个领域推向了下一个阶段。如果知识可以自由获取，那真正的核心瓶颈一定是别的东西——决定知识能否被采纳、改造和部署的制度环境。因此诺贝尔奖颁给的是制度而不只是知识：知识解释了增长的近因机制，而制度解释了为什么有些国家启动了这台引擎而另一些没有。

目前的结论

知识是增长的近因引擎——这是罗默留下的持久贡献。TFP贡献了跨国收入差异的大部分。但知识内生于制度、激励与社会结构。如果技术是唯一的障碍，穷国就可以靠模仿走向繁荣。它们做不到。要理解原因，就得看博弈的规则，而不仅仅是棋盘上的棋子。

如果技术可得却未被采用，瓶颈一定在别处。下一个阶段给出了过去二十年最有影响力的答案：制度——决定谁投资、谁创新、谁获取回报的博弈规则。

阶段 4/5

制度的答案

“Countries differ in their economic success because of their different institutions, the rules influencing how the economy works, and the incentives that motivate people.”
— Daron Acemoglu & James Robinson, Why Nations Fail, 2012

阿西莫格鲁、约翰逊和罗宾逊因证明殖民时代确立的制度——早在几个世纪前就已奠定——至今仍在决定哪些国家今天富裕，而获得了2024年的诺贝尔奖。证据就写在殖民者的死亡率记录里。

2001年，阿西莫格鲁、约翰逊和罗宾逊（AJR）发表了那篇重塑发展经济学的论文。他们的论点是：国家的财富不是由地理、文化或自然资源决定，而是由制度——治理经济和政治生活的博弈规则——决定的。

阿西莫格鲁、约翰逊和罗宾逊把200年前欧洲殖民者的死亡率作为今天制度质量的工具变量；这一相关性能够预测当代的GDP差距。在死亡率低的地方，殖民者建立了保护产权和政治话语权的包容性制度；在死亡率高的地方，他们建立了旨在榨取资源的榨取性制度。这些制度延续至今——为榨取而设计的那一套制度，至今仍让那些国家陷于贫困。

The extractive institutions AJR build on are not an abstraction. They were forced-labor systems, monopoly trading companies, racial legal hierarchies, and tax regimes designed to move wealth out — the colonial state as an extraction machine, documented economy by economy in the history of imperialism.

AJR sit at the end of a long institutionalist lineage — Veblen’s habits and Commons’s working rules, Coase on transaction costs, Douglass North on rules and enforcement, and finally the AJR turn to colonial origins. That tradition, traced as intellectual history, is History of Economic Thought Ch.15 (The institutionalist tradition).

观点

“The colonial origins of comparative development… where Europeans faced high mortality rates, they could not settle and set up extractive states; these institutions persisted after independence.”
— Acemoglu, Johnson & Robinson, American Economic Review, 2001

“殖民主义导致了贫困吗？”

2024年诺贝尔奖认为殖民制度是全球收入差距的根本原因。但殖民主义是一件事——还是这个标签涵盖了一百个不同的故事？

流行版本

Two extremes shout past each other. “Colonialism explains everything — poor countries are poor because the West looted them.” And: “Colonialism is an excuse — it ended sixty years ago; blame present governments.” The first collapses a complex, well-identified mechanism into a slogan; the second waves away an effect that the cleanest natural experiments in development economics keep confirming. The real question is how much, through what channel, and what it does and doesn’t excuse.

最强支持论点

The empirical case is unusually tight. AJR’s settler-mortality instrument isolates institutional quality from reverse causation. Melissa Dell’s study of Peru’s colonial mita — the forced-mining-labor draft — finds that districts subjected to it are measurably poorer today, with the boundary visible in household consumption centuries later. Nathan Nunn shows that the regions of Africa most ravaged by the slave trade are the poorest now, with the damage running through shattered trust and broken state-formation. Three independent designs, three different natural experiments, one consistent answer: the institutional scars of extraction persist for centuries. The deepest of those scars, the Atlantic slave trade, is the empirical anchor under Nunn’s result.

The narrower question — did the West get rich because of slavery? — is its own walkthrough: Did the West get rich because of slavery? Here the slave trade enters as one channel by which extractive institutions were imposed and persisted in the colonized world; that walkthrough takes the West-because-of-slavery argument as its central frame.

最强反对论点

Jared Diamond answers that geography came first — the head start in domesticable crops and animals, the disease environments, the accidents of latitude that shaped who could develop and who couldn’t, long before any colonizer arrived. David Albouy challenged the AJR settler-mortality data directly, arguing the mortality figures are noisier and more selectively coded than the headline result admits. And the counterexamples bite: Ethiopia was essentially never colonized and is poor; Botswana was colonized and became one of Africa’s richest and best-governed states. If colonial institutions were destiny, neither should exist.

判断

Colonial institutions left real scars — the AJR, Dell, and Nunn evidence is too consistent across too many independent designs to dismiss. But the story is more complex than a single cause. Colonialism is a major, well-documented source of institutional dysfunction across much of the developing world; it is one cause among several, interacting with geography and contingency rather than overriding them; and it does not relieve post-colonial governments of responsibility for what they built on the rubble. Botswana inherited extraction and chose differently. The scar is real. It is not a sentence.

Full resources: Ch 18 § The AJR Framework

The deeper version of this dispute — whether the European rise itself was institutions or geography or coal-and-colonies — runs through the Did Britain have to industrialize first? walkthrough, where the Pomeranz case for contingency is argued at full strength. Here the AJR institutional thesis is the frame, and geography enters as its strongest counter.

制度、地理还是文化？

“国家之所以失败，是因为其榨取性的经济制度没有为人们储蓄、投资和创新创造激励。榨取性的政治制度通过巩固从榨取中受益者的权力来支持这些经济制度。”
— Daron Acemoglu & James Robinson, Why Nations Fail, 2012

阿西莫格鲁-罗宾逊的框架优雅而简洁：包容性制度开启良性循环（广泛参与带来投资，带来增长，带来更具包容性的制度），榨取性制度则形成恶性循环（权力集中带来榨取，带来贫困，带来权力进一步集中）。诺贝尔委员会认可了这一观点。但优雅不等于完整。中国在阿西莫格鲁-罗宾逊归类为榨取性的制度下取得的爆发性增长，是这一框架最棘手的案例。他们的回应——榨取性制度下的增长真实存在但不可持续——仍然只是预测，而非已被证明的事实。

“你无法摆脱疟疾的束缚。你无法摆脱在没有通航水道的热带内陆国家的束缚。地理不是命运，但它对命运的约束远超制度主义者所承认的。”
— Jeffrey Sachs, “Institutions Don’t Rule,” NBER Working Paper, 2003

萨克斯的地理反论有扎实的实证支持。即使控制了制度因素，热带国家仍然比温带国家更穷。非洲内陆国家的运输成本是沿海国家的3–5倍。疟疾流行地区的增长持续偏低。但地理与制度之间的相关性恰恰就是识别挑战——是疟疾导致了坏制度，还是坏制度导致无法控制疟疾？新加坡地处热带，却是地球上最富裕的地方之一。问题不是在地理或制度之间二选一，而是它们如何相互作用。

目前的结论

制度几乎肯定非常重要——来自AJR、戴尔、纳恩等人的因果证据既扎实又多样，不容忽视。但“制度”是一个宽泛的类别，榨取性与包容性的二元划分是一种有用的简化，并非完整的理论。地理、文化和历史偶然与制度相互作用，而不是互相替代。诚实的立场是：制度对发展必不可少，但它们如何产生、如何变化、如何与其他力量互动，其故事远比任何单一框架所能概括的更复杂。由此引出最难的问题：如果制度如此重要，你又该如何真正改变它们？

The Botswana-and-Zimbabwe contrast, and the wider story of which post-colonial states escaped extraction, sits in Economic History Ch.17 (China Reform and the Asian Century).

宏大理论令人信服，但能经得起数据的考验吗？最后一个阶段用微观证据去面对宏观问题。随机对照实验——与测试药物所用的方法一样——能告诉我们如何让整个国家摆脱贫困吗？

阶段 5/5

包容性与榨取性

“我们想证明，在世界上最大的问题面前取得进展是可能的。”
— Esther Duflo, Nobel Prize Lecture, 2019

两个世纪的增长理论之后，这个领域仍然无法就什么使国家富裕达成共识。迪弗洛、班纳吉和克雷默因另辟蹊径而获得诺贝尔奖：别再争论了，去做实验。

RCT革命。发展经济学中的随机对照试验（RCT）运作方式和临床试验相同。你把村庄、家庭或个人随机分入处理组（接受干预）和对照组（不接受）。两组结果之间的差异就是因果效应，没有观察性研究中那种选择偏差的困扰。

结果令人耳目一新。GiveDirectly的无条件现金转移增加了受助者的收入和福利，浪费极少。米格尔和克雷默（2004）证明，给学童驱虫有巨大的长期回报——每个孩子只需\$0.50的治疗，就能让终身收入增加数千美元。信息干预也改变行为：告诉农民有新的种子品种，采用率就会提高；告诉父母教育的回报，入学率就会上升。

结构估计。弥合这一差距的一种办法，是构建具有明确机制的经济模型，用微观数据估计其参数，然后在宏观层面模拟反事实情境。谢长泰和克莱诺（2009）表明，把资源从生产率较低的企业重新配置到生产率较高的企业，可以让印度和中国的制造业TFP提高40–60%。资源错配这条线索把微观证据（为什么特定企业效率低下）与宏观结果（为什么整个经济体贫穷）连接了起来。

$$\text{ATE} = E[Y_i(1) - Y_i(0)] = \bar{Y}_{\text{treatment}} - \bar{Y}_{\text{control}}$$

the difference between the outcome each person would have under treatment, $Y_i(1)$, and under no treatment, $Y_i(0)$. You can never observe both for one person — the fundamental problem of causal inference — but randomization makes the control group’s average a valid stand-in for the treated group’s missing counterfactual. The estimate is clean. The question is what it is clean about.

直觉模式

RCT的力量很简单：随机把一些村庄分配到某个项目中，把另一些留作对照组。结果上的任何差异都一定由该项目造成，因为两组在干预之前平均而言是相同的。这正是因果证据的金标准。局限同样简单：结果告诉你这些村庄里发生了什么，却不一定告诉你在别的地方会发生什么。

聚合问题。RCT评估的干预是微观的。富国与穷国之间的收入差距却是宏观的。一个把个人收入提高20%的驱虫项目，对那些人来说意味着翻天覆地的变化，但在国民收入统计里几乎看不见。刚果民主共和国和丹麦之间的差距，不是由蚊帐短缺造成的。这就带来了一个根本性的矛盾：我们能严格评估的干预太小，解释不了发展差距；而大到足以解释差距的力量——制度、产业政策、宏观经济管理——又无法被随机化。

The structural alternative. Where RCTs go small, structural estimation goes for the whole machine. Hsieh and Klenow (2009) estimate that misallocation — capital and labor stuck in low-productivity firms instead of flowing to high-productivity ones — could account for differences in aggregate TFP of 30–50% between India, China, and the United States. The result depends on a fully specified model rather than a randomized comparison; it buys economy-wide reach at the cost of the design transparency a trial has. The two methods answer different questions, and neither answers the other’s.

直觉模式

这样想：RCT告诉你某种药物可以治愈某种疾病。但病人同时得了五十种病，其中一些是医院本身造成的。你既需要有针对性的药物，也需要一个关于医院出了什么问题的理论。

The arc from Arthur Lewis’s 1954 dual-economy model, through the structural-adjustment era, to the credibility revolution that brought RCTs to development is itself a tradition — traced as intellectual history in History of Economic Thought Ch.16 (Development economics).

实验还是宏大理论？

“随机实验从根本上改变了发展经济学。我们现在可以有信心地说哪些干预措施改善了穷人的生活。挑战在于将我们学到的东西推广应用。”
— Abhijit Banerjee & Esther Duflo, Poor Economics, 2011

班纳吉和迪弗洛的贡献，在方法上与在实质上同样重要。通过坚持实验证据，他们抬高了整个领域的证据门槛。在RCT之前，发展政策由意识形态和轶事驱动；在此之后，针对特定干预的可信证据基础终于建立起来。但“把学到的东西推广开来”恰恰是微观-宏观差距咬人的地方：在50个村庄有效的干预，推广到5000万人时可能就会失效，因为一般均衡效应——价格变化、劳动力市场变动、政治反应——会随之显现。

“对于发展研究来说，没有什么比对随机对照试验的痴迷更具灾难性的了。我们在研究蚊帐，而我们应该研究为什么有些国家能工业化而其他国家不能。”
— Lant Pritchett & Justin Sandefur, “Context Matters for Size,” Center for Global Development, 2013

普里切特的批评引起了许多发展从业者的共鸣。真正摆脱贫困的国家——韩国、台湾、中国、博茨瓦纳——靠的是产业政策、制度改革和宏观经济管理，而这些都无法通过随机实验来研究。RCT革命可能在不知不觉中把这个领域的注意力缩窄到了那些适合做实验的问题上，而真正重要的问题——关于制度、权力和政治经济学——则研究不足。但普里切特的替代路径——通过案例研究和结构模型来研究工业化——也有其自身的证据挑战。

The growth trajectories Pritchett points to — the Asian tigers, China’s takeoff, the Botswana exception — are the historical baseline in Economic History Ch.17 (China Reform and the Asian Century).

结论

对“为什么有些国家贫穷？”这个问题的诚实回答是分层的：制度和知识是根本原因，通过产权、人力资本、技术采纳和政治稳定发挥作用。RCT帮助我们理解这些渠道中的具体机制。结构模型帮助我们思考规模和一般均衡。但没有单一理论能解释一切。开启这次探索的问题至今仍然真正开放——而这本身，就是一件值得理解的重要事情。

目前的结论

我们从汉斯·罗斯林的动画图表开始：200个国家，200年，一场惊人的分化。五个阶段之后，以下是你现在知道的：

Measurement. The gap is real — it survives PPP, the informal economy, capability accounting, and every reasonable revision. The debate about GDP confirms the puzzle rather than dissolving it.
Capital. Solow taught us that capital deepening alone predicts a factor-of-three gap, not a factor-of-fifty — necessary, but explanatorily insufficient, with most of the difference hidden in a technology residual.
Ideas. Romer named the residual: non-rival ideas are the proximate engine, and TFP is the majority of the gap. But ideas are public, so the mystery becomes why poor countries can’t adopt them.
Institutions. AJR’s answer — the rules of the game decide whether ideas get used — is the deepest cause the field has identified, with settler-mortality evidence too consistent to dismiss, though it interacts with geography and contingency rather than replacing them.
Experiments. RCTs deliver clean causal evidence on specific interventions and save real lives, but operate at the wrong scale to explain national outcomes; structural models reach further at the cost of design transparency.

国家间的收入差距，是数个世纪制度分化演变的产物，被知识生产和采纳的差异所放大，被地理和历史所塑造，被抵制变革的政治结构所维系。没有哪种宏大理论能捕捉这一切。声称掌握“答案”的研究者们，无一例外只是在描述一个更大拼图中的一块。

下次有人告诉你“一切都归结于制度”、“只要多给援助就行”或“文化决定命运”时，你手里已经有了五个框架可以用来评估这个说法。没有哪个是完全正确的。每一个都捕捉了一个真实的机制。这个问题——经过数十年的杰出研究——至今仍然真正开放。这不是经济学的失败。这是社会科学中最难的问题，而要诚实地面对它，就需要同时在头脑中持有多个框架。

返回大问题

GDP衡量的是重要的东西吗？

流行版本

最强支持论点

最强反对论点

判断

我们能衡量这个差距吗？

目前的结论

外国援助能弥合差距吗？

流行版本

最强支持论点

最强反对论点

判断

资本是答案吗？

目前的结论

“中国模式可以复制吗？”

流行版本

最强支持论点

最强反对论点

判断

知识够了吗？

目前的结论

“殖民主义导致了贫困吗？”

流行版本

最强支持论点

最强反对论点

判断

制度、地理还是文化？

目前的结论

实验还是宏大理论？

结论

目前的结论

相关问题