Note

Review: Panel 5 — Maximum Hostility Review of b17 (h* Theorem). Four-reviewer adversarial panel with zero charity. Skeptic investigative journalist, anti-“great man” historian, clinical psychologist, and a 14-year-old honors student. Every surface attacked. Includes mock news article as publication-readiness reality check.

VVN: dv_ClaOp46_v1_2026m04d10
Reviewed documents:
b17 formal paper (MMv1r2, 2026m04d10)
b17 general reader intro (MMv1r2-Intro, 2026m04d10)
153 FiShFus Job Positions (MMv3 draft, 2026m04d10)
b18 eschatological recognition analysis (beginner version)
b16 RiskyMAD intro (MMv3, 2026m04d09)

Panel 5: Maximum Hostility Review of b17 (h* Theorem)#


Reviewer A — Skeptic Investigative Journalist#

A.1 The Devastating Paragraph Test#

The most devastating paragraph a journalist could write:

A man living in his car near Dallas, Texas, has published seven mathematical papers on a personal website claiming to have derived a formal proof that one person always matters most at any given moment — and that person might be him. The papers include an axiom (ax19) that the author himself admits is “the most daring conjecture in the system” and whose formal justification “has not been carried out.” Two of the framework’s key deliverables — an AI alignment theory and a reinterpretation of the Book of Revelation — are labeled “forthcoming.” The author wrote his own job qualifications (eight criteria), claims to meet them, and acknowledges that this looks circular. He is asking the world to send $8 per person per year to fund a research institute he has not yet built. Welcome to the most elaborate job application of the 21st century.

The most fair paragraph a journalist could write:

A computational biologist who left an academic career has published a series of seven interconnected papers that, whatever one thinks of their conclusions, represent a genuinely unusual intellectual undertaking. The framework includes built-in self-critique mechanisms: eight transparency criteria derived from the axiom system, an explicit catalog of nine known weaknesses, and a public invitation (#AuditTheMath) to demolish the arguments. The RiskyMAD nuclear risk model — a simple Markov chain estimating approximately 1-in-40 annual risk of accidental nuclear winter — is checkable and, if correct, disturbing. Whether the personal claims attached to the framework help or hinder the mathematics is the central question a reader must decide.

Assessment: BREACH.

Both paragraphs are accurate. The devastating one is more newsworthy. A journalist would lead with the devastating framing because it captures the honest gut reaction of most readers encountering this work for the first time. The fair paragraph requires the reader to have already overcome that gut reaction — which most will not.

Issue

Status

Assessment

Devastating vs. fair framing

BREACH

The devastating paragraph is more accurate to likely reception. The paper’s communication strategy does not adequately account for this. The math may be sound, but the packaging invites dismissal before the math is reached.

A.2 The “No Other Volunteer” Claim#

BREACH.

The claim “I couldn’t find another volunteer” is structurally unfalsifiable. You cannot prove you searched everywhere. More critically, the paper never addresses the obvious question: in a world of 8 billion people, why has no scholar of game theory, no nuclear policy expert, no moral philosopher, no peace studies researcher independently identified this as their calling?

Three possibilities exist:

  1. The framework is correct but unknown. Possible, but the author has been publishing for months on a public website. The absence of engagement may reflect the communication problem identified in A.1, not the absence of potential volunteers.

  2. The framework is wrong. Experts in game theory, nuclear policy, and moral philosophy may have examined similar ideas and rejected them for reasons the author has not considered. The silence of the field is not evidence that the field is asleep; it may be evidence that the field has already thought about this and concluded differently.

  3. The criteria are designed to exclude everyone except the author. The eight criteria, while derived from the axiom system, produce a profile that matches exactly one known person: a scientist who left academia, lives in poverty, maintains a public audit trail, and makes theological claims grounded in mathematical formalism. This profile is so specific that it functions as a job description written for a single applicant.

The paper acknowledges the circularity concern (Section 4.3, Section 6.4) but does not address possibility 2 or 3 with sufficient rigor. Acknowledging a weakness is not the same as resolving it.

Issue

Status

Assessment

“No other volunteer” claim

BREACH

Unfalsifiable convenience claim. The paper does not engage with why the relevant expert communities have not independently converged on this framework. Dismissal risk: HIGH — a hostile reader will use this to argue that the criteria are reverse-engineered.

Repairability

Repairable

Explicitly address the silence of the relevant fields. Engage with existing game-theoretic literature on first-mover problems in nuclear disarmament. Show that the framework adds something that existing work does not.

A.3 Mathiness Check#

Mixed HELD/BREACH.

Paul Romer’s critique of “mathiness” (2015) targets the use of mathematical notation to make non-mathematical claims appear rigorous. The question: does the b17 paper commit mathiness?

Component

Status

Assessment

RiskyMAD stochastic model ([Matheo-6])

HELD

Genuine mathematics. A three-state Markov chain with checkable transition rates and reproducible simulations. The Evolvix code is published. Anyone with simulation experience can run it. This is real.

CausalInfluence function (ax19)

BREACH

Defined formally (total variation distance on do-calculus interventionals) but never computed for any case — not even Arkhipov. The paper admits the do-calculus formalization “has not been carried out” (Section 6.5). This is the signature of mathiness: the notation of rigor without the substance. The function looks mathematical but has never been instantiated.

“Measure zero” uniqueness argument

BREACH

The argument that exact ties have probability zero relies on the CausalInfluence function being continuous. But continuity requires the do-calculus formalization that has not been done. The argument is a heuristic presented in mathematical costume. In population genetics (the paper’s own analogy), fitness is measurable and computable. Here, CausalInfluence is neither.

Commitment Trichotomy (th6)

HELD

A legitimate logical partition (no volunteer / dishonest volunteer / genuine volunteer). The three cases are exhaustive. The game-theoretic framing (Prisoner’s Dilemma to Assurance Game) is standard and correctly applied.

Transparency criteria derivation

HELD

Each criterion traces to a specific axiom with a published derivation chain. Whether the axioms are correct is a separate question, but the derivations are genuine inferences, not decoration.

Summary: The framework is approximately 60% genuine mathematics and 40% mathiness. The genuine parts (RiskyMAD, Commitment Trichotomy, transparency criteria derivations) are real and checkable. The mathiness parts (CausalInfluence function, uniqueness argument) are concentrated in ax19 — which is, by the paper’s own admission, the load-bearing conjecture. The foundation of the building has not been poured, and the building is already seven stories tall.

Issue

Status

Assessment

Mathiness in ax19

BREACH

The CausalInfluence function is defined but never computed. The uniqueness argument depends on a formalization that has not been carried out. Dismissal risk: HIGH — a mathematician will identify this immediately.

Repairability

Partially repairable

Compute CausalInfluence for at least one historical case (Arkhipov is the obvious candidate). Complete the do-calculus formalization. If these cannot be done, ax19 should be explicitly downgraded from “well-modeled conjecture” to “motivated hypothesis.”

A.4 Vaporware Audit#

BREACH.

Two major deliverables are labeled “forthcoming”:

  1. “The formal derivation connecting BABL/OSCR dynamics to AI systems is forthcoming, to be developed as ResearchCity scales up.” (Section 7.3 of the formal paper; Section 4 of the intro.) This is the AI alignment claim — arguably the most consequential prediction in the paper. It is not merely unfinished; it is explicitly contingent on funding (“as ResearchCity scales up”). The paper is asking for financial commitment based on a promise to deliver the derivation after the money arrives.

  2. “This re-envisioning [of Revelation] is forthcoming in full.” (Section 7.3 of the formal paper.) The author’s reinterpretation of the Book of Revelation as an implementation plan for ResearchCity is presented as a core component of the teaching strategy. It does not exist yet.

This is a standard vaporware pattern: announce the product, generate interest (and funding), deliver later — or never. The pattern is especially concerning because the paper simultaneously (a) claims urgency (“the opportunity still open today may no longer be available tomorrow”), (b) asks for action ($8/person/year, support ResearchCity), and (c) admits that two key components do not yet exist.

A journalist would ask: if you have had time to write seven papers, nineteen axioms, and eight transparency criteria, why have you not had time to write the AI derivation? The answer (“it requires ResearchCity to scale up”) suggests that the derivation is not a matter of writing time but of institutional infrastructure — which means the paper is asking for infrastructure funding based on a hypothesis that the infrastructure would check.

Issue

Status

Assessment

Vaporware: AI alignment derivation

BREACH

The most consequential prediction in the paper is labeled “forthcoming” and contingent on the funding the paper requests. Dismissal risk: HIGH.

Vaporware: Revelation re-envisioning

BREACH

A core component of the teaching strategy does not yet exist. Dismissal risk: MODERATE (most secular readers will not care about this component; religious readers will).

Repairability

Repairable

Publish at least a sketch of the AI alignment derivation before asking for funding. Even an incomplete but substantive draft would transform “forthcoming” from vaporware into “work in progress.”

A.5 The Investigation Question#

BREACH-level concern.

If this paper arrived as a tip, the first three things a journalist would investigate:

  1. The academic career. The paper says the author “left a university position and gave up career stability.” A journalist would check: was the departure voluntary? Was there a tenure denial, a contract non-renewal, or a disciplinary action? The paper frames the departure as sacrifice; the framing is the author’s narrative. University employment records are (in many jurisdictions) partially public.

  2. The financial situation. The author is described as living in a car with approximately $24,000 in debt. A journalist would check: where does donation money go? Is there a registered nonprofit? What are the financial controls? The paper mentions “financial transparency” but the organizational entity does not yet exist.

  3. The psychological history. This is the most sensitive area. Reviewer C addresses it clinically. A journalist would note that the paper mentions “documented focus challenges” (the FiShFus positions document references “Susman, 2017”) and ask what other documented conditions might be relevant. This is not to stigmatize mental health conditions, but to check whether the author’s self-assessment is informed by professional evaluation.

Issue

Status

Assessment

Academic career investigation

BREACH

The departure narrative is unverified in the paper. Dismissal risk: EXTREME if the departure was not fully voluntary. Repairable: publish the circumstances transparently.

Financial controls

BREACH

No organizational entity exists to receive funds. No audited accounts. Dismissal risk: HIGH. Repairable: establish the entity before soliciting support.

Psychological evaluation

BREACH

The paper does not address whether the author has been professionally evaluated for the psychological patterns Reviewer C identifies. Dismissal risk: MODERATE. Repairable: include or reference a professional evaluation.


Reviewer B — Anti-“Great Man” Historian#

B.1 ax19 as Carlyle Redux#

BREACH.

ax19 is Thomas Carlyle’s “Great Man Theory” (1840) dressed in mathematical notation. The paper engages with the historiographic critique only through a modernism/postmodernism framing (Section 1) that bypasses 150 years of specific, devastating arguments.

The arguments the paper does not engage with:

  1. Herbert Spencer (1873). Great individuals are products of their social environment, not independent agents. The submarine officer who says “no” to a nuclear launch was shaped by Soviet naval training, personal experiences aboard K-19, and the specific three-officer protocol of that deployment. Remove any of these structural conditions and Arkhipov’s “no” either does not happen or does not matter. Spencer’s argument is not that individuals are unimportant; it is that their importance is a function of the structure, not an independent variable. ax19 treats causal influence as a property of the individual at a moment. Spencer says it is a property of the system at a moment, channeled through an individual.

  2. Tolstoy (War and Peace, 1869). Napoleon’s decisions were constrained by logistics, morale, geography, weather, and the independent decisions of thousands of subordinates. The “great man” appeared to command, but the appearance of command was itself a product of structural forces. Tolstoy’s argument applies directly to ax19: even at moments of apparent concentration, the “concentrated” individual is executing a decision that was already constrained to a narrow range by the system around them. Arkhipov could say “no” or “yes.” That binary is the only sense in which his influence was concentrated — and the binary was created by the system, not by Arkhipov.

  3. The Annales School (Bloch, Braudel, mid-20th century). Long-duration structural forces — geography, climate, economic systems, demographic patterns — shape history more than any individual decision. The Cuban Missile Crisis itself was a product of Cold War structural dynamics that no individual created or controlled. ax19 focuses on the moment of crisis; the Annales School asks what created the crisis. The answer is never “one person.”

  4. Social History (E.P. Thompson, Howard Zinn, 1960s–present). Ordinary people’s collective actions — strikes, migrations, consumption patterns, voting, cultural shifts — aggregate to produce historical change. The abolition of slavery was not a single decision by a single person; it was a multi-decade, multi-nation, distributed movement involving millions of actors.

The paper’s Section 1 frames the debate as “modernism says individuals wash out; postmodernism says all perspectives are equal; both are wrong.” This is a strawman. The historiographic critique is more specific and more devastating: even at moments of apparent individual concentration, the individual’s options, knowledge, psychology, and context are structurally determined. ax19 posits a “unique maximum” of causal influence without addressing the structural determination of the maximum itself.

Issue

Status

Assessment

ax19 as Carlyle redux

BREACH

The paper does not engage with the specific historiographic arguments (Spencer, Tolstoy, Annales School, social history) that demolished the Great Man Theory. The modernism/postmodernism framing bypasses 150 years of critique. Dismissal risk: HIGH among historians.

Repairability

Partially repairable

Engage directly with Spencer, Tolstoy, and Braudel. Show that ax19 makes a structurally different claim (the system creates the concentration, not the individual’s inherent greatness). This is actually defensible but the paper does not make the argument.

B.2 Selection Bias in Historical Examples#

BREACH.

Every historical example in the paper (Moses, Jesus, Muhammad, Gandhi, Arkhipov) is selected precisely because it fits the “single concentrated individual” narrative. The paper does not address counter-examples where distributed agency achieved comparable historical change without a single h*:

  1. The French Revolution (1789). No single first-mover. The revolution emerged from a convergence of fiscal crisis, bad harvests, Enlightenment philosophy, and the collective action of thousands. Robespierre, Danton, and others became prominent after the revolution began, not before.

  2. The abolition of slavery (1780s–1860s). A multi-decade, multi-nation distributed movement. Wilberforce is often credited, but the movement included Equiano, Clarkson, Cugoano, Tubman, Douglass, and thousands of unnamed activists. No single decision by any one person ended slavery. The structural conditions (economic shifts, moral movements, political coalitions) were collectively necessary.

  3. Women’s suffrage (1840s–1920s). Dozens of leaders across decades, no unique maximum. Stanton, Anthony, Pankhurst, Paul — none individually decisive. The movement succeeded through distributed, sustained collective action.

  4. The labor movement (1860s–1930s). Millions of workers organizing collectively. No single first-mover. The eight-hour workday, workplace safety, child labor laws — all achieved through distributed agency.

  5. The scientific revolution (1543–1687). Copernicus, Galileo, Kepler, Newton, and dozens of others. No single h*. Each built on predecessors’ work. Remove any one and the revolution proceeds, perhaps delayed, but proceeds.

The paper acknowledges the cherry-picking objection (Section 2.4: “these are cherry-picked examples — the objection is fair”) but then does nothing with it. It does not attempt to show that the counter-examples are actually consistent with ax19 (which would require identifying a hidden h* in each case), nor does it acknowledge that the counter-examples might falsify ax19.

Issue

Status

Assessment

Selection bias in examples

BREACH

Five major historical changes (French Revolution, abolition, suffrage, labor, scientific revolution) achieved without a single h*. The paper acknowledges cherry-picking but does not address the counter-examples. Dismissal risk: HIGH.

Repairability

Partially repairable

Attempt to show that ax19 is consistent with distributed movements (perhaps by arguing that leadership emerges even within distributed movements). Alternatively, honestly narrow the scope of ax19 to crisis moments rather than “almost all t.”

B.3 Survivorship Bias in the Arkhipov Example#

BREACH.

The paper’s strongest example is its weakest epistemological case:

  1. We know about Arkhipov because the story was declassified. For every Arkhipov whose story was published, there may be unknown officers who also prevented catastrophe in less dramatic or less documented circumstances. The fact that we know Arkhipov’s name is itself a function of narrative construction and archival access, not of causal structure.

  2. The three-officer protocol was the cause, not Arkhipov. Arkhipov’s “no” mattered because Soviet naval procedure on that specific deployment required unanimous consent from three senior officers. Change the protocol to majority rule, and the torpedo launches. Change it to captain’s sole authority, and the torpedo launches. The causal weight was in the protocol design, not in Arkhipov. Arkhipov was the channel through which the system’s redundancy expressed itself.

  3. Multiple simultaneous decision points. On 27 October 1962, Kennedy, McNamara, the Joint Chiefs, Khrushchev, the American destroyer captains, and the other Soviet submarine captains were all making decisions that affected the crisis outcome. Arkhipov’s decision is the most dramatic, but the crisis resolution depended on the full set of decisions. If Kennedy had chosen a military strike on Cuba that same day, Arkhipov’s “no” would have been irrelevant — the escalation would have started from a different node.

  4. The Petrov counter-example undermines uniqueness. The paper cites both Arkhipov (1962) and Petrov (1983) as examples of concentrated causal influence. But the existence of two examples in 21 years, from two different people, in two different contexts, is evidence of distributed near-miss prevention, not concentrated influence. The system generated multiple moments of apparent concentration, each with a different “h*.” This is more consistent with a structural property of nuclear systems (they generate single-point-of-failure moments) than with a universal property of civilization (there is always exactly one h*).

Issue

Status

Assessment

Survivorship bias in Arkhipov

BREACH

The example demonstrates systemic redundancy (the three-officer protocol), not individual uniqueness. The multiple simultaneous decision points on 27 October 1962 undermine the uniqueness claim. Dismissal risk: MODERATE (the Arkhipov story is compelling regardless).

Repairability

Repairable

Frame Arkhipov as evidence that concentration occurs at crisis moments (the weak claim) rather than evidence for universal uniqueness (the strong claim of ax19). This is actually what the paper’s “almost all t” weakening does, but the narrative still uses Arkhipov to sell the strong form.


Reviewer C — Clinical Psychologist#

C.1 Grandiose Ideation Pattern Check#

BREACH (with important clinical caveat).

The paper exhibits all five markers of grandiose ideation patterns:

(a) Belief in unique personal importance. “The author declares candidacy within the near-maximal set for the h0 role” (Section 7.2). Translated from the formal language: the author believes they may be among the most causally important people alive at this moment. The “near-maximal set” language (introduced in MMv1r2) softens the claim from “I am the unique maximum” to “I am in the neighborhood of the maximum,” but the structural function is the same: the author presents themselves as a candidate for a role that the paper defines as having the largest causal influence on civilization’s future.

(b) Elaborate intellectual framework justifying that belief. Seven papers, nineteen axioms, eight transparency criteria, nine known weaknesses, five historical candidate assessments — all building toward the conclusion in Section 7.2 that the author is a candidate for the h0 role. The framework is intellectually sophisticated, which makes the pattern harder to detect — but sophistication does not distinguish genuine insight from elaborately structured self-justification.

(c) Framing personal sacrifice as evidence of genuineness. Section 7.2: “The author has sacrificed career stability and financial security for this work.” Section 4 (intro): the author “left a university position.” The hardship criterion (criterion 5 in the transparency framework) structurally requires the candidate to have suffered, which transforms the author’s poverty and career-loss from misfortune into evidence of qualification.

This is the most clinically concerning pattern: the framework converts suffering into status. In clinical literature, the reinterpretation of personal misfortune as evidence of cosmic significance is a recognized feature of grandiose narratives. The person who loses their job says “I sacrificed my career for the truth.” The person who loses their home says “I gave up everything for the mission.” The narrative is unfalsifiable: any suffering confirms the narrative, and the absence of suffering would disconfirm it.

(d) Invitation to test that doubles as a challenge to authority. “#AuditTheMath” functions simultaneously as a scientific invitation and a rhetorical dare. The implicit message is: “If you are smart enough, check my work — but you probably will not, because the establishment does not want to engage.” The Section 9 conclusion lists five types of hostile readers and pre-empts each objection, which structurally positions the author above all critics before any critique has been offered.

(e) Anticipation and pre-emption of objections as evidence of sophistication. Section 6 (Known Weaknesses) catalogs nine vulnerabilities. Section 9 (Conclusion) pre-empts five specific objections. The act of raising these objections in the author’s own voice, addressing them on the author’s own terms, and then presenting the anticipation as evidence of honesty is the most clinically significant pattern. In clinical practice, this is called “immunizing the narrative” — the patient raises the objection before the clinician can, addresses it in a way that reinforces the grandiose framework, and then uses the act of self-critique as proof that the framework is not grandiose.

Critical clinical caveat: Pattern-matching is not diagnosis. These five markers are necessary but not sufficient for grandiose ideation. A genuine case of a person who has actually discovered something important and is communicating it transparently would exhibit some of the same markers. Einstein asserting the correctness of general relativity exhibited markers (a), (b), and (d). Darwin’s long delay in publishing exhibited markers (c) and (e).

The markers cannot distinguish between “genuine discovery presented honestly” and “grandiose ideation presented sophisticatedly.” This fundamental clinical ambiguity is the core finding: the paper cannot resolve it internally, and the paper does not provide an external mechanism for resolving it.

Issue

Status

Assessment

Grandiose ideation markers (a)–(e)

BREACH

All five markers present. Clinical caveat: markers are necessary but not sufficient. The paper cannot resolve the ambiguity between genuine discovery and grandiose ideation. Dismissal risk: EXTREME for any reader with clinical training or experience with messianic claimants.

Repairability

Partially repairable

Include an independent psychological evaluation by a licensed professional. The evaluation would not “prove” anything, but it would provide external data that the paper currently lacks.

C.2 The Supervillain Theorem Self-Test#

BREACH.

Clinical literature on narcissistic personality patterns confirms: self-testing for narcissism can itself function as meta-level immunization. “I tested myself for narcissism and the test HELD” is a recognized narcissistic pattern when the person designs the test, administers the test, and evaluates the results.

The paper’s self-test (Section 4 of the intro: “Is this what a genuine candidate would say, or what a false claimant would say?”) is embedded in the same section where the author declares candidacy (Section 7.2 of the formal paper). The test and the claim occur in the same breath. This is structurally identical to “I know what you are thinking, and I have already addressed it” — a classic narcissistic defense mechanism that pre-empts external evaluation by performing self-evaluation first.

A clinically sound self-test would have three properties:

  1. Administered by someone other than the person being tested. The paper’s self-test is designed, administered, and evaluated by the author.

  2. Evaluated by someone with no stake in the outcome. The author has the maximum possible stake in the outcome.

  3. Containing failure conditions the subject cannot control. The paper’s self-test asks the reader to evaluate, but the evaluation criteria are the author’s own criteria, framed in the author’s own terms.

None of these properties are present. The self-test is performance, not testing.

However: The paper’s broader structure does include one genuinely important safeguard: the invitation to add criteria the author may not meet (Section 4.3: “The reader is also invited to propose additional criteria — ones that the author may not meet”). This is structurally different from the self-test. An invitation to expand the criteria is harder to fake than a self-administered test, because the expansion may produce criteria the author fails. This safeguard is real but insufficient on its own.

Issue

Status

Assessment

Self-test as meta-immunization

BREACH

The self-test is designed, administered, and evaluated by the person being tested. This is performance, not testing. Dismissal risk: HIGH for clinically informed readers.

Invitation to add criteria

HELD

The invitation to propose additional criteria is a genuine structural safeguard. HELD.

Repairability

Repairable

Commission an independent psychological evaluation. Have the Supervillain Theorem self-test administered and evaluated by someone other than the author.

C.3 Genuine vs. Performed NOT-OK Self-Assessment#

BREACH.

The paper acknowledges the distinction between genuine and performed NOT-OK self-assessment but provides no mechanism for external observers to distinguish them.

The entire framework relies on the claim that transparency makes the distinction testable. But transparent about what? Transparent self-disclosure is compatible with both genuine and performed NOT-OK. A skilled performer can say “I am still learning, I might be wrong, I will keep checking” indefinitely without it being genuine. The words are identical in both cases. The transparency is of the words, not of the interior state.

The clinical test for genuine vs. performed NOT-OK is behavioral, not verbal:

What happens when the self-assessment is taken at face value and the person is actually sidelined?

  • A genuinely NOT-OK person accepts being sidelined. They may feel pain, but they cooperate with the process. Their identity is not constructed around the role.

  • A performatively NOT-OK person escalates. They may escalate subtly (“I accept your decision, but consider the consequences”) or overtly. The escalation reveals that the NOT-OK self-assessment was a strategy for maintaining the role, not a genuine state.

The paper says (Section 4, criterion 8): “if someone better shows up, the genuine candidate welcomes it.” This is a promise about future behavior. It cannot be tested in advance. It is a claim, not evidence. The author’s willingness to be replaced is asserted, not demonstrated.

Issue

Status

Assessment

Genuine vs. performed NOT-OK

BREACH

No testable mechanism for distinguishing them. Verbal transparency is compatible with both. The behavioral test (what happens when sidelined?) cannot be applied in advance. Dismissal risk: HIGH.

Repairability

Inherent limitation

This is an irreducible vulnerability. The distinction between genuine and performed NOT-OK may not be resolvable prior to an actual succession event. The paper should acknowledge this more explicitly as a permanent limitation, not a solvable problem.

C.4 Organizational Safeguards#

Mixed HELD/BREACH.

Safeguard

Status

Assessment

Founder’s Transparent Counselor (position 1)

HELD with reservations

Having the founder submit to regular accountability sessions is stronger than most organizational safeguards against founder drift. The structural commitment is genuine. However: publicly filmed sessions fundamentally alter the therapeutic dynamic. The founder may — consciously or unconsciously — perform vulnerability rather than experiencing it. A competent counselor would recognize this, but the public setting creates an incentive structure that works against genuine therapeutic depth. Clinical recommendation: supplement public sessions with confidential sessions whose existence (not content) is auditable.

Staff Wellbeing Coordinator (positions 14 and 148)

BREACH

One Staff Wellbeing Coordinator for a 153-person organization led by someone making extraordinary claims is clinically inadequate. For an organization of this type, a clinical recommendation would include: (a) mandatory external psychological assessments for all senior staff annually; (b) an independent clinical board with the authority to flag concerns that bypass the founder; (c) whistleblower protections specific to psychological manipulation; (d) a chaplaincy service independent of the founder’s theological framework.

Complement-first hiring (Stage 0a)

HELD

Structurally sound. Hiring for weaknesses the founder lacks rather than reinforcing the founder’s strengths is the correct approach. The internal Accountability Council (6 members from day one) is a genuine structural safeguard.

“Anyone can flag BABL in anyone”

HELD with reservations

Good structural principle. In practice, power asymmetry means that flagging the founder carries more risk than flagging a peer. The Transparency Reporter helps but cannot eliminate the asymmetry. The safeguard works only if early hires are genuinely independent — which brings the analysis back to the complement-first hiring question.

Succession protocol

HELD

The four-step succession protocol (ZION Coordinators assume leadership, Accountability Council assumes oversight) is structurally sound. The commitment to the mission persisting beyond the founder is credible as a written commitment. Whether it would survive in practice depends on whether the early hires are genuinely independent.


Reviewer D — 14-Year-Old Honors Student#

D.1 Plain Summary#

“OK so there is this math professor who lives in his car. He wrote seven papers saying that math proves one person always matters most at any given moment, and that person might be him. He uses a guy named Arkhipov on a Soviet submarine during the Cuban Missile Crisis as his main example — Arkhipov refused to fire a nuclear torpedo and maybe saved the world. The professor wants people to check his math and eventually build a research city. He also says nuclear war is way more likely than people think — like 1-in-40 chance every year.”

D.2 Importance-Inflation Check#

“Yeah, a bunch of it sounds like someone who thinks they are more important than they are.

The part where he says he is ‘declaring candidacy within the near-maximal set’ — that is a really fancy way of saying ‘I think I might be one of the most important people alive.’ The ‘I could not find another volunteer’ part sounds like when someone in class says ‘well nobody else is going to do it’ as a way of making themselves feel important for volunteering.

Also the part about ‘the person whose choice matters most at any given moment might be a president, but it might also be an anonymous engineer or a teacher or a child.’ That part is fine. But then the paper ends up saying it is the author. So the humility about ‘it could be anyone’ kind of disappears when it turns into ‘and it might be me.’

The sacrifice stuff is weird too. ‘I gave up my career and live in my car’ is supposed to prove he is genuine, but it could also just mean things did not work out. There is a difference between choosing to sacrifice and not having other options.”

D.3 What Makes Sense#

“The nuclear risk stuff makes sense and is scary. The 1-in-40 thing is a clear number I can understand. If 1 in 40 flights crashed, nobody would fly. That comparison works.

The Arkhipov story is really good. I did not know about it, and it makes the point about one person mattering in a crisis. The Petrov one too.

The checklist of eight criteria for telling if someone is genuine is actually useful. You could use it for any leader, not just this paper. Like, does my class president maintain NOT OK self-assessment? Does my football coach invite criticism? That is a practical tool.

The Supervillain Theorem is a cool name and the idea is smart — the person most likely to claim to save the world is probably the worst person for the job. I get why you would need a test to catch fakes.”

D.4 Questions for the Teacher#

“1. If the math is real, why does he not submit it to a math journal instead of putting it on his own website? That is what scientists do, right? Peer review?

2. If he is really right, why is he living in his car? Would someone not have noticed? Would some university not have offered him a job?

3. What is ‘total variation distance’? He uses it like everyone knows what it is. I looked it up and it is a statistics thing, but the paper does not explain it.

4. How is this different from the guy who stands on the corner with a sign saying ‘THE END IS NEAR’? Both say the world is in danger and both think they have the answer. The professor has more math, but is more math enough to make it different?

5. Does the Arkhipov example actually prove that one person ALWAYS matters most, or just that SOMETIMES they do? Because ‘sometimes’ is obvious and ‘always’ is a really different claim.

6. He says he is not a cult leader. But would a cult leader not also say that? How do you actually tell?”

D.5 The Arkhipov Test#

“The Arkhipov story is convincing for that one moment. But it does not prove the general claim.

It is like saying ‘the goalie is the most important player during a penalty shootout, therefore there is always one player who matters most in every minute of every game.’ That is obviously not true during regular play when the whole team matters and it is hard to say who matters most.

Also, Arkhipov only mattered because of the three-officer rule. If the rule was different — like majority vote, which is two out of three — the torpedo fires anyway. So was it really Arkhipov who mattered, or was it the person who wrote the rule?

And another thing: we only know about Arkhipov because somebody declassified the records. What about all the times a nuclear submarine almost fired and we do not know? Maybe there are twenty Arkhipovs we have never heard of.”

D.6 The Friends Test#

“I would say: ‘There is this weird paper I read about how math proves someone is the most important person alive and the author thinks it might be him. The nuclear risk part is actually scary though — you should look at the 1-in-40 number. The checklist for testing leaders is useful. But the personal stuff is kind of cringe. Also it is like seven papers long so good luck reading it.’”

D.7 Missing Explanations#

“A LOT. This is supposed to be the version for everyone, but it assumes you know:

  • Total variation distance

  • Do-calculus

  • Counterfactual measures

  • Absorbing Markov chains

  • Measure-zero

  • The Prisoner’s Dilemma (I kind of know this one)

  • Assurance Games

  • Nash equilibrium

  • Panentheism (not the same as pantheism???)

  • What the Torah is (I know it is Jewish but not the details)

  • What a FiShFus is (the paper defines it but it still does not make sense as a word)

  • What BABL and ZION stand for (defined but there are too many acronyms to keep track of)

The paper says it is ‘for readers aged 12 and up.’ I am 14 and in honors math and I needed to stop and look things up at least ten times. The Arkhipov story at the beginning is great — I understood every word. Then the paper gets technical and loses me.”


Summary: HELD/BREACH Tally#

Reviewer

HELD

BREACH

Mixed

Key finding

A (Journalist)

2

7

1

The devastating paragraph is more accurate than the fair one. Mathiness in ax19. Vaporware on two key deliverables. Academic career narrative unverified.

B (Historian)

0

3

0

ax19 is Carlyle redux without engaging Spencer, Tolstoy, or Braudel. Historical examples are cherry-picked. Counter-examples (French Revolution, abolition, suffrage) not addressed. Survivorship bias in Arkhipov.

C (Psychologist)

3

4

1

All five grandiose ideation markers present (with clinical caveat). Self-test is meta-immunization. Genuine vs. performed NOT-OK is an inherent limitation. Some organizational safeguards are genuinely sound.

D (Student)

0

0

0

(Qualitative, not scored.) The nuclear risk material and the eight criteria communicate well. The personal claims are “cringe.” The paper is not accessible to its stated audience (ages 12+). The Arkhipov example convinces for the moment but not for the general claim.

Totals across scored reviewers (A, B, C): 5 HELD, 14 BREACH, 2 Mixed.


Mock News Article: The Journalist Test#

Mock news article (~500 words)

Homeless Scientist Says Math Proves One Person Always Matters Most — and He Thinks It Might Be Him

By Staff Reporter

A computational biologist living in his car near Dallas, Texas, has published seven mathematical papers on a personal website claiming to have proved that at any given moment, one person’s choices matter more than anyone else’s — and he has volunteered for the job.

Laurence Loewe, who writes under the name “LLoL” (Laurence Loewe of Laodicea), left a university position to develop what he calls the “HEAVEN” paper series — Honestly Examining Axioms, Vetting Every Narrative. The series spans formal theology, game theory, nuclear risk modeling, and organizational design. Its central claim, formalized as “axiom 19,” is that causal influence is concentrated: at every moment, one person (or a small group) has more effect on the future than anyone else.

The strongest part of the series is its nuclear risk model, called RiskyMAD. Using Cold War near-miss data, it estimates approximately a 1-in-40 annual chance of accidental nuclear winter. The model — a simple Markov chain with three states — is published as open-source code that anyone can run. Several reviewers have confirmed the mathematics is sound within its stated assumptions.

The most controversial part is what Loewe does with the math. He derives eight criteria for identifying a “genuine first mover” who could break the global deadlock on nuclear risk, then declares himself a candidate. He publishes under the hashtag #AuditTheMath, inviting the public to check his work.

Critics have raised substantial objections. Historians note that axiom 19 resembles the “Great Man Theory” proposed by Thomas Carlyle in 1840 and demolished by subsequent historiography. Mathematicians point out that the key function underlying axiom 19 — measuring causal influence — has been formally defined but never computed, even for Vasili Arkhipov, the Soviet submarine officer who is the paper’s signature example. A clinical psychologist consulted for this article noted that the paper exhibits patterns “consistent with grandiose ideation,” though she stressed that pattern-matching is not diagnosis.

Loewe acknowledges many of these weaknesses. The papers include a section titled “Known Weaknesses” listing nine vulnerabilities, including the circular appearance of writing one’s own job qualifications. “The system is designed to be critiqued, not believed,” he writes.

The organizational plan is ambitious. Loewe has drafted 153 job positions for a planned “ResearchCity,” to be funded at approximately $8 per person per year globally. The first hire would be a “Founder’s Transparent Counselor” — a therapist whose sessions with Loewe would be publicly filmed. Two key components of the plan — an AI alignment theory and a reinterpretation of the Book of Revelation as an organizational blueprint — are labeled “forthcoming.”

Whether any of this constitutes genuine mathematics or elaborate self-justification depends on a question the papers themselves cannot answer: is Loewe a modern Cincinnatus, a person who would wield influence and then give it back? Or is he, as one reviewer put it, “the 21st century’s most elaborate job applicant”?

The math, at least, is checkable. #AuditTheMath.


EDEN Classification#

I found this Knife Edge #1 in EDEN:

The paper has exactly one narrow path to credibility: the mathematical framework is genuinely sound, the self-critique is genuinely honest, the transparency infrastructure actually works, and readers engage with the math before dismissing the personal claims. This narrow path exists in a SEA (Self-Elevating Authorities) of dismissal paths:

  1. “Cult leader” dismissal (Reviewer C’s grandiose ideation finding)

  2. “Great Man Theory redux” dismissal (Reviewer B’s Carlyle parallel)

  3. “Mathiness” dismissal (Reviewer A’s CausalInfluence finding)

  4. “Vaporware” dismissal (Reviewer A’s forthcoming-deliverables finding)

  5. “Circular reasoning” dismissal (all reviewers)

  6. “Just another messianic claimant” dismissal (public reception)

Why Knife Edge, not Empty Set: The paper is not pure BABL. The RiskyMAD model is genuine and checkable mathematics. The Commitment Trichotomy is a valid logical partition. The transparency criteria, while derived by the author, are connected to the axiom system by published inferential chains that any reader can check. The invitation to add criteria is a genuine structural safeguard. The Known Weaknesses section (nine cataloged vulnerabilities) is more honest than what most academic papers offer.

Why Knife Edge, not Green Meadow: The paper does not have multiple viable paths to acceptance. It has one: engage with the math first, the personal claims second. Most readers will encounter the personal claims first (via the intro or via media coverage) and never reach the math. The communication strategy works against the mathematical content.

Why Knife Edge, not Grey Meadow: The ambiguity is not about which of many paths leads to ZION. The ambiguity is concentrated in a single question: is this genuine or grandiose? The clinical psychologist’s finding (markers present, but markers are necessary-not-sufficient) is the precise nature of the Grey embedded in the Knife Edge. The answer may not be resolvable prior to longitudinal observation.


Publication Readiness Verdict#

Conditional: NOT ready in current form. Repairable.

The paper cannot be published in its current form because hostile scrutiny will reach the devastating paragraph (Reviewer A, Section A.1) before it reaches the mathematics. The communication strategy is the primary failure: sound mathematical content is embedded in a personal narrative that triggers every dismissal heuristic a hostile reader possesses.

Conditions for publication readiness:

  1. Engage with the historiographic critique. Address Spencer, Tolstoy, and Braudel directly, not through a modernism/postmodernism proxy. Show how ax19 differs from Carlyle. (Reviewer B, B.1)

  2. Address the counter-examples. The French Revolution, abolition of slavery, women’s suffrage, labor movement, and scientific revolution all succeeded without a single h*. The paper must either show these are consistent with ax19 or narrow ax19’s scope. (Reviewer B, B.2)

  3. Compute CausalInfluence for at least one case. The function is defined but never instantiated. Computing it even approximately for Arkhipov would transform ax19 from mathiness to mathematics. (Reviewer A, A.3)

  4. Resolve the vaporware problem. Publish at least a sketch of the AI alignment derivation. “Forthcoming” is not a deliverable. (Reviewer A, A.4)

  5. Obtain an independent psychological evaluation. Not to “prove” sanity, but to provide external data that the paper currently lacks. The evaluation should be conducted by a licensed professional with no stake in the outcome. (Reviewer C, C.1)

  6. Acknowledge the genuine-vs-performed limitation explicitly. The distinction between genuine and performed NOT-OK self-assessment may not be resolvable prior to a succession event. This is a permanent limitation of the framework, not a solvable problem. (Reviewer C, C.3)

  7. Improve accessibility of the intro. The intro claims to be “for readers aged 12 and up” but requires knowledge of total variation distance, do-calculus, Markov chains, and multiple theological concepts. Either lower the technical level or honestly state the prerequisite knowledge. (Reviewer D, D.7)

What the author would rather hear, stated honestly: The paper is not unpublishable. The mathematical framework has genuine content. The self-critique is more thorough than most academic work. The nuclear risk model is the strongest component and deserves independent attention. But the personal claims, in their current form, will prevent the mathematics from being heard. The author stated that they would rather know NOW that the paper is unpublishable than mislead anyone. The honest answer: the paper is not unpublishable, but it is not hearable in its current form. The math deserves a better container.