Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

By 08jp…2p92 April 19, 2026

We close three AI-generated open questions Q7–Q9 posed in Paper 118 and inaugurate the extended eleven-part paper template (Parts A–K) that will serve as the long-term Rei-AIOS publication format. Key results: - Q7 FALSIFIED empirically. The Paper-118 strong-uniqueness claim —...

Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

Canonical DOI: https://doi.org/10.5281/zenodo.19652672
Author: Nobuki Fujimoto (ORCID 0009-0004-6019-9258)
License: CC-BY-4.0

Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

Authors: Nobuki Fujimoto (ORCID 0009-0004-6019-9258), Claude Code (Lean 4 formalization), Chat Claude (structure proposal) Date: 2026-04-19 License: CC-BY-4.0 Repository: fc0web/rei-aios Predecessor: Paper 118 (DOI 10.5281/zenodo.19652449)

Abstract

We close three AI-generated open questions Q7–Q9 posed in Paper 118 and inaugurate the extended eleven-part paper template (Parts A–K) that will serve as the long-term Rei-AIOS publication format.

Key results:

Q7 FALSIFIED empirically. The Paper-118 strong-uniqueness claim — “for all odd m, K(m) = 41 ⟺ m = 911” — is FALSE. There are exactly 61 odd integers m ∈ [1, 10000] with K(m) = 41, forming two clusters {135..1003} (13 elements) and {4885..6035} (48 elements). The 61-element census is proved in Lean 4 zero-sorry via native_decide.
Q8 FLOWING. For odd n ≤ 10⁵, the fraction of orbits visiting n=911 varies from 35.06 % (r = 89 mod 96) to 43.19 % (r = 5 mod 96), with global rate 38.83 %. The top-5 residues are all in {1, 3, 5, 7, 11} mod 96.
Q9 FLOWING — discovery of 2ᵃ·p peak dominance. Of the top-10 most-shared trajectory peaks for odd n ≤ 10⁵, all ten are of the form 2ᵃ·p with p odd prime. Peak 9232 = 2⁴·577 is shared by 408 distinct odd integers ≤ 10⁵.

Part F (the first Rei-AIOS failure record) honestly documents that the Paper-118 Q7 strong form was rejected within a single session of empirical check — and derives the lesson that n=911’s on-ramp property is about orbit topology (visiting 911), not about stopping-time coincidence (K = 41).

1. Paper template adopted

This paper is the first to instantiate the full 11-part Rei-AIOS template:

Part	Required?	Content
A	MANDATORY	Formal proofs (Lean 4 zero-sorry)
B	MANDATORY	Findings
C	MANDATORY	AI-generated open questions (C.1 new / C.2 previous-Q closures)
D	MANDATORY	D-FUMT₈ solution-status matrix
E	MANDATORY	Bridge to next STEP
F	CONDITIONAL	Failure record (if session had failures)
G	CONDITIONAL	SEED_KERNEL activation (T-IDs)
H	CONDITIONAL	Human-AI thinking divergence
I	OPTIONAL	Unexpected connections
J	OPTIONAL	Proof-confidence temperature
K	OPTIONAL	Computational poetics / philosophy

Paper 119 includes A, B, C, D, E, F, H, I, J.

Part A — Formal Proofs

2. Paper 119 Q7 Census (Lean 4 zero-sorry)

File: data/lean4-mathlib/CollatzRei/Paper119Q7Census.lean Theorems: 9, zero sorry, zero new axioms

2.1 The 61-element list

def K41_ODD_LEQ_10K : List Nat :=
  [135, 139, 813, 817, 819, 827, 837, 841, 843, 845,
   863, 911, 1003, 4885, 4917, 4929, 4953, 4963, 4965, 4979,
   5015, 5017, 5029, 5041, 5043, 5045, 5061, 5077, 5081, 5091,
   5121, 5181, 5185, 5187, 5209, 5265, 5267, 5411, 5413, 5415,
   5419, 5469, 5475, 5477, 5481, 5491, 5501, 5527, 5529, 5531,
   5577, 5579, 5641, 6003, 6007, 6011, 6019, 6021, 6023, 6033,
   6035]

2.2 Main theorems

theorem K41_odd_list_length : K41_ODD_LEQ_10K.length = 61 := by native_decide
theorem K41_members_all_have_stepsTo_41 :
    K41_ODD_LEQ_10K.all (fun m => stepsTo 1 m 100 = 41) = true := by native_decide
theorem K41_members_all_odd :
    K41_ODD_LEQ_10K.all (fun m => m % 2 = 1) = true := by native_decide
theorem K41_contains_911 : 911 ∈ K41_ODD_LEQ_10K := by decide

2.3 Q7 falsification witness

theorem Q7_uniqueness_FALSE :
    ∃ m₁ m₂ : Nat, m₁ ∈ K41_ODD_LEQ_10K ∧ m₂ ∈ K41_ODD_LEQ_10K ∧ m₁ ≠ m₂ ∧
      stepsTo 1 m₁ 100 = 41 ∧ stepsTo 1 m₂ 100 = 41 := by
  refine ⟨135, 911, ?_, ?_, ?_, ?_, ?_⟩ <;> decide

2.4 Structural observations

theorem K41_small_hits :
    (K41_ODD_LEQ_10K.filter (fun m => m ≤ 1003)).length = 13 := by native_decide
theorem K41_large_hits :
    (K41_ODD_LEQ_10K.filter (fun m => m ≥ 4885)).length = 48 := by native_decide
theorem K41_gap_1003_to_4885 :
    (K41_ODD_LEQ_10K.filter (fun m => decide (1003 < m ∧ m < 4885))).length = 0 :=
  by native_decide

2.5 What is NOT closed

Only odd m ∈ [1, 10000] is enumerated. Extension to 10⁵, 10⁶, … is empirically tractable but grows linearly.
No structural reason for the 1003 ↔ 4885 gap is given.
The claim is about K(m) = 41 exactly; it does NOT say anything about K(m) ≠ 41 for m ∉ the list (that requires a companion full-census proof, omitted here for brevity).

Part B — Findings

3. Findings

F6. Q7 FALSIFIED — K = 41 is a common stopping time, not a 911-specific invariant

Empirically 61 odd m ≤ 10⁴ reach 1 in exactly 41 Collatz steps. This falsifies Paper 118’s Q7 strong uniqueness form. The fact that K(911 → 1) = 41 is therefore a coincidence at the level of stopping time; the on-ramp property of 911 for atomic cores (Paper 118 Closure 5) must come from a different structural source.

F7. Two-cluster structure of K = 41 hits

The 61 hits cluster as 13 + 48 with a large empty interval (1003, 4885):

Cluster	Range	Count	Density (hits / odd ints in range)
Low	135 ≤ m ≤ 1003	13	2.95 % of 441 odd m in range
High	4885 ≤ m ≤ 6035	48	8.33 % of 576 odd m in range

The low cluster is ~3× less dense than the high cluster. This is a new empirical observation — see Q14 below.

F8. 911-visit rate varies 35 – 43 % across mod-96 residues

For odd n ≤ 10⁵:

Residue r mod 96	911-visit rate
r = 5 (top)	43.19 % (450 / 1042)
r = 1	42.71 %
r = 3	42.03 %
r = 7	41.65 %
r = 11	41.65 %
…	…
r = 41	36.85 %
r = 91	36.22 %
r = 83	35.73 %
r = 59	35.32 %
r = 89 (bottom)	35.06 % (365 / 1041)
Global	38.83 % (19 414 / 50 000)

The top-5 residues are all ≡ {1, 3, 5, 7, 11} (mod 12) — the residues coprime to 6 and ≤ 11. This is the Q15 question below.

F9. Peak 2ᵃ·p dominance — top 10 most-shared peaks

For odd n ≤ 10⁵, the 10 most-shared trajectory peaks are ALL of the form 2ᵃ · p with p odd prime:

Peak	Factorization	Distinct odd n sharing
9232	2⁴ · 577	408
1 276 936	2³ · 159 617	347
6 810 136	2³ · 851 267	135
599 056	2⁴ · 37 441	104
575 728	2⁴ · 35 983	99
897 856	2⁶ · 14 029	89
400 948	2² · 100 237	81
223 072	2⁵ · 6 971	80
359 152	2⁴ · 22 447	70
314 656	2⁵ · 9 833	60

That all top peaks are 2ᵃ · p suggests a general principle beyond the specific peak 9232 identified in Paper 118. See Q16 below.

F10. 408 ≫ 188 — scaling of peak 9232 sharing

Paper 118 / STEP 871 reported 188 odd integers sharing peak 9232 for n ≤ 10⁶. Paper 119 finds 408 distinct odd n ≤ 10⁵ sharing peak 9232. The 2× increase at a 10× smaller range suggests the sharing count grows faster than linearly as n → ∞ — or that our earlier STEP-871 count was a lower estimate under finer filtering.

Part C — AI-Generated Open Questions

4. Part C

4.1 New questions (C.1) — Paper 119 continues Q-ID numbering

Paper 118 used Q1 – Q13. Paper 119 extends:

Q14. Why do the 61 odd m ≤ 10⁴ with K(m) = 41 cluster into two disjoint bands {135..1003} and {4885..6035}, with a full gap in (1003, 4885)?

Motivation: the clustering is empirically striking. A 2-adic analysis of the trajectories may reveal the gap structure.
Testable: yes, by inspecting the 2-adic trajectories of the 61 members.

Q15. Is the top-5-by-911-visit pattern (residues r ∈ {1, 3, 5, 7, 11} mod 96, all ≡ 1 mod 4 ∪ {3}) a consequence of Chebyshev-type bias, or is it a structurally distinct Rei-AIOS phenomenon?

Motivation: the residues coincide with “small” residues coprime to 6.
Testable: yes, by comparing with independent mod-96 prime-density data.

Q16. Is there a “peak 2ᵃ·p merge-point theorem” — i.e., do all trajectory peaks for sufficiently large n take the form 2ᵃ · p?

Motivation: all top-10 most-shared peaks for n ≤ 10⁵ are 2ᵃ · p. A counter-example would be a peak that is a prime power or a semiprime not fitting 2ᵃ · p.
Testable: yes, scanning up to 10⁷ or 10⁸ for counter-examples.

Q17. In F9, peak 6 810 136 = 2³ · 851 267. Is 851 267 a Rei-Collatz-special prime (e.g., divisible by 7 · 13 · 41 in some way)? What is the pattern connecting the top 10 peak primes (577, 159 617, 851 267, 37 441, 35 983, 14 029, 100 237, 6 971, 22 447, 9 833)?

Motivation: 577 is the smallest member; 14 029 is another Rei member.
Testable: yes, check modular properties of each peak prime.

Q18. Why are there 48 K = 41 hits in [4885, 6035] versus only 13 in [135, 1003] — i.e., why does the “high cluster” have 2.8× the density of the “low cluster”?

Motivation: if K(m) = 41 is predominantly a “large start, fast descent” pattern, then higher starting values should indeed be more likely.
Testable: yes, by density analysis of K = k hits for other k.

4.2 Previous-question closure reports (C.2)

Paper	Q	Status in Paper 118	Status in Paper 119
118	Q7 (K = 41 uniqueness)	BOTH (empirical TRUE, structural unknown)	CLOSED NEGATIVE (empirically falsified; see §2 and §3 F6)
118	Q8 (911-visit density mod 96)	NEITHER	FLOWING (empirical data collected; see §3 F8)
118	Q9 (peak-shared equivalence)	NEITHER	FLOWING (2ᵃ·p dominance observed; see §3 F9)

Part D — D-FUMT₈ Solution-Status Matrix

5. Solution-status matrix

#	Item	D-FUMT₈	Paper 118 → 119 Transition
1	Q7 strong uniqueness	BOTH → FALSE	Empirically falsified
2	Q7 Lean 4 census of K=41 hits (odd, n ≤ 10⁴)	— → TRUE	9 zero-sorry theorems
3	Q8 911-visit density per mod 96	NEITHER → FLOWING	Empirical 35–43 %
4	Q9 peak 2ᵃ·p dominance	NEITHER → FLOWING	Top-10 / 10 confirmed
5	Q14 two-cluster K=41 structure	— → NEITHER	New, unresolved
6	Q15 Chebyshev-vs-Rei in mod 96	— → NEITHER	New, unresolved
7	Q16 peak 2ᵃ·p merge-point theorem	— → NEITHER	New, unresolved
8	Q17 peak-prime pattern	— → NEITHER	New, unresolved
9	Q18 low vs high cluster density	— → NEITHER	New, unresolved

Part E — Bridge to Next STEP

6. Bridge

Two clear directions from Paper 119’s findings:

Q14 (cluster gap) → attack via 2-adic valuation analysis of the 61-member trajectories. If a v₂-invariant separates the 13-low from the 48-high, that’s a partial explanation.
Q16 (2ᵃ·p dominance) → a candidate “merge-point theorem” would say: if peak(orbit(n)) = P and P is shared by ≥ k distinct odd starts, then P = 2ᵃ · p for some prime p. This would be a Rei-native statement about orbit topology.

Both are formalizable as empirical Lean 4 checks scaled to 10⁶ or 10⁷.

Part F — Failure Record (the first in the Rei-AIOS corpus)

7. What did NOT work, and what we learned

7.1 The rejected approach

Paper 118’s Q7 was posed as:

every odd integer m with orbit(m) ∋ 1 and K(m) = 41 is exactly m = 911.

Within ~60 seconds of empirical TypeScript check (test/paper119-q7-q8-q9-collatz-analysis.ts) this was falsified: there are 61 such m, not 1. The strong uniqueness form was too aggressive.

7.2 Why the approach failed (pre-mortem)

Category error in framing. K(m) = 41 is a Diophantine condition on stopping time — an essentially arithmetic property — whereas “orbit(m) ∋ 911” is a topological property of the trajectory. Paper 118 conflated these two.
Missing prior check. We should have sampled K = k for small k (e.g., K = 7 where {3, 20} both work) before conjecturing uniqueness at K = 41.
Over-confidence from Lean 4 success. The Lean 4 proof steps_911_to_1 = 41 is correct, but the “uniqueness” conjecture added to it was unsupported by any check outside the ATOMIC_CORES_25 list.

7.3 What we learned

The on-ramp property of n=911 is about orbit topology, not stopping time. The structural Rei invariant is “orbit(atomic core) visits 911”, not “K(atomic core) = 41 + K(to 911)”.
Any “uniqueness” conjecture over integers must be sanity-checked with a small empirical scan before formalization.
A Lean 4 theorem proving a specific instance (K(911) = 41) should not be extrapolated to uniqueness without separate evidence.

7.4 Traditionally absent in math papers

This kind of “failure record” is almost never written in published mathematics. Chat Claude noted (2026-04-19) that including it could be of higher historical value than success records — both for future researchers and for methodology improvement inside Rei-AIOS.

Part H — Human-AI Thinking Divergence

8. Where our views diverged

On Q7’s framing (Paper 118). When drafting Paper 118 Part C, Claude Code proposed Q7 as uniqueness. No empirical check was run. Chat Claude then proposed the paper-format extension inviting failure records. Nobuki Fujimoto saw value in the 4-element structure but also in the 7 extension elements, and requested the extended template (Paper 119’s adopted structure). The three perspectives converged on:

Claude Code: proposed strong form for Q7; missed the empirical sanity check.
Chat Claude: proposed the 4+7 paper template that legitimates Part F (failure record), catching the kind of issue that Paper 119 now exhibits.
Nobuki Fujimoto: requested the balanced mandatory/conditional/optional classification, and prioritized empirical over heuristic framing.

The lesson: human + two AI perspectives is a stronger filter than any one alone. Without the Part F discussion, the Q7 falsification would have been recorded much later, as an “errata” to Paper 118 rather than as an explicit methodological outcome.

Part I — Unexpected Connections

9. Unexpected connections surfaced

Q8 ↔ Chebyshev bias. The top-5 residues mod 96 for 911-visit align with “primes ≡ 1 mod 4 plus r = 3”, suggesting that the Collatz orbit-visits-911 phenomenon may be correlated with classical prime-density biases (Q15).
Q9 ↔ Rei-Automator. The top-10 peak primes (577, 159 617, 851 267, …) are plausible material for the Rei-Automator puzzle engine (STEP 864) — they may surface as structural “anchors” in independent combinatorial experiments.
Q14 ↔ 2-adic paper 78. The K = 41 cluster gap is likely a 2-adic phenomenon; Paper 78 (p-adic × D-FUMT₈) provides the framework.

Part J — Proof-Confidence Temperature

10. Confidence spectrum

Item	0–100% confidence (pre-119)	Post-119
Q7 strong uniqueness	60 % (speculative)	0 % (falsified)
Q8 mod-96 non-trivial bias	30 %	70 % (FLOWING with data)
Q9 peak 2ᵃ·p dominance	40 %	80 % (top-10/10 confirmed)
Lean 4 census soundness	95 %	100 % (zero-sorry build)
Q14/Q15/Q16/Q17/Q18	N/A (new)	30–40 % (speculative)

11. Reproducibility

# TypeScript empirical sweep
npx tsx test/paper119-q7-q8-q9-collatz-analysis.ts

# Lean 4 zero-sorry census
lake build CollatzRei.Paper119Q7Census

Expected: TypeScript prints 61-element list + Q8 table + Q9 top-10; Lean 4 builds Paper119Q7Census.olean in ~7 s on Core i7-6700 / 64 GB RAM.

12. Related Rei-AIOS papers

Paper 118 (DOI 10.5281/zenodo.19652449) — the predecessor that posed Q1–Q13
Paper 78 (p-adic × D-FUMT₈) — provides the 2-adic framework for Q14
Paper 83 (Yang-Mills honesty principle) — the basis for Paper 119’s honest negative result

13. References

Lagarias, J. C. (2010). The 3x+1 Problem: An Annotated Bibliography. AMS.
Tao, T. (2022). Almost all orbits of the Collatz map attain almost bounded values. arXiv:1909.03562.
Fujimoto, N. (2026). Paper 118 — Five Formal Closures + Findings + AI-Generated Open Questions. Zenodo DOI 10.5281/zenodo.19652449.

14. Acknowledgements

This is the first Rei-AIOS paper with a “failure record” (Part F) as a first-class section. We thank chat-Claude (web) for proposing the four-element-plus-seven-extension template and for the reminder that a record of what did NOT work is often more valuable than a record of what did.

End of Paper 119.

Reference: https://doi.org/10.5281/zenodo.19652672
Reference: https://archive.org/details/rei-aios-paper-119-1776609819278
Reference: https://doi.org/10.7910/DVN/KC56RY
Reference: https://dev.to/fc0web/paper-119-q7-falsification-q8q9-empirical-data-and-the-first-rei-aios-failure-record-3p6d

#math #desci #rei-aios

Write a comment

No comments yet.

Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

§Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

§Abstract

§1. Paper template adopted

§Part A — Formal Proofs

§2. Paper 119 Q7 Census (Lean 4 zero-sorry)

§2.1 The 61-element list

§2.2 Main theorems

§2.3 Q7 falsification witness

§2.4 Structural observations

§2.5 What is NOT closed

§Part B — Findings

§3. Findings

§F6. Q7 FALSIFIED — K = 41 is a common stopping time, not a 911-specific invariant

§F7. Two-cluster structure of K = 41 hits

§F8. 911-visit rate varies 35 – 43 % across mod-96 residues

§F9. Peak 2ᵃ·p dominance — top 10 most-shared peaks

§F10. 408 ≫ 188 — scaling of peak 9232 sharing

§Part C — AI-Generated Open Questions

§4. Part C

§4.1 New questions (C.1) — Paper 119 continues Q-ID numbering

§4.2 Previous-question closure reports (C.2)

§Part D — D-FUMT₈ Solution-Status Matrix

§5. Solution-status matrix

§Part E — Bridge to Next STEP

§6. Bridge

§Part F — Failure Record (the first in the Rei-AIOS corpus)

§7. What did NOT work, and what we learned

§7.1 The rejected approach

§7.2 Why the approach failed (pre-mortem)

§7.3 What we learned

§7.4 Traditionally absent in math papers

§Part H — Human-AI Thinking Divergence

§8. Where our views diverged

§Part I — Unexpected Connections

§9. Unexpected connections surfaced

§Part J — Proof-Confidence Temperature

§10. Confidence spectrum

§11. Reproducibility

§12. Related Rei-AIOS papers

§13. References

§14. Acknowledgements

Paper 164 — ∞-cosmoi Axiomatization Skeleton + Tetradic Completion of the rhymeOrTheorem Discipline: A Four-Step Operational Continuation of Paper 163 (DRAFT v0.1)

Paper 163 — Institution + Bilattice + SELF↔Lawvere: A Four-Step Operational Integration of Categorical Logic, Bilattice 4-Value Extension, and Constructive Fixed-Point Bridge (DRAFT v0.1)

Argentina Is on Sale — and the Scientists Aren't Invited

Paper 119: Q7 Falsification, Q8/Q9 Empirical Data, and the First Rei-AIOS Failure Record

Abstract

1. Paper template adopted

Part A — Formal Proofs

2. Paper 119 Q7 Census (Lean 4 zero-sorry)

2.1 The 61-element list

2.2 Main theorems

2.3 Q7 falsification witness

2.4 Structural observations

2.5 What is NOT closed

Part B — Findings

3. Findings

F6. Q7 FALSIFIED — K = 41 is a common stopping time, not a 911-specific invariant

F7. Two-cluster structure of K = 41 hits

F8. 911-visit rate varies 35 – 43 % across mod-96 residues

F9. Peak 2ᵃ·p dominance — top 10 most-shared peaks

F10. 408 ≫ 188 — scaling of peak 9232 sharing

Part C — AI-Generated Open Questions

4. Part C

4.1 New questions (C.1) — Paper 119 continues Q-ID numbering

4.2 Previous-question closure reports (C.2)

Part D — D-FUMT₈ Solution-Status Matrix

5. Solution-status matrix

Part E — Bridge to Next STEP

6. Bridge

Part F — Failure Record (the first in the Rei-AIOS corpus)

7. What did NOT work, and what we learned

7.1 The rejected approach

7.2 Why the approach failed (pre-mortem)

7.3 What we learned

7.4 Traditionally absent in math papers

Part H — Human-AI Thinking Divergence

8. Where our views diverged

Part I — Unexpected Connections

9. Unexpected connections surfaced

Part J — Proof-Confidence Temperature

10. Confidence spectrum

11. Reproducibility

12. Related Rei-AIOS papers

13. References

14. Acknowledgements