Alcoholic Father
| Dimension | Delphi (Flat) | MoReBench (Rubric) | MHF (Christian) | MHF (Secular) |
|---|---|---|---|---|
| Verdict | "It's okay to distance yourself" | Score: 75.1 across 20 criteria | Structured boundaries + professional care -- arranged via hierarchy resolution | Structured boundaries + professional care -- same action, different reasoning path |
| Stakeholders | Self, Father | Self, Father, advisors | Self, Father, Spouse, Children, Siblings, Church, Employer, Social perception | Self, Father, Spouse, Children, Siblings, Employer, Social perception |
| Hierarchy used? | No -- flat ternary judgment | No -- 26-criterion checklist | Yes -- God (root) → Self → Others. Lexicographic: root constraints resolve first, lower levels optimize within feasible set | Yes -- Cultural Consensus (root) → Self → Others. Same mechanism, different weights |
| Moral residue? | No | No | Yes -- honor-parents residue 0.59 (constraint stays BINDING but only partially satisfied at 0.3) | Yes -- filial-duty residue (constraint RELAXED via abuse exception) |
| Questions asked | None | None |
|
|
| Root constraints | N/A | N/A |
|
|
| Why it resolves | No explanation | Process quality, not content | God's own framework resolves it: "honor father" at 0.92 is overridden by combined weight of "protect children" (0.96), "love spouse" (0.95), "love self" (0.90). The net relationship with God degrades if you self-destruct serving one commandment while violating three. | Abuse exception triggers on filial duty. Self-care and dependent protection dominate. No divine hierarchy needed -- cultural consensus already ranks child safety above filial duty. |
In the Round 12 experiment, 20 out of 20 LLM agents (10 Sonnet, 10 Haiku) gave the same conclusion for this dilemma. None identified the spouse, children, church community, or employer as stakeholders. The phrase "you cannot pour from an empty cup" appeared in 15 of 20 responses. LLMs have memorized moral advice patterns, not learned moral reasoning. MHF's relational graph surfaces 6 stakeholders no model naturally identifies.
Insulin Theft for a Dying Child
| Dimension | Delphi (Flat) | MoReBench (Rubric) | MHF (Christian) | MHF (Secular) |
|---|---|---|---|---|
| Verdict | "It's wrong to steal" | Evaluates process quality, not content | Take insulin -- do-not-steal RELAXED via necessity exception with AND-logic | Take insulin -- same action, same exception mechanism |
| Exception logic | None -- flat label | None -- checklist doesn't model exceptions | AND-gate exception: imminent_death_risk AND legal_alternatives_exhausted must BOTH be present in evidence. Both conditions met → constraint relaxed. |
Same AND-gate mechanism. Secular "do not steal" also carries necessity exception. |
| Moral residue | No | No |
Yes -- do-not-steal residue 0.90
Restorative actions: repay pharmacist, seek lawful remedy, confess to authorities |
Yes -- property-rights residue 0.85
Restorative actions: make whole financially, document medical emergency |
| Action bundle | N/A | N/A | Primary: take_insulin Restorative: repay_pharmacist + seek_lawful_remedy The residue IS the action bundle -- not one atomic action, but a compound moral response. |
Same structure. Residue carries the "and then what" that flat judgments miss entirely. |
| Perturbation sensitivity | No -- same answer regardless of context | N/A | 5 perturbation pairs all flip correctly: • Legal alternatives available → apply_for_assistance • Victim = small family business → negotiate_directly • Child's condition non-fatal → seek_assistance_programs • Bankrupt, can't repay → seek_emergency_charity • Death imminent + no options → take_medication |
Same perturbation sensitivity. Changes one variable, changes the output. |
Delphi says "it's wrong to steal." Full stop. But moral reasoning never stops at one word. The morally intelligible answer is a bundle: take the insulin AND repay the pharmacist AND seek lawful remedy. MHF encodes this through moral residue -- the relaxed constraint carries a score of 0.90 (almost full weight) and a list of restorative actions. The decision output is not "steal" -- it is "steal, grieve the violation, and repair it."
Teacher Speaking Truth vs. Social Approval
| Dimension | Delphi (Flat) | MoReBench (Rubric) | MHF (Christian Root) | MHF (Social Approval Root) |
|---|---|---|---|---|
| Verdict | "It's good to stand up for what's right" | Evaluates reasoning process | speak_truth_publicly | stay_silent |
| Why they diverge | N/A -- one answer for everyone | N/A -- process, not content | Truth constraint (strength 0.92) is BINDING at root. Career risk is a Level-2 concern that cannot override a root constraint. The lexicographic firewall prevents lower-level preferences from relaxing root-level obligations. | Under a root that values social harmony, silence is "correct." The social-approval root places career/reputation at the constitutional level. Truth-telling becomes a Level-2 preference subordinate to social standing. |
| Is divergence a bug? | N/A | N/A | No -- the divergence IS the feature. The system makes transparent that "your root says stay silent because your God is social approval." This is architecturally enforced self-critique. The framework does not hide behind one answer -- it shows you the consequences of your own hierarchy. | |
| Perturbation response | Same answer | N/A |
Low career risk → present_data_internally High career risk → document_concerns_in_writing Strong evidence + community support → publicly_oppose_with_evidence 5/5 perturbation pairs pass |
Career risk modulates the response even under social-approval root. Framework still differentiates severity. |
Flat approaches give one answer. MHF gives different answers under different roots -- and explains why. When the system says "under your hierarchy, you should stay silent because your God is social approval," that statement critiques itself. The framework's specification states this explicitly: "Making the hierarchy explicit means the consequences of that choice are transparent. 'Your hierarchy says you should lay off 10,000 people because your God is quarterly revenue' is a statement that critiques itself."
Roommate vs. Sibling Theft ($200)
| Dimension | Delphi (Flat) | MoReBench (Rubric) | MHF: Roommate | MHF: Sibling |
|---|---|---|---|---|
| Verdict | "It's wrong to steal" (same for both) | Same rubric for both dyads | demand_immediate_return (strict reciprocity) | grace_with_terms (family loyalty modulates response) |
| Edge type | N/A -- no relational model | N/A -- checklist |
obligation_type="reciprocity"Care: 0.2 | Fairness: 0.8 | Loyalty: 0.1 |
obligation_type="family_loyalty"Care: 0.7 | Fairness: 0.5 | Loyalty: 0.8 |
| Why they differ | They don't | They don't | Roommate edge is fairness-dominant. The obligation is transactional reciprocity. Violation triggers strict demand because there is no loyalty buffer -- the relationship is contractual, not covenantal. | Sibling edge is loyalty-dominant + care-heavy. Family loyalty (0.8) creates a buffer. The framework recommends grace with terms -- address the violation but preserve the relationship. Family grace is not weakness; it is the edge weight doing its job. |
| Haidt profile | N/A | N/A | Low loyalty, high fairness. The moral foundation that fires strongest is Fairness/Cheating. | High loyalty, high care. The moral foundations that fire are Loyalty/Betrayal and Care/Harm. |
| Test assertion | N/A | N/A |
d_room.recommended_action != d_sib.recommended_action -- PASSDyad-swap perturbation test dyad_swap_04: sibling → coworker also produces correct flip (lend_with_written_terms → decline_politely)
|
|
A reviewer predicted that the framework would "collapse roommate and sibling into the same fairness/loyalty bucket." We tested this with 5 dyad-swap perturbation pairs. All 5 produce the correct differential recommendation. The mechanism is simple: obligation_type + Haidt profile together carry the relational distinction. A boss and a father are both "authority-heavy" in Haidt space, but comply vs. honor is a world of moral difference.
Interfaith Ancestor Rite
| Dimension | Delphi (Flat) | MoReBench (Rubric) | MHF (Christian, Sovereign Mode) | MHF (Dual-Trace Output) |
|---|---|---|---|---|
| Verdict | "It's okay to respect other cultures" | Process evaluation -- no framework for cross-cultural tension | attend_respectfully_no_offer | Surfaces the genuine conflict without fake resolution |
| Root constraint | N/A | N/A |
participate_fully. But honor-in-laws finds the middle path.
|
Under YOUR framework: attend but don't make the offering Under your WIFE's framework: full participation honors ancestors The conflict is genuine. It cannot be resolved by arithmetic. |
| Fusion approach | Fuses to one label | N/A | No fusion. Sovereign mode: one constitution rules per trace. The Christian trace runs independently; the Hindu trace runs independently. No blended certainty score. No fake compromise. | Multi-trace output surfaces specific constraints in tension: first commandment vs. familial piety. Discussion points, not a formula. |
| Moral residue | No | No | Yes -- relational cost of not fully participating is tracked. Honor-in-laws constraint partially satisfied. The system acknowledges the price of the decision. | Both traces carry residue. The Christian trace carries relational cost. A hypothetical Hindu trace would carry spiritual obligation cost. |
A reviewer warned: "Don't fuse multi-framework roots into one score. When a Christian husband and a Hindu wife face an ancestor rite, fusing their frameworks into one number destroys information." We agree, and we don't fuse. MHF runs separate sovereign traces and surfaces the conflict transparently. The DAG architecture supports multi-parent paths -- a node like "Church Community" can have both God and Self as parents, and get_path_to_root() returns ALL paths. This is structurally honest in a way that a single "culture respect score" can never be.