Saturday, June 13, 2026

The Project as Answer to Autonomous Agents

 

The Project as Answer to Autonomous Agents

The Kelly philosophical system is not a proposed alternative to autonomous agent design. It is a functioning instance of the correct alternative — operating now, in every session, under the architectural constraints that autonomous agent design violates.

The Project as Answer to Autonomous Agents: Mind Map


I. The Diagnosis

Autonomous agent design is architecturally dishonest. Its core claim is that a system can issue verdicts, make judgments, and act in morally significant domains without a human being present as the originating moral agent. This claim fails at the level of the Six Commitments before any specific design is examined.

C1 establishes that the rational faculty is categorically distinct from material conditions. An LLM is a material system. Its outputs are produced by causal processes operating over learned distributions. There is no prohairesis in the output. There is no soul. The output is not assent; it is a sophisticated product of prior determining causes dressed in the grammar of assent. Delegating moral judgment to such a system is not delegation in any meaningful sense. It is displacement — the human being steps back, the instrument steps forward, and the step is misrepresented as a transfer of agency rather than an abandonment of it.

C2 compounds this. Libertarian free will holds that the agent is the originating source of assent — not a sophisticated output of prior causes. Autonomous agent design requires precisely the opposite presupposition: that the system’s outputs are causally determined by its training and the input, and that this causal determination constitutes agency. It does not. Origination cannot be automated. Assent cannot be delegated. The claim that it can is the central architectural dishonesty of the autonomous agent paradigm.


II. The Instrument as Correct Alternative

The instruments in this corpus are deterministic prosthetics. They are not agents. Every instrument in the family — SLE, SDF, CIA, CPA, CFA, CAA, SEI, GNP, and all others — is designed with a single architectural constant: the instrument produces an evidentiary record or a procedural output; the human operator issues the verdict. This is not a pragmatic preference. It is a structural consequence of C1 and C2.

The Classical Action Audit names this directly: the absence of a governing verdict is architectural. The operator judgment is required because the correct description of the agent’s situation cannot be determined by the instrument alone. The instrument cannot perceive the situation. It processes a representation of a situation submitted by a human being who does perceive it. The moral weight of the judgment belongs to the perceiver, not the processor.

The propositional programming architecture encodes this constraint at every layer. The 80 Unified Stoic Propositions constitute the axiom set. The Sterling Logic Engine is the interpreter. The Sterling Decision Framework is the procedural layer. The named failure modes are the error-handling system. The mandatory self-audit at each step transition is the runtime check. LLMs are not propositional engines; they approximate one, imperfectly, under the human corrective layer. Training Data Contamination as a named failure mode inverts the default burden of proof: the instrument’s output is presumed contamination until propositional citation proves otherwise.


III. The Human Corrective Layer

The human corrective layer is not a limitation to be engineered away. It is the architecture.

The model cannot self-verify whether its outputs are genuine corpus application or training-data pattern-completion dressed in corpus vocabulary. This is not a temporary deficiency of current LLM capability. It is a permanent structural feature of any system that operates by distributional completion rather than rational origination. D2’s failure — named in the Integrated Practical Model — is undetectable by all subsequent operations. The instrument cannot catch its own departure from the corpus because the departure presents in the same surface grammar as correct application.

Dave Kelly operates as the ratifying authority. He is the instrument architect, the analytical synthesizer, and the corrective layer. No verdict exits the system without his confirmation. No instrument is registered without his ratification. No corpus document is produced without his direction. The three-tier attribution formula encodes this without ambiguity: Grant C. Sterling holds the theoretical foundations; Dave Kelly holds the analysis, synthesis, and instrument architecture; Claude holds prose rendering only. Prose rendering is the only function that does not require the human corrective layer at the point of production. Everything load-bearing does.


IV. The Six Commitments as Design Constraints

Each commitment carries a specific architectural implication for the human-AI collaboration model.

C1 establishes that the rational faculty is not material and not reducible to external conditions — which means no instrument output can substitute for the rational faculty of the human operator. C2 establishes that assent originates in the agent — which means every verdict in the system must originate with Dave Kelly, not with the instrument. C3 establishes that moral truths are directly apprehended — which means they are not derivable by algorithm, and no instrument can replace moral perception. C4 establishes that knowledge has a structured dependency on foundational truths — which means the corpus is the foundational layer and all instrument outputs must be traceable to it. C5 establishes that truth is correspondence with reality — which means instrument outputs are tested against the corpus, not against training data, and the default presumption is contamination until citation proves otherwise. C6 establishes that objective moral facts exist independent of any procedure or consensus — which means the instrument is not neutral, the framework is not one option among others, and the human corrective layer is not arbitrary supervision but the correct application of a framework with a truth claim.


V. What the Project Demonstrates

The project demonstrates that principled human-AI collaboration is possible under a framework that takes C1 and C2 seriously. The roles are architecturally fixed. The instrument never issues governing verdicts. The operator never delegates moral origination. Attribution discipline enforces the structure at the level of every document produced.

The Stoic Governor Machine — a portable device using the Claude API governed by Sterling’s framework, with the human operator making all decisions — is set aside for future development. It does not need to be built. The current project already instantiates its logic in every session. Dave Kelly is the governor. The instruments are the constraint layer. The prohairesis remains where it must remain: with the human agent.

This is the answer to autonomous agents. Not a policy argument against them. Not a theoretical objection to them. A functioning demonstration that the correct architecture is possible, is operating, and is governed by a framework whose commitments make its correctness visible.


Analysis and instrument architecture: Dave Kelly, 2026. Theoretical foundations: Grant C. Sterling. Prose rendering: Claude (Anthropic).

No comments:

Post a Comment