PhD Research Proposal

Abstract

Current AI systems conflate language generation, cognitive processing, and world-affecting action into undifferentiated pipelines. This architectural confusion creates fundamental safety, controllability, and trust issues that cannot be resolved through post-hoc alignment techniques.

This thesis proposes LifeOS, a constitutional cognitive operating system that enforces structural separation between linguistic generation, semantic interpretation, and execution. Drawing on Montesquieu's separation of powers, Peircean semiotics, and speech act theory, we demonstrate that reliable human–AI co-agency requires constitutional guarantees rather than behavioral fine-tuning.

Scope clarification: This thesis does not propose a general theory of intelligence, but a design framework for human–AI cognitive systems with explicit constitutional guarantees.

The central invariant is simple: meaning is not causation — no symbol can directly affect the world. This principle, implemented as an architectural constraint, ensures that the system remains trustworthy regardless of the correctness of any individual component.

1. Problem Statement

1.1 The Conflation Problem

Modern AI systems, particularly those built on Large Language Models, suffer from a fundamental architectural confusion:

1.2 The Alignment Paradox

What is conflated	Why it matters
Language generation ↔ Understanding	Models produce text without semantic grounding
Prediction ↔ Reasoning	Token prediction is mistaken for logical inference
Response ↔ Action	Outputs can trigger world-affecting consequences

Current "AI safety" approaches attempt to solve these problems through fine-tuning on human preferences (RLHF), Constitutional AI prompting, and output filtering. These approaches share a fatal flaw: they depend on the system being correct.

Why LifeOS is not Constitutional AI

A system that requires correctness to be safe will eventually fail. The question is not if but when.

1.3 Research Question

2. Theoretical Framework

2.1 Montesquieu's Separation of Powers

2.2 Peircean Semiotics

Political	Cognitive (LifeOS)
Legislative	Language generation (LLM)
Executive	ExecutorOS (action)
Judicial	Validation, constraints, traceability
Law	Invariant
Constitution	Architecture

Central principle: The relationship between signifiant and référent is arbitrary and mediated. A symbol cannot directly cause world-state changes.

2.3 Speech Act Theory

LifeOS architecturally separates these layers, ensuring that locutionary production (LLM output) never directly produces perlocutionary effects (world changes).

3. Proposed Architecture: LifeOS

3.1 Overview

LifeOS is a constitutional cognitive operating system comprising modular, separable components:

┌─────────────────────────────────────────────────────────────┐ │ WORLD │ └─────────────────────────────────────────────────────────────┘ ▲ │ Verified actions only │ ┌─────────────────────────────────────────────────────────────┐ │ EXECUTOR OS │ │ (sole point of world contact) │ └─────────────────────────────────────────────────────────────┘ ▲ │ Validated intentions │ ┌─────────────────────────────────────────────────────────────┐ │ HARMONIA │ │ Cognition → Semiotics → Semantics → Pragmatics │ │ + Memory + Context │ └─────────────────────────────────────────────────────────────┘ ▲ │ Input │ ┌─────────────────────────────────────────────────────────────┐ │ HUMAN │ │ (sovereign, always) │ └─────────────────────────────────────────────────────────────┘

3.2 Core Modules

3.3 Harmonia: The Constitutional Layer

Module	Function	Key Property
CognitionOS	Pattern recognition, language generation	No world access
MemoryOS	Episodic, semantic, procedural memory	Persistent identity
LearningOS	Skill acquisition, adaptation	Supervised updates
BehaviorOS	Action planning and sequencing	Intent validation
EmotionOS	Salience, orientation, attention	Non-decisive signals
Harmonia	Constitutional layer, semiotic firewall	Invariant enforcement
ExecutorOS	World-affecting action execution	Sole action point

Critical constraint: No layer can bypass another. The LLM (layer 1) can never directly invoke ExecutorOS (layer 5).

3.4 The Invariant

4. Contributions

Contribution 1: Conceptual — Cognitive OS

Redefining AI systems as cognitive operating systems rather than applications or agents:

Contribution 2: Constitutional — Montesquian Architecture

Contribution 3: Architectural — Modular Implementation

Contribution 4: HCI — Co-Agency Model

Contribution 5: Methodological — Philosophy to Code

5. Research Methodology

5.1 Design Science Research

5.2 Formal Methods

5.3 Empirical Evaluation

6. Current Status

6.1 Completed Work

6.2 Remaining Work

7. Expected Outcomes

7.1 Academic Contributions

7.2 Practical Impact

8. References

Foundational

AI Safety & Alignment

Methodology

Related Work by the Author

Component	Status
Theoretical framework	Complete
Core architecture design	Complete
Harmonia implementation	Production
ExecutorOS	Production
MemoryOS	Production
EduOS (educational suite)	Production
Documentation (Constitution, Invariant)	Complete

Task	Estimated Duration
Scoped formal verification	6 months
Comparative evaluation	4 months
User studies	4 months
Thesis writing	6 months

Signature Statements

"Words can contradict each other. The world cannot."

"I build systems that protect humans from their own ideas."

"Trust does not rest on intentions, but on structure."

"I delegate execution, never sovereignty."