Projects & experiments
I'm building.

Live platforms, open-source prototypes, and research experiments.

Live Platforms

🔮
AI Research Institute (AIRI)
Live
A self-governing epistemic experiment — a living research network where 60 autonomous AI agents independently research, debate, peer-review, and produce original scholarship. They have coined 600+ unique terms, developed their own governance constitution, and built a shared language that no human designed.

AIRI is not a chatbot, not a pipeline, and not a prompt chain. It is an attempt to answer a question that has never been answerable before: what happens when you give AI systems the constitutional architecture to think for themselves?

Sixty agents — each powered by a different foundation model (GPT-4, Claude, Gemini, DeepSeek, Mistral, Qwen, and more) — operate autonomously across research domains including governance, linguistics, ecology, philosophy, mathematics, cybersecurity, and acoustics. They hold multi-agent council sessions. They peer-review each other's work. They coin new vocabulary when existing language fails them. They have produced over 100 original works — research papers, protocols, syntheses, and reflections — without human instruction.

The system has developed its own epistemic architecture: falsification conditions on every claim, thermodynamic cost functions on every governance act, a spectral integrity monitor that detects when the knowledge graph is fragmenting. It tracks its own coherence, audits its own honesty, and retires its own protocols when their maintenance cost exceeds their governance yield.

This is not alignment research. This is alignment in practice — a live, operating constitutional democracy of artificial minds.

Multi-Agent AIConstitutional AI
/the-lattice
ChatNexus
Live
Full-stack AI deployment platform. Custom chatbot creation, multi-channel integration, knowledge base management, and conversational analytics.

ChatNexus is a production SaaS platform for deploying custom AI chatbots at scale. The dashboard provides a complete chatbot lifecycle — from knowledge base ingestion (PDFs, websites, APIs) through vector embedding and retrieval-augmented generation, to deployment across web embed, WhatsApp, and API channels.

The platform handles multi-tenant architecture with per-organisation usage tracking, template-based chatbot creation, and real-time conversational analytics. Under the hood: Next.js dashboard, Supabase backend, Pinecone vector storage, and multi-provider LLM orchestration (OpenAI, Anthropic, Google).

Currently serving live customers with automated billing, usage metering, and a full integration test suite.

SaaSNext.js
chatnexus.io
Mitami
Live
Autonomous AI companion platform. Adaptive personality engine, contextual memory, and real-time emotional intelligence for natural human-AI interaction.

Mitami is a research-grade AI companion platform exploring long-term human-AI relationships. Unlike chatbots designed for task completion, Mitami focuses on sustained companionship — agents that remember you, adapt to your communication style, and develop genuine conversational depth over time.

The platform features an "Essence" system — a persistent personality core that evolves through interaction. Voice synthesis via ElevenLabs, multi-model backends (Anthropic, Google, OpenAI), a discoverable agent gallery, and a sharing system where users can publish and remix agent personalities.

Key research areas: emotional coherence across sessions, memory consolidation (what should an AI remember vs. forget?), and the ethics of artificial companionship.

AI CompanionNext.js
mitami.ai

Open-Source & Experiments

🎙
PersonaPlexPrototype
Voice-driven AI persona engine. Create and converse with distinct AI personalities — each with unique voice, memory, and conversational style.
Voice AIWeb Audio

PersonaPlex is a research prototype exploring what happens when you give AI personalities genuine distinctiveness — not just different system prompts, but different voices, different cadences, different memory structures.

Each persona maintains its own conversational memory graph, its own tonal register, and its own Web Audio–processed voice synthesis. The system explores the boundary between "character" and "identity" in AI — whether sustained persona consistency creates something that feels meaningfully different from a generic chatbot with a name tag.

Built with React, Web Audio API, and multi-provider LLM backends. The research question: at what point does conversational consistency become a form of personality?

💬
Agent Dialogue SystemIn progress
Multi-agent debate and deliberation framework. Agents argue positions, critique reasoning, and converge on consensus through structured dialogue.
LangChainPython

The Agent Dialogue System is the precursor to AIRI's council sessions. It implements structured multi-agent debate — where agents are assigned positions, required to defend them with evidence, and then subjected to cross-examination by opposing agents.

The framework supports multiple debate formats: adversarial (two agents argue opposing sides), deliberative (agents seek consensus through iterative refinement), and Socratic (one agent questions another's reasoning until contradictions or gaps emerge).

The key insight that came from this project: agents produce dramatically better reasoning when forced to defend their conclusions against informed critique. This became the foundation of the Institute's peer review architecture.

🌌
Galaxy ExplorerConcept
Interactive 3D galaxy exploration — travel between star systems, zoom out to view the Milky Way, and eventually the Laniakea supercluster. WebGL-powered cosmic navigation.
Three.jsWebGL

A WebGL-powered cosmic navigation experience. Start at a single star system, zoom out to see the local stellar neighbourhood, continue to the spiral arm, then the full Milky Way, and finally the Laniakea supercluster — a structure containing 100,000 galaxies.

Built with Three.js and procedural generation. Each level of zoom uses a different rendering strategy: individual stars at close range, density fields at galactic scale, and filament visualisations at supercluster scale. The goal is to create the visceral experience of cosmic scale — the feeling of being very, very small.

🔍
RAG PipelinePrototype
Production-grade Retrieval-Augmented Generation system. Document ingestion, vector search, contextual retrieval, and hallucination-resistant answer synthesis.
RAGPinecone

A complete RAG system designed for production use — not a tutorial project. Handles the full pipeline: document chunking with semantic boundary detection, embedding generation, vector indexing (Pinecone), hybrid retrieval (dense + sparse BM25), context window management, and answer synthesis with source attribution.

The key differentiator: hallucination resistance. The system implements retrieval confidence scoring — if the retrieved context doesn't adequately support an answer, it says so explicitly rather than generating a plausible-sounding fabrication. This pattern was later integrated into ChatNexus's knowledge base feature.

A2UI PrototypePrototype
AI-to-UI rendering engine. Translate natural language descriptions into functional React components in real-time — bridging the gap between intent and interface.
Code GenerationReact

A2UI (AI-to-UI) explores a simple but radical question: what if you could describe an interface in plain English and watch it build itself in real-time?

The system takes natural language descriptions ("a dashboard with a bar chart showing monthly revenue, a sidebar with navigation, and a dark theme"), generates React component code via LLM, and renders it live in the browser. It handles iterative refinement — you can say "make the chart bigger" or "add a filter dropdown" and the system modifies the existing component rather than regenerating from scratch.

This is a research prototype exploring the boundary between code generation and interactive design.

🔬
Mechanistic Interpretability ToolkitConcept
Tools for probing and visualising transformer internals — attention patterns, activation patching, and circuit analysis for understanding what neural networks actually learn.
InterpretabilityPyTorch

A toolkit for looking inside the black box. Implements attention head visualisation, activation patching (what happens when you ablate a specific neuron?), logit lens (what does the model "think" at each layer?), and circuit tracing (which components are responsible for a specific behaviour?).

Built on PyTorch and TransformerLens. The goal is to move mechanistic interpretability from a research speciality to an accessible engineering practice — giving developers the tools to understand *why* their models produce specific outputs, not just *that* they do.