Projects & Research

Things I've built and studied, most recent first.

2025 · UC Berkeley × Satlyt

Small Language Models for Edge AI in Space↗

Building an AI agent workflow with tool-calling for satellite edge computing — extracting actionable tasks from unstructured logs and routing them to specialized models.

2025

Voice AI Freight Platform↗

Freight brokerage platform with an autonomous voice agent handling 24/7 inbound carrier calls — FMCSA verification, fuzzy load matching, and a rule-based negotiation engine with margin protection. Real-time dashboard tracks booking rates, revenue, and carrier sentiment. Next.js / FastAPI on Fly.io.

2025

Enterprise Nervous System↗

Multi-agent incident response using AG2 swarms and MCP — parallel retrieval across logs, GitHub, and Jira to automate root-cause analysis and postmortems in under 19 seconds per incident. Evaluated over 21 production-style failure scenarios with deterministic scoring.

2025

Vulcan OmniPro Assistant↗

Full-stack multimodal assistant for the Vulcan OmniPro 220 welder: five specialized agents (orchestrator, retrieval, vision, diagnostic, artifact) behind a central router with WebSocket streaming. The vision agent turns manual images into interactive React components, SVG diagrams, and Mermaid flowcharts; hybrid BM25 + semantic search with cross-encoder reranking. React / FastAPI / PostgreSQL on Fly.io.

2025

AgentProbe↗

A red-teaming framework for AI agent systems: a four-agent adversarial swarm (Recon, Attack, Evaluator, Reporter) that surfaces prompt injection, PII leakage, and policy violations. Hybrid evaluation combines deterministic rules, regex pipelines, and LLM scoring aligned with OWASP LLM risk categories.

2025

CodeBase Agent↗

Agentic code-refactoring system: a multi-agent pipeline that analyzes Python code and validates its own changes.

2025 · CISPA

Smooth Cascade Unlearning via Reversed Self-Distillation

Machine unlearning that escalates in-context → LoRA → full fine-tuning by forget-set size, using reversed self-distillation to align model versions and minimize membership inference risk. Validated on SST-2, AGNews, and MIT Movies with LiRA auditing on OLMo 2 1B.

2024 · Centrale Lyon

Data Analysis for Table Tennis Matches↗

Physics-based modeling of table tennis ball trajectories with player data to analyze bounce uncertainty zones and strike timing — feeding player classification and strategy.

2023 · OCaml

Type System for Secure Information Flow↗

A type system and checker in OCaml that enforces the noninterference property.