Flabubium

Mechanistic Interpretability

Decoding gears, revealing minds, and pushing for safer AI systems.

Activation Patching

Tracing how information flows through a model by swapping activations at specific sites.

Representation Analysis

Studying how individual neurons simultaneously respond to many different, unrelated concepts.

Mechanistic Causality

Finding the small subnetworks inside large models that are responsible for specific capabilities.

Exploring architectures in motion, with infinite possibilities.

Energy Functions

Exploring how assigning energy scores to data can lead to richer, more structured representations.

Hybrid Reasoning

Designing architectures that combine pattern learning with structured, rule-based reasoning.

Truly caring about correctness

Formal Methods

Turning informal mathematical arguments into fully verified, machine-checkable proofs.

Proof Assistants

Connecting language models to the Lean proof assistant for interactive, verified reasoning.

Tactic Search

Developing smarter algorithms for navigating the vast space of possible proof strategies.

Scroll