The most complex model we actually understand

Welch Labs

Dec 20, 2025

14 notes

14 Notes in this Video

Modular Arithmetic as Neural Network Learning Task
One-Hot Encoding: How Models Perceive Tokens
Grokking: Delayed Generalization in Neural Networks
Embedding Matrix Transforms Sparse Inputs to Dense Representations
Network Learns Fourier Trigonometric Solution to Modular Addition
Sparse Linear Probes Extract Hidden Trigonometric Signals
Analog Clocks Physically Implement Modular Arithmetic
Diagonal Symmetry in Activations Detects Input Sums
Trigonometric Sum Identity Enables Neural Addition
Excluded Loss Reveals Hidden Progress During Grokking
Three Training Phases: Memorization, Structure Building, Cleanup
Claude Haiku Represents Character Count on Six-Dimensional Manifold
QK Twist: Attention Rotates Manifolds to Detect Distance
AI as Alien Intelligence: Summoning Ghosts, Not Building Minds

Modular Arithmetic as Neural Network Learning Task

ModularArithmetic TrainingData ClockArithmetic BoundedOperations

The OpenAI research team studying grokking chose modular arithmetic as their experimental task, creating datasets by systematically varying operands and computing remainders after division by a modulus.

One-Hot Encoding: How Models Perceive Tokens

OneHotEncoding TokenRepresentation SparseVectors InputEncoding

Neural networks receive input through one-hot encoding, a sparse representation where each token activates exactly one position in a vector while all others remain zero.

Grokking: Delayed Generalization in Neural Networks

Grokking Generalization TrainingDynamics DelayedLearning

Discovered accidentally by an OpenAI research team in 2021 when a researcher left a model training during vacation, the term “grokking” comes from Robert Heinlein’s 1961 novel describing profound understanding that merges with the knower.

Embedding Matrix Transforms Sparse Inputs to Dense Representations

EmbeddingMatrix DenseVectors LearnedWeights RepresentationLearning

The embedding matrix multiplies sparse one-hot input vectors to produce dense embedding vectors, serving as the first learned transformation in the network.

Network Learns Fourier Trigonometric Solution to Modular Addition

FourierAnalysis TrigonometricFunctions PeriodicSolutions FrequencyDomain

Neil Nandanda’s team discovered through discrete Fourier analysis that the network learns to represent inputs and operations using sine and cosine functions at specific frequencies.

Sparse Linear Probes Extract Hidden Trigonometric Signals

LinearProbes InterpretabilityMethods SignalExtraction WeightedSums

Researchers analyzing the grokking model use sparse linear probes as an interpretability technique to extract clean trigonometric signals from noisy distributed embedding representations.

Analog Clocks Physically Implement Modular Arithmetic

ClockArithmetic CircularStructure PhysicalAnalogy PeriodicMotion

Analog clocks serve as everyday physical implementations of modular addition, with circular motion perfectly matching modulo arithmetic’s wraparound behavior.

Diagonal Symmetry in Activations Detects Input Sums

ActivationPatterns DiagonalStructure SumDetection GeometricLearning

Deeper multi-layer perceptron neurons develop diagonal wave patterns where activation crests align with specific input sum values, enabling sum detection through geometric structure.

Trigonometric Sum Identity Enables Neural Addition

TrigonometricIdentity AngleAddition MathematicalIdentities ProductToSum

The network discovers and effectively implements the trigonometric identity cos(x)cos(y) - sin(x)sin(y) = cos(x+y) to convert products of sines and cosines into the sum of angles itself.

Excluded Loss Reveals Hidden Progress During Grokking

ExcludedLoss TrainingMetrics FrequencyAnalysis HiddenProgress

Neil Nandanda and collaborators proposed the excluded loss metric as a clever diagnostic revealing internal structure formation invisible to standard accuracy and loss measurements.

Three Training Phases: Memorization, Structure Building, Cleanup

TrainingPhases MemorizationPhase CleanupPhase LearningDynamics

Nandanda and collaborators identified three distinct phases networks progress through when grokking: initial memorization, hidden structure building, and final cleanup.

Claude Haiku Represents Character Count on Six-Dimensional Manifold

ClaudeHaiku HighDimensionalRepresentation ManifoldLearning LineBreakMechanism

An Anthropic research team studying Claude 3.5 Haiku discovered clean geometric structure controlling the model’s line-break insertion behavior, representing a rare interpretability success in full-size models.

QK Twist: Attention Rotates Manifolds to Detect Distance

AttentionMechanism GeometricRotation QKTwist DistanceDetection

The Anthropic team discovered what they term a “QK twist” where attention blocks rotate helix-like geometric representations in six-dimensional space to implement distance comparison.

AI as Alien Intelligence: Summoning Ghosts, Not Building Minds

AIPhilosophy AlienIntelligence Anthropomorphism ConceptualFrameworks

AI researcher Andre Karpathy suggested that training large language models resembles summoning ghosts more than building animal intelligence, framing AI as fundamentally alien rather than human-like.