Moravec's Paradox
Draft notes on what remains easy for humans and hard for machines.
Draft notes on what remains easy for humans and hard for machines.
A compact walkthrough of bottlenecks, latent spaces, and reconstruction.
Non-greedy acceptance, residual sampling, and why q matters.
Clipping attention logits for MLA with shared rotary keys.
Approximating the serial pre-norm block.
On singular token spaces.
Small but faithful subsets of large point sets.
Play chess alone or with a friend.
Communication deficiencies in optimization.