I work on test time interpretability, evaluation, and alignment for large language models. Down to play chess and talk about finance anytime.
Currently: building PRISM + evaluation systems.
I work on test time interpretability, evaluation, and alignment for large language models. Down to play chess and talk about finance anytime.
Currently: building PRISM + evaluation systems.