Activation Space Interventions Can Be Transferred Between Large Language Models Paper • 2503.04429 • Published Mar 6 • 2
TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research Paper • 2503.12730 • Published Mar 17 • 4