Towards eliciting latent knowledge from LLMs with mechanistic interpretability Paper • 2505.14352 • Published May 20 • 9