OrtSAE: Orthogonal Sparse Autoencoders Uncover Atomic Features Paper • 2509.22033 • Published Sep 26 • 18 • 2
The Rogue Scalpel: Activation Steering Compromises LLM Safety Paper • 2509.22067 • Published Sep 26 • 27 • 2
IoT-MCP: Bridging LLMs and IoT Systems Through Model Context Protocol Paper • 2510.01260 • Published Sep 25 • 2 • 2
I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Paper • 2503.18878 • Published Mar 24 • 119 • 2