Nikita Balagansky's picture

Nikita Balagansky

elephantmipt

·

https://elephantmipt.github.io

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

authored a paper 4 days ago

Steering LLM Reasoning Through Bias-Only Adaptation

authored a paper 4 days ago

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

View all activity

Organizations

authored 4 papers 4 days ago

Small Vectors, Big Effects: A Mechanistic Study of RL-Induced Reasoning via Steering Vectors

Paper • 2509.06608 • Published Sep 8, 2025

Steering LLM Reasoning Through Bias-Only Adaptation

Paper • 2505.18706 • Published May 24, 2025

Train One Sparse Autoencoder Across Multiple Sparsity Budgets to Preserve Interpretability and Accuracy

Paper • 2505.24473 • Published May 30, 2025

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 7 days ago • 12

upvoted a paper 5 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 7 days ago • 12

submitted a paper to Daily Papers 5 days ago

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

Paper • 2606.10029 • Published 7 days ago • 12

authored a paper 14 days ago

Trust-Region Behavior Blending for On-Policy Distillation

Paper • 2605.31159 • Published 18 days ago • 66

updated a model about 2 months ago

elephantmipt/sae_uramt7ar

published a model about 2 months ago

elephantmipt/sae_uramt7ar

updated a model about 2 months ago

elephantmipt/sae_wiajygyw

published a model about 2 months ago

elephantmipt/sae_wiajygyw

updated a model about 2 months ago

elephantmipt/sae_edt7oylt

published a model about 2 months ago

elephantmipt/sae_edt7oylt

updated a model about 2 months ago

elephantmipt/sae_k9oz7r8j

published a model about 2 months ago

elephantmipt/sae_2qy4isey

updated a model about 2 months ago

elephantmipt/sae_10d1xu3h

published 4 models about 2 months ago

elephantmipt/sae_k9oz7r8j

elephantmipt/sae_10d1xu3h

elephantmipt/sae_92c2o28x

elephantmipt/sae_z4l3upt1