Lukas Galke Poech's picture

Lukas Galke Poech

lgalke

·

https://lgalke.github.io

AI & ML interests

LLM interpretability, agentic/multi-agent safety

Recent Activity

authored a paper 1 day ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

upvoted a paper 2 days ago

The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment

authored a paper 7 days ago

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling

View all activity

Organizations

lgalke 's models 1

lgalke/Qwen3.5-35B-A3B-psysafe

Image-Text-to-Text • 36B • Updated Mar 25 • 19