Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Lukas Galke Poech's picture
11 23

Lukas Galke Poech

lgalke
namazifard's profile picture EvilScript's profile picture
·
https://lgalke.github.io
  • LukasGalke
  • lgalke
  • lukas-galke-8086b0155
  • lukasgalke.bsky.social

AI & ML interests

LLM interpretability, agentic/multi-agent safety

Recent Activity

authored a paper 1 day ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
upvoted a paper 2 days ago
The Arbiter Agent: Continually Monitoring Multi-Agent Conversations to Detect Emergent Misalignment
authored a paper 7 days ago
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
View all activity

Organizations

Danish Foundation Models's profile picture MLX Community's profile picture filter with espresso's profile picture RUNE Lab's profile picture Schneider-Kamp Lab's profile picture Machine Ecology Lab's profile picture Inversion Lab for AI Safety's profile picture AI Safety & Interpretability Lab's profile picture

lgalke 's models 1

lgalke/Qwen3.5-35B-A3B-psysafe

Image-Text-to-Text • 36B • Updated Mar 25 • 19
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs