Nikita Kezins
entfane
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated a dataset 3 days ago
entfane/violent_eval published a dataset 4 days ago
entfane/violent_eval updated a model 4 days ago
entfane/gpt2_constitutional_classifier_violence