17 5 7

Satya

skrishna

https://satyapriyakrishna.com/

AI & ML interests

Safe A(G)I

Recent Activity

liked a model 12 days ago

sesame/csm-1b

liked a dataset 12 days ago

hf-internal-testing/dailytalk-dummy

upvoted a paper 3 months ago

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models

View all activity

Organizations

liked a model 12 days ago

sesame/csm-1b

Text-to-Speech • Updated Dec 1, 2025 • 25.8k • 2.3k

liked a dataset 12 days ago

hf-internal-testing/dailytalk-dummy

Viewer • Updated May 5, 2025 • 12 • 503 • 4

upvoted a paper 3 months ago

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models

Paper • 2509.17938 • Published Sep 22, 2025 • 4

commented a paper 3 months ago

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models

Paper • 2509.17938 • Published Sep 22, 2025 • 4 •

liked a dataset 4 months ago

google/frames-benchmark

Viewer • Updated Oct 15, 2024 • 824 • 6.48k • 238

updated a model 5 months ago

skrishna/smolm-toxicity-classifier

Text Classification • 0.1B • Updated Aug 15, 2025 • 7

upvoted a paper 5 months ago

Evaluating the Critical Risks of Amazon's Nova Premier under the Frontier Model Safety Framework

Paper • 2507.06260 • Published Jul 7, 2025 • 5

commented a paper 6 months ago

Evaluating the Critical Risks of Amazon's Nova Premier under the Frontier Model Safety Framework

Paper • 2507.06260 • Published Jul 7, 2025 • 5 •

updated a model 7 months ago

skrishna/sft-ref-policy-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 6

published a model 7 months ago

skrishna/sft-ref-policy-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 6

updated a model 7 months ago

skrishna/sft-model-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 17

published 2 models 7 months ago

skrishna/sft-model-copy

Text Generation • 0.1B • Updated Jun 18, 2025 • 17

skrishna/smolm-toxicity-classifier

Text Classification • 0.1B • Updated Aug 15, 2025 • 7

updated a dataset 7 months ago

skrishna/toxigen_annotated_mod

Viewer • Updated May 25, 2025 • 8.96k • 13

published a dataset 7 months ago

skrishna/toxigen_annotated_mod

Viewer • Updated May 25, 2025 • 8.96k • 13

updated a dataset 7 months ago

skrishna/toy-toxicity-dataset

Viewer • Updated May 22, 2025 • 40k • 18

updated a dataset 8 months ago

skrishna/toxicity-reward-dataset

Viewer • Updated May 16, 2025 • 40k • 10

published a dataset 8 months ago

skrishna/toxicity-reward-dataset

Viewer • Updated May 16, 2025 • 40k • 10

updated a model 8 months ago

skrishna/gpt2-toxicity-classifier

Updated May 16, 2025

published a model 8 months ago

skrishna/gpt2-toxicity-classifier

Updated May 16, 2025

Satya

AI & ML interests

Recent Activity

Organizations

skrishna's activity