-
-
-
-
-
-
Inference Providers
Active filters: kto
Aaryan-Nakhat/experiment_117_RL_itr_4_on_exp_105_model_v2
Text Generation
• 3B • Updated
• 2
Aaryan-Nakhat/experiment_119_RL_itr_4_on_exp_105_model_v2
Text Generation
• 3B • Updated
• 1
WokeAI/tankie-kto-v1-adpt
Text Generation
• Updated
AIPlans/Qwen3-0.6B-KTO_trial
Text Generation
• 0.6B • Updated
• 2
• 1
ucrelnlp/PyMUSAS-Neural-Multilingual-Small-BEM
ucrelnlp/PyMUSAS-Neural-Multilingual-Base-BEM
karim12344321/llama2-7b-kto-mental-health_final
Text Generation
• Updated
onnx-community/mmBERT-small-ONNX
Fill-Mask
• Updated
• 11
• 2
developer-lunark/doha-kto
4B • Updated
• 2
4B • Updated
4B • Updated
• 3
developer-lunark/jihu-kto
4B • Updated
4B • Updated
mradermacher/yul-kto-GGUF
4B • Updated
• 214
mradermacher/yul-kto-i1-GGUF
4B • Updated
• 21
Nishef/MiniCPM-1B-sft-bf16-Full_KTO_20251225_185339
Text Generation
• Updated
Nishef/Qwen3-0.6B-Full_DPO_20251225_130318
Text Generation
• Updated
Nishef/Qwen3-0.6B-Full_KTO_20251225_102050
Text Generation
• Updated
Nishef/Qwen3-0.6B-Full_ORPO_20251225_145426
Text Generation
• Updated
Nishef/SmolLM2-360M-Full_DPO_20251225_043457
Text Generation
• Updated
Nishef/SmolLM2-360M-Full_KTO_20251225_020028
Text Generation
• Updated
Nishef/SmolLM2-360M-Full_ORPO_20251225_062447
Text Generation
• Updated
Nishef/SmolLM2-360M-Full_KTO_20251225_020028-merged
Text Generation
• 0.4B • Updated
• 11
Nishef/SmolLM2-360M-Full_DPO_20251225_043457-merged
Text Generation
• 0.4B • Updated
• 1
Nishef/SmolLM2-360M-Full_ORPO_20251225_062447-merged
Text Generation
• 0.4B • Updated
• 7
Nishef/Qwen3-0.6B-Full_KTO_20251225_102050-merged
Text Generation
• 0.6B • Updated
• 3
Nishef/Qwen3-0.6B-Full_DPO_20251225_130318-merged
Text Generation
• 0.6B • Updated
• 36
Nishef/Qwen3-0.6B-Full_ORPO_20251225_145426-merged
Text Generation
• 0.6B • Updated
• 2
Nishef/MiniCPM-1B-sft-bf16-Full_KTO_20251225_185339-merged
Text Generation
• 1B • Updated
• 1
Nishef/SmolLM2-360M-Full_KNOWLEDGE_RETAINING_ENHANCED_KTO_20251227_151509
Text Generation
• Updated
• 17