Alignment Personalization Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas" nbalepur/persona-tailoring Viewer • Updated Dec 17, 2024 • 5.35k • 25 nbalepur/persona-inference Viewer • Updated Dec 17, 2024 • 1.2k • 10 nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic Updated Dec 14, 2024 nbalepur/Llama-3.1-8B-PT-DPO-HHH Updated Dec 14, 2024
Mnemonic Generation Models and datasets for mnemonic generation research nbalepur/LLama-2-70b-Mnemonic-Tokenizer Updated May 11, 2024 nbalepur/LLama-2-70b-Mnemonic-DPO Text Generation • 69B • Updated May 10, 2024 • 2 nbalepur/LLama-2-70b-Mnemonic-SFT Text Generation • 69B • Updated May 10, 2024 • 4 • 1 nbalepur/Mnemonic_Pref Viewer • Updated May 16, 2024 • 472
Alignment Personalization Collection of datasets and models for our paper "Whose Boat Does it Float? Improving Personalization in Preference Tuning via Inferred User Personas" nbalepur/persona-tailoring Viewer • Updated Dec 17, 2024 • 5.35k • 25 nbalepur/persona-inference Viewer • Updated Dec 17, 2024 • 1.2k • 10 nbalepur/Llama-3.1-8B-PT-DPO-Mnemonic Updated Dec 14, 2024 nbalepur/Llama-3.1-8B-PT-DPO-HHH Updated Dec 14, 2024
Mnemonic Generation Models and datasets for mnemonic generation research nbalepur/LLama-2-70b-Mnemonic-Tokenizer Updated May 11, 2024 nbalepur/LLama-2-70b-Mnemonic-DPO Text Generation • 69B • Updated May 10, 2024 • 2 nbalepur/LLama-2-70b-Mnemonic-SFT Text Generation • 69B • Updated May 10, 2024 • 4 • 1 nbalepur/Mnemonic_Pref Viewer • Updated May 16, 2024 • 472