@ariG23498 on Hugging Face: "Tried my hand at simplifying the derivations of Direct Preference…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

ariG23498

posted an update Jan 19, 2025

Post

2862

Tried my hand at simplifying the derivations of Direct Preference Optimization.

I cover how one can reformulate RLHF into DPO. The idea of implicit reward modeling is chef's kiss.

Blog: https://huggingface.co/blog/ariG23498/rlhf-to-dpo

deleted

Jan 19, 2025

This comment has been hidden

In this post

ariG23498 Aritra Roy Gosthipaty