alcompa

alcompa

AI & ML interests

None yet

Recent Activity

updated a dataset 12 days ago

aimagelab/CHAIR-DPO_preference_datasets

updated a collection 19 days ago

CHAIR-DPO

updated a collection 19 days ago

CHAIR-DPO

View all activity

Organizations

upvoted an article 3 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25

•

upvoted an article 4 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8

•

upvoted an article 6 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3

•

upvoted 2 articles 7 months ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

•

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

255

upvoted an article 10 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

270

upvoted an article about 1 year ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

alcompa

AI & ML interests

Recent Activity

Organizations

alcompa's activity

There is no such thing as a tokenizer-free lunch

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

The N Implementation Details of RLHF with PPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

How to generate text: using different decoding methods for language generation with Transformers

Decoding Strategies in Large Language Models