Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation Paper • 2510.04373 • Published Oct 5
Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models Paper • 2510.04618 • Published Oct 6 • 123
Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use Paper • 2509.12867 • Published Sep 16
A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning Paper • 2510.01132 • Published Oct 1 • 5