In a Training Loop 🔄

13 20 41

Honglin Guo

KYLN24

KYLN24

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

upvoted a paper 3 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

upvoted a paper 5 days ago

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment

View all activity

Organizations

authored a paper 3 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 7 days ago • 62

upvoted a paper 3 days ago

ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development

Paper • 2601.11077 • Published 7 days ago • 62

upvoted a paper 5 days ago

OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment

Paper • 2601.01576 • Published 19 days ago • 17

authored a paper 7 days ago

OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding

Paper • 2601.10343 • Published 8 days ago

liked a dataset 12 days ago

nex-agi/agent-sft

Preview • Updated Dec 9, 2025 • 189 • 105

authored a paper about 1 month ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 144

upvoted a paper about 1 month ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 144

New activity in nex-agi/agent-sft about 1 month ago

Improve dataset card: Add paper, code, project page links and tags

#3 opened about 2 months ago by

nielsr

Is it normal that there is no thinking process in the content?

#4 opened about 1 month ago by

xianf

liked a dataset about 2 months ago

allenai/Dolci-Think-SFT-Python

Viewer • Updated 18 days ago • 1.09M • 579 • 4

authored 4 papers about 2 months ago

Better Process Supervision with Bi-directional Rewarding Signals

Paper • 2503.04618 • Published Mar 6, 2025

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21, 2025 • 84

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

New activity in nex-agi/agent-sft about 2 months ago

Has the agent's trajectory data been verified/validated?

#1 opened about 2 months ago by

Aunderline

Upload Grass

#2 opened about 2 months ago by

CryptoKing8787

updated a collection about 2 months ago

Nex-N1

Collection

7 items • Updated Dec 22, 2025 • 10

upvoted a paper about 2 months ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

upvoted a collection about 2 months ago

Nex-N1

Collection

7 items • Updated Dec 22, 2025 • 10

updated a collection about 2 months ago

Nex-N1

Collection

7 items • Updated Dec 22, 2025 • 10

Honglin Guo

AI & ML interests

Recent Activity

Organizations

KYLN24's activity

Improve dataset card: Add paper, code, project page links and tags

Is it normal that there is no thinking process in the content?

Has the agent's trajectory data been verified/validated?

Upload Grass