view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 55
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 • 1.15k
view article Article Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge +1 Oct 28, 2024 • 29
view article Article Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge +1 Oct 28, 2024 • 29
view article Article Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖 Jun 20, 2024 • 26