BENCHMARKS UC Berkeley Expands BFCL Benchmark to v4, Adding Agentic Evaluation for LLM Function Calling 4/10 2 min read 2 months ago
TOOL UPDATES Hubcap: A 25-Line PHP Script That Exposes the Minimal Architecture of Autonomous AI Agents 4/10 3 min read 2 months ago