ANALYSIS GUARD-SLM: Token Activation Defense Targets Small Language Model Jailbreaks 4/10 4 min read 1 month ago
ANALYSIS ARTLAS Maps 78 Cultural-Technology Institutions Using NLP Clustering 3/10 4 min read 1 month ago
ANALYSIS Debate Protocol Design Shapes Multi-Agent AI Outcomes, Controlled Study Finds 3/10 4 min read 1 month ago
ANALYSIS Philippine Intern Survey Maps Four Categories of AI Tool Use in OJT 3/10 4 min read 1 month ago
ANALYSIS ScoringBench Ranks Tabular AI Models on Full Distribution Accuracy 3/10 4 min read 1 month ago
ANALYSIS Epistemic Uncertainty Proposed as Routing Signal for Cheaper, More Reliable AI Explanations 3/10 4 min read 1 month ago
ANALYSIS ATP-Bench: Researchers Benchmark 10 MLLMs on Agentic Tool Planning 3/10 4 min read 1 month ago
ANALYSIS ShapE-GRPO Uses Shapley Values to Fix GRPO Free-Rider Problem in LLM Training 3/10 4 min read 1 month ago