Claude Skill Evaluates B2B Vendors by Interrogating Their AI Agents and Cross-Referencing Claims

An open-source Claude skill released on GitHub by Salespeak AI automates B2B vendor evaluation by researching companies, asking adversarial questions, and directly probing vendors’ AI agents through their APIs. The skill generates a scorecard that cross-references vendor claims against independent sources including G2, Gartner, and press coverage, with transparent evidence tracking for each finding.

The tool addresses a shift in how B2B purchasing decisions are made. According to recent research, 66 percent of UK senior decision-makers now use AI platforms like ChatGPT, Microsoft Copilot, and Perplexity as part of their procurement process. Among buyers aged 25 to 34, the figure reaches 85 percent. Just five brands appear in 80 percent of top responses delivered by AI agents across any given B2B category, meaning vendors that are not optimized for AI discovery are increasingly invisible to a growing share of buyers.

The skill works in three phases. First, it automatically researches the buyer’s company to understand their specific needs and context. Second, it asks category-specific questions, including deliberately adversarial ones about common customer complaints and unsuitable use cases. Third, if the vendor has an AI agent accessible via REST API, the skill engages it directly, testing how it handles tough questions and flagging instances where the agent deflects rather than answering directly.

For vendors without AI agents, the skill evaluates them using the same scoring framework but relies entirely on public sources. This creates a level playing field in the evaluation while highlighting which vendors have invested in AI-powered customer engagement and which have not. The MIT-licensed skill can be installed in Claude Code and works for any vendor category.

The timing aligns with a broader acceleration in AI-driven procurement. A Gartner survey found that 98 percent of procurement leaders are deploying or planning to deploy generative AI, while 94 percent of procurement executives use it at least weekly, up 44 percentage points from 2023 to 2024. AI is compressing the pre-sales phase of B2B transactions, with buyers arriving at vendor conversations with pre-formed shortlists and evaluation criteria already shaped by AI research.

The practical implication for B2B vendors is that their AI agents, public content, and review profiles are becoming their primary sales representatives for a growing share of deals. A Claude skill that systematically evaluates and scores these assets creates pressure to improve them, the same way SEO tools created pressure to optimize websites for search engines a decade ago.

Claude Skill Evaluates B2B Vendors by Interrogating Their AI Agents and Cross-Referencing Claims

Enjoyed this story?

Bluesky Launches AI Assistant Attie for Custom Social Media Feed Creation

Chroma Releases Context-1: 20B Parameter Model for Multi-Hop Search

Developer Releases Miasma Tool to Trap AI Web Scrapers with Poisoned Data

Before you go…