AI agents expose martech’s critical weakness

▼ Summary
– A new SaaStr dataset grading 152 B2B software APIs found an overall average score of 72 out of 100 (C+), with marketing APIs averaging just 63.6.
– Only 5 out of 57 marketing-relevant APIs scored 80 or higher, with HubSpot and Lightfield at 80 (A-), while Marketo scored 50 and Gainsight scored 47.
– The weakest dimensions for marketing APIs are agent readiness (average 6.1/10) and rate limits (average 6.6/10), meaning most platforms lack safe testing environments and can’t handle high-volume automated calls.
– Top-performing APIs like Stripe (97) and Anthropic (90) highlight the gap, as marketing and sales platforms generally lack webhooks and event support, forcing agents to repeatedly poll for updates.
– Practitioners should check if their CRM can retry actions without duplicates, if their marketing platform pushes real-time updates, and if their sales tool alerts on data changes to identify AI readiness gaps.
The rapid rise of autonomous AI agents is exposing a critical weakness in the martech landscape that many organizations have long ignored. While these agents promise to execute marketing workflows at machine speed, the reality is that most enterprise platforms are structurally unprepared to support them. The core problem isn’t the AI itself, but the brittle data integration infrastructure that underpins modern marketing stacks.
For the first time, quantifiable evidence of this gap has emerged. The SaaStr AI Agent API Report Card, a new public dataset from the SaaS founder community led by Jason Lemkin, evaluates 152 B2B software APIs across six key criteria that matter when an AI agent takes the wheel: API design, events and streaming support, authentication, rate limits, SDK quality and documentation, and agent readiness , whether the API is built to be safely operated by AI. Each criterion scores 0–10, for a maximum of 100, with letter grades from A+ to F. Three independent AI models , Claude, GPT, and Gemini , conducted the evaluations.
The results paint a sobering picture. The overall average across all categories is 72 out of 100, a C+. But that number is artificially inflated by strong showings in infrastructure and developer tools. When you zoom in on the categories marketers actually rely on, the scores drop sharply.
The marketing API gap is real. Marketing platforms average just 63.6 out of 100. Customer success tools score 62.9. Sales intelligence averages 65.8. Even CRM, the most mature category in business software, manages only 68.5. Compare that to AI and LLM APIs (80.8), authentication and identity (78.8), DevTools (76.9), and infrastructure (77.6). The AI tools are ready. The platforms they are supposed to work inside are not.
Out of 57 marketing-relevant APIs in the report card, only five score 80 or higher , an A- or better. That’s just 9%. HubSpot and Lightfield both hit 80 (A-). Salesforce earns a 75 (B+). After that, the marketing stack drops off quickly: Klaviyo at 75, Customer.io at 70, Beehiiv at 70, Braze at 67, and Iterable at 66.
Then comes the tail. Marketo scores a 50 out of 100 , a C grade, tied for the lowest score of any API in any category on the entire report card. ActiveCampaign scores 53. Mailchimp scores 57. Gainsight scores 47. “The bottom of the list is the real story,” Lemkin wrote. “These are the budget categories most directly under threat from agent-driven workflows.” Keep in mind that these letter grades are generous; typically, an 80 translates to a B-, and anything below 60 is failing.
What’s dragging these scores down? The report card breaks each score into six sub-criteria. The weakest dimension overall is rate limits (average 6.6 out of 10). Most APIs were built for humans clicking around a dashboard, not for software making thousands of automated calls per minute. But for marketing platforms specifically, the weakest dimension is agent readiness , 6.1 out of 10. This includes sandbox environments for safe testing, standardized error messages, and consistent API behavior that prevents duplicate records when an action is retried. Without these, an AI agent cannot safely test, detect failures, or repeat operations without accidentally creating duplicate contacts, leads, or records.
Webhooks and event support are also a major pain point. Sales intelligence tools average just 5.9 out of 10 on webhooks, meaning agents must repeatedly check for updates rather than being notified automatically. Hunter.io scores 7 out of 10 on webhooks, while Apollo scores 4. The contrast with the top of the overall leaderboard is stark. Stripe scores 97 (A+) with perfect 10s across API design, webhooks, auth, SDKs, docs, and agent readiness. GitHub scores 92 (A). Anthropic and OpenAI each score 90 (A).
There are bright spots. HubSpot’s spring 2026 release shipped updated API versioning and dedicated developer APIs for its Breeze AI Agents, improvements that earned it an A- (80). Salesforce’s Agentforce 360 and Agent Scripting Toolkit keep it competitive with a B+ (75). But these are exceptions. The majority of marketing and sales platforms sit in the B range , functional enough for basic automation, but with meaningful gaps that will frustrate more complex AI workflows.
For practitioners evaluating their stacks, the report card offers a straightforward framework. Can your CRM safely retry a failed action without creating duplicate records? Can your marketing automation platform push real-time updates instead of waiting to be asked? Does your sales intelligence tool automatically alert you when prospect data changes, or does your team have to proactively look for it? If the answer to more than one is “no,” your stack has an AI readiness gap , even if the product looks great in a demo. A polished dashboard doesn’t mean the underlying API is built for agents.
(Source: MarTech)