Back to blog
/8 min read/how to calculate share of voice in ai search (without fooling yourself)
Abstract visualization: flowing green nodes on dark background — how to calculate share of voice in ai search (without fooling yourself)

How to Calculate Share of Voice in AI Search Without Fooling Yourself (2026)

AI share of voice tells you what percentage of AI-generated responses mention your brand across a defined set of prompts, relative to your competitors. The formula itself is simple: brand mentions divided by total mentions across all tracked prompts, multiplied by 100. The hard part is building a prompt set that actually reflects how your buyers search, so the number you calculate means something.

What Is AI Share of Voice?

AI share of voice measures how often your brand appears in AI-generated answers compared to competitors, across a set of queries relevant to your category. It's the AI-era equivalent of share of search, except instead of counting keyword rankings, you're counting brand mentions inside synthesized responses from ChatGPT, Perplexity, Google AI Overviews, Claude, and Gemini.

The reason this matters now is scale. Google's AI Overviews alone reaches 2 billion monthly users as of May 2026. ChatGPT processes over 2 billion daily queries. When someone asks any of these systems for the best tool in your category, you're either in the answer or you're not. There's no page two.

The AI Share of Voice Formula

The calculation has two versions, depending on what you want to measure.

Basic mention rate

Run a set of prompts across your target AI engines. For each prompt, record whether your brand was mentioned. Then:

AI SoV = (Prompts where your brand appears / Total prompts run) × 100

If you run 100 prompts and your brand appears in 34 of them, your AI SoV is 34%.

Competitive share of voice

This version measures your slice of a fixed pie. Add up all brand mentions across all competitors for a given prompt set. Your share is your mentions divided by the total.

AI SoV = (Your brand mentions / Total brand mentions across all competitors) × 100

This is what Semrush uses in its AI Visibility Toolkit. As they note, AI share of voice across all brands in a category adds up to 100%. That means every point you gain, a competitor loses. It's useful for competitive benchmarking, though it requires you to define the competitor set first.

Both formulas are valid. Use the basic mention rate for tracking your own progress over time. Use the competitive version for positioning conversations with stakeholders who want to know where you stand in the market.

What Does a Specific Share of Voice Score Actually Mean?

A 50% AI share of voice means your brand appears in half of all prompts tracked within your competitive set. At that level, you're likely the dominant brand in AI-generated answers for your category, but there's still meaningful visibility going to competitors. A 100% share of voice would mean you appear in every tracked response and no competitor does, which is extremely rare outside of highly niche categories with weak competition.

More practically: a low score on a well-constructed prompt set is a precise signal about where you're losing. If you appear in 60% of category prompts but only 15% of problem-solution prompts, your content is getting you recognised as a brand but not as a solution to the specific problems your buyers have. That's a content gap you can act on.

Why Most AI SoV Numbers Are Wrong Before You Start Calculating

The formula is not the problem. The prompt set is. As Alex Birkett puts it, the hard part of measuring AI share of voice is "building a prompt set that reflects how your customers think and speak." Most teams either use too few prompts, or they bias heavily toward branded queries like "[brand] reviews" or "[brand] vs [competitor]" while ignoring the category-level and use-case queries where brand discovery actually happens.

There are four ways a prompt set can fool you:

  • Too few prompts means random variation in AI responses produces noise that looks like signal. We recommend a minimum of 30-50 prompts per topic-market combination before you trust any trend line.
  • Branded-query overload inflates your score artificially. Of course you appear when someone asks "tell me about [your brand]." That's not how buyers discover you.
  • Single-engine measurement misses the variation between platforms. A brand that appears consistently in Perplexity may barely register in Claude. Platform behaviour differs because retrieval mechanisms differ.
  • Static prompt sets age out. Search patterns shift. New use cases emerge. A prompt set that was accurate in Q1 can be misleading by Q3.

How to Build a Prompt Set That Gives You Accurate Data

A reliable prompt set covers multiple intent types, not just branded queries. Structure your prompts across these six categories, and you'll capture the full range of moments where your brand could appear.

Intent Type Example Prompt What It Measures
Category "What's the best project management tool for remote teams?" Baseline brand awareness in AI training data
Use-case "What CRM should I use for a 10-person sales team?" Contextual relevance in specific scenarios
Comparison "How does [brand] compare to [competitor]?" Competitive positioning
Recommendation "Can you recommend an email tool for a DTC brand?" Likelihood of being named when asked directly
Problem-solution "How do I reduce churn in a SaaS product?" Whether you appear in solution contexts
Feature-specific "Which analytics platform has the best attribution modelling?" Feature association and differentiation

Good prompts read like real user queries. "List CRM software brands" is a test string. "I'm running a 15-person sales team and we're outgrowing spreadsheets - what should we use?" is how a buyer actually thinks. AI engines respond differently to these, and your share of voice score will look very different depending on which version you track.

If you want prompts built from real search data rather than intuition, BrandPrompts generates research-backed prompt sets using keyword volumes, People Also Ask patterns, and trend data, then tags every prompt by intent type, market, and competitor relevance. The output imports directly into tracking platforms like Peec AI, Profound, and Searchable.

Which Platforms Should You Track?

The answer is all of them, because visibility varies greatly by platform. ChatGPT uses Bing's index for live retrieval. Claude searches via Brave. Google AI Overviews draws from Google's own index. Perplexity uses its own crawler plus third-party APIs. The same brand, with the same content, can have meaningfully different visibility scores across these four platforms.

AI search is no longer a one-platform market. Gemini's AI Overviews reaches 2 billion monthly users. ChatGPT has 900 million weekly active users as of February 2026. Perplexity crossed 100 million monthly active users across its search and agent products as of April 2026. Claude's website recorded 823.5 million visits in April 2026 alone. You need visibility data across all four, not just the one your team uses internally.

Practically, this means tracking in Peec AI, Profound, or a similar platform that queries multiple AI engines and records mention rates per prompt. Manual testing across platforms is useful for spot checks, but it doesn't scale to a statistically meaningful prompt set.

Interpreting Your AI SoV Data Correctly

Once you have a score, the instinct is to benchmark it against some industry standard. There are no reliable industry standards yet. Benchmarks are still emerging and vary enormously by category, competitive density, and prompt construction. Don't let anyone sell you a "good" score in the abstract.

What you can do is compare against your own baseline over time, and compare against specific competitors across specific intent types. A score broken down by prompt category is far more useful than a single aggregate number. If your category-prompt score is strong but your problem-solution score is weak, that tells you exactly where your content strategy has gaps.

One thing worth noting: AI share of voice captures something rankings and traffic can't. It reflects your overall web presence within a category, including earned media, community mentions, and third-party coverage, not just your own site's performance. A brand with modest SEO rankings but strong PR coverage can outperform its organic position in AI share of voice. The reverse is also true.

Frequently Asked Questions

How do I measure AI share of voice?

Run a defined set of prompts across your target AI engines. Record whether your brand appears in each response. Divide the number of prompts where you appear by the total prompts run, then multiply by 100. For competitive benchmarking, divide your mentions by the total mentions across all tracked brands. Tools like Semrush's AI Visibility Toolkit, Peec AI, and Profound automate this at scale.

How do I calculate share of voice?

The formula is: (your brand mentions / total brand mentions across all competitors) × 100. The output is a percentage of a shared pie that adds up to 100%. The most important variable is not the formula itself but the prompt set you run it against. A poorly constructed prompt set produces a number that looks precise but doesn't reflect how buyers actually search.

What does 50% share of voice mean in AI search?

It means your brand appears in half of all tracked prompts within your competitive set. At 50%, you're likely the leading brand in AI responses for your category, though competitors are still capturing significant visibility. The score only means something relative to the quality of your prompt set and the competitors you've defined.

What does 100% share of voice mean?

Your brand appears in every tracked response and no other brand in your defined competitor set does. This is rare and almost never stable, because it typically reflects a narrow prompt set or a niche with very low competition. A 100% score on a small, branded-query-heavy prompt set is not a good signal. A 100% score across a broad, intent-varied prompt set would be genuinely extraordinary.

How many prompts do I need for reliable AI SoV data?

We recommend a minimum of 30-50 prompts per topic-market combination. Below that threshold, the non-deterministic nature of AI responses means random variation can swing your score greatly between measurement periods. More prompts give you more statistical confidence that what you're seeing is a real trend, not noise.

The Measurement Trap to Avoid

The most common mistake we see is tracking a small set of branded queries, seeing a high score, and concluding that AI visibility is fine. It's not fine if you're only measuring prompts where you were already likely to appear. The prompts that matter most are the ones buyers use before they've decided which brand to consider. That's the category-level, use-case, and problem-solution layer, and most brands are far less visible there than they think.

Build the prompt set first. Then calculate the score. In that order. If you need help with the prompt research step, BrandPrompts has a one-off pricing model starting at $29, with no recurring subscription, for teams that want research-backed prompt sets ready to import into their tracking platform of choice.

The formula won't fool you. The prompt set will, if you let it.

Track your brand's AI search visibility

BrandPrompts monitors how your brand appears across ChatGPT, Perplexity, Gemini, and Google AI Overviews. Know where you stand before your competitors do.

Get started freeOr calculate how many prompts you need to track →