May 28, 2026/7 min read/how chatgpt actually chooses citations: bing, authority, and the 87% rule

Abstract visualization: flowing green nodes on dark background — how chatgpt actually chooses citations: bing, authority, and the 87% rule

How ChatGPT Actually Chooses Citations: Bing, Authority, and the 87% Rule in 2026

ChatGPT's citation selection is not random and it's not a mystery. In web browsing mode, ChatGPT retrieves live content through Bing's search index, and research shows that roughly 87% of SearchGPT citations match Bing's top-ranked pages. If your site doesn't rank in Bing, ChatGPT almost certainly won't cite it. Understanding that single fact changes how you should think about GEO entirely.

Why ChatGPT Has Two Completely Different Citation Modes

ChatGPT operates in two distinct modes, and which one is active determines everything about how sources get selected. Most marketers don't know this, which is why their GEO strategy is often aimed at the wrong target.

In its default mode, ChatGPT generates responses from patterns in its training data. According to Sudha Solutions, that training data is roughly 570 GB of text collected before the model's knowledge cutoff. In this mode, ChatGPT isn't retrieving any live sources. When it appears to cite something, it's constructing a plausible-sounding reference from memory. Those citations can be fabricated. They're not pulled from a real-time index.

Web browsing mode is different. When a user's query benefits from current information, ChatGPT fetches live web content through Bing. It then synthesises that retrieved content into a response and cites the actual pages it read. This is the mode that matters most for brand visibility, and it's the one where the 87% Bing correlation holds.

The practical implication: your Bing SEO is your ChatGPT GEO. The two are not separate workstreams.

What Is the 87% Rule and Why Does It Matter?

The 87% figure refers to the alignment between ChatGPT's cited sources and Bing's top search results for the same queries. When ChatGPT searches the web, it uses Bing's index as its retrieval pool. It doesn't have its own separate web crawler for real-time queries. Bing is the gate.

This matters because most brands optimise exclusively for Google. Their Bing presence is an afterthought: not verified in Bing Webmaster Tools, not checked for crawl errors, not monitored for ranking positions. If that describes your situation, you have a blind spot that's costing you ChatGPT visibility every day.

What this means in practice:

Verify your site is indexed in Bing Webmaster Tools. Submit your sitemap if it isn't.
Check that OAI-SearchBot is allowed in your robots.txt. If it's blocked, ChatGPT's real-time retrieval can't reach you.
Monitor your Bing ranking positions for category-level queries, not just branded ones. That's where discovery happens.
Ensure your pages load and render without JavaScript dependency. Bing's crawler is less patient with JS-heavy content than you might expect.
Watch for content behind login walls, bot detection, or click-to-expand sections. ChatGPT can't cite what it can't read.

Which Sources Does ChatGPT Actually Prefer?

Beyond Bing ranking position, ChatGPT has clear patterns in the types of sources it favours. Research by Discovered Labs found that Wikipedia accounts for 47.9% of ChatGPT's top citations. That's a number worth sitting with. Nearly half of ChatGPT's citations come from a single source that most brands can't publish on directly.

The implication is that ChatGPT is heavily biased toward earned authority: third-party encyclopedic coverage, editorial mentions, established institutional sources. Your brand's own blog is competing against Wikipedia for citation slots, and Wikipedia usually wins.

This is why the off-page dimension of GEO matters more than most people realise. Getting cited in authoritative third-party content, appearing in industry roundups, being mentioned on sites with genuine editorial standards, these are the signals ChatGPT's retrieval system weights heavily. A well-written page on your own domain is a starting point, not a destination.

How ChatGPT's Citation Selection Compares Across AI Engines

ChatGPT's citation behaviour is distinct from Perplexity's and Claude's. Understanding those differences tells you where to invest, and how to structure that investment differently per platform.

Platform	Primary Index	Top Citation Source	Key Differentiator
ChatGPT	Bing (real-time retrieval)	Wikipedia (47.9% of top citations)	87% alignment with Bing top results
Perplexity	Own crawler + search APIs	Reddit (46.7% of top citations)	Average 8.2 cited sources per answer
Claude	Brave Search index	Earned/editorial media	Conservative citation habits; skews technical
Google AI Overviews	Google Search index	Distributed (Reddit, YouTube prominent)	Appears in ~18% of Google searches
Gemini	Google ecosystem	Medium, Reddit, YouTube	More brand-owned content than peers

The Discovered Labs research also found that only 12% of AI citations overlap with Google's top 10 results. Strong Google rankings don't transfer to AI citation visibility. Each engine has its own retrieval logic, its own index, and its own source preferences. A GEO strategy built around Google performance alone leaves most of the citation opportunity uncaptured.

What Makes Content Actually Citable by ChatGPT?

Getting into Bing's index is necessary but not sufficient. Once ChatGPT retrieves a set of candidate pages, it still has to decide which ones to cite in its response. That selection is based on how well the content answers the specific query, how clearly it's structured, and how much authority the source carries.

Content structure is a bigger factor than most brands appreciate. ChatGPT's retrieval pipeline chunks pages into passages. It doesn't read your article like a human does, top to bottom, building context as it goes. It retrieves chunks and evaluates them independently. If your most relevant information is buried in paragraph seven after three paragraphs of scene-setting, ChatGPT may never get to it. Put the answer in the first sentence of every section. Make every paragraph independently understandable.

Authority signals matter too. Pages that demonstrate genuine expertise, carry real author credentials, link to primary sources, and earn mentions from independent third-party sites are weighted more heavily. Generic thought leadership without supporting data or external validation tends to get ignored.

One finding worth noting: Discovered Labs' research showed that citation rates from well-optimised content can rise from the 5-15% range to 40-50% within six months when structured correctly for AI retrieval. That's a real range, not a guaranteed outcome, but it suggests the gap between optimised and unoptimised content is substantial.

The Monitoring Problem: How Do You Know If You're Being Cited?

This is where most brands are flying blind. ChatGPT processes over 2 billion queries daily (per the source data above), and there's no Search Console equivalent for AI citations. You can't see your "AI ranking position." You either appear in a response or you don't, and the same query can produce different answers on different runs because AI responses are non-deterministic.

The only way to measure your ChatGPT citation visibility systematically is to run a structured set of prompts across the engine repeatedly and track whether and how your brand appears. That requires two things: the right prompts, and a consistent testing methodology.

Most brands tracking GEO visibility start with too few prompts, almost always biased toward branded queries like "[brand] review" or "[brand] vs [competitor]." Those aren't the queries where brand discovery happens. The high-value prompts are category-level: "best CRM for small businesses," "what project management tool should I use for remote teams," "how do I solve [problem my product solves]." Those are the queries where ChatGPT decides whether your brand exists in the answer or not.

If you're building out a prompt tracking programme, BrandPrompts generates research-backed prompt sets from real search data specifically designed for import into GEO tracking platforms like Peec AI, Profound, and Otterly.AI. The prompts are structured across intent types so you're measuring category, comparison, recommendation, and problem-solution queries, not just branded ones.

Frequently Asked Questions

Does ChatGPT use Google for its web search?

No. ChatGPT uses Bing's search index for real-time web retrieval, not Google. This is why Bing ranking positions are directly relevant to ChatGPT citation visibility. Research shows roughly 87% of SearchGPT citations align with Bing's top results for the same queries.

Why does Wikipedia appear so often in ChatGPT citations?

Wikipedia accounts for 47.9% of ChatGPT's top citations according to Discovered Labs research. Wikipedia carries extremely high authority signals: it's heavily referenced across the web, updated frequently, written in structured encyclopedic format, and indexed prominently in Bing. ChatGPT's retrieval system weights these signals heavily. Brands can't publish directly on Wikipedia, but earning mentions that eventually make their way into Wikipedia entries is a long-term GEO play worth considering.

If my site ranks well on Google, will ChatGPT cite me?

Not necessarily. Only 12% of AI citations overlap with Google's top 10 results, according to Discovered Labs. Google SEO and ChatGPT citation visibility are largely separate outcomes because they rely on different indexes and different retrieval logic. Bing performance is the more direct lever for ChatGPT visibility.

How many queries should I be tracking to measure ChatGPT citation visibility?

For statistically reliable visibility measurement, you need at least 30-50 prompts per topic-market combination. Fewer than that and natural variation in AI responses makes the data too noisy to act on. A tracking programme with fewer than 30 prompts per topic is likely producing misleading results.

Does ChatGPT cite brand-owned content or only third-party sources?

ChatGPT will cite brand-owned content if it appears in Bing's top results for the query. But its citation patterns skew heavily toward earned, third-party coverage. Wikipedia, editorial publications, and authoritative third-party sites are systematically preferred. Brand-owned pages compete but generally need stronger external authority signals to win citation slots against well-established third-party sources.

Where to Start

The 87% Bing correlation is the most actionable single fact in ChatGPT GEO. It tells you exactly where to start: Bing Webmaster Tools, robots.txt configuration for OAI-SearchBot, and Bing ranking positions for your category-level queries. Most brands haven't done this work because they've assumed Google performance transfers to AI visibility. It doesn't.

Beyond Bing, the citation patterns point clearly toward earned authority. Wikipedia's dominance in ChatGPT citations is a signal that the AI is looking for the kinds of trust markers that third-party coverage provides. Your GEO strategy needs an off-page component, not just content optimisation on your own domain.

The measurement piece is where brands stall. Without a structured prompt set and a consistent testing methodology, you're guessing at your own visibility. Check out the BrandPrompts pricing page if you want to see what a research-backed prompt programme looks like before you commit to building one from scratch. The tracking platforms are only as useful as the prompts you feed them.

Track your brand's AI search visibility

BrandPrompts monitors how your brand appears across ChatGPT, Perplexity, Gemini, and Google AI Overviews. Know where you stand before your competitors do.

Get started free Or calculate how many prompts you need to track →