Microsoft has introduced a new AI Performance Report inside Bing Webmaster Tools. In my view, this marks one of the first real steps toward measuring visibility in AI experiences, not just traditional search rankings.
In this article, I want to summarise what I explained in my video: what the report shows, how the metrics work, where the gaps still are, and why this matters if you care about Generative Engine Optimization (GEO).
Why Microsoft Released the AI Performance Report
For years we measured success using clicks, impressions, and rankings. That model starts to break down once AI answers summarize content directly inside Copilot or AI summaries.
The new report introduces an analytics layer focused on AI citations instead of classic SERP performance.
From my perspective, the goal is clear:
- Help publishers understand when their content is used as a source in AI answers
- Provide visibility into which topics trigger citations
- Move measurement closer to influence, not just traffic
Microsoft describes this as giving publishers insight into how content appears across AI experiences within Bing.
What the New Metrics Actually Mean
Inside the report, there are a few core metrics that matter.
Total Citations
This shows how often pages from your website appear as sources in AI-generated responses during a selected time period.
This is not a ranking signal and it is not traffic. It is simply confirmation that Bing’s AI systems referenced your content.
Average Cited Pages
This metric represents the average number of unique pages cited per day.
I see this as a rough indicator of topical depth. If more pages are cited, it often means Bing recognizes broader authority around a subject.
Page-Level Citation Data
You can drill down to see:
- Which URLs are cited
- How frequently they appear
- The query themes connected to those citations
One important detail: Bing does not show the actual prompts. Instead, it shows the “fan-out” search queries that likely contributed to the AI response.
The Biggest Limitation: No Prompt Data
One thing I was really hoping for was access to the actual prompts.
Right now:
- You do not see the original AI question
- You do not see click-through rate
- You do not see user engagement from the AI answer itself
Instead, Bing exposes the expanded queries derived from prompts.
This is useful, but it means analysts still need to reverse-engineer intent rather than measure it directly.
How This Differs From Traditional Search Performance
Here is how I personally separate the two reporting models.
| Classic Search Performance | AI Performance Report |
|---|---|
| Focus on clicks and rankings | Focus on citations |
| Measures SERP behavior | Measures AI usage |
| Keyword-driven analysis | Prompt fan-out analysis |
| Visibility tied to traffic | Visibility tied to influence |
In short, we are moving from measuring Did someone click? to Was my content used as a source?
That is a major shift in how discovery works.
Why Citations Matter Even Without Clicks
One of the key points I make in the video is that influence now happens even when there is no visit.
If your content is cited:
- Your brand or expertise shapes the answer
- Your information influences user decisions
- But analytics may show zero traffic
This is exactly why GEO is becoming critical. Visibility is no longer limited to blue links.
How This Connects to Adobe LLM Optimizer and GEO Workflows
Even with this new report, I still see tools like Adobe LLM Optimizer as highly relevant.
Why?
Because Bing still does not provide:
- Prompt data
- Cross-platform visibility (ChatGPT, Gemini, etc.)
- Deep competitive insights
In my opinion, the real opportunity is combining Bing’s citation data with:
- Log file analysis
- Prompt simulations
- LLM monitoring tools
My team is already exploring how to ingest these grounded queries and use them to better understand prompt behavior.
Practical Takeaways From the Report
If you are working on GEO or AI visibility, here is how I would approach this new data:
- Identify URLs with high citation counts and expand those topic clusters.
- Look at fan-out queries to understand how prompts branch into multiple searches.
- Compare citation activity with crawl logs to validate AI usage patterns.
- Treat citations as an influence metric, not a traffic metric.
What This Report Does Not Cover (Yet)
It is important to set expectations.
Right now the report only reflects:
- Bing Copilot and Bing AI experiences
- Bing’s own ecosystem
It does not include:
- ChatGPT
- Perplexity
- Gemini
- Other LLM platforms
So while it is a big step forward, it is still just one piece of the AI visibility puzzle.
My Conclusion
I see this release as the first official GEO-style reporting feature from a major search platform.
It shows that measurement is shifting away from rankings and toward AI usage and citations.
But we are still early.
Without prompts, cross-platform data, or CTR visibility, we need to combine this report with external tooling and deeper analysis.
Still, this is a strong signal of where search analytics is heading next.