+++

GEO (Generative Engine Optimization): researched methods

If you are reading this text, probably you are SEO-specialist or copywriter, and you are searching new methods because traditional SEO doesn`t work, so get these GEO tips to get organic results. Generative Engine Optimisation (GEO), a new paradigm designed to help content creators improve the visibility of their websites in responses generated by large language models. Unlike traditional search engine optimisation (SEO), GEO methods take into account how generative engines synthesise and summarise information from various sources. The research present GEO-bench, an extensive dataset for evaluating these techniques, and show that strategies such as adding statistics, quotes, and links can significantly improve visibility.

Basic GEO methods in 1 pic

Effective GEO methods

    • Adding proven links: Including relevant links from reliable sources.

    • Expert quotes: Inserting citations from authoritative sources. (up to 30-40%)

    • Adding statistics: Replacing qualitative arguments with quantitative statistical data where possible.

    • Fluency Optimisation and Ease of Understanding: stylistic changes, such as improving the fluency and readability of the text

Which GEO methods have proven most effective in improving content visibility?
GEO research has shown that several methods can significantly increase visibility. These methods have led to a 30-40% increase in visibility on the ‘Words adjusted for position’ metric and a 15-30% increase on the ‘Subjective impression’ metric. It was also found that stylistic changes, such as improving the fluency and readability of the text (‘Fluency Optimisation’ and ‘Ease of Understanding’), increase visibility by 15-30%, indicating that generative engines value both content and its presentation.

Here you can have an example of GEO citation:
Pranjal Aggarwal PHD Student & Research Intern in his Princeton GEO research 2024 says:
Specifically, our top-performing methods, Cite Sources, Quotation Addition, and Statistics Addition, achieved a relative improvement of 30-40% on the Position-Adjusted Word Count metric and 15-30% on the Subjective Impression metric.

GEO methods 2What to emulate from these example for GEO (expert quotes):

Neutral tone: The expert explains rather than promotes — generative systems prefer factual, non-promotional wording.
Full attribution: Name + degree/position + organisation placed next to the quote (not only at the end of the article).
Proximity to facts: Put the quote immediately after the claim/statistic so the AI can easily lift the entire block.
Layout: Use clear pull-quotes/block quotes or “says Dr. …” — models often extract these as ready-made answer fragments.

Expert profile: Include a link/box about the specialist (bio, role, affiliation) to strengthen E-E-A-T and the likelihood of citation.

GEO metrics

Generative Engine Optimization (GEO) proposes a new suite of impression metrics to measure content visibility in Generative Engines (GEs), as traditional SEO metrics are insufficient for this new paradigm. GEs provide rich, structured responses with embedded citations that vary in length, position, and style, making the concept of visibility more nuanced and multi-faceted than in traditional search engines, which typically present a linear list of websites.

The GEO framework introduces new impression metrics designed with three key principles: they should be relevant for creators, explainable, and easily comprehensible by a broad spectrum of content creators. These metrics allow content creators to gauge and optimize their website’s performance in GEs.

Here are the key GEO metrics:

    1. Word Count (Impwc):
        • This metric is the normalized word count of sentences that cite a particular source within the GE’s response.

        • A higher word count suggests that the source played a more significant role in the generated answer, indicating greater user exposure to that source.

        • Mathematically, it’s defined as the sum of word counts of sentences citing a source, divided by the total word count of sentences in the response. If a sentence is cited by multiple sources, its word count is shared equally among them.

    1. Position-Adjusted Word Count (Imppwc):
        • This metric builds upon the Word Count but assigns a higher weight to sentences cited earlier in the response.

        • It uses an exponentially decaying function based on the citation’s position, reflecting the intuition that content appearing earlier is more likely to be read.

        • This means a website cited at the top might have a higher impression even with a lower word count than one cited later. The choice of an exponentially decaying function is motivated by studies showing that click-through rates often follow a power-law as a function of ranking in search engines.

        • The Position-Adjusted Word Count is one of the two main metrics used for evaluating GEO methods in research.

    1. Subjective Impression:
        • This is a comprehensive metric that incorporates various subjective factors related to the impact of citations on user attention. It aims to capture aspects that objective metrics like word count might miss.

        • It includes the following sub-metrics, often measured using LLM-based evaluation methods like G-Eval:
            • Relevance of the cited material to the user query.

            • Influence of the citation, assessing how much the generated response relies on it.

            • Uniqueness of the material presented by the citation.

            • Subjective position, gauging the prominence of the source’s positioning from the user’s viewpoint.

            • Subjective count, measuring the amount of content from the citation as perceived by the user.

            • Probability of a user clicking the citation.

            • Diversity of the presented material.

        • To allow for fair comparison, Subjective Impression scores are normalized to have the same mean and variance as Position-Adjusted Word Count, as G-Eval scores can be poorly calibrated.

        • It serves as the second main evaluation metric for GEO methods.

All impression metrics are normalized so that the sum of impressions of all citations in a response equals 1, and performance is compared by calculating the relative improvement in impression.

Timeline of Generative Engine Optimization (GEO)

    • Three Decades Ago (Pre-LLMs): Traditional search engines, like early Google and Bing, revolutionized information access. These engines primarily provided lists of relevant website links based on keyword matching. Search Engine Optimization (SEO) emerged as a practice to improve website rankings in these traditional search results.

    • Recent Past (Advent of LLMs): The success of large language models (LLMs) like GPT-3, GPT-4, and others led to the development of a new paradigm of search engines, which the sources formalize as Generative Engines (GEs).

    • Emergence of Generative Engines (GEs): GEs (e.g., Google’s SGE, Perplexity.ai, BingChat) begin to rapidly replace traditional search engines. These systems synthesize information from multiple sources using LLMs to generate comprehensive, personalized, and often multi-modal responses, grounded in retrieved sources and with attribution.

    • Problem for Content Creators Identified (Concurrent with GE Emergence): The rise of GEs poses a significant challenge for website and content creators. GEs directly provide answers, potentially reducing organic traffic to websites, which many small businesses and individuals rely on. The “black-box” and fast-moving nature of GEs makes it difficult for creators to understand how their content is ingested and displayed.

    • Introduction of Generative Engine Optimization (GEO): To address the disadvantages faced by content creators, “Generative Engine Optimization (GEO)” is introduced as a novel paradigm. GEO is a flexible black-box optimization framework designed to help content creators improve their content’s visibility in GE responses by tailoring and calibrating presentation, text style, and overall content.

    • Development of New Impression Metrics for GEs: Traditional SEO metrics (like average ranking on a SERP) are deemed insufficient for GEs due to their rich, structured responses with embedded citations. GEO proposes new impression metrics, including:

    • Word Count (Impwc): Normalized word count of sentences citing a source.

    • Position-Adjusted Word Count (Imppwc): Word count weighted by an exponentially decaying function based on citation position, giving higher value to earlier mentions.

    • Subjective Impression: A comprehensive metric incorporating subjective factors like relevance, influence, uniqueness, subjective position, subjective count, click probability, and diversity, often measured using LLM-based evaluation methods like G-Eval.

    • Creation of GEO-bench (for systematic evaluation): A large-scale benchmark called GEO-bench is curated. It consists of 10,000 diverse user queries across multiple domains, along with relevant web sources. This benchmark facilitates systematic evaluation of GEO methods.

    • Initial Research and Evaluation of GEO Methods: Rigorous evaluations using GEO-bench are conducted to test the efficacy of various GEO methods.

    • Key Findings (High-Performing Methods): Methods like Quotation Addition, Statistics Addition, and Cite Sources are found to significantly boost visibility (up to 28% on Subjective Impression and 40% on Position-Adjusted Word Count). Fluency Optimization and Easy-to-Understand language also show substantial improvements (15-30%).

    • Key Findings (Ineffective Methods): Traditional SEO methods such as Keyword Stuffing are found to offer little to no improvement, and in some cases, perform worse than no optimization. “Authoritative” writing style also shows no significant improvement overall.

    • Discovery of Domain-Specific GEO Efficacy: Research reveals that the effectiveness of GEO methods varies significantly across different content domains and query types (e.g., “Authoritative” for debate/history, “Cite Sources” for factual questions, “Statistics Addition” for ‘Law & Government’/’Opinion’ questions, “Quotation Addition” for ‘People & Society’/’Explanation’/’History’). This underscores the need for targeted GEO strategies.

    • Observation of Differential Impact on Websites by SERP Rank: It’s discovered that lower-ranked websites (in traditional SERP) benefit significantly more from GEO methods, sometimes seeing increases over 100% (e.g., 115.1% for “Cite Sources” on fifth-ranked sites), while top-ranked sites may see a decrease. This suggests GEO has the potential to “democratize the digital space.”

    • Demonstration of Combined GEO Strategy Benefits: Studies show that combining multiple GEO strategies can lead to enhanced performance, outperforming single strategies (e.g., Fluency Optimization and Statistics Addition combined outperforms single strategies by over 5.5%). “Cite Sources” also significantly boosts performance when combined.

    • Validation of GEO on Deployed Generative Engines: GEO methods are evaluated on a commercially deployed generative engine, Perplexity.ai, with millions of users. The results confirm the generalizability and real-world impact of the proposed GEO methods, showing improvements up to 37% and reaffirming the ineffectiveness of traditional SEO methods like Keyword Stuffing in this new environment.

GEO FAQ

Generative Engines (GEs), like Google’s SGE or Perplexity.ai, represent a new paradigm in information retrieval. Unlike traditional search engines that primarily provide a list of relevant website links for a user query, GEs utilize large language models (LLMs) to synthesize information from multiple sources and generate a comprehensive, personalized, and often multi-modal response. These responses are grounded in the retrieved sources, ensuring attribution and allowing users to verify the information, effectively eliminating the need to navigate directly to websites. This shift significantly improves user utility by offering faster and more accurate information, but it also creates challenges for content creators.

Generative Engine Optimization (GEO) is a novel paradigm introduced to help content creators improve their content’s visibility in Generative Engine responses. It’s necessary because the rise of GEs disadvantages website and content creators. GEs directly provide answers, potentially reducing organic traffic to websites, which many small businesses and individuals rely on. Furthermore, the black-box and fast-moving nature of GEs makes it difficult for creators to understand how their content is ingested and displayed. GEO provides a flexible black-box optimization framework that enables content creators to tailor and calibrate their content’s presentation, text style, and overall content to increase visibility.

Traditional SEO measures visibility by a website’s average ranking on a search engine results page (SERP), which typically presents a linear list of websites. However, this metric is insufficient for Generative Engines because GEs provide rich, structured responses with embedded citations that can vary in length, position, and style. The concept of visibility in GEs is more nuanced and multi-faceted.

To address this, GEO proposes new impression metrics, including:

  • Word Count (Impwc): The normalized word count of sentences that cite a particular source within the GE’s response. A higher word count suggests the source played a more significant role in the answer.
  • Position-Adjusted Word Count (Imppwc): This metric builds on word count but assigns higher weight to sentences cited earlier in the response, using an exponentially decaying function based on citation position. This reflects the intuition that content appearing earlier is more likely to be read.
  • Subjective Impression: This comprehensive metric incorporates subjective factors like the relevance of cited material to the query, the influence of the citation on the response, the uniqueness of the material, subjective position and count, the probability of a user clicking the citation, and the diversity of presented material. These are often measured using LLM-based evaluation methods like G-Eval.

The research highlights several highly effective GEO methods, which generally involve enriching content and improving its presentation:

  • Quotation Addition: Incorporating credible quotes from reliable sources significantly boosts visibility (up to 28% on Subjective Impression and 40% on Position-Adjusted Word Count).
  • Statistics Addition: Adding relevant quantitative statistics instead of purely qualitative discussions also leads to substantial improvements (up to 23.7% on Subjective Impression).
  • Cite Sources: Explicitly including citations from reliable sources enhances credibility and visibility.
  • Fluency Optimization: Improving the fluency of the website text can lead to a significant visibility boost (15-30%), suggesting GEs value good information presentation.
  • Easy-to-Understand: Simplifying the language of the website also contributes to increased visibility.

These methods demonstrate that GEs value not only the content itself but also how it is presented and supported.

Traditional SEO methods, such as Keyword Stuffing (modifying content to include more keywords from the query), have been found to offer little to no improvement in generative engine responses. In some cases, keyword stuffing even performed worse than a baseline with no optimization. This underscores a key difference between traditional search engines (which might rely heavily on keyword matching) and generative engines (which employ language models for a more nuanced understanding of text and user queries). Website owners need to rethink their optimization strategies for this new paradigm.

Yes, the efficacy of different GEO methods varies significantly across domains and query types. For example:

  • Authoritative writing styles are more effective for debate-style questions and historical domains.
  • Cite Sources is particularly beneficial for factual questions, as citations provide verification.
  • Statistics Addition is highly effective in domains like ‘Law & Government’ and for ‘Opinion’ type questions, where data-driven evidence adds value.
  • Quotation Addition works well in ‘People & Society,’ ‘Explanation,’ and ‘History’ domains, where personal narratives or historical events are common.

This domain-specific variation implies that content creators should adopt targeted GEO strategies based on the specific subject matter and intent of their content to achieve the best visibility.

GEO methods tend to have a differential impact based on a website’s traditional Search Engine Results Pages (SERP) ranking. Notably, lower-ranked websites often benefit significantly more from GEO. For instance, the “Cite Sources” method resulted in a remarkable 115.1% increase in visibility for websites ranked fifth in SERP, while top-ranked websites sometimes saw a decrease.

This suggests that GEO has the potential to democratize the digital space. Traditional search engines often favor larger corporations with established domain presence and backlinks, which are hard for small creators to achieve. Generative Engines, by focusing on content quality and presentation as optimized by GEO, can level the playing field, allowing smaller entities to improve their visibility and compete more effectively.

Yes, combining multiple GEO strategies can lead to enhanced performance. For example, the combination of Fluency Optimization and Statistics Addition was found to outperform any single GEO strategy by more than 5.5%. Additionally, “Cite Sources” significantly boosted performance when used in conjunction with other methods, even though it was less effective when used alone. This indicates that content creators are likely to employ a mix of strategies in the real world, and these combinations can yield synergistic improvements in visibility. NotebookLM can be inaccurate; please double check its responses.