Arcalea Glossary
The language of marketing intelligence, clearly defined.
Answer Engine Optimization (AEO)
AEO is the practice of structuring content, schema markup, and brand entity signals so that AI systems (ChatGPT, Perplexity, Google AI Overviews, Gemini, and Copilot) cite your brand when answering questions relevant to your business. AEO differs from SEO in that the goal is citation inside a generated answer, not a ranked link in search results. It requires three foundations: entity clarity (AI knows what your brand is), content extractability (AI can pull direct answers from your pages), and authority signals (AI trusts your brand as a source).
AEO Audit
An AEO Audit is a structured assessment of the infrastructure factors that determine whether and how a brand is cited by AI answer engines. Arcalea's AEO Audit framework evaluates seven layers: entity clarity, schema completeness, content extractability, authority signals, citation source analysis, accuracy monitoring, and competitive AI share of voice. The output is a prioritized remediation plan with specific infrastructure improvements, ranked by expected impact on citation rate.
AEO Index
The AEO Index is Arcalea's proprietary platform for tracking brand citation frequency, accuracy, and competitive presence across ChatGPT, Perplexity, Google AI Overviews, Gemini, and Microsoft Copilot. It benchmarks AI search visibility at the brand, category, and keyword level. Arcalea also publishes original industry-level AEO Index research across verticals including higher education, healthcare, and professional services, providing competitive benchmarks for brands in each sector.
AI Overviews
AI Overviews (formerly Search Generative Experience, or SGE) is Google's AI-generated summary that appears at the top of search results pages for many queries. AI Overviews synthesize information from multiple sources into a direct answer and include source citations. Appearing in AI Overviews is distinct from traditional SERP ranking: it requires AEO infrastructure, specifically schema markup, entity clarity, answer-first content structure, and strong E-E-A-T signals. Click-through rates from AI Overviews citations are meaningfully lower than traditional position-1 rankings.
AI Share of Voice
AI Share of Voice is a metric that measures how frequently a brand appears in AI-generated answers relative to competitors within a defined topic or category. Expressed as a percentage, it is calculated by running a standardized set of queries across AI platforms and recording citation frequency by brand. Arcalea measures AI Share of Voice through its AEO Index platform, tracking both overall citation rate and share relative to the competitive set. It is the AI-search equivalent of traditional search share of voice.
Answer-First Content
Answer-first content is a writing structure in which the direct answer to a question appears in the first one to two sentences, before context, supporting evidence, or qualifications. This structure increases the probability that AI systems will extract and cite the content, because AI answer engines favor passages that deliver the answer immediately. A page that begins a section with "AEO stands for Answer Engine Optimization. It is the practice of…" is answer-first; one that begins "When thinking about how AI systems work, it's important to understand…" is not.
Architecture of Authority
Architecture of Authority is Arcalea's proprietary framework for building the entity infrastructure that causes AI systems to recognize and cite a brand as an authoritative source in its category. It encompasses six layers: structured data completeness (Organization, Person, FAQPage schemas), entity disambiguation (Wikidata/Knowledge Graph presence), citation source quality (backlink authority, Wikipedia coverage), content extractability (answer-first structure), E-E-A-T signals (credentials, author attribution), and sameAs cross-referencing across authoritative platforms.
Attribution Window
An attribution window is the time period during which a marketing touchpoint can receive credit for a conversion. For example, a 30-day click attribution window credits a paid ad click for any purchase that occurs within 30 days of that click. Window length significantly affects reported channel performance: shorter windows undercount upper-funnel activity; longer windows can over-credit early touchpoints in long sales cycles. Platforms like Google and Meta set default windows that favor their own channels, which is one reason cross-channel attribution tools like Galileo produce meaningfully different results than platform-native reporting.
Customer Acquisition Cost (CAC)
Customer Acquisition Cost (CAC) is the total sales and marketing cost required to acquire one new customer, calculated as total acquisition spend divided by new customers acquired in a given period. CAC is a core unit economics metric for evaluating channel efficiency and determining whether growth is sustainable. The ratio of LTV to CAC (LTV:CAC) is a standard indicator of long-term business health; a 3:1 or higher ratio is typically considered healthy. CAC should be tracked by acquisition channel, not just in aggregate, to identify which programs are building a sustainable business vs. burning cash at above-sustainable rates.
Citation Rate
Citation rate is the percentage of relevant queries, within a defined topic set, for which an AI answer engine includes a brand's name, product, or content in its response. A 60% citation rate means the brand appears in 60 out of 100 queries on relevant topics. Citation rate is the primary AEO performance metric, analogous to click-through rate in traditional SEO. It is measured separately by AI platform (ChatGPT citation rate, Perplexity citation rate, etc.) since citation behavior varies significantly across platforms.
Cookieless Attribution
Cookieless attribution refers to measurement methods that track marketing performance without relying on third-party browser cookies, which have been deprecated by Safari, Firefox, and effectively phased out by Chrome's SameSite changes. Cookieless approaches include first-party data modeling, server-side tracking, data clean rooms, probabilistic matching, and platform-reported conversions. Galileo uses a first-party, CRM-integrated attribution model that is inherently cookieless: it connects to closed revenue data in the CRM rather than relying on browser-level tracking cookies.
Data-Driven Attribution
Data-driven attribution is a model that uses machine learning to assign fractional credit to each touchpoint based on the actual conversion patterns in your data, rather than applying a fixed rule (like last-click or linear). Google's data-driven attribution model is built into Google Ads and GA4, but it only attributes credit to Google's own touchpoints and uses self-reported conversion data. Galileo builds a client-specific, cross-channel data-driven model that includes all paid, organic, and offline touchpoints, then connects to CRM revenue data, producing materially different (and more accurate) results than platform-native data-driven attribution.
E-E-A-T
E-E-A-T stands for Experience, Expertise, Authoritativeness, and Trustworthiness. These are the quality signals Google uses to evaluate content for both traditional search rankings and AI Overviews inclusion. E-E-A-T is particularly important for YMYL (Your Money or Your Life) topics. In AEO, E-E-A-T signals are operationalized through Person schema (author credentials), author attribution on content, documented expertise (degrees, certifications, professional history), and entity cross-referencing in knowledge graphs like Wikidata. A brand with no named authors, no credential documentation, and no external entity validation has structurally weak E-E-A-T regardless of content quality.
Entity Recognition
Entity recognition is the process by which search engines and AI systems identify real-world objects (brands, people, places, products) and distinguish them from generic terms. When an AI system "recognizes" a brand as an entity, it can associate structured facts with it (location, leadership, products, credentials) and surface those facts in generated answers. Entity clarity, achieved through consistent name usage, schema markup, Wikidata presence, and Wikipedia coverage, is foundational AEO infrastructure. Without entity recognition, AI systems either ignore a brand or describe it inaccurately.
EVPI (Expected Value of Perfect Information)
EVPI stands for Expected Value of Perfect Information. In marketing attribution, EVPI quantifies the maximum dollar value a marketer would gain if they knew with certainty which marketing activities drove revenue. It sets a theoretical upper bound on how much it's worth investing in better measurement. If EVPI for a given channel is low, the current attribution approach is accurate enough; if EVPI is high, better measurement infrastructure has a demonstrable financial return that justifies the investment. EVPI is used by Arcalea to make the financial case for attribution investment before engagement.
FAQPage Schema
FAQPage schema is a structured data markup type (from Schema.org) that tells search engines and AI systems that a page contains a list of questions and their answers. FAQPage markup is among the highest-value AEO schema types because it delivers pre-structured Q&A pairs that AI systems can extract and cite directly, without needing to parse prose. Each Question entry should use answer-first structure: the direct answer in the first sentence, supporting context after. Pages with FAQPage schema see meaningfully higher citation rates in AI answer engines than equivalent pages without it.
Galileo
Galileo is Arcalea's proprietary multi-touch revenue attribution platform, built over eight years and refined across production deployments in complex B2B, higher education, and enterprise accounts. It ingests data from paid media, organic search, CRM, and offline channels; uses machine learning to assign fractional revenue credit across the full conversion path; and connects to CRM data to measure closed revenue rather than proxy conversions or platform-reported events. The Galileo methodology has been qualified as expert-grade analysis in federal court and was named a Fortune Most Innovative Product in 2024.
Generative Engine Optimization (GEO)
Knowledge Graph
A Knowledge Graph is a structured database of entities (people, organizations, places, products) and the relationships between them. Google's Knowledge Graph powers the information boxes (Knowledge Panels) in search results and informs AI-generated answers. Wikidata is the largest open Knowledge Graph and a primary source for AI training data. Being represented in Google's Knowledge Graph, and cross-referenced in Wikidata via sameAs schema, significantly increases the probability that AI systems will recognize a brand as an established entity and include it in relevant responses.
Last-Click Attribution
Last-click attribution is a measurement model that assigns 100% of conversion credit to the final touchpoint before a sale. It is the default model in most ad platforms and the most widely used attribution approach. Last-click systematically over-credits bottom-funnel channels (brand search, retargeting, direct) and under-credits upper-funnel channels (display, content, social, podcast) that initiate or nurture the customer relationship. For complex B2B sales cycles with 30+ day consideration periods, last-click attribution produces structurally distorted budget decisions that consistently underinvest in awareness and overinvest in channels that close deals already won.
LTV:CAC Ratio
LTV:CAC is the ratio of customer lifetime value (LTV) to customer acquisition cost (CAC). It is the core unit economics metric for evaluating marketing program sustainability. A 3:1 ratio is typically considered healthy, meaning you earn three times the cost to acquire each customer over their lifetime. Below 1:1, the business loses money on each customer. Above 5:1 may indicate underinvestment in growth. LTV:CAC should be calculated by acquisition channel to identify which programs are building a sustainable business vs. which are producing cheap first purchases with low retention.
Marketing Mix Modeling (MMM)
Marketing Mix Modeling (MMM) is a statistical technique that uses regression analysis to estimate the contribution of each marketing channel to overall sales, without requiring user-level tracking. MMM operates at the aggregate level (weekly spend and revenue by channel) and is inherently privacy-compliant. It is most useful for measuring the contribution of hard-to-track channels (TV, radio, out-of-home, sponsorships) and for informing strategic budget allocation decisions. MMM is less accurate for individual campaign decisions, real-time optimization, or B2B contexts with long sales cycles, where multi-touch attribution tools like Galileo provide more actionable signal.
Multi-Touch Attribution (MTA)
Multi-touch attribution (MTA) is a measurement approach that distributes conversion credit across multiple touchpoints in a customer's path to purchase. Unlike single-touch models (first-click or last-click) that give 100% credit to one touchpoint, MTA methods include linear (equal credit to all touches), time-decay (more credit to recent touches), position-based (more credit to first and last), and data-driven (ML-based fractional credit calibrated to actual conversion patterns). Galileo uses a data-driven MTA model connected to closed CRM revenue, producing channel-level attribution that reflects how revenue is actually generated rather than how ad platforms report it.
Organic Share of Voice
Organic Share of Voice (SOV) measures a brand's visibility in organic search results relative to the total available visibility across a defined keyword set. Calculated as the sum of a brand's organic click potential across tracked keywords divided by the total click potential of the full set, organic SOV is a competitive benchmark for SEO performance that accounts for both rankings and click-through rates simultaneously. As AI-generated answers displace organic clicks for an increasing share of informational queries, AI Share of Voice (measured by citation rate) is becoming a parallel and increasingly important companion metric.
Retrieval-Augmented Generation (RAG)
Retrieval-Augmented Generation (RAG) is an AI architecture in which a large language model retrieves relevant documents or web content in real time before generating a response, rather than relying solely on knowledge baked into its training data. Most AI answer engines use RAG, including Perplexity, Bing Copilot, and Google AI Overviews. This is critical for AEO: it means the content on your website, if it is crawlable, structured, and trustworthy, can directly influence what an AI system says about your brand in real time. Schema markup, answer-first content, and strong E-E-A-T signals all make content more likely to be retrieved and cited.
Revenue Attribution
Revenue attribution is the practice of connecting marketing activity to actual closed revenue, not proxy metrics like clicks, impressions, leads, or MQLs. True revenue attribution requires integration with CRM data (Salesforce, HubSpot, or equivalent) to track prospects from first marketing touchpoint through to won deals. This is distinct from conversion attribution, which typically measures form fills or demo requests rather than signed contracts or completed purchases. Galileo is built specifically for revenue attribution, connecting every marketing channel to CRM-verified closed revenue across B2B sales cycles of any length.
ROAS (Return on Ad Spend)
ROAS (Return on Ad Spend) is the revenue generated per dollar spent on advertising, calculated as ad-attributed revenue divided by ad spend. Platform-reported ROAS (from Google Ads, Meta Ads) uses last-touch, platform-only attribution that typically overstates true performance by 2x to 5x for most accounts. True ROAS requires cross-channel attribution tied to closed CRM revenue, not platform-reported conversions. The gap between platform-reported ROAS and actual revenue-attributed ROAS is one of the most common and most costly measurement errors in paid media. It consistently leads to over-investment in channels that look good in platform dashboards but underdeliver in actual revenue.
Return on Marketing Investment (ROMI)
Return on Marketing Investment (ROMI) measures the net revenue generated by marketing spend as a ratio to that spend. Unlike ROAS (which measures ad-attributed revenue divided by ad cost alone), ROMI is a broader metric that incorporates all marketing costs (agency fees, technology, creative production, and media) against full-funnel revenue contribution. ROMI is the CFO-level metric that connects marketing programs to business outcomes. It is most useful for executive-level budget decisions and annual planning; less useful for campaign-level optimization, where ROAS or CAC by channel is more actionable.
Schema Markup / Structured Data
Schema markup (also called structured data) is code added to a webpage that explicitly tells search engines and AI systems what the content means, not just what it says. It uses the Schema.org vocabulary and is typically written in JSON-LD format embedded in the page's <head>. High-value AEO schema types include FAQPage (Q&A pairs for direct citation), Organization (brand entity facts for Knowledge Graph), Person (author credentials for E-E-A-T), Article (publish date, author, dateModified), and SoftwareApplication (product descriptions). Without schema markup, AI systems must infer meaning from content, a process with significantly higher error rates than reading explicit structured data.
Search Intelligence
Search intelligence is the practice of applying data science and machine learning to organic search data (keyword rankings, SERP share, content performance, competitive positioning) in order to quantify what search investments are worth and where opportunity is going uncaptured. Arcalea's Compass platform applies search intelligence to treat SEO as a financial asset rather than a content calendar: every ranking is assigned a dollar value based on traffic potential and conversion rate, making it possible to defend organic investment with the same financial rigor used for paid media. Compass was one of the first platforms to apply ML to search ranking factor decomposition.
The concepts are clear. The measurement is the work.
Understanding AEO and attribution terminology is the starting point. Building the infrastructure that makes your brand citable and your spend accountable is what Arcalea does.