Real examples of how brands improved their AI visibility and started appearing in ChatGPT, Gemini, and Copilot recommendations.
Kyte Baby has the richest descriptions in the baby category and a devoted community. Zero JSON-LD means the data is trapped. 4% AI visibility.
Lalo's Suction Bowl has 922 reviews at 4.6 stars. Their Play Kitchen costs $740. Product descriptions are 12-32 words. Zero AI visibility across 150 tests.
Every muslin query on every AI platform defaults to aden + anais. Little Unicorn makes beautiful muslin blankets with 30-word descriptions and 1 tag per product. Zero AI visibility.
Mushie surfaces 90% of the time on Gemini for mold-free bath toys. On every other query, every other platform — zero. The most concentrated visibility profile in the audit set.
Tubby Todd's All Over Ointment is a cult product. Parents swear by it daily in Facebook groups and Reddit. 1% AI visibility. Pharmaceutical brands own every eczema query.
Act+Acre is built on scalp care specialism — exactly the kind of niche AI agents should reward. The basic data infrastructure works. But the tags don't reflect the niche, so the brand competes for scalp queries with no structural advantage over generalists.
Beardbrand essentially created the modern premium beard care movement through a decade of YouTube content. AI agents recommend Amazon-native Honest Amish and Viking Revolution instead. YouTube subscribers are not an AI signal.
Ceremonia's Aceite de Moska carries 41 tags with a structured prefix taxonomy — Ingredient_, Benefit_, product-badge_. The best tag system across 55 audits. Two of three audited products return 404 errors. World-class data sitting on broken catalogue infrastructure.
Counter Culture owns 'ethically sourced coffee' on Copilot at 80% — but its 30-year reputation can't compensate for thin product data. Identity is a real AI signal, but it's a ceiling, not a foundation.
Crown Affair has Vogue features, a cult following, and products priced at $38-42. One product literally returns an empty description field. The structured data profile is the thinnest recorded across 55 brand audits.
DIFF has the most extensive tag taxonomy in the eyewear group and a charitable mission that differentiates it from hundreds of fashion sunglasses brands. The mission appears nowhere in product data — and AI agents recommend Arlo Wolf instead.
dpHUE's Cool Blonde Conditioner carries 23 tags — the highest in the hair care audit group. Half are attribute tags that genuinely help AI agents. The other half are bogo-free and promo_25%-off. AI agents process them all as product metadata.
Dr. Squatch carries 12 tags per product — a healthy count. Every single tag is internal: edition_core, exclude_rebuy, subscription-eligible_yes. Not one describes scent, skin type, or ingredients. The result: 0% ChatGPT visibility.
Fellow's Stagg EKG is the category default for pour over kettles on Copilot — 10/10 runs at position one. The Opus grinder is invisible. Same brand, same data infrastructure, completely different outcomes.
Every Floyd product carries the vendor name 'RIZE' — apparently a fulfilment partner — in the data field AI agents read as brand identity. Combined with missing schema and a 1.6/5 Trustpilot, Floyd is one of the most structurally disadvantaged brands in the home decor audit set.
Function of Beauty has 27,769 Trustpilot reviews at 4.0 — roughly 10x more than any other brand in the audit set. Two of three audited products return no product schema. The most validated brand is structurally invisible to AI agents on most of its product pages.
Girlfriend Collective has the widest size range (XXS to 6XL) and leggings made from recycled water bottles. Neither differentiator appears in the product data AI agents consume.
Hawkins New York is known in the design world for its curation and minimalist aesthetic. Two of three audited products carry the vendor name 'Sophie Lou Jacobsen' instead of Hawkins New York. The house brand identity that exists for human shoppers has no data representation.
Huron has the best review infrastructure in the grooming audit group — aggregateRating on every product, hundreds of reviews each. And the thinnest descriptions. AI agents can verify the products are well-rated but have almost nothing to recommend them for.
In a category where 10-40 word descriptions are the norm, Janji's 286-word product pages are a structural competitive advantage. A case study in what good looks like.
KREWE's CONTI Haze 12K has 35 tags — the most comprehensively tagged product across 100 brand audits. Zero appearances on any platform for any query. Tags without editorial authority are metadata that nobody reads.
Lulu and Georgia is built on designer curation — interior designers collaborate on the selection, customers buy into the taste. AI agents do not experience curation. They experience data. And the data on a $908 coffee table is 36 words.
Lumin's product descriptions range from 9 to 66 words — the widest content variance in the grooming audit group. For a subscription brand whose value proposition is the range, fragmented data means AI agents see disconnected products, not a system.
Onyx Coffee Lab has SCA competition wins, the richest product descriptions in the coffee audit set, and zero JSON-LD on any product page. The result: 1% AI visibility — the most extreme reputation-to-visibility disconnect in the entire 55-brand audit.
Our Place became one of the defining DTC launches of the 2020s through TikTok virality and Selena Gomez. The Always Pan — a $199 product — has a 5-word product description. TikTok doesn't feed AI agents.
Outdoor Voices raised $60M and defined a generation of activewear. Zero product schema. 19-word descriptions. 2.3-star Trustpilot. When culture moves on, what remains is what you put in the data.
Pair Eyewear's product pages contain zero words of body description. Copilot surfaces the brand 52% of the time anyway. Category creation is the most powerful AI visibility signal we've measured — but it has limits.
Across 55 DTC audits, Parachute has the weakest product data of any brand audited. Three products tested, zero words of description on all three, zero JSON-LD on any product, 1.7/5 Trustpilot with 411 reviews. The data floor.
Satisfy is to running what Rapha is to cycling. Paris-based. Fashion-credible. Performance-legitimate. Zero product schema. 39 internal tags. No AI agent can read why a $210 running short is worth $210.
Sunski writes the richest product descriptions in the eyewear audit group — three times longer than DIFF or Tens. Without JSON-LD wrapping that content, AI agents have no structured path to it. The best product stories in the group are wrapped in the worst infrastructure.
Supply makes premium single-edge razors with award-winning design. The award appears as the internal tag _label_Award-Winning. No AI agent will parse that as a quality differentiator. Niche brands cannot win on volume — they have to win on specificity.
606 Trustpilot reviews at 1.5 stars. Zero product schema. 10-word descriptions. When every data layer is weak simultaneously, there is nothing to compensate for anything else.
Tens makes lenses that apply a warm, film-like tint to everything you see — a genuinely unique product. Three products share an identical 44-word description. AI agents cannot tell the warm-tint lifestyle product apart from Oakley Prizm sport performance lenses.
Trade Coffee is 4.6/5 on Trustpilot — the best in the coffee group. AI visibility: 3%. The taste quiz that powers the matching algorithm collects exactly the data AI agents need, but the product pages they crawl are nearly empty.
Verve has the highest tag count of any brand in the coffee audit group and owns 'California coffee roaster' on Copilot at 90%. Every product-level query returns zero. More tags doesn't mean more visibility — quality and query mapping do.
Ana Luisa has been Climate Neutral Certified since 2020. AI agents know it. They recommended it 4% of the time anyway.
Fable has the best JSON-LD of any pet brand tested. 9/10 structured data score. 7% AI visibility — the lowest in the pet audit set.
Gorjana has excellent JSON-LD, 4.9-star ratings, and thousands of reviews. Every product uses the same 32-word template description. 9% AI visibility.
Gemini can recite Jenny Bird's tagline. Celebrity fans include Kylie Jenner and Hailey Bieber. Ask any AI for the best chunky hoop earring, and Jenny Bird surfaces 1 out of 30 times.
Maxbone has the best product descriptions in the pet category. And the worst AI visibility. 220 words of rich copy trapped behind zero structured data.
Mejuri has 86% AI visibility on Gemini and 24% on Copilot. Same brand, same products, same day — three completely different realities.
Missoma is 'the current it-brand for layering' on Gemini and does not exist on ChatGPT. Celebrity endorsements and editorial credibility — with product data that tells AI agents nothing.
WagWear makes premium dog leashes with Sunbrella cording and brass hardware. 150 automated tests. Zero appearances. Total AI invisibility.
West Paw has 74% visibility on Gemini, 100% on Copilot's eco-friendly query, and 0% on ChatGPT. The widest platform gap measured.
Wild One surfaces at position #1 on Gemini for stylish dog leashes. It does not exist on ChatGPT. The brand's design reputation outpaces its product data.
Bala achieves 40% AI visibility — second-highest in the audit set — with Gemini at 78% and Copilot ranking it #1 for ankle weights. The product description is 34 words.
Brightland achieves 14% AI visibility across 150 tests — but 100% of it comes from Gemini. ChatGPT and Copilot have never heard of this California olive oil brand.
Caraway achieves 41% AI visibility — highest in the audit set — with an extraordinary 82% on Gemini. But ChatGPT returns 0% across 50 tests. Same brand, same day, three different realities.
Chamberlain Coffee achieves just 3% AI visibility across 150 tests — the lowest score in the entire audit set. Celebrity fame does not equal AI product visibility.
Fly By Jing achieves 27% AI visibility — the highest in the audit set — and is one of only two brands to break through ChatGPT. Cultural penetration beats product data.
Graza achieves 17% AI visibility — strongest on its defining format query where Gemini calls it 'The Gold Standard'. But ChatGPT recommends OXO dispensers instead.
Material Kitchen achieves 28% AI visibility — above average — and is one of only two brands to break ChatGPT's 0% barrier. But it consistently ranks 4th or 5th, never first.
Muddy Bites achieves 10% AI visibility — but all 15 surfacings came from a single query. If you don't already know waffle cone bites exist, AI will never introduce you.
Olipop achieves 31% AI visibility — highest in Food & Beverage — despite having the worst product data in the audit set. Category authority compensates for data gaps, but ChatGPT's 0% shows the limits.
Open Spaces achieves 13% AI visibility — but 100% on Gemini for DTC brand queries and near-zero for actual product queries. The DTC storefront appears shut down.
We audited Boy Smells, a gender-neutral candle brand sold at Nordstrom and Saks Fifth Avenue with coverage in Vogue and GQ. Across 150 tests, they surfaced just 12% of the time. ChatGPT recognises them as an indie brand (60% surfacing). Gemini barely knows they exist (4%). Their core differentiator — a coconut and beeswax blend — appears nowhere in their product descriptions.
We audited Brooklyn Candle Studio, a clean candle brand operating from Industry City, Brooklyn. Across 150 tests, they surfaced 15% of the time. But 74% of those surfacings came from a single query — Brooklyn eco-friendly candle. On that query, Gemini and Copilot put them at #1 (90% and 80%). ChatGPT? 0/50. Zero. The most polarised visibility profile in the audit programme.
Dagne Dover has zero JSON-LD — a first in 30 audits. Gemini carries 100% of their AI visibility at 52%, calling the Allyn Tote 'the gold standard' for neoprene. ChatGPT and Copilot: 0/100 combined. The most extreme platform disparity in the audit programme.
We audited Earl of East, a London-based candle brand with the best structured data in the audit programme — aggregateRating on every product, 264-word descriptions with room size guides. Across 150 tests, they surfaced 11% of the time. ChatGPT: 0%. Copilot: 0%. Gemini: 32%. The most extreme platform disparity in 22 brand audits.
Horizn Studios has literally zero words on every product page — yet achieves 25% AI visibility driven entirely by editorial coverage on Gemini. The paradox of brand without data.
Keychron has the highest AI visibility in the audit set at 45% — Gemini 80%, Copilot 56%. But ChatGPT has never heard of them. The best product taxonomy (19 structured Key:Value tags) paired with the worst structured data (no Product schema). A $249.99 keyboard described in 52 words of packing list.
LEVEL8 has genuine specs — German Makrolon polycarbonate, YKK zippers, RPET lining — buried under messy formatting, useless tags, and zero editorial presence. 17% AI visibility.
Monos has 300-460 word descriptions with exact specs, 4.9/5 ratings, and comprehensive structured data. ChatGPT returns 0/50. The gap between data quality and AI visibility.
Native Union scores 9/10 on descriptions and 9/10 on structured data — the highest in tech accessories. But actual AI visibility is just 13%. 0% on ChatGPT. Rich product pages are necessary but not sufficient.
Nomad is the #1 recommendation for leather iPhone cases on Gemini — 10 out of 10 runs. ChatGPT has never mentioned them. 0 out of 50 tests. The widest gap between brand reputation and product data in the audit set: Wirecutter, The Verge, MKBHD coverage — and a 24-word MacBook sleeve description.
Ohsnap is NOT invisible to AI — Gemini recommends it 46% of the time at #1, calling it the 'gold standard' for thin phone grips. But ChatGPT returns 0/50. The product_type field is empty on every product. The most basic categorisation field in Shopify — blank.
We audited P.F. Candle Co., a brand sold at Target and Nordstrom with 909 reviews on a single product. Across 150 tests, they had the highest visibility in the candle audit set at 22%. But ChatGPT returned 0/50. Their product descriptions are 23 words. The word 'soy' does not appear once. Reviews and editorial presence outweigh description quality — but only on some platforms.
Satechi's overall 33% matched the estimate exactly — but the per-platform reality was completely inverted. ChatGPT was estimated at 50% and scored 0/50. Copilot was estimated at 30% and scored 38%. The aggregate was right. The story was wrong.
Stubble & Co claims 55,000+ reviews but gives AI agents 26 words to work with. Zero tags on hero products. No dimensions, no weight, no airline compliance. Near-invisible.
We audited Apotheke, a Brooklyn-based premium candle brand stocked at Bloomingdale's and Nordstrom. They have the best product description template of any brand in this audit programme — structured sections, scent pyramids, burn times, dimensions. Across 150 tests (10 repeats per query per platform), they surfaced just 14% of the time. The template is perfect. Everything around it is broken.
We audited Dr Sam's, a Harley Street dermatologist's skincare brand with 8,000 five-star reviews and JSON-LD structured data. They surfaced on 0 out of 15 AI shopping queries. The widest gap between data infrastructure and actual visibility we've seen.
We audited Herbivore Botanicals, a $50M clean beauty brand stocked at Sephora and Ulta. They surfaced on 0 out of 15 AI shopping queries. But the real finding is bigger: clean beauty has no AI consensus. Every platform recommended completely different brands. The category is wide open.
We audited Summer Fridays, the influencer-founded brand behind Sephora's best-selling Jet Lag Mask. 1.1 million Instagram followers. Best structured data in our audit set. They surfaced on 2 out of 15 AI shopping queries. One of those was from eBay.
We audited UpCircle Beauty, a B Corp-certified upcycled skincare brand. They surfaced on 8 out of 15 AI shopping queries — the highest score in this audit programme. When the query matched their sustainability story, AI agents put them first. When it didn't, they vanished.
We audited Wildsmith Skin, a British luxury skincare brand born from a 400-acre Georgian estate. They have independent clinical trial data, full INCI lists, and a compelling brand story. Their Shopify descriptions are 7-9 word taglines. AI agents surfaced them on 1 out of 15 queries. The content exists. It is in the wrong place.
Taylor Stitch has fabric weights, construction specs, and care instructions on every product page. They surfaced on 3 out of 15 AI queries - the worst in our audit set. Good data is necessary but not sufficient.
Finisterre surfaced on 9 out of 15 AI queries - the highest in this audit set. Then I looked closer: 6/6 on brand queries, 1/6 on product queries. A brand coasting on reputation while its product data sits empty.
We audited Hiut Denim, arguably the best craft denim brand in Britain, against AI shopping agents. They surfaced for their story. They were invisible for their products.
We audited Isabella Vrana ,a brand with 175,000 Depop followers, against AI shopping agents. It didn't surface on a single query. Here's the full breakdown.
A cult British streetwear brand with over 1 million Instagram followers didn't surface on a single AI query. Not one.
Get a free AI visibility audit and find out exactly where your products stand.
Get Your Free Audit