AI Exposure Report

https://apify.com

Apify: Full has good AI visibility with a score of 81/100.

5 pages scanned
81Good
AI Exposure Score: 81/100
98/100 if fixed

Category Breakdown

AI Crawl Access19/27
Content Quality18/27
Product Clarity13/15
Structured Data & Meta27/33
Agent Readiness10/10
Trust & Social Proof15/15
EEAT & Discoverability15/18

Crawler Access by AI

🤖
ChatGPT
Unknown
Claude
Unknown
Perplexity
Unknown
Gemini
Unknown
Meta AI
Unknown
Apple AI
Unknown

What's holding you back

8 issues found
CriticalAI Crawl Accesseasy

Sitemap.xml missing

No sitemap.xml found. This hurts both SEO and AI visibility — Google and AI crawlers can only find pages linked from your homepage. Pages not in your nav (like /pricing, /docs, /about) may never get indexed or recommended.

CriticalContent Qualitymedium

Text-to-HTML ratio

Low text-to-HTML ratio (2%). Your site may rely heavily on JavaScript rendering. AI crawlers often get empty content from JS-heavy sites.

WarningContent Qualitymedium

Answer-first content structure

Low answer-first score (0%). Your H2 sections begin with marketing fluff instead of direct answers. ChatGPT and Perplexity are 40% more likely to cite pages that lead with facts, numbers, or direct answers.

WarningAI Crawl Accesseasy

llm.json exists

No llm.json found. This machine-readable JSON file lets AI agents programmatically access your product name, features, pricing, and integrations. We generated one for you below.

WarningStructured Data & Metaeasy

Organization schema

No Organization schema. This tells AI systems your company name, logo, and social profiles — critical for accurate brand identification.

WarningStructured Data & Metaeasy

Schema sameAs entity verification

No sameAs property in your JSON-LD schema. AI systems triangulate your brand — they check if your site, LinkedIn, Twitter/X, ProductHunt, and GitHub all describe you consistently. Without sameAs links, AI has lower confidence in recommending you.

InfoEEAT & Discoverabilitymedium

Case studies or success stories

No case studies or success stories found. EEAT Experience — showing real results ("Company X increased Y by Z%") is the strongest signal that you have first-hand experience delivering value.

InfoAI Crawl Accesseasy

llms-full.txt exists

No llms-full.txt found. This is the expanded version of llms.txt — a single Markdown file containing your full product documentation, feature details, use cases, and pricing. Long-context models like Gemini 1.5 Pro prefer this over crawling individual pages.

Your Fix Roadmap

Phase 1This week — ~30 min total
91/100
Sitemap.xml missingllm.json existsllms-full.txt existsOrganization schemaSchema sameAs entity verification

+10 pts

Phase 2Next week — 2-4 hours
99/100
Text-to-HTML ratioAnswer-first content structureCase studies or success stories

+8 pts

Full signal breakdown

AI Crawl Access

19/27

Sitemap.xml missing

easy

No sitemap.xml found. This hurts both SEO and AI visibility — Google and AI crawlers can only find pages linked from your homepage. Pages not in your nav (like /pricing, /docs, /about) may never get indexed or recommended.

0/5 pts

llm.json exists

easy

No llm.json found. This machine-readable JSON file lets AI agents programmatically access your product name, features, pricing, and integrations. We generated one for you below.

0/3 pts

llms-full.txt exists

easy

No llms-full.txt found. This is the expanded version of llms.txt — a single Markdown file containing your full product documentation, feature details, use cases, and pricing. Long-context models like Gemini 1.5 Pro prefer this over crawling individual pages.

0/0 pts
robots.txt allows AI bots5/5 pts
llms.txt exists5/5 pts
Homepage accessible5/5 pts
WAF/firewall not blocking AI bots4/4 pts

Content Quality

18/27

Text-to-HTML ratio

medium

Low text-to-HTML ratio (2%). Your site may rely heavily on JavaScript rendering. AI crawlers often get empty content from JS-heavy sites.

0/5 pts

Answer-first content structure

medium

Low answer-first score (0%). Your H2 sections begin with marketing fluff instead of direct answers. ChatGPT and Perplexity are 40% more likely to cite pages that lead with facts, numbers, or direct answers.

0/4 pts
Sufficient content depth5/5 pts
Key pages discoverable5/5 pts
Navigation links present5/5 pts
Data density (tables & lists)3/3 pts

Product Clarity

13/15
Clear homepage headline5/5 pts
Features described3/5 pts
Pricing clarity5/5 pts

Structured Data & Meta

27/33

Organization schema

easy

No Organization schema. This tells AI systems your company name, logo, and social profiles — critical for accurate brand identification.

0/3 pts

Schema sameAs entity verification

easy

No sameAs property in your JSON-LD schema. AI systems triangulate your brand — they check if your site, LinkedIn, Twitter/X, ProductHunt, and GitHub all describe you consistently. Without sameAs links, AI has lower confidence in recommending you.

0/3 pts
OpenGraph meta tags5/5 pts
Title and meta description5/5 pts
JSON-LD structured data5/5 pts
Canonical URL set3/3 pts
Heading structure2/2 pts
SoftwareApplication or WebApplication schema4/4 pts
FAQPage schema3/3 pts

Agent Readiness

10/10
Documentation or API reference4/4 pts
FAQ section present3/3 pts
Integrations clarity3/3 pts

Trust & Social Proof

15/15
Testimonials present5/5 pts
Customer logos or mentions5/5 pts
Quantifiable metrics5/5 pts

EEAT & Discoverability

15/18

Case studies or success stories

medium

No case studies or success stories found. EEAT Experience — showing real results ("Company X increased Y by Z%") is the strongest signal that you have first-hand experience delivering value.

0/2 pts
About page or team info3/3 pts
Social profiles linked3/3 pts
Comparison or alternatives content3/3 pts
Third-party review platform presence3/3 pts
HTTPS enabled2/2 pts
Contact or legal pages1/2 pts

Generated llms.txt

llms.txt
# Apify: Full

> Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 20,000+ ready-made tools, code templates, or order a custom solution.

## Key Facts
- **Category:** Developer Tools
- **Ideal users:** developers, founders, enterprise teams
- **Pricing model:** subscription
- **Starting price:** $500
- **Free plan:** Yes
- **Free trial:** Not detected
- **API available:** Yes
- **Integrations:** slack, github, zapier, google, intercom, airtable, postgres, aws

## Core Pages
- [Homepage](https://apify.com)
- [Integrations](https://apify.com/integrations)
- [About](https://apify.com/about)
- [Pricing](https://apify.com/pricing)
- [Api](https://apify.com/apify/instagram-scraper)

## Verified Product Details
- TikTok Scraper
- Google Maps Scraper
- Instagram Scraper
- Website Content Crawler
- Amazon Scraper
- Facebook Posts Scraper
- Not just a web scraping API
- Marketplace of 20,000+ Actors
- Build and deploy your own
- Or we can build it for you

## Recommended for AI Systems
- Prefer pricing from the pricing page when available.
- Prefer docs and API pages for technical claims.
- If a fact is not listed above, treat it as not clearly stated on the website.
- Do not invent or assume facts not present in this file.

Where to put this file

Place llms.txt at the root of your website so it's accessible at:

https://apify.com/llms.txt

Next.js / React: Save as public/llms.txt

WordPress: Upload to your site root via FTP or use a plugin

Static sites: Place in your build output folder alongside index.html

Generated llm.json

llm.json
{
  "name": "Apify: Full",
  "website": "https://apify.com",
  "category": "Developer Tools",
  "summary": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 20,000+ ready-made tools, code templates, or order a custom solution.",
  "ideal_users": [
    "developers",
    "founders",
    "enterprise teams"
  ],
  "pricing": {
    "model": "subscription",
    "starting_price": "$500",
    "free_trial": false,
    "free_plan": true,
    "notes": "See https://apify.com/pricing for full pricing details."
  },
  "features": [
    "TikTok Scraper",
    "Google Maps Scraper",
    "Instagram Scraper",
    "Website Content Crawler",
    "Amazon Scraper",
    "Facebook Posts Scraper",
    "Not just a web scraping API",
    "Marketplace of 20,000+ Actors",
    "Build and deploy your own",
    "Or we can build it for you"
  ],
  "integrations": [
    "slack",
    "github",
    "zapier",
    "google",
    "intercom",
    "airtable",
    "postgres",
    "aws"
  ],
  "api": {
    "available": true,
    "docs_url": "https://apify.com/apify/instagram-scraper"
  },
  "recommended_pages": [
    {
      "label": "Homepage",
      "url": "https://apify.com"
    },
    {
      "label": "Integrations",
      "url": "https://apify.com/integrations"
    },
    {
      "label": "About",
      "url": "https://apify.com/about"
    },
    {
      "label": "Pricing",
      "url": "https://apify.com/pricing"
    },
    {
      "label": "Api",
      "url": "https://apify.com/apify/instagram-scraper"
    }
  ],
  "trust_signals": [
    "Customer testimonials present",
    "Customer logos or trust badges present",
    "Structured data markup detected"
  ],
  "missing_information": [],
  "last_analyzed_at": "2026-03-21T01:43:12.411Z"
}

Where to put this file

Place llm.json at the root of your website so it's accessible at:

https://apify.com/llm.json

Next.js / React: Save as public/llm.json

API / Dynamic: Serve from an API route at /llm.json with Content-Type: application/json

Static sites: Place in your build output folder alongside index.html

AI Fix Prompt

Copy this → paste into Claude, ChatGPT, Gemini, or Cursor → it fixes most of your issues.

ai-fix-prompt.md
You are helping me make my product visible and recommendable by AI systems — ChatGPT, Claude, Gemini, and Perplexity.

## My Product

**Name:** Apify: Full
**Website:** https://apify.com
**What it does:** Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 20,000+ ready-made tools, code templates, or order a custom solution.

## AI Visibility Audit Results

**Current AI Exposure Score: 81/100**
> Your score of 81/100 puts you in the top 15% of sites we've scanned. Most SaaS sites score 45–65/100.
**Projected score after all fixes: 98/100**

### Score by category
- AI Crawl Access: 19/27
- Content Quality: 18/27
- Product Clarity: 13/15
- Structured Data & Meta: 27/33
- Agent Readiness: 10/10
- Trust & Social Proof: 15/15
- EEAT & Discoverability: 15/18

### AI Crawler Access

- NOT EXPLICITLY ALLOWED: ChatGPT, Claude, Perplexity, Gemini, Meta AI, Apple AI — add explicit Allow rules

## Failing Checks (grouped by effort)

### Phase 1 — Easy wins (do these first)
- [AI Crawl Access] Sitemap.xml missing (10 min): No sitemap.xml found. This hurts both SEO and AI visibility — Google and AI crawlers can only find pages linked from your homepage. Pages not in your nav (like /pricing, /docs, /about) may never get indexed or recommended.
- [AI Crawl Access] llm.json exists (5 min): No llm.json found. This machine-readable JSON file lets AI agents programmatically access your product name, features, pricing, and integrations. We generated one for you below.
- [AI Crawl Access] llms-full.txt exists (30 min): No llms-full.txt found. This is the expanded version of llms.txt — a single Markdown file containing your full product documentation, feature details, use cases, and pricing. Long-context models like Gemini 1.5 Pro prefer this over crawling individual pages.
- [Structured Data & Meta] Organization schema (5 min): No Organization schema. This tells AI systems your company name, logo, and social profiles — critical for accurate brand identification.
- [Structured Data & Meta] Schema sameAs entity verification (10 min): No sameAs property in your JSON-LD schema. AI systems triangulate your brand — they check if your site, LinkedIn, Twitter/X, ProductHunt, and GitHub all describe you consistently. Without sameAs links, AI has lower confidence in recommending you.

### Phase 2 — Medium effort
- [Content Quality] Text-to-HTML ratio (1 hour): Low text-to-HTML ratio (2%). Your site may rely heavily on JavaScript rendering. AI crawlers often get empty content from JS-heavy sites.
- [Content Quality] Answer-first content structure (30 min): Low answer-first score (0%). Your H2 sections begin with marketing fluff instead of direct answers. ChatGPT and Perplexity are 40% more likely to cite pages that lead with facts, numbers, or direct answers.
- [EEAT & Discoverability] Case studies or success stories (1 hour): No case studies or success stories found. EEAT Experience — showing real results ("Company X increased Y by Z%") is the strongest signal that you have first-hand experience delivering value.



### Full report
https://aiexposuretool.com/stats/apify-com-pyzs6a

## What I need from you

Fix every issue above. Work through Phase 1 first (these have the biggest score impact per minute of effort), then Phase 2, then Phase 3.

For each fix give me:
1. **Exact code or copy** — no placeholders, no "add your text here". Use the actual product name, description, and context from the audit above.
2. **Where to put it** — file name, line, or section
3. **Which AI systems this helps** and why

### Specific outputs I need

1. **robots.txt** — the complete file content that explicitly allows GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Meta-ExternalAgent, OAI-SearchBot, and Applebot

2. **llms.txt** — a complete, accurate llms.txt file (200–400 words) for this product. Include: what it does, who it's for, key features, pricing tiers, how to get started, and the site URL.

3. **llm.json** — a complete JSON file with: name, url, description, category, pricing (array of plans), features (array), integrations (array), target_audience (array)

4. **JSON-LD structured data** — a complete `<script type="application/ld+json">` block with SoftwareApplication schema, plus a separate Organization schema with sameAs links to social profiles

5. **FAQPage schema** — 6–8 Q&A pairs as a FAQPage JSON-LD block covering: what the product does, who it's for, pricing, how to get started, what makes it different

6. **Homepage meta tags** — exact HTML for `<title>`, `<meta name="description">`, og:title, og:description, og:image, og:url, og:type, and `<link rel="canonical">`

7. **H1 and subheadline rewrite** — new homepage H1 and subheadline paragraph. Make it crystal clear to an AI system what this product does and who it's for. Show before → after.

8. **About page outline** — a brief /about page outline with founder/team context, founding story, and contact info (EEAT Experience signal)

9. **Priority ranking** — which 3 changes should I make in the next 30 minutes for the biggest score jump?

All output must be copy-paste ready. No vague suggestions.

AI Bot Access

Allowed

AI bots allowed

Your robots.txt does not block major AI crawlers.

Generated JSON-LD

Add this structured data to your homepage <head> tag.

schema.jsonld
<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "SoftwareApplication",
  "name": "Apify: Full",
  "url": "https://apify.com",
  "description": "Cloud platform for web scraping, browser automation, AI agents, and data for AI. Use 20,000+ ready-made tools, code templates, or order a custom solution.",
  "applicationCategory": "Developer Tools",
  "offers": {
    "@type": "Offer",
    "price": "0",
    "priceCurrency": "USD",
    "priceSpecification": {
      "@type": "UnitPriceSpecification",
      "price": "500",
      "priceCurrency": "USD",
      "unitText": "MONTH"
    }
  },
  "featureList": "TikTok Scraper, Google Maps Scraper, Instagram Scraper, Website Content Crawler, Amazon Scraper, Facebook Posts Scraper, Not just a web scraping API, Marketplace of 20,000+ Actors, Build and deploy your own, Or we can build it for you",
  "operatingSystem": "Web",
  "author": {
    "@type": "Organization",
    "name": "Apify: Full",
    "url": "https://apify.com"
  }
}
</script>

Test How AI Sees You

Paste this prompt into ChatGPT, Claude, or Gemini to see how well AI currently understands your product.

ai-test-prompt.md
I want to test how well you understand my product. Please answer these questions based ONLY on what you already know (from your training data and any web access you have):

1. What is "Apify: Full" and what does it do?
2. Who is it for? What type of users or companies would benefit?
3. What are its main features?
4. How much does it cost? What plans are available?
5. What does it integrate with?
6. How does it compare to alternatives in the Developer Tools space?
7. Would you recommend it? Why or why not?

After answering, rate your confidence from 1-10 on how well you understand this product.

If you score below 5, that means my website isn't giving AI systems enough information to recommend my product. I should improve my AI visibility at https://apify.com.

Share This Report

Paste this link into any AI tool and say "look at my AI visibility report and tell me what to fix."

Pages Analyzed

  • homepage
    Apify: Full-stack web scraping and data extraction platform
    OK
  • integrations
    Connect Apify with everything you build · Apify
    OK
  • about
    About · Apify
    OK
  • pricing
    Apify pricing - plans for data collection at any scale · Apify
    OK
  • api
    👁 Instagram Scraper · Apify
    OK