Skip to main content
Technical Dossier · Reference

Technical SEO Audit for archive.org

· 60 signals checked · Share this report

Strong

TL;DR

Free technical SEO audit for archive.org. Score: 88/100 across 60+ on-page, performance, mobile, and security signals. Run by Baaed FREE SEO Suite.

  • 44 pass
  • 13 warn
  • 3 fail

Audit Results

Analytics 3 signals
  • Google Analytics No Google Analytics detected. Without analytics you can't measure traffic or conversions.
  • Google Tag Manager No GTM detected. Optional — only worth adding if you run many tracking pixels.
  • Facebook Pixel No Facebook Pixel detected. Optional — only worth adding if you run Meta ads and need retargeting.
Mobile 2 signals
  • Mobile Viewport Mobile viewport correctly configured.
  • Flash Content No Flash content — good, Flash has been discontinued since 2020.
Onpage 18 signals
  • Title Tag Title is 93 characters — may be truncated on desktop SERPs. Aim for 50–60. Use our Meta Tag Generator →
  • Meta Description No meta description tag. Search engines may generate their own snippet from page content. Use our Meta Tag Generator →
  • Meta Keywords No meta keywords tag present — this is modern best practice. Google ignores this tag since 2009.
  • H1 Heading No H1 heading on the page. Every page should have exactly one H1 that describes its main topic.
  • Heading Hierarchy Logical heading structure: H1:0 H2:1 H3:0 H4:0 H5:0 H6:0.
  • Image Alt Text No images on page — nothing to audit.
  • Internal vs External Links No links found on the page. Internal linking is critical for SEO.
  • Follow / Nofollow No links to analyze.
  • Canonical URL No <link rel="canonical"> tag. Helps prevent duplicate content issues. Rewrite URLs with our tool →
  • Language Attribute Page declares lang="en".
  • Character Encoding UTF-8 charset declared — correct modern default.
  • DOCTYPE HTML5 DOCTYPE declared correctly.
  • Custom 404 Page Missing URLs return a full 404 response (4,958 bytes). Visitors get proper feedback.
  • W3C HTML Validity 2 W3C HTML error(s). Minor — fix for clean markup.
  • Text to HTML Ratio Text/HTML ratio: 11.4%. Aim for 25%+ for stronger SEO signal.
  • Total Link Count No links on the page.
  • Author Meta No author meta tag. Optional — relevant for blogs and news sites.
  • Readability Proxy Average sentence length ~15 words — good readability range.
Performance 12 signals
  • HTTP Status Server returned HTTP 200.
  • Response Time Fast TTFB: 202ms. Well under Google's 800ms target.
  • Page Size Page weight: 1 KB. Lightweight — good for mobile users on slow connections.
  • Compression Gzip compression active. Consider upgrading to Brotli for ~15-20% smaller responses.
  • Browser Caching Static asset https://archive.org/offshoot_assets/favicon.ico caches well: max-age=315360000.
  • HTTP Version Serving over HTTP/2 — multiplexed and modern.
  • External Resources 1 CSS + 3 JS external files — reasonable.
  • Inline <style> Tags No inline style blocks — clean separation of concerns.
  • Inline <script> Tags No inline scripts — excellent for CSP and caching.
  • Iframes No iframes — keeps load time predictable.
  • Image Count No images — minimal image payload.
  • Broken Link Check 12 of 30 crawled links returned 4xx/5xx — fix or redirect them. Examples: https://archive.org/details/gov.uscourts.ilnb.1375288, https://archive.org/details/gov.uscourts.moeb.429586, https://archive.org/details/gov.uscourts.moeb.429830, https://archive.org/details/youtube-dxdLcOy0RNI, https://archive.org/details/gov.uscourts.ilnb.1375470. Run full broken-links scan →
Security 9 signals
  • HTTPS Page served over HTTPS. Good for SEO, privacy, and user trust.
  • HSTS Strict-Transport-Security header present. Forces HTTPS on future visits.
  • Content Security Policy Content-Security-Policy header present. Protects against XSS.
  • X-Frame-Options / CSP frame-ancestors Clickjacking protection is in place.
  • Mixed Content No mixed content — all resources loaded over HTTPS.
  • Server Header Disclosure Server header reveals software and version: "nginx/1.31.1". Attackers use this to target known CVEs.
  • X-Powered-By Disclosure No X-Powered-By header — good.
  • security.txt security.txt file present — RFC 9116 standard for security contact info.
  • DNSBL / Spam Blocklist IP is clean on the 5 major DNSBL zones spot-checked (Spamhaus, SpamCop, Barracuda, SORBS, PSBL).
Technical 16 signals
  • Viewport Meta Tag Responsive viewport declared: width=device-width, initial-scale=1.
  • Robots Meta Tag No restrictive robots meta — default is index,follow (good).
  • robots.txt robots.txt is present at https://archive.org/robots.txt.
  • XML Sitemap Sitemap found at https://archive.org/sitemap/sitemap.xml.
  • Structured Data (Schema) No structured data found (JSON-LD, Microdata, or RDFa). Rich results won't be eligible.
  • Structured Data Richness No schema types detected.
  • Open Graph Tags No Open Graph tags. Social media shares will use fallback data and may look unprofessional.
  • Twitter Card No Twitter Card tags. Twitter shares fall back to Open Graph or plain text.
  • AMP Page is not AMP. This is fine — AMP is optional and declining in Google's ranking signals.
  • Favicon Favicon declared: /offshoot_assets/favicon.ico.
  • Hreflang No hreflang tags. This is correct for single-language sites.
  • ads.txt No ads.txt file. This is optional — add one only if you monetize via programmatic display ads.
  • Server IP / Location Resolved to 207.241.224.2.
  • Server Location / Host Hosting provider: <strong>Unknown</strong> (rDNS: <code>www.archive.org</code>).
  • www / non-www canonical www redirects to apex. Single canonical hostname configured correctly.
  • Domain Age Registered 1995-12-14 (30 year(s) old). Mature domain — Google trusts age modestly.

Want to fix the issues we flagged?

Baaed FREE SEO Suite has free tools that solve every metric in this report.

Re-run this audit Browse all tools

More dossiers in Reference