← Back to Content Test Hub

Document Inventory Fixtures

This page exercises the document inventory feature of the content_strategist scanner profile. The scanner sweeps <a href> tags, classifies same-origin links by file extension (.pdf / .doc / .docx / .ppt / .pptx / .xls / .xlsx), then issues an HTTP HEAD request to each unique URL to capture Content-Length, Content-Type, and Last-Modified.

Cross-page duplicate: /content/inventory/files/annual-report.pdf is also linked from /content/home, so its source_pages_total count should be ≥ 2 in the inventory results. The second link below on this same page adds a third reference.

Downloadable Documents

LinkTypeApprox sizeLast-ModifiedNotes
Annual Report PDF~20 KB Yesterday (2026-04-26) Linked from /content/home too — exercises source_pages_total > 1
Product Brochure PDF~500 KB 1 year ago (2025-04-27)
Technical Whitepaper PDF~5 MB 3 years ago (2023-04-27) Largest fixture — exercises size column formatting
Fact Sheet DOCX~15 KB Yesterday (2026-04-26)
Slide Deck PPTX~25 KB 1 year ago (2025-04-27)
Data Table XLSX~10 KB unset No Last-Modified header — exercises the “—” fallback in the UI
Annual Report (second link) PDF~20 KB Yesterday (2026-04-26) Duplicate <a> to same URL on this page — dedup by canonical URL

Extension-less URL (not inventoried)

The link below points to /content/inventory/files/whitepaper-download, which serves Content-Type: application/pdf but has no file extension. The scanner classifies documents by extension only, so this URL will not appear in the Document Inventory tab. It is included here to document and verify that behaviour.

Whitepaper (no extension)