Can Swiftype index PDF documents?


#1

Yes! We now offer crawler-based indexing of PDFs (up to ~10MB in size) to our Pro and Enterprise level plans.

To detect PDFs for indexing, the URLs will need to be discoverable within your site’s HTML pages or included in an installed sitemap.

Swiftype is able to extract text within the body of the PDF document, as well as any values within the PDF files standard metadata fields:

  • title
  • author
  • subject
  • keywords

Indexing of image based content or OCR processing is not currently supported.