Can Swiftype index PDF documents?

system · December 21, 2015, 10:30pm

Yes! We now offer crawler-based indexing of PDFs (up to ~10MB in size) to our Pro and Enterprise level plans.

To detect PDFs for indexing, the URLs will need to be discoverable within your site’s HTML pages or included in an installed sitemap.

Swiftype is able to extract text within the body of the PDF document, as well as any values within the PDF files standard metadata fields:

Indexing of image based content or OCR processing is not currently supported.

Topic		Replies	Views
Searching the body of a PDF plus having custom fields API api , crawler	1	5262	May 15, 2018
Can I index a PDF document using the API? API	1	7291	December 18, 2019
How to return only PDF and Word files?	2	3370	April 5, 2018
I am using Swiftype site search. I am wondering if it's possible to download the indexing file?	0	6787	November 7, 2019
How to index private pay-wall content Helpline	1	5235	April 14, 2017