How to return only PDF and Word files?


#1

I have a search engine crawling the whole site. On some pages of the site, there are links to PDF or Word files.

How can I only get the PDF or Word files in my search results? Or can I do a faceted search for PDF and Word files?


#2

Does the crawler currently index the PDF and Word files already, or are you also trying to figure out how to grab them?


#3

The crawler has indexed the files already. All the files’ url starts with /-/media/files.