Version 5
supported
This version of Silverstripe CMS is still supported though will not receive any additional features.
Go to documentation for the most recent stable version.
Text extraction
This module provides a framework for extracting text content from various file formats, such as PDFs and Office documents. The extracted content can be used programmatically or made available directly on your File
objects.
Installation
composer require silverstripe/textextraction
GitHub repository
https://github.com/silverstripe/silverstripe-textextraction