Document processing made easy with this document parsing REST API and Python SDK

Extract data from over than 40 popular document formats. Parse documents applying a template in a batch process.

Start Free Trial
GroupDocs.Parser Cloud SDK for Python

GroupDocs.Parser Cloud SDK for Python


GroupDocs.Parser Cloud for Python allows to parse data from over 50 popular document types. Text extraction, image extraction, parsing data by template are most valuable features of GroupDocs.Parser and available in our Cloud REST API service. With our SDK it is possible to solve many document processing automation tasks and extract data easily just applying a predefined template.

Along with the regular documents, you can use the supported features on password protected files and containers like ZIP archives, OST/PST mail data files, eBooks, markups, and PDF portfolios in your Python applications.

Previous Next

Document Parser REST API Features



Parse PDF, word processing documents, spreadsheets & presentations


Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files


Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint


Extract text content from the whole document


Extract text and images from specific pages


Extract formatted text by setting extraction mode


Extract text from the documents in ZIP archives or Emails


Retrieve documents from Emails, PDF portfolios & MS Outlook storages


Get document page count and other information

Extract Data from a Document applying e template

GroupDocs.Parser Cloud API, allows to extract data from documents by a template. You may define a template for your typical documents such as invoices, receipts, reports and applying the template you can easily extract data for further processing. Following example shows how to extract data by template.

Extract data from documents by template - Python

# For complete examples and data files, please go to
import groupdocs_parser_cloud
app_sid = "XXXX-XXXX-XXXX-XXXX" # Get AppKey and AppSID from
app_key = "XXXXXXXXXXXXXXXX" # Get AppKey and AppSID from
parseApi = groupdocs_parser_cloud.ParseApi.from_keys(app_sid, app_key)
options = groupdocs_parser_cloud.ParseOptions()
options.file_info = groupdocs_parser_cloud.FileInfo()
options.file_info.file_path = "words-processing/docx/companies.docx"
options.template_path = "templates/companies.json"
request = groupdocs_parser_cloud.ParseRequest(options)
result = parseApi.parse(request)

Support and Learning Resources


GroupDocs.Parser Cloud also offers individual document parser SDKs for other popular languages as listed below: