Extract data from over than 40 popular document formats. Parse documents applying a template in a batch process.Start Free Trial
GroupDocs.Parser Cloud for Python allows to parse data from over 50 popular document types. Text extraction, image extraction, parsing data by template are most valuable features of GroupDocs.Parser and available in our Cloud REST API service. With our SDK it is possible to solve many document processing automation tasks and extract data easily just applying a predefined template. Along with the regular documents, you can use the supported features on password protected files and containers like ZIP archives, OST/PST mail data files, eBooks, markups, and PDF portfolios in your Python applications.
An overview of the features supported by the document parser Cloud API.
Document parser Cloud API supported formats.
Supported Operating Systems and Frameworks
Check our Cloud SDKs for Python at GitHub if you are looking for the source code to parse files in the Cloud.
You can try GroupDocs.Parser Low-Code Python APIs without any limitations.
GroupDocs.Parser Cloud is also available as Docker image which can be used to self-host the service. Or you may build your own services using GroupDocs.Parser High-code APIs which currently drive both our Free Apps and REST APIs.
GroupDocs.Parser Cloud API, allows to extract data from documents by a template. You may define a template for your typical documents such as invoices, receipts, reports and applying the template you can easily extract data for further processing. Following example shows how to extract data by template.
# For complete examples and data files, please go to https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-python-samples import groupdocs_parser_cloud app_sid = "XXXX-XXXX-XXXX-XXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud app_key = "XXXXXXXXXXXXXXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud parseApi = groupdocs_parser_cloud.ParseApi.from_keys(app_sid, app_key) options = groupdocs_parser_cloud.ParseOptions() options.file_info = groupdocs_parser_cloud.FileInfo() options.file_info.file_path = "words-processing/docx/companies.docx" options.template_path = "templates/companies.json" request = groupdocs_parser_cloud.ParseRequest(options) result = parseApi.parse(request) ```