GroupDocs.Parser Cloud for cURL allows parsing and data extraction for almost all of the common business document formats. Using simple cURL commands; text and images can be extracted from the whole document or from the specific pages range. REST API also supports archives like ZIP to be parsed and extract images, document information and the details of the container items.

At a
Glance
Supported
File Formats
Platform
Independence

An overview of the features supported by the document parser Cloud API.‎

Features

Parse documents
Parse document by template
Extract text
Extract images
Extract metadata

Extract from Documents

Emails
eBooks
PDF files
PDF Portfolio
Files within ZIP archives
Text and Markup documents
Microsoft & OpenDocument

GroupDocs.Parser

Document parser Cloud API supported formats.

Microsoft Office Formats

Word Processing: DOC, DOCX, DOCM, DOT, DOTX, DOTM
Spreadsheets: XLS, XLSX, XLSM, XLSB, XLT, XLTX, XLTM, XLA, XLAM
Presentations: PPT, PPTX, PPTM, PPS, PPSX, PPSM, POT, POTX, POTM
OneNote: ONE

OpenDocument & Other Formats

OpenDocument Formats: ODT, OTT, ODS, OTS, ODP, OTP
Text: TXT, RTF
Markup: HTML, XHTML, MHTML, MD, XML
Portable Formats: PDF
Email: PST, OST, EML, EMLX, MSG
eBook Formats: EPUB, FB2, CHM
Other: NUMBERS, CSV, ZIP

GroupDocs.Parser

Supported Operating Systems and Frameworks

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document

Using GroupDocs.Parser Cloud API, you can quickly extract images, text, and metadata from the document placed in your Cloud storage. Following simple cURL command shows how to extract text from a document.

Parse MS Word Document to Extract Text - cURL

curl -X POST "https://api.groupdocs.cloud/v1.0/parser/text" \
-H "accept: application/json" \
-H "authorization: Bearer TOKEN-HERE" \
-H "Content-Type: application/json" \
-d "{ \"FileInfo\": { \"FilePath\": \"foldername/document.docx\", }}"

Parse Documents & Extract Data using cURL

GroupDocs.Parser
Cloud for cURL

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document

Parse MS Word Document to Extract Text - cURL

Extract images from Documents using GroupDocs.Parser Free App

Extract images from Documents using GroupDocs.Parser Free App

Support and Learning Resources

Learning Resources

Product Support

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

GroupDocs.Parser
Cloud SDK for .NET

GroupDocs.Parser
Cloud SDK for Java

GroupDocs.Parser
Cloud SDK for PHP

GroupDocs.Parser
Cloud SDK for Python

GroupDocs.Parser
Cloud SDK for Ruby

GroupDocs.Parser
Cloud SDK for Node.js

Parse Documents & Extract Data using cURL

GroupDocs.ParserCloud for cURL

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document

Parse MS Word Document to Extract Text - cURL

Extract images from Documents using GroupDocs.Parser Free App

Extract images from Documents using GroupDocs.Parser Free App

Support and Learning Resources

Learning Resources

Product Support

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

GroupDocs.ParserCloud SDK for .NET

GroupDocs.ParserCloud SDK for Java

GroupDocs.ParserCloud SDK for PHP

GroupDocs.ParserCloud SDK for Python

GroupDocs.ParserCloud SDK for Ruby

GroupDocs.ParserCloud SDK for Node.js

Cookies

Cookies Settings

GroupDocs.Parser
Cloud for cURL

GroupDocs.Parser
Cloud SDK for .NET

GroupDocs.Parser
Cloud SDK for Java

GroupDocs.Parser
Cloud SDK for PHP

GroupDocs.Parser
Cloud SDK for Python

GroupDocs.Parser
Cloud SDK for Ruby

GroupDocs.Parser
Cloud SDK for Node.js