Parse Documents & Extract Data using cURL

Parse PDF and file formats from Word, Excel, Web, eBooks, Emails, ZIP archives to extract text, images, and metadata using REST API.

Start Free Trial
GroupDocs.Parser for cURL

GroupDocs.Parser Cloud for cURL

 

GroupDocs.Parser Cloud for cURL allows parsing and data extraction for almost all of the common business document formats. Using simple cURL commands; text and images can be extracted from the whole document or from the specific pages range. REST API also supports archives like ZIP to be parsed and extract images, document information and the details of the container items.

Previous Next

Document Parser REST API Features

 

 

Parse PDF, word processing documents, spreadsheets & presentations

 

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

 

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

 

Extract text content from the whole document

 

Extract text and images from specific pages

 

Extract formatted text by setting extraction mode

 

Extract text from the documents in ZIP archives or Emails

 

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

 

Get document page count and other information

Extract Text from a Document

Using GroupDocs.Parser Cloud API, you can quickly extract images, text, and metadata from the document placed in your Cloud storage. Following simple cURL command shows how to extract text from a document.

Parse MS Word Document to Extract Text - cURL


curl -X POST "https://api.groupdocs.cloud/v1.0/parser/text" \
-H "accept: application/json" \
-H "authorization: Bearer TOKEN-HERE" \
-H "Content-Type: application/json" \
-d "{ \"FileInfo\": { \"FilePath\": \"foldername/document.docx\", }}"

Support and Learning Resources

 
  

GroupDocs.Parser Cloud also offers individual document parser SDKs for other popular languages as listed below: