Parse Documents & Extract Data using cURL

Parse PDF and file formats from Word, Excel, Web, eBooks, Emails, ZIP archives to extract text, images, and metadata using REST API.

  • GroupDocs.Parser Cloud SDK for .NET
  • GroupDocs.Parser Cloud SDK for Java
  • GroupDocs.Parser Cloud SDK for PHP
  • GroupDocs.Parser Cloud SDK for Python
  • GroupDocs.Parser Cloud SDK for Ruby
  • GroupDocs.Parser Cloud SDK for Node.js
Start Free Trial

GroupDocs.Parser Cloud for cURL allows parsing and data extraction for almost all of the common business document formats. Using simple cURL commands; text and images can be extracted from the whole document or from the specific pages range. REST API also supports archives like ZIP to be parsed and extract images, document information and the details of the container items.

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document

Using GroupDocs.Parser Cloud API, you can quickly extract images, text, and metadata from the document placed in your Cloud storage. Following simple cURL command shows how to extract text from a document.

Parse MS Word Document to Extract Text - cURL

curl -X POST "https://api.groupdocs.cloud/v1.0/parser/text" \
-H "accept: application/json" \
-H "authorization: Bearer TOKEN-HERE" \
-H "Content-Type: application/json" \
-d "{ \"FileInfo\": { \"FilePath\": \"foldername/document.docx\", }}"

Extract images from Documents using GroupDocs.Parser Free App

PPTXODPXLSEMLBZ2XLTGIFTARMHTMLPPSXPPTMPNGZIPDOTXEPUBMDONEPPSMOTSDOCHTMLFB2RARXLSBDOTMPPTDOCXXLSXJP2TIFFTIFPSTGZCSVOTTDOCMDOTOTPRTFCHMPPSXLTXXMLBMPXLTMXLAXLAMEMLXMSGPOTMJPEGPOTXXHTMLODTXLSMOSTJPGPOTTXTPDFODS

Extract images from Documents using GroupDocs.Parser Free App

OTPPSTMDMSGPPTMMHTMLTXTPPSXXLAJP2XLSMEMLZIPGIFPNGXLTPDFBMPOTSJPGCSVXMLONETARHTMLCHMFB2GZTIFFPPSXLTXXLAMXLSXDOCXXLTMEMLXDOTBZ2RARDOTXOTTOSTEPUBPPSMPOTMODPXLSBTIFXHTMLPOTPOTXJPEGRTFODSDOCPPTXLSDOTMODTPPTXDOCM

Support and Learning Resources

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

  English
4e77447