Extract data from your documents with this REST API & Node.js Cloud SDK

Extract text, images, metadata and other data from 40+ document formats. Apply template to extract tables or specific data fields.

  • GroupDocs.Parser Cloud for cURL
  • GroupDocs.Parser Cloud SDK for .NET
  • GroupDocs.Parser Cloud SDK for Java
  • GroupDocs.Parser Cloud SDK for PHP
  • GroupDocs.Parser Cloud SDK for Python
  • GroupDocs.Parser Cloud SDK for Ruby
Start Free Trial

GroupDocs.Parser Cloud for Node.js is a wrapper around document data extraction REST API to parse over 50 document types. Extract text in various modes, extract images, parsing documents with predefined templates are the most popular features of GroupDocs.Parser Cloud API. The API supports password protected files and containers like ZIP archives, OST/PST mail data files, eBooks, markups, and PDF portfolios in your Node.js applications.

Frequently Asked Questions

I want to create my own Node.js application that can parse files online?

Check our Cloud SDKs for Node.js at GitHub if you are looking for the source code to parse files in the Cloud.

Can I try GroupDocs.Parser REST APIs on Node.js for free?

You can try GroupDocs.Parser Low-Code Node.js APIs without any limitations.

I do not want to upload my confidential files anywhere? What are my options?

GroupDocs.Parser Cloud is also available as Docker image which can be used to self-host the service. Or you may build your own services using GroupDocs.Parser High-code APIs which currently drive both our Free Apps and REST APIs.

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Formatted text from a Document

GroupDocs.Parser Cloud API, provides several ways to extract text from documents. Following example shows how to extract formatted text from a document.

Extract formatted text from a document - Node.js

// For complete examples and data files, please go to https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-node-samples
global.parser_cloud = require("groupdocs-parser-cloud");

global.appSid = "XXXX-XXXX-XXXX-XXXX"; // Get AppKey and AppSID from https://dashboard.groupdocs.cloud
global.appKey = "XXXXXXXXXXXXXXXX"; // Get AppKey and AppSID from https://dashboard.groupdocs.cloud

global.parseApi = parser_cloud.ParseApi.fromKeys(appSid, appKey);

let fileInfo = new parser_cloud.FileInfo();
fileInfo.filePath = "words-processing/docx/formatted-document.docx";

let options = new parser_cloud.TextOptions();
options.fileInfo = fileInfo;

let formattedTextOptions = new parser_cloud.FormattedTextOptions();
formattedTextOptions.mode = "Markdown";
options.formattedTextOptions = formattedTextOptions;

let request = new parser_cloud.TextRequest(options);
let response = await parseApi.text(request);
    ```

OTTPDFFB2PSTTARXLTMDOCDOTMBMPPOTXLAMODPXLTXGIFRARXLSBJP2ONEJPEGJPGODSPOTXEMLXTIFFPNGXLSXODTOSTCHMXLTXLAPOTMRTFOTPXHTMLXMLMSGDOTXTXTTIFDOTEPUBPPSXBZ2PPSMPPTXOTSDOCXMHTMLPPTMGZEMLHTMLXLSMCSVDOCMPPSXLSMDPPTZIP

JPEGPOTXBMPXLSBDOCXXLSMXLTMODTPSTPPTONEDOTMMSGMHTMLHTMLXLSDOTFB2EPUBXLAGZPNGXLTXBZ2PPSXXLTXMLZIPTIFEMLXJPGRAROTSOTPPOTMCSVDOCMXLAMGIFTIFFPPSMXLSXPPTMPPTXOTTRTFPDFDOCTXTXHTMLODSODPPOTMDDOTXTARCHMPPSEMLJP2OST

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

  English
4b2bd44