Extract data from your documents with this REST API & Node.js Cloud SDK

Extract text, images, metadata and other data from 40+ document formats. Apply template to extract tables or specific data fields.

Start Free Trial

GroupDocs.Parser Cloud for Node.js is a wrapper around document data extraction REST API to parse over 50 document types. Extract text in various modes, extract images, parsing documents with predefined templates are the most popular features of GroupDocs.Parser Cloud API. The API supports password protected files and containers like ZIP archives, OST/PST mail data files, eBooks, markups, and PDF portfolios in your Node.js applications.

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Formatted text from a Document

GroupDocs.Parser Cloud API, provides several ways to extract text from documents. Following example shows how to extract formatted text from a document.

Extract formatted text from a document - Node.js

// For complete examples and data files, please go to https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-node-samples
global.parser_cloud = require("groupdocs-parser-cloud");

global.appSid = "XXXX-XXXX-XXXX-XXXX"; // Get AppKey and AppSID from https://dashboard.groupdocs.cloud
global.appKey = "XXXXXXXXXXXXXXXX"; // Get AppKey and AppSID from https://dashboard.groupdocs.cloud

global.parseApi = parser_cloud.ParseApi.fromKeys(appSid, appKey);

let fileInfo = new parser_cloud.FileInfo();
fileInfo.filePath = "words-processing/docx/formatted-document.docx";

let options = new parser_cloud.TextOptions();
options.fileInfo = fileInfo;

let formattedTextOptions = new parser_cloud.FormattedTextOptions();
formattedTextOptions.mode = "Markdown";
options.formattedTextOptions = formattedTextOptions;

let request = new parser_cloud.TextRequest(options);
let response = await parseApi.text(request);
    ```

Support and Learning Resources

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below: