Parse & Extract with Cloud API & .NET SDK

.NET Parser SDK communicate with the REST API enabling you to build document parsing and data extraction tools on .NET platform.

  • GroupDocs.Parser Cloud for cURL
  • GroupDocs.Parser Cloud SDK for Java
  • GroupDocs.Parser Cloud SDK for PHP
  • GroupDocs.Parser Cloud SDK for Python
  • GroupDocs.Parser Cloud SDK for Ruby
  • GroupDocs.Parser Cloud SDK for Node.js
Start Free Trial

GroupDocs.Parser Cloud SDK facilitates .NET developers to parse any document to extract text, extract images and metadata within .NET based applications. SDK (REST API Client) is the easiest and quick way for the developer to speed up the development. This lets the developer focus on just to write the project-specific code and not to worry about the low-level details of making requests and handling the responses. Documents can be parsed by easy to use user-defined templates with data field definitions and table definitions. Then it’s simple to extract data such as text fields, numbers, tables from the typical documents and even from containers like ZIP archives, OST/PST mail data files, eBooks, markups, and PDF portfolios.

Frequently Asked Questions

I want to create my own .NET application that can parse files online?

Check our Cloud SDKs for .NET at GitHub if you are looking for the source code to parse files in the Cloud.

Can I try GroupDocs.Parser REST APIs on .NET for free?

You can try GroupDocs.Parser Low-Code .NET APIs without any limitations.

I do not want to upload my confidential files anywhere? What are my options?

GroupDocs.Parser Cloud is also available as Docker image which can be used to self-host the service. Or you may build your own services using GroupDocs.Parser High-code APIs which currently drive both our Free Apps and REST APIs.

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Text from a Document by Page Number Range

GroupDocs.Parser Cloud API, provides several ways to extract text from the documents. You can extract only text, extract formatted text by setting extraction modes, extract from the specific pages by setting pages range. Following example shows how to extract text from a PDF document by setting page range.

Parse PDF document to extract text from specific pages - C# .NET

// Get AppKey and AppSID from https://dashboard.groupdocs.cloud
  string MyAppKey = "";
  string MyAppSid = "";

  var configuration = new Configuration(MyAppSid, MyAppKey);

  var apiInstance = new ParseApi(configuration);
  var fileInfo = new FileInfo
  {
  FilePath = "directory/document.pdf"
  };

  var options = new TextOptions
  {
  FileInfo = fileInfo,
  StartPageNumber = 1,
  CountPagesToExtract = 2
  };

  var request = new TextRequest(options);
  var response = apiInstance.Text(request);

  // For complete examples, visit https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-dotnet-samples

Extract images from Documents using GroupDocs.Parser Free App

PSTODTTIFFEPUBMDXLSMPPTMPDFJP2EMLXTARONEMHTMLXHTMLEMLDOCXXLTXXLSXLSXCHMJPEGPNGCSVDOTDOTMXMLXLTPPSMXLSBMSGPPSXHTMLOTTPPTDOCDOTXFB2TXTOTSPOTTIFJPGPOTMXLAOSTBZ2XLTMDOCMZIPPPSBMPGIFXLAMOTPODSRTFPOTXRARODPGZPPTX

Extract images from Documents using GroupDocs.Parser Free App

ZIPPOTXEMLTARPPTXMSGJP2XLSMMHTMLEMLXBZ2OSTGIFDOTPPSMFB2XLSBDOCXHTMLDOCMOTPXLSPPTTIFPDFJPGXLTMRTFONEODSPNGPSTJPEGOTTODTDOCXLTMDRARXHTMLXMLXLAMDOTXXLTXPPSXEPUBPPSGZBMPPPTMPOTCSVCHMXLSXTIFFXLATXTODPPOTMDOTMOTS

Support and Learning Resources

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

  English
4e77447