Parse your documents & automate document processing via Ruby SDK

Build document processing solutions using Ruby REST API. Use our unified API with different document types without worrying about file format specifics.

  • GroupDocs.Parser Cloud for cURL
  • GroupDocs.Parser Cloud SDK for .NET
  • GroupDocs.Parser Cloud SDK for Java
  • GroupDocs.Parser Cloud SDK for PHP
  • GroupDocs.Parser Cloud SDK for Python
  • GroupDocs.Parser Cloud SDK for Node.js
Start Free Trial

GroupDocs.Parser Cloud SDK for Ruby and is a powerful REST API document data parsing solution that allows to parse over 50 document types. The SDK allows to build document processing automation solutions in a unified way. No matter what document format you receive - PDF or DOCX or any other supported one, you are allowed to extract data with the unified API. Parsing by template is a valuable feature, but the API also provides other methods to extract images and text.

Frequently Asked Questions

I want to create my own Ruby application that can parse files online?

Check our Cloud SDKs for Ruby at GitHub if you are looking for the source code to parse files in the Cloud.

Can I try GroupDocs.Parser REST APIs on Ruby for free?

You can try GroupDocs.Parser Low-Code Ruby APIs without any limitations.

I do not want to upload my confidential files anywhere? What are my options?

GroupDocs.Parser Cloud is also available as Docker image which can be used to self-host the service. Or you may build your own services using GroupDocs.Parser High-code APIs which currently drive both our Free Apps and REST APIs.

Document Parser REST API Features

Parse PDF, word processing documents, spreadsheets & presentations

Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files

Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint

Extract text content from the whole document

Extract text and images from specific pages

Extract formatted text by setting extraction mode

Extract text from the documents in ZIP archives or Emails

Retrieve documents from Emails, PDF portfolios & MS Outlook storages

Get document page count and other information

Extract Images from a Document

GroupDocs.Parser Cloud API, provides several ways to extract images from the documents. You can extract all the images from the whole document, extract images from the specific pages by setting pages range. Following example shows how to extract images from a document by a page number range.

Extract images by a page number range - Ruby

# For complete examples and data files, please go to https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-ruby-samples
require 'groupdocs_viewer_cloud'

$app_sid = "XXXX-XXXX-XXXX-XXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud
$app_key = "XXXXXXXXXXXXXXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud

parseApi = GroupDocsParserCloud::ParseApi.from_keys($app_sid, $app_key)

options = GroupDocsParserCloud::ImagesOptions.new
options.file_info = GroupDocsParserCloud::FileInfo.new
options.file_info.file_path = "slides/three-slides.pptx"

options.start_page_number = 1
options.count_pages_to_extract = 2

request = GroupDocsParserCloud::ImagesRequest.new(options)
response = parseApi.images(request)
    ```

PSTODPTIFRTFPPSXTXTCSVPPTMPOTXPDFGIFXMLPPTDOCXPPSMTIFFXLSMXLAXHTMLONEEPUBXLSBXLSXMHTMLJPEGXLAMOSTDOCMOTSEMLPPTXPOTMODTDOTXDOTPOTXLTGZXLTXBZ2RARMSGDOTMOTTXLTMMDPNGCHMOTPBMPPPSEMLXHTMLJPGFB2JP2ZIPDOCODSTARXLS

BMPPNGPPSMXLAODPODTDOCPSTOTSXLTMZIPPPSXODSOSTMHTMLXHTMLTXTBZ2PPTXJPGXLSXJP2XLSJPEGPOTXDOTXDOCXPPTMHTMLOTPTIFDOTMXLSMXLTXEMLPOTMRARTIFFEMLXTARMDCSVPDFMSGGZPPTOTTEPUBDOTRTFDOCMFB2GIFXLSBPOTPPSONEXMLXLTXLAMCHM

GroupDocs.Parser Cloud also offers individual document parsing SDKs for other languages as listed below:

  English
4b2bd44