Parse your documents & automate document processing via Ruby SDK
Build document processing solutions using Ruby REST API. Use our unified API with different document types without worrying about file format specifics.
Start Free Trial
GroupDocs.Parser Cloud SDK for Ruby
GroupDocs.Parser Cloud SDK for Ruby and is a powerful REST API document data parsing solution that allows to parse over 50 document types. The SDK allows to build document processing automation solutions in a unified way. No matter what document format you receive - PDF or DOCX or any other supported one, you are allowed to extract data with the unified API.
Parsing by template is a valuable feature, but the API also provides other methods to extract images and text.
Document Parser REST API Features
Parse PDF, word processing documents, spreadsheets & presentations
Parse Microsoft Word, Excel, PowerPoint and OpenDocument template files
Parse Macro-Enabled document & templates for MS Word, Excel & PowerPoint
Extract text content from the whole document
Extract text and images from specific pages
Extract formatted text by setting extraction mode
Extract text from the documents in ZIP archives or Emails
Retrieve documents from Emails, PDF portfolios & MS Outlook storages
Get document page count and other information
Extract Images from a Document
GroupDocs.Parser Cloud API, provides several ways to extract images from the documents. You can extract all the images from the whole document, extract images from the specific pages by setting pages range. Following example shows how to extract images from a document by a page number range.
Extract images by a page number range - Ruby
# For complete examples and data files, please go to https://github.com/groupdocs-parser-cloud/groupdocs-parser-cloud-ruby-samples
require 'groupdocs_viewer_cloud'
$app_sid = "XXXX-XXXX-XXXX-XXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud
$app_key = "XXXXXXXXXXXXXXXX" # Get AppKey and AppSID from https://dashboard.groupdocs.cloud
parseApi = GroupDocsParserCloud::ParseApi.from_keys($app_sid, $app_key)
options = GroupDocsParserCloud::ImagesOptions.new
options.file_info = GroupDocsParserCloud::FileInfo.new
options.file_info.file_path = "slides/three-slides.pptx"
options.start_page_number = 1
options.count_pages_to_extract = 2
request = GroupDocsParserCloud::ImagesRequest.new(options)
response = parseApi.images(request)
Support and Learning Resources
- Learning Resources
- Documentation
- Source Code
- API References
- Product Support
- Free Support
- Blog