Docling Parse PDF Extractor

Native C++ PDF parsing engine used by Docling to extract text with precise coordinates from programmatic (non-scanned) PDF files. Distributed as a Python extension.

Docling Parse PDF Extractor is one of 16 APIs that Docling publishes on the APIs.io network.

Tagged areas include PDF, Parsing, and C++. The published artifact set on APIs.io includes API documentation.

API entry from apis.yml

apis.yml Raw ↑
aid: docling:docling-parse
name: Docling Parse PDF Extractor
tags:
- PDF
- Parsing
- C++
humanURL: https://github.com/docling-project/docling-parse
properties:
- url: https://github.com/docling-project/docling-parse
  type: Documentation
- url: https://github.com/docling-project/docling-parse
  type: SourceCode
description: Native C++ PDF parsing engine used by Docling to extract text with precise coordinates from
  programmatic (non-scanned) PDF files. Distributed as a Python extension.