Solutions

Every PDF operation your pipeline needs.

Automate PDF document processing at scale with PDFluent: extract text, split/merge pages, apply redactions, and convert to PDF/A — all in pure Rust.

Start evaluation Read the docs

Code example

rust

use pdfluent::PdfDocument;

fn main() -> pdfluent::Result<()> {
    let doc = PdfDocument::open("invoice.pdf")?;

    // Positioned text blocks: text + bounding box + page
    for block in doc.text_with_layout()? {
        println!("p{} {:?} {}", block.page, block.bbox, block.text);
    }
    Ok(())
}

Run cargo add [email protected] to get started.

What it does

Text extraction

Extract text with full layout context: font names, sizes, coordinates, reading order, and paragraph boundaries. Useful for search indexing, redaction detection, and content migration.

OCR — local and cloud

Local OCR via ocrs (WASM-compatible, no server upload). Cloud adapters for Mistral OCR, Google Document AI, AWS Textract, and Azure Form Recognizer. Same API, swappable backend.

Redaction

Find and permanently remove sensitive content — PII, account numbers, legal references. Redaction burns through to the content stream, not just the visual layer.

Merge and split

Split by page range, bookmark, or content pattern. Merge multiple PDFs with bookmark and page label preservation. Works on linearised and encrypted files.

Page manipulation

Rotate, crop, resize, and reorder pages. Add watermarks, headers, footers, and overlays. Flatten annotations and form fields.

Format conversion

Convert PDF to DOCX, XLSX, and PPTX with layout fidelity. Convert Office documents to PDF. Render pages to PNG, JPEG, or SVG at any DPI.

Deployment options

Server-side (Rust binary)Docker containerAWS LambdaAzure FunctionsKubernetes

Frequently asked questions

Related how-to guides

Extract text from a PDF in Rust Merge PDFs in Rust Redact a PDF in Rust

View pricing