Package com.extractpdf4j.parsers


package com.extractpdf4j.parsers
Implements the primary PDF parsing strategies and extraction components used to convert document content into structured tabular output.
  • Classes
    Class
    Description
    BaseParser
    HybridParser
    LatticeParser
    OcrStreamParser (header-aware): - Removes horizontal *and* vertical rules before OCR.
    StreamParser