Uses of Class
com.extractpdf4j.parsers.HybridParser
Packages that use HybridParser
Package
Description
Implements the primary PDF parsing strategies and extraction components used to convert
document content into structured tabular output.
-
Uses of HybridParser in com.extractpdf4j.parsers
Methods in com.extractpdf4j.parsers that return HybridParserModifier and TypeMethodDescriptionHybridParser.debug(boolean on) Enables or disables debug outputs for lattice/OCR strategies.Directory where debug artifacts should be written (lattice + OCR).HybridParser.dpi(float dpi) Sets DPI for image-based parsing (used by lattice + OCR strategies).HybridParser.keepCells(boolean on) Whether to preserve empty cells when reconstructing grids (lattice only).HybridParser.minScore(double score) Sets the minimum allowed average score across a list of tables.HybridParser.stripText(boolean strip) Enables or disables text normalization for stream-style extraction across all underlying strategies.