Uses of Class
com.extractpdf4j.parsers.BaseParser
Packages that use BaseParser
Package
Description
Defines annotations used throughout ExtractPDF4J for configuration, metadata declaration,
and extension points.
Implements the primary PDF parsing strategies and extraction components used to convert
document content into structured tabular output.
-
Uses of BaseParser in com.extractpdf4j.annotations
Methods in com.extractpdf4j.annotations that return BaseParserModifier and TypeMethodDescriptionstatic BaseParserExtractPdfAnnotations.parserFrom(Class<?> type) Builds a parser instance (no filepath) from theExtractPdfConfigannotation on a class.static BaseParserExtractPdfAnnotations.parserFrom(Class<?> type, String filepath) Builds a parser instance from theExtractPdfConfigannotation on a class. -
Uses of BaseParser in com.extractpdf4j.parsers
Subclasses of BaseParser in com.extractpdf4j.parsersModifier and TypeClassDescriptionclassHybridParserclassLatticeParserclassOcrStreamParser (header-aware): - Removes horizontal *and* vertical rules before OCR.classStreamParserMethods in com.extractpdf4j.parsers that return BaseParserModifier and TypeMethodDescriptionSets the pages to parse.Sets the page selection for this parser and propagates the same selection to all underlying strategies.BaseParser.stripText(boolean strip) Enables or disables text normalization for stream-style extraction.