PyPDFParser#
- class langchain_community.document_loaders.parsers.pdf.PyPDFParser(password: str | bytes | None = None, extract_images: bool = False, *, extraction_mode: str = 'plain', extraction_kwargs: Dict[str, Any] | None = None)[source]#
Load PDF using pypdf
Methods
__init__
([password,Β extract_images,Β ...])lazy_parse
(blob)Lazily parse the blob.
parse
(blob)Eagerly parse the blob into a document or documents.
- Parameters:
password (Optional[Union[str, bytes]]) β
extract_images (bool) β
extraction_mode (str) β
extraction_kwargs (Optional[Dict[str, Any]]) β
- __init__(password: str | bytes | None = None, extract_images: bool = False, *, extraction_mode: str = 'plain', extraction_kwargs: Dict[str, Any] | None = None)[source]#
- Parameters:
password (str | bytes | None) β
extract_images (bool) β
extraction_mode (str) β
extraction_kwargs (Dict[str, Any] | None) β