Docugami converts business documents into a Document XML Knowledge Graph, generating forests of XML semantic trees representing entire documents. This is a rich representation that includes the semantic and structural characteristics of various chunks in the document as an XML tree.
Installation and Setup#
pip install lxml
See a usage example.
from langchain.document_loaders import DocugamiLoader