Skip to main content


Docugami converts business documents into a Document XML Knowledge Graph, generating forests of XML semantic trees representing entire documents. This is a rich representation that includes the semantic and structural characteristics of various chunks in the document as an XML tree.

Installation and Setup

pip install lxml

Document Loader

See a usage example.

from langchain.document_loaders import DocugamiLoader

API Reference: