Skip to main content


arXiv is an open-access archive for 2 million scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics.

Installation and Setup​

First, you need to install arxiv python package.

pip install arxiv

Second, you need to install PyMuPDF python package which transforms PDF files downloaded from the site into the text format.

pip install pymupdf

Document Loader​

See a usage example.

from langchain_community.document_loaders import ArxivLoader
API Reference:ArxivLoader


See a usage example.

from langchain.retrievers import ArxivRetriever
API Reference:ArxivRetriever

Was this page helpful?

You can also leave detailed feedback on GitHub.