Skip to main content


Atlas is a platform by Nomic made for interacting with both small and internet scale unstructured datasets. It enables anyone to visualize, search, and share massive datasets in their browser.

This notebook shows you how to use functionality related to the AtlasDB vectorstore.

%pip install --upgrade --quiet  spacy
!python3 -m spacy download en_core_web_sm
%pip install --upgrade --quiet  nomic

Load Packages

import time

from langchain.text_splitter import SpacyTextSplitter
from langchain_community.document_loaders import TextLoader
from langchain_community.vectorstores import AtlasDB

Prepare the Data

loader = TextLoader("../../modules/state_of_the_union.txt")
documents = loader.load()
text_splitter = SpacyTextSplitter(separator="|")
texts = []
for doc in text_splitter.split_documents(documents):

texts = [e.strip() for e in texts]

Map the Data using Nomic’s Atlas

db = AtlasDB.from_texts(
name="test_index_" + str(time.time()), # unique name for your vector store
description="test_index", # a description for your vector store
index_kwargs={"build_topic_model": True},

Here is a map with the result of this code. This map displays the texts of the State of the Union.