NVIDIA

NVIDIA provides an integration package for LangChain: langchain-nvidia-ai-endpoints.

NVIDIA AI Foundation Endpoints

NVIDIA AI Foundation Endpoints give users easy access to NVIDIA hosted API endpoints for NVIDIA AI Foundation Models like Mixtral 8x7B, Llama 2, Stable Diffusion, etc. These models, hosted on the NVIDIA API catalog, are optimized, tested, and hosted on the NVIDIA AI platform, making them fast and easy to evaluate, further customize, and seamlessly run at peak performance on any accelerated stack.
With NVIDIA AI Foundation Endpoints, you can get quick results from a fully accelerated stack running on NVIDIA DGX Cloud. Once customized, these models can be deployed anywhere with enterprise-grade security, stability, and support using NVIDIA AI Enterprise.

A selection of NVIDIA AI Foundation models is supported directly in LangChain with familiar APIs.

The supported models can be found in build.nvidia.com.

These models can be accessed via the langchain-nvidia-ai-endpoints package, as shown below.

Setting up

Create a free account with NVIDIA, which hosts NVIDIA AI Foundation models
Click on your model of choice
Under Input select the Python tab, and click Get API Key. Then click Generate Key.
Copy and save the generated key as NVIDIA_API_KEY. From there, you should have access to the endpoints.

export NVIDIA_API_KEY=nvapi-XXXXXXXXXXXXXXXXXXXXXXXXXX

Install a package:

pip install -U langchain-nvidia-ai-endpoints

Chat models

See a usage example.

from langchain_nvidia_ai_endpoints import ChatNVIDIA

llm = ChatNVIDIA(model="mixtral_8x7b")
result = llm.invoke("Write a ballad about LangChain.")
print(result.content)

API Reference:

ChatNVIDIA

Embedding models

See a usage example.

from langchain_nvidia_ai_endpoints import NVIDIAEmbeddings

API Reference:

NVIDIAEmbeddings

NVIDIA

NVIDIA AI Foundation Endpoints​

Setting up​

Chat models​

API Reference:

Embedding models​

API Reference:

Help us out by providing feedback on this documentation page:

NVIDIA AI Foundation Endpoints

Setting up

Chat models

Embedding models