ChatOCIGenAI

This notebook provides a quick overview for getting started with OCIGenAI chat models. For detailed documentation of all ChatOCIGenAI features and configurations head to the API reference.

Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service that provides a set of state-of-the-art, customizable large language models (LLMs) that cover a wide range of use cases, and which is available through a single API. Using the OCI Generative AI service you can access ready-to-use pretrained models, or create and host your own fine-tuned custom models based on your own data on dedicated AI clusters. Detailed documentation of the service and API is available here and here.

Overview

Integration details

Class	Package	Local	Serializable	JS support
ChatOCIGenAI	langchain-oci	❌	❌	❌

Model features

Tool calling	Structured output	JSON mode	Image input	Audio input	Video input	Token-level streaming	Native async	Token usage	Logprobs
✅	✅	✅	✅	❌	❌	✅	❌	❌	❌

Setup

To access OCIGenAI models you'll need to install the oci and langchain-oci packages.

Credentials

The credentials and authentication methods supported for this integration are equivalent to those used with other OCI services and follow the standard SDK authentication methods, specifically API Key, session token, instance principal, and resource principal.

API key is the default authentication method used in the examples above. The following example demonstrates how to use a different authentication method (session token)

Installation

The LangChain OCIGenAI integration lives in the langchain-oci package and you will also need to install the oci package:

%pip install -qU langchain-oci

Instantiation

Now we can instantiate our model object and generate chat completions:

from langchain_oci.chat_models import ChatOCIGenAI

chat = ChatOCIGenAI(
    model_id="cohere.command-r-plus-08-2024",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="compartment_id",
    model_kwargs={"temperature": 0, "max_tokens": 500},
    auth_type="SECURITY_TOKEN",
    auth_profile="auth_profile_name",
    auth_file_location="auth_file_location",
)

Invocation

response = chat.invoke("Tell me one fact about Earth")

print(response.content)

Chaining

We can chain our model with a prompt template like so:

from langchain_core.prompts import PromptTemplate
from langchain_oci.chat_models import ChatOCIGenAI

llm = ChatOCIGenAI(
    model_id="cohere.command-r-plus-08-2024",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="compartment_id",
    model_kwargs={"temperature": 0, "max_tokens": 500},
    auth_type="SECURITY_TOKEN",
    auth_profile="auth_profile_name",
    auth_file_location="auth_file_location",
)
prompt = PromptTemplate(input_variables=["query"], template="{query}")
llm_chain = prompt | llm
response = llm_chain.invoke("what is the capital of france?")
print(response)

API Reference:PromptTemplate

API reference

For detailed documentation of all ChatOCIGenAI features and configurations head to the API reference: https://pypi.org/project/langchain-oci/

Chat model conceptual guide
Chat model how-to guides

Overview​

Integration details​

Model features​

Setup​

Credentials​

Installation​

Instantiation​

Invocation​

Chaining​

API reference​

Related​