AirbyteCDKLoader#
- class langchain_community.document_loaders.airbyte.AirbyteCDKLoader(config: Mapping[str, Any], source_class: Any, stream_name: str, record_handler: Callable[[Any, str | None], Document] | None = None, state: Any | None = None)[source]#
Load with an Airbyte source connector implemented using the CDK.
Initializes the loader.
- Parameters:
config (Mapping[str, Any]) β The config to pass to the source connector.
source_class (Any) β The source connector class.
stream_name (str) β The name of the stream to load.
record_handler (Callable[[Any, str | None], Document] | None) β A function that takes in a record and an optional id and returns a Document. If None, the record will be used as the document. Defaults to None.
state (Any | None) β The state to pass to the source connector. Defaults to None.
Attributes
last_state
Methods
__init__
(config,Β source_class,Β stream_name)Initializes the loader.
A lazy loader for Documents.
aload
()Load data into Document objects.
A lazy loader for Documents.
load
()Load data into Document objects.
load_and_split
([text_splitter])Load Documents and split into chunks.
- __init__(config: Mapping[str, Any], source_class: Any, stream_name: str, record_handler: Callable[[Any, str | None], Document] | None = None, state: Any | None = None) None [source]#
Initializes the loader.
- Parameters:
config (Mapping[str, Any]) β The config to pass to the source connector.
source_class (Any) β The source connector class.
stream_name (str) β The name of the stream to load.
record_handler (Callable[[Any, str | None], Document] | None) β A function that takes in a record and an optional id and returns a Document. If None, the record will be used as the document. Defaults to None.
state (Any | None) β The state to pass to the source connector. Defaults to None.
- Return type:
None
- async alazy_load() AsyncIterator[Document] #
A lazy loader for Documents.
- Return type:
AsyncIterator[Document]
- lazy_load() Iterator[Document] [source]#
A lazy loader for Documents.
- Return type:
Iterator[Document]
- load_and_split(text_splitter: TextSplitter | None = None) List[Document] #
Load Documents and split into chunks. Chunks are returned as Documents.
Do not override this method. It should be considered to be deprecated!
- Parameters:
text_splitter (Optional[TextSplitter]) β TextSplitter instance to use for splitting documents. Defaults to RecursiveCharacterTextSplitter.
- Returns:
List of Documents.
- Return type:
List[Document]
Examples using AirbyteCDKLoader