GoogleSpeechToTextLoader#

class langchain_community.document_loaders.google_speech_to_text.GoogleSpeechToTextLoader(project_id: str, file_path: str, location: str = 'us-central1', recognizer_id: str = '_', config: RecognitionConfig | None = None, config_mask: FieldMask | None = None)[source]#

Deprecated since version 0.0.32: Use langchain_google_community.SpeechToTextLoader instead.

Loader for Google Cloud Speech-to-Text audio transcripts.

It uses the Google Cloud Speech-to-Text API to transcribe audio files and loads the transcribed text into one or more Documents, depending on the specified format.

To use, you should have the google-cloud-speech python package installed.

Audio files can be specified via a Google Cloud Storage uri or a local file path.

For a detailed explanation of Google Cloud Speech-to-Text, refer to the product documentation. https://cloud.google.com/speech-to-text

Initializes the GoogleSpeechToTextLoader.

Parameters:

Methods

__init__(project_id,Β file_path[,Β location,Β ...])

Initializes the GoogleSpeechToTextLoader.

alazy_load()

A lazy loader for Documents.

aload()

Load data into Document objects.

lazy_load()

A lazy loader for Documents.

load()

Transcribes the audio file and loads the transcript into documents.

load_and_split([text_splitter])

Load Documents and split into chunks.

__init__(project_id: str, file_path: str, location: str = 'us-central1', recognizer_id: str = '_', config: RecognitionConfig | None = None, config_mask: FieldMask | None = None)[source]#

Initializes the GoogleSpeechToTextLoader.

Parameters:
async alazy_load() β†’ AsyncIterator[Document]#

A lazy loader for Documents.

Return type:

AsyncIterator[Document]

async aload() β†’ List[Document]#

Load data into Document objects.

Return type:

List[Document]

lazy_load() β†’ Iterator[Document]#

A lazy loader for Documents.

Return type:

Iterator[Document]

load() β†’ List[Document][source]#

Transcribes the audio file and loads the transcript into documents.

It uses the Google Cloud Speech-to-Text API to transcribe the audio file and blocks until the transcription is finished.

Return type:

List[Document]

load_and_split(text_splitter: TextSplitter | None = None) β†’ List[Document]#

Load Documents and split into chunks. Chunks are returned as Documents.

Do not override this method. It should be considered to be deprecated!

Parameters:

text_splitter (Optional[TextSplitter]) – TextSplitter instance to use for splitting documents. Defaults to RecursiveCharacterTextSplitter.

Returns:

List of Documents.

Return type:

List[Document]