Skip to main content

Google Cloud Text-to-Speech

Google Cloud Text-to-Speech enables developers to synthesize natural-sounding speech with 100+ voices, available in multiple languages and variants. It applies DeepMindโ€™s groundbreaking research in WaveNet and Googleโ€™s powerful neural networks to deliver the highest fidelity possible.

This notebook shows how to interact with the Google Cloud Text-to-Speech API to achieve speech synthesis capabilities.

First, you need to set up an Google Cloud project. You can follow the instructions here.

%pip install --upgrade --quiet  google-cloud-text-to-speech

Usageโ€‹

from langchain_community.tools import GoogleCloudTextToSpeechTool

text_to_speak = "Hello world!"

tts = GoogleCloudTextToSpeechTool()
tts.name

We can generate audio, save it to the temporary file and then play it.

speech_file = tts.run(text_to_speak)

Help us out by providing feedback on this documentation page: