Google Speechtotext API，InvalidArgument:400必须使用单通道（单声道）

2条回答

网友

1楼 · 编辑于 2024-04-27 23:51:41

假设您使用的是google-cloud-speech库，那么可以使用recognitionConfig中的audio_channel_count属性，并指定输入音频数据中的频道数（默认为一个频道（mono））。你可以这样做：

from google.cloud import speech
client = speech.SpeechClient()
results = client.recognize(
    audio=speech.types.RecognitionAudio(
        uri='gs://your-bucket/recording.wav',
    ),
    config=speech.types.RecognitionConfig(
        encoding='LINEAR16',
        language_code='en-US',
        sample_rate_hertz=44100,
        audio_channel_count=2,
    ),
)

有关详细信息，请参阅API doc。在

网友

2楼 · 编辑于 2024-04-27 23:51:41

您应该使用下面的函数动态返回音频通道和帧速率它获取音频文件路径并返回帧速率和通道数

def frame_rate_channel(audio_file_name): print(audio_file_name) with wave.open(audio_file_name, "rb") as wave_file: frame_rate = wave_file.getframerate() channels = wave_file.getnchannels() return frame_rate,channels

相关问题更多 >

编程相关推荐

热门问题

热门文章

Google Speechtotext API，InvalidArgument:400必须使用单通道（单声道）

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >