Google Speech to Text API 很慢

问题描述

我正在使用 "Transcribing long audio files using a Google Cloud Storage file" here. 上的指南

但是使用带有“enable_automatic_punctuation”的“增强型视频”模型时速度很慢。每 10 分钟的音频持续时间大约需要 2.5 分钟。

这是我的配置：

config = speech.RecognitionConfig(
  encoding=speech.RecognitionConfig.AudioEncoding.FLAC,audio_channel_count=2,language_code="en-US",use_enhanced=True,# Use enhanced model.
  model='video',# Enhanced model for videos
  enable_word_time_offsets=True,# Add the word video times (seconds)
  enable_automatic_punctuation=True  # Add punctuation to the transcript.
)
operation = client.long_running_recognize(config=config,audio=audio)

有什么方法可以让我们得到更快的响应。对于 10 分钟的音频文件，语音识别模型 + NLP 标点模型真的需要这么长时间吗？ 2 分钟？

解决方法

暂无找到可以解决该程序问题的有效方法，小编努力寻找整理中！

如果你已经找到好的解决方法，欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@）

google-api google-api-python-client google-speech-api google-speech-to-text-api