python - OGG_OPUS 入力オーディオストリームを Google Speech-to-Text API で受け入れ可能なバイトストリーム形式に変換する方法は?

翻译自：https://stackoverflow.com/questions/65436521 2020-12-24T09:40:01.493

75 次

コンテキスト:元は OGG_OPUS 形式の録音済みオーディオストリームの URL があります。URL からの音声をバイトストリームに変換しています (Google API で必要な場合 - https://cloud.google.com/speech-to-text/docs/reference/rest/v1p1beta1/RecognitionAudio )。

このバイトストリームを Google Speech-to-text API に提供すると、null 応答が返されます。

質問：

ここで Google API が Null 応答を返すのはなぜですか?
Google API は音声入力の OGG_OPUS 形式を本当にサポートしていますか?

コードブロック

import time
from urllib.request import urlopen
from io import BytesIO
import requests
import base64

# Imports the Google Cloud client library
from google.cloud import speech

# Instantiates a client
client = speech.SpeechClient()
url = "https://drive.google.com/file/d/1zlJaptJYJe0ge_SkpB52N6uRTsEKUGG4/view?usp=sharing"

response  = requests.get(url,stream=True)
output = base64.b64encode(BytesIO(response.content).read())

audio = speech.RecognitionAudio(content=output)

config = speech.RecognitionConfig(
   encoding=speech.RecognitionConfig.AudioEncoding.OGG_OPUS,
   sample_rate_hertz=16000,
   language_code="en-US",
)

# Detects speech in the audio file
response = client.recognize(config=config, audio=audio)
print("response: ", response)
'''

python - OGG_OPUS 入力オーディオ ストリームを Google Speech-to-Text API で受け入れ可能なバイトストリーム形式に変換する方法は?

0 に答える 0

Related

Reference

python - OGG_OPUS 入力オーディオストリームを Google Speech-to-Text API で受け入れ可能なバイトストリーム形式に変換する方法は?