使用AT&T语音转文本API与Python

3 投票

1 回答

1002 浏览

提问于 2025-04-18 02:33

我正在尝试使用AT&T的语音转文字API。目前，我已经获取到了访问令牌。

def get_access_token(client_id, client_secret):
headers = {'Content-Type': 'application/x-www-form-urlencoded', 'Accept': 'application/json'}

data = {'client_id': client_id, 'client_secret': client_secret, 'scope': 'SPEECH',
        'grant_type': 'client_credentials'}

response = requests.post(oauth_url, data=data, headers=headers)
return response.text

到目前为止，这是我用来发送音频文件以获取json响应的代码：

def get_text_from_file(file, access_token):
headers = {'Authorization': 'Bearer ' + access_token, 'Accept': 'application/json', 'Content-Type': 'audio/wav',
           'X-SpeechContext': 'Generic', 'Connection': 'Keep-Alive'}

但是我不太确定怎么发送这个文件。有没有人能帮帮我？

API集成 json处理语音识别音频文件上传

1 个回答

这是我刚刚搞定的，使用了requests库，还有一些其他的资源，我会在下面给链接。

import json
import requests

class ATTSpeech:
    CLIENT_ID = "SOME"
    CLIENT_SECRET = "ID"
    TOKEN = None

    def __init__(self, *args, **kwargs):
        self.get_token()


    def get_token(self):
        # Get Access Token via OAuth.
        # https://matrix.bf.sl.attcompute.com/apps/constellation-sandbox
        response = requests.post("https://api.att.com/oauth/token", {
            "client_id": self.CLIENT_ID,
            "client_secret": self.CLIENT_SECRET,
            "grant_type": "client_credentials",
            "scope": "SPEECH,STTC"
        })
        content = json.loads(response.content)
        self.TOKEN = content["access_token"]


    def text_from_file(self, path):

        with open(path, 'rb') as f:
            response = requests.post("https://api.att.com/speech/v3/speechToText",
                headers = {
                    "Authorization": "Bearer %s" % self.TOKEN,
                    "Accept": "application/json",
                    "Content-Type": "audio/wav",
                    "X-SpeechContext": "Generic",
            }, data=f)
        content = json.loads(response.content)
        return content

https://sites.google.com/site/brssbrss/attspeechapi

http://changingjasper.blogspot.com/2014/06/making-jasper-use-at-speech-api.html

使用方法大概是这样的，假设你把这个文件保存为ATTEngine。

from ATTEngine import ATTSpeech
a = ATTSpeech()
a.text_from_file('/Users/issackelly/Desktop/here.wav')

回答于 2025-04-18 由 Python大师

分享举报

使用AT&T语音转文本API与Python

1 个回答

撰写回答