九游会旧版>港股通>

1234s 分享:基于asterisk和tts/asr语音识别配置示例-九游会旧版

发布时间:第一铺

以下是笔者看到的一个比较完整的示例,此示例支持了asterisk,google asr/tts api接口。通过api调用返回的结果来实现呼叫的处理。现在和大家分享一下具体的处理流程:

首先,我们一下基于google的语音识别的处理。首先需要安装依赖支持包:

然后把speech-recog.agi的agi文件拷贝到 /var/lib/asterisk/agi-bin/

拷贝进去以后,处理执行权限,保证agi那个正常工作。此agi配置文件配置了api接口的调用机制。

使用语法:

agi(speech-recog.agi,[lang],[timeout],[intkey],[nobeep])

通过拨号规则的agi接口调用语音识别和tts数据:

imple speech recognitionexten => 1234,1,answer()exten => 1234,n,agi(speech-recog.agi,en-us) // exten => 1234,n,verbose(1,the text you just said is: ${utterance})exten => 1234,n,verbose(1,the probability to be right is: ${confidence})exten => 1234,n,hangup();;speech recognition demo:exten => 1235,1,answer()exten => 1235,n,agi(googletts.agi,"say something in english, when done press the pound key.",en)exten => 1235,n(record),agi(speech-recog.agi,en-us)exten => 1235,n,verbose(1,script returned: ${confidence} , ${utterance});check the probability of a successful recognition:exten => 1235,n(success),gotoif($["${confidence}" > "0.8"]?playback:retry);playback the text:exten => 1235,n(playback),agi(googletts.agi,"the text you just said was...",en)exten => 1235,n,agi(googletts.agi,"${utterance}",en)exten => 1235,n,goto(end);retry in case speech recognition wasn't successful:exten => 1235,n(retry),agi(googletts.agi,"can you please repeat more clearly?",en)exten => 1235,n,goto(record)exten => 1235,n(fail),agi(googletts.agi,"failed to get speech data.",en)exten => 1235,n(end),hangup();;voice dialing exampleexten => 1236,1,answer()exten => 1236,n,agi(googletts.agi,"please say the number you want to dial.",en)exten => 1236,n(record),agi(speech-recog.agi,en-us)exten => 1236,n,gotoif($["${confidence}" > "0.8"]?success:retry)exten => 1236,n(success),goto(${utterance},1)exten => 1236,n(retry),agi(googletts.agi,"can you please repeat?",en)exten => 1236,n,goto(record)

以上是asr的接口调用,用户也可以使用tts调用方式。当然,首先需要创建一个tts.agi 文件,拷贝此文件到agi默认路径,执行权限设置,保证其可执行。

使用语法:

agi(googletts.agi,text,[language],[intkey])

tts和asterisk的测试示例:

googletts demoexten => 1234,1,answer() ;;play mesage in english:exten => 1234,n,agi(googletts.agi,"this is a simple google text to speech test in english.",en) ;;play message in spanish:exten => 1234,n,agi(googletts.agi,"esta es una simple prueba en español.",es) ;;play message in greek:exten => 1234,n,agi(googletts.agi,"αυτό είναι ένα απλό τέστ στα ελληνικά.",el) ;;play message in japanese:exten => 1234,n,agi(googletts.agi,"これは、日本の簡単なテストです。良い一日を。",ja) ;;play message in simplified chinese:exten => 1234,n,agi(googletts.agi,"这是一个简单的测试,在中国。有一个愉快的一天。",zh-cn);a simple dynamic ivr using googletts[my_ivr]exten => s,1,answer()exten => s,n,set(timeout(digit)=5)exten => s,n,agi(googletts.agi,"welcome to my small interactive voice response menu.",en) ;;wait for digit:exten => s,n(start),agi(googletts.agi,"please dial a digit.",en,any)exten => s,n,waitexten() ;;playback the name of the digit and wait for another one:exten => _x,1,agi(googletts.agi,"you just pressed ${exten}. try another one please.",en,any)exten => _x,n,waitexten()exten => i,1,agi(googletts.agi,"invalid extension.",en)exten => i,n,goto(s,start)exten => t,1,agi(googletts.agi,"request timed out.",en)exten => t,n,goto(s,start)exten => h,1,hangup()

以上示例是一个国外开发人员的开源代码分享,笔者没有测试,因为访问google还是有很多不方便的地方。开发人员也提供了语音合成的接口,支持微软的翻译工具来实现,读者可以进一步研究。读者可以根据asr和tts的接口给的大概思路,利用我们国内的asr和tts厂家(例如,百度,科大讯飞等)的api接口进行调整来实现asr/tts/ivr的流程处理。

参考资料以及源代码下载:

http://zaf.github.io/asterisk-speech-recog/

到底了
网站地图