[RavenclawDev 202] Re: adding ASR
Svetlana Stenchikova
svetastenchikova at gmail.com
Tue Nov 28 17:02:19 EST 2006
We are thinking that the easiest way would be to
wrap the sphinx 4 engine with the socket interface which would accept the
connection from the AudioServer and send the result the same way as sphinx 2
does now.
Then we could reuse your AudioServer without changes.
Does this sound reasonable to you?
We need to know the format of what is sent and received via sockets from
AudioServer to Sphinx and back.
It looks like AudioServer sends raw audio data to the Sphinx, is that right?
And what is the format of the information returned from the ASR engine?
Is it a frame encoded as a string? Which is the code that implements the
sphinx 2 wrappers?
Thank you,
Svetlana
On 11/27/06, Svetlana Stenchikova <svetastenchikova at gmail.com> wrote:
>
> Thank you Antoine.
>
> If there is a documentation for the AudioServer somewhere, could you
> please point to it.
>
> Does AudioServer communicates with Sphinx male and female via sockets?
>
> Did sphinx 2 provide a socket interface or did you write a wrapper for it?
>
> It seems that running 3 separate servers for ASR adds complexity to the
> application (unless sphinx already provides socked api).
> Alternatively, we could (as we did with the back end) have a single java
> (or c) server which runs as a wrapper to the sphinx.
> However, if we are running multiple versions of sphinx(male/female) we
> would still need a single server to combine output from both, right?
>
> Could you please comment on this design issue.
>
>
> thank you,
> Svetlana
>
>
> On 11/27/06, Antoine Raux <antoine at cs.cmu.edu> wrote:
> >
> > Hi Svetlana,
> >
> >
> >
> > Yes AudioServer is the Galaxy server for ASR. It is however a little
> > more than just a wrapper since it runs as an interface to audio inputs and
> > can stream the audio to several engines in parallel (hence the SPHINX_MALE
> > and SPHINX_FEMALE processes). AudioServer then collects the hypotheses and
> > sends them to the hub (all the hypotheses are then parsed and Helios picks
> > one for RavenClaw).
> >
> > The code for AudioServer is in Agents/MultiDecoder/Audio_Server
> >
> >
> >
> > To answer your other question, yes you can use TTY-server as a template
> > for your ASR agent. Don't forget to handle confidence scores (potentially
> > dummy ones if Sphinx4 doesn't give you real ones).
> >
> >
> >
> > Hope this helps,
> >
> > antoine
> >
> >
> > ------------------------------
> >
> > *From:* ravenclaw-developers-bounces at LOGANBERRY.srv.cs.cmu.edu [mailto:
> > ravenclaw-developers-bounces at LOGANBERRY.srv.cs.cmu.edu] * On Behalf Of *Svetlana
> > Stenchikova
> > *Sent:* Monday, November 27, 2006 1:26 PM
> > *To:* Dan Bohus
> > *Cc:* Sarah Hoffman; ravenclaw-developers at cs.cmu.edu
> > *Subject:* [RavenclawDev 199] Re: adding ASR
> >
> >
> >
> > Hi,
> > we are now starting to add sphinx ASR to our calendar system.
> >
> > We had some general questions about it.
> >
> > The roomline version is set to work with sphinx 2, and there are 3
> > servers that are starting for it.
> >
> >
> >
> > {
> > title SPHINX_AUDIO
> > dir .
> > path ..\..\Bin\x86-nt\Audio_Server.exe -engine_list
> > sphinx_engines.txt
> > server_name sphinx
> > }
> >
> > {
> > title SPHINX_MALE
> > dir ..\..\Resources\DecoderConfig\
> > path ..\..\Bin\x86-nt\Sphinx_Engine -name male -argfn male-8khz.arg-port 9990
> > server_name sphinx_male
> > }
> >
> > {
> > title SPHINX_FEMALE
> > dir ..\..\Resources\DecoderConfig\
> > path ..\..\Bin\x86-nt\Sphinx_Engine -name female -argfn
> > female-8khz.arg -port 9991
> > server_name sphinx_female
> > }
> >
> >
> > Is Audio_Server.exe the Galaxy wrapper? Where is the code for it?
> >
> > We plan to use sphinx 4, so we may want to write a java wrapper for it.
> > If I understand correctly, we could just use the TTY wrapper that we have
> > and replace the calls to output and input text with calls to sphinx. Is this
> > correct?
> >
> > thank you.
> >
> > Svetlana
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://mailman.srv.cs.cmu.edu/pipermail/ravenclaw-developers/attachments/20061128/1ff9086a/attachment-0001.html
More information about the Ravenclaw-developers
mailing list