johnyjj2 0 Light Poster

Hello :-)!

SUMMARY: How to run pocketsphinx_continuous on mobile phone with Symbian, and how to do it with simulator of mobile phone with Symbian. PocketSphinx can be downloaded for free here, for both Linux and Windows: .



I'd like to create application for mobile phone and server such that:

1. User runs application on mobile phone with Symbian, this application uses PocketSphinx. It is Automatic Speech Recognition software for devices like mobile phones, written in C. Somebody told me that "Pocketsphinx is used successfully on Symbian with minimal effort". My knowledge about Symbian is limited but I know something about CMU Sphinx. Can you help me, please, to run this PocketSphinx on Symbian? I also need to find simulator of mobile phone with Symbian. I have already found Wireless Toolkit and I tried to use this emulator, however I think it doesn't allow to run simulation of mobile phones with Symbian. (By the way I am limited in my project to mobile phone of the price up to about 160 euro; which Symbian version would be best for my application?).

2. This application communicates with the user. The users speaks digits and some other words (dictionary is about fifteen words, I can create language and acoustic models, as well as formal grammar in CMU Sphinx, with the use of SphinxTrain). The application recognizes those digits.

3. Based on this talk, mobile phone sends information about results of recognition to the server. I can do it in Java for CLDC/MIDP with httpconnection, POST method on the mobile phone side and with Tomcat on server. I think this httpconnection must be possible not only in Java ME, but also in Symbian C. But this is thing which will bother me later, now I'd like to run pocketsphinx_continuous on Symbian and try to modify it according to my needs.

Summing up, what I'd like to know is how to run demo pocketsphinx_continuous on mobile phone with Symbian, and how to do it with simulator of mobile phone with Symbian. (I've got Wireless Toolkit but I think it is not capable of running Symbian). PocketSphinx can be downloaded for free here, for both Linux and Windows: . There are two ways of installing it in Ubuntu. First is to unpack it (it is tar.gz), enter its directory and type "./configure", "make", "sudo make install". The other, different, is written in README file, i.e. run, "./configure", "make clean all", "make test", "make install". The result should be, among many other files, javadoc in doc directory.



What is required to give me some advices is written above. So if you don't want / don't have time to read the whole text, just above explanation is enough. But let me say about other possible approaches which I try to do. Those are worse than what I explained above, but I also tried them.

CMU Sphinx contains SphinxTrain to create acoustic model. It has got some different recognizers (also called decoders), those are PocketSphinx, Sphinx4 and some other. PocketSphinx is written in C and is for devices like mobile phones. Sphinx4 is Java application for PCs because it requires better hardware. There is good tutorial about using SphinxTrain with Sphinx4. It is here: . There is no similar tutorial for PocketSphinx. However with some knowledge about Symbian and C it is possible to run it on Symbian. The other approach than PocketSphinx, is using this Sphinx4. However it requires connection between mobile phone and server, sending audio data from cellular phone to server. It can be done with Skype, but there is still problem on server side with redirecting sound from Skype to Sphinx4. So you see there are two different approaches. One is to run speech recognition on mobile phone with PocketSphinx and Symbian. The other is speech recognition on server with Sphinx4. There are some ways to follow this second option. One is Skype on both server and mobile phone. (But there is problem with redirecting speech from Skype to Sphinx4). The other is too expensive Digium card. The third one is IVR, but I still look for good open-source IVR software. All of those three ways of second option involve Sphinx4. It was much easier to create application with Sphinx4. It is for PCs, not embedded devices, because it requires better device, extensive floating point math and some other things. But it has much better documentation than PocketSphinx. But I want to use PocketSphinx because I don't need to pay to anybody for access to mobile internet in order to send data through internet for application like Skype. The access to internet is required only for short time to send little text data with httpconnection, POST method. This is why I'd like you to help me, please, with running pocketsphinx_continuous on simulator of PocketSphinx on mobile phone with Symbian. (By the way, I see I've got installed S60 Developer Tools -> 3rd Edition FP1 SDK -> MIDP -> Emulator).

Summing up in a list, those are in CMU Sphinx:

I. SphinxTrain, which I can use to create files needed for speech recognition

II. decoders which use speech input and some files which I create with SphinxTrain, to follow speech recognition (input speech + files from SphinxTrain = are used by decoder)

II-1. PocketSphinx, written in C, for devices like mobile phones

II-2. Sphinx3, written in C, for PCs, actually the best developed

II-3. Sphinx4, written in Java, for PCs, with the best documentation

II-4. Some other, older versions

Those are possible approaches of solving my task:

I. Speech recognition on mobile phone with PocketSphinx. This is much better way than [II].

Problems: 1. running PocketSphinx on mobile phone with Symbian. 2. simulating mobile phone with Symbian on PC. I also thought that maybe I should consider Meamo, what do you think about it? Or use things different than CMU Sphinx, e.g. Simon, HTK, Julius. I looked at those other ASR engines and I think PocketSphinx may be best choice.

II. Speech recognition on server with Sphinx4. There are some different ways of establishing voice connection between mobile phone and server.

II-1. ordinary call from mobile phone to server; server has got Digium card. Disadvantage: Digium card is expensive.

II-2. to use Skype on both mobile phone and server. Problem: redirecting of sound from Skype to Sphinx4 and vice versa.

II-3. can you think about any other ways? I found that Asterisk may be useful.

Greetings :-)!