Thread: Aura H2O Turn pages by voice
View Single Post
Old 08-05-2017, 09:31 AM   #12
Ecallan
Member
Ecallan began at the beginning.
 
Posts: 18
Karma: 10
Join Date: Jul 2017
Device: Kobo H2O
Thank you for this fantastic amount of work. I don't really have anybody to assist me with this project at the moment, but I will follow directions step-by-step and get there in the end!

A question on installing the kobo start menu (KSM) when I followed the link in your post, there was a line in red at the top saying that the ksm 08 doesn't work on the kobo aura H2O. Could you provide a link directly to the file that I need to download and copy to the .kobo folder please?

Eugene

Quote:
Originally Posted by tshering View Post
Sure. I am currently in my office. I will post the code this night when I am at home (after cleaning the code a little bit, and preparing some explanations) or maybe tomorrow. But do not expect to much. It is working, but it is far from perfect. And there is still some preparatory work to do. I give you the details:

One can (or has to) "record" the necessary touch events, so that one can later "play" them. I did this for the Kobo Touch, but cannot do it for the H2O, since I do not have access to one. When I post the code here, I will also ask whether somebody can record the touch events. I hope some people might be interested in this too. This recorded touch events will still be useful even if we decide to change other parts of our approach.

On the pc
  • you will have to enable the windows speech recognition, if you did not do this already (Since then I learned that you are using Nuance NaturallySpeaking. I have still to google for it. I just read "the program transcribes the words into the active window at the location of the cursor." Can it also close the input box by confirming [saying "OK"] or cancelling [saying "Cancel"]? Or something like that? If yes you could just use this instead of the Windows inbuilt speech recognition.)
  • you will also have to enable telnet
  • and you will also have to prepare your pc for usbnet (If you wish we could avoid this for the moment, and do our first tests with wife instead of telnet. But this has some disadvantages.)
  • I think all windows pcs can run visual basic scripts out of the box. Right?

On the Kobo side, you would have to
  • enable telnet (You can simply copy the KoboRoot.tgz that you can find here to the directory .kobo of you device and restart the device)
  • when you decide for using usbnet, I recommend installing the Kobo start menu (KSM) for easily enabling/disabling usbnet.
  • copy the files with the recorded touch events to the kobo (once we have them)

A short description of the script:
  • It reads a text file that associates certain commands like "next page", "previous page" with the names of files that contain the appropriate recorded touch events.
  • It waits for your input (by speaking or typing [so that people can also use it without having to enable speech recognition, or the English language package]).
    • In test mode, if the scripts can identify the input phrase with a command, it confirms this, or it tells you that it does not recognize the command (both acoustically with the Windows inbuilt text to speech).
    • In normal mode, the script sends a command with telnet to the Kobo device in order to trigger the touch event, and gives acoustic information (you can disable this).
  • You terminate the script by saying/typing "exit."

Things that do not work so well:
  • 1) Once you start the script (in normal mode) you should not change to another window or application. If you do, the script looses its connection with the telnet window.
  • 2) After terminating the script, you have to close the telnet session, exit telnet and close the cmd window yourself.
  • 3) One has to get used to a certain way of speaking to make the speech recognition recognize what one is saying.

So maybe as a first step you could just try the script in test mode to get a feeling of how it works, and see whether you think it is usable at all. To do this no changes are necessary on the Kobo device, it is not involved at all.

At a later stage, I might try to replace the use of the vb script interpreter and the windows inbuilt telnet program by a stand alone application. That should make some things easier (points 1 and 2 from above) and be easier portable to other platforms. I am however not sure about the integration of the speech recognition.

What is currently working:
  • page forwards and backwards in a book
  • open the main menu
  • go into the library or the collections, page through them and open a book
  • close a book
  • open a book from "home"
This is done with the more recent firmwares that have the "new home." Having the menu easily available makes things much easier.
Ecallan is offline   Reply With Quote