Bumping up this recent thread on the bookreader list about text-to-speech. Mike and Gregor, in case you haven't seen what's currently possible: <br><br>I believe James S's Read Etexts uses speech-dispatcher to read selected text. Aleksey and others may have done further work with espeak... I've included some old threads from the Sugar list this past spring below.<br>
<br>SJ<br><br><br>On Thu, Oct 29, Mike McCabe <span dir="ltr"><<a href="mailto:mccabe@archive.org">mccabe@archive.org</a>></span> wrote:<br><br>
I also think this is a great idea. I've worked with several<br>
text-to-speech readers recently, as part of my effort to make the<br>
Internet Archive books available to print disabled people.<br>
<br>
They're very useful, and I think that this mode of reading could be of<br>
use to a very broad range of users. I suspect we'll see more of it soon.<br>
<br>
I'm also curious to hear about specific experiences with<br>
linux-compatible free TTS, as we may be producing audio books with this<br>
to work with the new Library of Congress audio players.<br>
<br>
Best regards -<br>
Mike<br>
<br>
<br>
<br><br>== [1] old note from James Simmons ==<br>( in repsponse to this speech-synthesis summer of code proposal:<br><a href="http://wiki.sugarlabs.org/go/speech-synthesis" target="_blank">http://wiki.sugarlabs.org/go/<span class="il">speech</span>-synthesis</a> )<br>
<br>Chirag,<br>
<br>
Since you have been working with Aleksey Lim you probably know about<br>
<span class="il">text</span> <span class="il">to</span> <span class="il">speech</span> with highlighting in Read Etexts. I wrote the original<br>
TTS code that used <span class="il">speech</span>-dispatcher with some assistance from <span class="il">Hemant</span><br>
Goyal and the folks on the <span class="il">speech</span>-dispatcher project. Aleksey<br>
refactored my code so it could work with either <span class="il">speech</span>-dispatcher or his<br>
own gstreamer espeak plugin. Not only does his plugin need no<br>
configuration <span class="il">to</span> work, it also does a LOT better in producing timely<br>
callbacks as it reads each word.<br>
<br>As you point out in your proposal, highlighting the word as it is spoken<br>
is a big part of the benefit of what you're proposing. If all you<br>
wanted <span class="il">to</span> do was capture some highlighted <span class="il">text</span> in the clipboard and have<br>
it spoken in a voice you can configure in a control panel, that would be<br>
easy, even trivial. It's the highlighting that's difficult. When I<br>
added <span class="il">speech</span> <span class="il">to</span> Read Etexts I deliberately tried for the simplest<br>
approach that would get the job done. It reads only the current page.<br>
It always starts either at the first word on the page, or if <span class="il">speech</span> has<br>
been paused, it resumes with the last word spoken. You can't choose the<br>
word <span class="il">to</span> start on. The Activity itself receives the callbacks as each<br>
word is spoken and takes care of doing the highlight and scrolling the<br>
textarea so the highlighted word stays on the screen.<br>
<br>
If I had <span class="il">to</span> write a facility that did what Read Etexts does outside of<br>
the Activity I wouldn't know how <span class="il">to</span> do it. It seems <span class="il">to</span> me that<br>
highlighting is best done by the Activity itself. I can't deny that it<br>
would be useful <span class="il">to</span> have all this work done as you have described without<br>
the Activity knowing anything about it, but it doesn't seem feasible.<br>
You'd have <span class="il">to</span> have something that could work with gtk textareas, the<br>
evince component Read uses, Abiword, and everything else that came along.<br>
<br>
Another thing you'd have <span class="il">to</span> deal with is PDFs composed of scanned in<br>
book pages. There are a lot of these around (the Internet Archive is<br>
full of them) and somehow the kid trying <span class="il">to</span> select words on a scanned in<br>
page would have <span class="il">to</span> be clued in that these words are not selectable.<br>
<br>
I suppose you could make an Activity that grabbed whatever <span class="il">text</span> was in<br>
the clipboard, displayed it in a textarea, and highlighted the words in<br>
that textarea as it spoke them. I'm pretty sure that wasn't what you<br>
had in mind.<br>
<br>
Splitting sentences into separate words will be a challenge. I just
use spaces as delimiters and filter out characters like asterisks,
vertical bars, etc. That works OK for English but not for other
languages. If I wanted Read Etexts <span class="il">to</span>
do highlighting on the Bhagavad-Gita in the original Sanskrit it
wouldn't work. Even in English I get tripped up by double hyphens
(--). It would be nice if Gutenberg etexts put spaces around double
hyphens but they don't.<br>
<br>
It looks like you've picked a challenging project, and I would love <span class="il">to</span> be proven wrong about everything I've mentioned here. Good luck with this,<br>
<br>
James Simmons<br>
<br><br>
== 2: SynPhony and reading assistance ==<br><br><div class="gmail_quote">On Tue, Feb 17, 2009 at 12:48 PM, Carol Farlow Lerche <span dir="ltr"><<a href="mailto:cafl@msbit.com" target="_blank">cafl@msbit.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">I'd like to call your attention again to <a href="http://synphony.wiki.sourceforge.net/" target="_blank">SynPhony</a>.
We are close to a base release (probably this week) of a 44,000 word
English word database that has a very rich array of information helpful
to the teaching of English, especially reading. A 10,000 word Spanish
lexicon and 50000 word German one will follow. Norbert Rennert who
compiled these, would like very much to work with other language
experts to extend this effort to other languages. Some highlights of
the English lexicon: screened from the CMU Sphynx
corpus for accessibility to children, each word entry has frequency
data from analysis with respect to a large corpus of text merged in,
phoneme breakdown (used by reading curricula to decide the order in
which words should be introduced or deemed decodable), etymology,
semantic domain (categorization), IPA coding, syllabification and
stress marking.<br>
<br>The second release will merge in many images, though we don't
expect to have a complete image-to-word mapping without a volunteer
effort. We plan to create an API and a way to define a curriculum
sequence for word groups once the basic database is released, to allow
integration of the word bank across all the activities that are
literacy related, as well as create more. We also hope to use the word
bank to score texts for reading level and assist in creation of
simplified version of extant texts suitable for use by emergent
readers. Please read our design documents at the above site.<br>
<br><div class="gmail_quote"><div><div>On Tue, Feb 17, 2009 at 2:02 AM, Tomeu Vizoso <span dir="ltr"><<a href="mailto:tomeu@sugarlabs.org" target="_blank">tomeu@sugarlabs.org</a>></span> wrote:<br>
</div></div><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div>
Aleksey has started a very interesting new path:<br>
<br>
<a href="http://lists.sugarlabs.org/archive/sugar-devel/2009-February/011470.html" target="_blank">http://lists.sugarlabs.org/archive/sugar-devel/2009-February/011470.html</a><br>
<br>
</div></div></blockquote></div></blockquote></div><br>
<br><br><div class="gmail_quote"><br><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div class="h5">Gregor Kervina wrote:<br>
> Hi Sayamindu,<br>
> thanks for quick reply!<br>
> There is a lot of text to speech software out there - I use<br>
> <a href="http://www.bytecool.com/coolspch.htm" target="_blank">http://www.bytecool.com/coolspch.htm</a> that you can try trial and download<br>
> additional voices, just to get a feeling, but it is not free and not for<br>
> linux. Many other programs are more complex and complicated and some of<br>
> them use very complex voice engines that in my opinion doesn't sound<br>
> very good. (I use Mary voice with cool speech)<br>
><br>
> OK I spent some time to find all TTS software that is free for linux and<br>
> here are some links:<br>
><br>
> <a href="http://linux-sound.org/speech.html" target="_blank">http://linux-sound.org/speech.html</a><br>
> <a href="http://linuxhelp.blogspot.com/2006/01/festival-text-to-speech-synthesis.html" target="_blank">http://linuxhelp.blogspot.com/2006/01/festival-text-to-speech-synthesis.html</a><br>
> <a href="http://larswiki.atrc.utoronto.ca/wiki/Software" target="_blank">http://larswiki.atrc.utoronto.ca/wiki/Software</a> - see the links under<br>
> Speech section<br>
> <a href="http://www.xenocafe.com/tutorials/php/festival_text_to_speech/index.php" target="_blank">http://www.xenocafe.com/tutorials/php/festival_text_to_speech/index.php</a><br>
> <a href="http://www.wikihow.com/Convert-Text-to-Speech-on-Linux" target="_blank">http://www.wikihow.com/Convert-Text-to-Speech-on-Linux</a><br>
> <a href="http://www.cstr.ed.ac.uk/projects/festival/" target="_blank">http://www.cstr.ed.ac.uk/projects/festival/</a><br>
> <a href="http://www.cstr.ed.ac.uk/projects/festival/onlinedemo.html" target="_blank">http://www.cstr.ed.ac.uk/projects/festival/onlinedemo.html</a> - listen to<br>
> some demo voices<br>
> <a href="http://sourceforge.net/projects/dhvani/" target="_blank">http://sourceforge.net/projects/dhvani/</a> - this one not english<br>
> <a href="http://sourceforge.net/projects/tts-cubed/" target="_blank">http://sourceforge.net/projects/tts-cubed/</a><br>
> <a href="http://www.speech.cs.cmu.edu/hephaestus.html" target="_blank">http://www.speech.cs.cmu.edu/hephaestus.html</a> - click the links in Speech<br>
> Synthesis section<br>
> <a href="http://www.speech.cs.cmu.edu/comp.speech/Section5/Synth/rsynth.html" target="_blank">http://www.speech.cs.cmu.edu/comp.speech/Section5/Synth/rsynth.html</a><br>
> <a href="http://www.linux.com/archive/feature/122197" target="_blank">http://www.linux.com/archive/feature/122197</a> - two readers - plug-ins for<br>
> firefox.<br>
><br>
> I can not test them because I'm not a linux user. Maybe you can modify<br>
> some of these software (probably Festival) for more user friendly<br>
> reading and maybe program a specific button on XO keyboard that will<br>
> automatically read the selected text no matter what program is used for<br>
> opening the text.<br>
><br>
> Judging from google search result for DTBooks, this technology is not<br>
> spread at all. The other problem is that it uses somtimes recorded audio<br>
> and the size of that is too large for XO... I think the most important<br>
> is that TTS works with reader that will open 1.6M e-books from internet<br>
> archive<br>
</div></div>> <<a href="http://www.xconomy.com/boston/2009/10/24/internet-archive-opens-1-6-million-e-books-to-olpc-laptops/" target="_blank">http://www.xconomy.com/boston/2009/10/24/internet-archive-opens-1-6-million-e-books-to-olpc-laptops/</a>>(are<br>
<div class="im">> you in this team?).<br>
><br>
> Also one important thing is to add cheap headphones with laptop so<br>
> children could listen to reading without desturbing others and in the<br>
> noisy environments ... another advantage of audio reading is much longer<br>
> battery life because you can turn off LCD monitor and audio alone does<br>
> not consume much energy.<br>
><br>
> Let me know what you think.<br>
> All the best,<br>
> Gregor<br>
><br>
><br>
><br>
> On Mon, Oct 26, 2009 at 4:08 PM, Sayamindu Dasgupta <<a href="mailto:sayamindu@gmail.com">sayamindu@gmail.com</a><br>
</div><div class="im">> <mailto:<a href="mailto:sayamindu@gmail.com">sayamindu@gmail.com</a>>> wrote:<br>
><br>
> Hi Gregor,<br>
> Thanks a lot for jumping in :-)<br>
><br>
> On Mon, Oct 26, 2009 at 2:38 AM, Gregor Kervina<br>
</div><div><div></div><div class="h5">> <<a href="mailto:gregor.kervina@gmail.com">gregor.kervina@gmail.com</a> <mailto:<a href="mailto:gregor.kervina@gmail.com">gregor.kervina@gmail.com</a>>> wrote:<br>
> > Dear Sayamindu Dasgupta, SJ Klein and other members of this list,<br>
> ><br>
> > I'm a student of electrical engineering from Europe and would<br>
> like to share<br>
> > with you my very positive experience with text to speech<br>
> technology that can<br>
> > in my opinion significantly increase the educational potential of<br>
> XO if used<br>
> > in the right way.<br>
> ><br>
> > For the past 12 years (since I was 15 years old) I'm daily<br>
> learning from<br>
> > e-books and internet using text to speech software. I know this<br>
> software is<br>
> > unpopular in developed world, many people don't even know that it<br>
> exists. On<br>
> > the other hand many people (including me) don't like reading long<br>
> texts on<br>
> > the LCD screens - that's why e-books are also not very popular.<br>
> ><br>
> > But unlike my friends I read 50+ e-books every ear and also daily<br>
> news on<br>
> > the internet - I just select the text, copy it, and CoolSpeech<br>
> software<br>
> > (using Mary voice) reads me all the text with speeds 300 to 500<br>
> words per<br>
> > minute. In this way I can browse other sites or look at photos or<br>
> just lay<br>
> > down and listen while my laptop is reading to me.<br>
> > Other people don't understand what I'm reading because it is too<br>
> fast for<br>
> > them but it can be learned quickly with slower speeds at beginning.<br>
> ><br>
> > I think XO laptops should definitely have such software<br>
> pre-installed and a<br>
> > video introduction how to use it and what reading speeds can they<br>
> expect<br>
> > after some time of practicing.<br>
> > It is also ideal for children with poor eye sight.<br>
> ><br>
><br>
> This sounds awesome. Could you let us know if the text to speech<br>
> software you have in mind is free/opensource and if it works on Linux<br>
> ?<br>
> I am also looking at DTBooks specifications for digital talking books<br>
> - do you know how useful/widespread this technology is ?<br>
><br>
> Thanks,<br>
> Sayamindu<br>
><br>
><br>
> --<br>
> Sayamindu Dasgupta<br>
> [<a href="http://sayamindu.randomink.org/ramblings" target="_blank">http://sayamindu.randomink.org/ramblings</a>]<br>
><br>
><br>
><br>
</div></div>> ------------------------------------------------------------------------<br>
<div><div></div><div class="h5">><br>
> _______________________________________________<br>
> Bookreader mailing list<br>
> <a href="mailto:Bookreader@lists.laptop.org">Bookreader@lists.laptop.org</a><br>
> <a href="http://lists.laptop.org/listinfo/bookreader" target="_blank">http://lists.laptop.org/listinfo/bookreader</a><br>
_______________________________________________<br>
Bookreader mailing list<br>
<a href="mailto:Bookreader@lists.laptop.org">Bookreader@lists.laptop.org</a><br>
<a href="http://lists.laptop.org/listinfo/bookreader" target="_blank">http://lists.laptop.org/listinfo/bookreader</a><br>
</div></div></blockquote></div><br>