[Sugar-devel] [DESIGN] Design Team agenda item: TTS, e-speak and voices

Chris Leonard cjlhomeaddress at gmail.com
Fri Jul 20 10:53:18 EDT 2012


On Thu, Jul 19, 2012 at 9:21 PM, Gonzalo Odiard <gonzalo at laptop.org> wrote:
> Thanks Chris by putting this topic on the table.
> Overall, I am no sure this is a topic to the Design Team,
> anyway, I will add a few comments below...

Well, I'd argue that a slow drift towards "lock-in" on a core
technology involving an aspect of the user interface (TTS) merits some
Design Team consideration, I will not dispute that the needs are
heavier core developer-type action items than "graphic design".

> We need research:
> * Can we use mbrola voices? (http://espeak.sourceforge.net/mbrola.html)
>
> * Gnome-speech: When I implemented the tts feature in sugar,
> tried to use gnome-speach to have a layer where we can use espeak or
> festival voices,
> but at the moment I tested it, festival voices were not recognized,
> then I decided use espeak only. We can change that if found a better solution.
> We should test again (a quick test in F17, only show a festival voice)

Yes, part of my purpose in posting this was to raise awareness and
perhaps get others interested in working with you on the speech engine
question.  I seems like too big a task to drop on one person, plus we
want the benefit of any other experience that might be out there.

> * Move speech to sugar-toolkit: we have a lot of code copy pasted in
> the activities
> to implement tts. Having a central implementation should solve a lot
> of problems.

>From a L10n perspective, I'd be happy to see the e-speak voice lists
from Speak and Memorize collapsed into a single place.

> I think the problem is not espeak, but the gstreamer espeak plugin.

I stand corrected, but it is still a packaging issue right?

re: new voice files
>
> I don't know how difficult this can be, but quidam said
> is very difficult create good voice files.
>
> I think we need find other groups of experts in the topic
> and try to create alliances. Many of the voices are created by universities.

Agreed, this can't be a Sugar Labs solo effort.  Part of the criteria
of anything that we lean on as heavily as e-speak for TTS should be an
assessment of the vitality of the upstream and the potential for
developing a collaborative effort (as opposed to taking on support of
the upstream package).

cjl


More information about the Sugar-devel mailing list