Linux-Compatible Natural-Sounding Text-to-Speech Synthesizer
2015-08-22
I am looking for a more natural-sounding text-to-speech synthesizer than eSpeak, which actually is very reliable and easy to use in a Linux script. Thus far I haven't been able to find such a product. I've tried several Wine-based TTS and found them hard to use and disappointing even though I don't mind paying a reasonable sum. Any help will be greatly appreciated. Julianvb |
You AND Ken Starks (blog of Helios, and FOSS Force writer).
You might consider sending him a note, it would be nice if more people bonded together to help show the market need. |
Quote:
I guess so! I tried listening to a sample of eSpeak here but I barely could understand what was being said. :( Have you taken a look at Festival? You can try their online demo here. I found the voice of "Tom (English American male)" pleasant and easy to understand. I'm not sure how you can download the software from their download page but I did find Festvox, which I guess is a program that incorporates Festival's software, here. :) Let us know how it goes... Regards... |
According to Jonathan Nadeau, a blind Linux user and maintainer or Sonar Linux, an improved screenreader is sorely needed for Linux. One of his hopes is to help provide one.
He uses Orca in Sonar. |
Hi, LinuxUser42, ardvark71 and frankbell,
Thank you all for your helpful inputs. I've just received an e-mail from Alan W. Black, a TTS expert at Carnegie-Mellon University recommending Festival 2.4 and CMU Flite, a more portable and faster C version of Festival. Today I installed Flite via synaptic and tried it out on two of my Linux Mint computers. I found the (U.S. English) voices quite natural and its syntax very similar to that of eSpeak. Thus there is no need for me to modify my existing Linux show-and-tell scripts. In case anyone is interested in an excellent free Chinese-language TTS, I recommend Ekho highly. I've been using it for 3 years. It encompasses Mandarin, Cantonese and other major Chinese languages. Its sound quality is more than sufficient for all my current needs. By the way, Ekho was originally designed as a TTS for the blind in China but now it benefits the entire society. I'll definitely get in touch with Ken Starks to remind the commercial software world that there's room for it to contribute and benefit even in Linux. Julianvb |
Quote:
If you would, please mark this thread as "SOLVED" by clicking on "Thread Tools" directly above your initial post. Thanks! Regards... |
Hi, ardvark71,
I thought I did mark this thread as [Solved] yesterday from the first post, namely Aug 25. Thanks. Julianvb |
Quote:
Regards... |
There's also a combination of espeak+mbrola voices. But this sounds even worse. Especially when run from gespeak because it makes unwanted gaps between words.
(BTW: if your gespeak does not see any mbrola voices installed symlink the folder espeak data to another location: ln -s /usr/lib/i386-linux-gnu/espeak-data/ /usr/share/espeak-data the original folder might be also in /usr/lib/x86_64-linux-gnu/espeak-data, see: bugs.launchpad.net for details). |
Hi, newsgrabber,
Thanks very much for your interesting input. Please let me know when you come across a reasonably priced natural-sounding TTS for Linux. Julianvb |
Quote:
|
just for kicks i compiled flite from here and it sounds much better than the example from here.
http://iki.fi/dt/stuff/theraven.ogg (you must also download & use the voices, but even without them it sounds better than the espeak example) |
2017-01-06
I am happy to report that I recently came across Cepstral's Swift TTS, a commercial TTS compatible with WIndows, OSX and Linux. I've tested its Linux David voice briefly, which sounds quite natural and I found the syntax very user-friendly. About a dozen voices are available and they are priced from $10 to $45. According to Cepstral, a licensed copy of swift may be used on only one computer and all its user-created swift output files may not be used on any computer not having its own Cepstral license. I like the firm's pre-purchase policy of allowing the public free testing of their TTS products. I hope my information will be helpful to Linux users who are still searching for a natural-sounding commercial TTS. Anyone interested in the Swift TTS will benefit from visiting http://www.cepstral.com and reading its very informative pages. Julianvb |
Hi Julian...
Thank you for your update, I'm glad you found another product that fits your criteria. Perhaps your information will be helpful for others looking for this kind of software. :) Regards... |
not sure why i had to compile it myself then, but flite is in the repos for at least archlinux, ubuntu and debian.
probably most distros. so after installing with package management it comes with a basic male voice, and 'slt', a very soft (and more natural imo) female voice: Code:
$ flite -h Code:
$ cd but it's also possible to use the voices without downloading them first: Code:
echo "Hello World!" | flite -voice http://www.festvox.org/flite/packed/flite-2.0/voices/cmu_us_axb.flitevox |
Quote:
https://code.google.com/archive/p/open-sapi/ Google Open SAPI can be installed on any ubuntu-compatible machine but there were some issues related to wine versions. Therefore I made a vitrual box machine which is ready to use, with all required libraries/versions on board. In case you are still interested. |
Quote:
|
Hi, ardvark,
I wish I had known Cepstral's purchase policy before I decided to buy its Swift TTS. Unless a huge fee in addition to the purchase price has been paid in advance, the Cepstral software automatically stops the purchaser from using any voice files he/she has created with the TTS. I learned too late about this unacceptable and disappointing restriction and therefore would like to warn any potential buyers of this matter. I think I do understand the reason behind their policy, which they should have clearly declared to me at the outset. Julianvb |
All times are GMT -5. The time now is 12:50 PM. |