LinuxQuestions.org
Welcome to the most active Linux Forum on the web.
Home Forums Tutorials Articles Register
Go Back   LinuxQuestions.org > Forums > Linux Forums > Linux - Software
User Name
Password
Linux - Software This forum is for Software issues.
Having a problem installing a new program? Want to know which application is best for the job? Post your question in this forum.

Notices


Reply
  Search this Thread
Old 08-22-2015, 05:16 AM   #1
julianvb
Member
 
Registered: Apr 2009
Posts: 390

Rep: Reputation: 3
Linux-Compatible Natural-Sounding Text-to-Speech Synthesizer


2015-08-22

I am looking for a more natural-sounding text-to-speech synthesizer than eSpeak, which actually is very reliable and easy to use in a Linux script. Thus far I haven't been able to find such a product. I've tried several Wine-based TTS and found them hard to use and disappointing even though I don't mind paying a reasonable sum. Any help will be greatly appreciated.

Julianvb
 
Old 08-22-2015, 02:59 PM   #2
LinuxUser42
Member
 
Registered: Nov 2010
Distribution: Lubuntu, Raspbian, Openelec, messing with others.
Posts: 143

Rep: Reputation: 19
You AND Ken Starks (blog of Helios, and FOSS Force writer).
You might consider sending him a note, it would be nice if more people bonded together to help show the market need.
 
Old 08-22-2015, 03:47 PM   #3
ardvark71
LQ Veteran
 
Registered: Feb 2015
Location: USA
Distribution: Lubuntu 14.04, 22.04, Windows 8.1 and 10
Posts: 6,282
Blog Entries: 4

Rep: Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842
Quote:
Originally Posted by julianvb View Post
I am looking for a more natural-sounding text-to-speech synthesizer than eSpeak, which actually is very reliable and easy to use in a Linux script.
Hi...

I guess so! I tried listening to a sample of eSpeak here but I barely could understand what was being said.

Have you taken a look at Festival? You can try their online demo here. I found the voice of "Tom (English American male)" pleasant and easy to understand. I'm not sure how you can download the software from their download page but I did find Festvox, which I guess is a program that incorporates Festival's software, here.

Let us know how it goes...

Regards...
 
1 members found this post helpful.
Old 08-22-2015, 08:31 PM   #4
frankbell
LQ Guru
 
Registered: Jan 2006
Location: Virginia, USA
Distribution: Slackware, Ubuntu MATE, Mageia, and whatever VMs I happen to be playing with
Posts: 19,314
Blog Entries: 28

Rep: Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137Reputation: 6137
According to Jonathan Nadeau, a blind Linux user and maintainer or Sonar Linux, an improved screenreader is sorely needed for Linux. One of his hopes is to help provide one.

He uses Orca in Sonar.
 
Old 08-24-2015, 11:10 PM   #5
julianvb
Member
 
Registered: Apr 2009
Posts: 390

Original Poster
Rep: Reputation: 3
Hi, LinuxUser42, ardvark71 and frankbell,

Thank you all for your helpful inputs. I've just received an e-mail from Alan W. Black, a TTS expert at Carnegie-Mellon University recommending Festival 2.4 and CMU Flite, a more portable and faster C version of Festival. Today I installed Flite via synaptic and tried it out on two of my Linux Mint computers. I found the (U.S. English) voices quite natural and its syntax very similar to that of eSpeak. Thus there is no need for me to modify my existing Linux show-and-tell scripts.

In case anyone is interested in an excellent free Chinese-language TTS, I recommend Ekho highly. I've been using it for 3 years. It encompasses Mandarin, Cantonese and other major Chinese languages. Its sound quality is more than sufficient for all my current needs. By the way, Ekho was originally designed as a TTS for the blind in China but now it benefits the entire society.

I'll definitely get in touch with Ken Starks to remind the commercial software world that there's room for it to contribute and benefit even in Linux.

Julianvb

Last edited by julianvb; 08-24-2015 at 11:36 PM.
 
Old 08-24-2015, 11:42 PM   #6
ardvark71
LQ Veteran
 
Registered: Feb 2015
Location: USA
Distribution: Lubuntu 14.04, 22.04, Windows 8.1 and 10
Posts: 6,282
Blog Entries: 4

Rep: Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842
Quote:
Originally Posted by julianvb View Post
Thank you all for your helpful inputs. I've just received an e-mail from Alan W. Black, a TTS expert at Carnegie-Mellon University recommending Festival 2.4 and CMU Flite, a more portable and faster C version of Festival. Today I installed Flite via synaptic and tried it out on two of my Linux Mint computers. I found the (U.S. English) voices quite natural and its syntax very similar to that of eSpeak. Thus there is no need for me to modify my existing Linux show-and-tell scripts.

In case anyone is interested in an excellent free Chinese-language TTS, I recommend Ekho highly. I've been using it for 3 years. It encompasses Mandarin, Cantonese and other major Chinese languages. Its sound quality is more than sufficient for all my current needs. By the way, Ekho was originally designed as a TTS for the blind in China but now it benefits the entire society.

I'll definitely get in touch with Ken Starks to remind the commercial software world that there's room for it to contribute and benefit even in Linux.
You're welcome, glad you found a solution that works.

If you would, please mark this thread as "SOLVED" by clicking on "Thread Tools" directly above your initial post. Thanks!

Regards...
 
Old 08-26-2015, 11:28 AM   #7
julianvb
Member
 
Registered: Apr 2009
Posts: 390

Original Poster
Rep: Reputation: 3
Hi, ardvark71,

I thought I did mark this thread as [Solved] yesterday from the first post, namely Aug 25. Thanks.

Julianvb
 
Old 08-26-2015, 02:23 PM   #8
ardvark71
LQ Veteran
 
Registered: Feb 2015
Location: USA
Distribution: Lubuntu 14.04, 22.04, Windows 8.1 and 10
Posts: 6,282
Blog Entries: 4

Rep: Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842
Quote:
Originally Posted by julianvb
I thought I did mark this thread as [Solved] yesterday from the first post, namely Aug 25. Thanks.
You did, yes. Thank you.

Regards...

Last edited by ardvark71; 08-26-2015 at 02:25 PM. Reason: Changed information.
 
Old 11-05-2015, 07:07 PM   #9
newsgrabber
LQ Newbie
 
Registered: Jan 2006
Posts: 3

Rep: Reputation: 0
There's also a combination of espeak+mbrola voices. But this sounds even worse. Especially when run from gespeak because it makes unwanted gaps between words.

(BTW: if your gespeak does not see any mbrola voices installed symlink the folder espeak data to another location:
ln -s /usr/lib/i386-linux-gnu/espeak-data/ /usr/share/espeak-data
the original folder might be also in /usr/lib/x86_64-linux-gnu/espeak-data, see: bugs.launchpad.net for details).
 
Old 11-06-2015, 12:35 AM   #10
julianvb
Member
 
Registered: Apr 2009
Posts: 390

Original Poster
Rep: Reputation: 3
Hi, newsgrabber,

Thanks very much for your interesting input. Please let me know when you come across a reasonably priced natural-sounding TTS for Linux.

Julianvb
 
Old 11-07-2015, 04:10 AM   #11
newsgrabber
LQ Newbie
 
Registered: Jan 2006
Posts: 3

Rep: Reputation: 0
Quote:
Originally Posted by julianvb View Post
Hi, newsgrabber,

Thanks very much for your interesting input. Please let me know when you come across a reasonably priced natural-sounding TTS for Linux.
write to me @ poczta.onet.pl
 
Old 11-07-2015, 06:23 AM   #12
ondoho
LQ Addict
 
Registered: Dec 2013
Posts: 19,872
Blog Entries: 12

Rep: Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053
just for kicks i compiled flite from here and it sounds much better than the example from here.
http://iki.fi/dt/stuff/theraven.ogg
(you must also download & use the voices, but even without them it sounds better than the espeak example)

Last edited by ondoho; 11-07-2015 at 06:25 AM.
 
Old 01-06-2017, 09:41 PM   #13
julianvb
Member
 
Registered: Apr 2009
Posts: 390

Original Poster
Rep: Reputation: 3
2017-01-06

I am happy to report that I recently came across Cepstral's Swift TTS, a commercial TTS compatible with WIndows, OSX and Linux. I've tested its Linux David voice briefly, which sounds quite natural and I found the syntax very user-friendly. About a dozen voices are available and they are priced from $10 to $45. According to Cepstral, a licensed copy of swift may be used on only one computer and all its user-created swift output files may not be used on any computer not having its own Cepstral license. I like the firm's pre-purchase policy of allowing the public free testing of their TTS products.

I hope my information will be helpful to Linux users who are still searching for a natural-sounding commercial TTS.
Anyone interested in the Swift TTS will benefit from visiting http://www.cepstral.com and reading its very informative pages.

Julianvb
 
1 members found this post helpful.
Old 01-06-2017, 09:52 PM   #14
ardvark71
LQ Veteran
 
Registered: Feb 2015
Location: USA
Distribution: Lubuntu 14.04, 22.04, Windows 8.1 and 10
Posts: 6,282
Blog Entries: 4

Rep: Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842Reputation: 842
Hi Julian...

Thank you for your update, I'm glad you found another product that fits your criteria. Perhaps your information will be helpful for others looking for this kind of software.

Regards...
 
Old 01-07-2017, 07:05 AM   #15
ondoho
LQ Addict
 
Registered: Dec 2013
Posts: 19,872
Blog Entries: 12

Rep: Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053Reputation: 6053
not sure why i had to compile it myself then, but flite is in the repos for at least archlinux, ubuntu and debian.
probably most distros.

so after installing with package management it comes with a basic male voice, and 'slt', a very soft (and more natural imo) female voice:
Code:
$ flite -h
flite: a small simple speech synthesizer
  Carnegie Mellon University, Copyright (c) 1999-2011, all rights reserved
  version: flite-2.0.0-release Dec 2014 (http://cmuflite.org)
usage: flite TEXT/FILE [WAVEFILE]
  Converts text in TEXTFILE to a waveform in WAVEFILE
  If text contains a space the it is treated as a literal
  textstring and spoken, and not as a file name
  if WAVEFILE is unspecified or "play" the result is
  played on the current systems audio device.  If WAVEFILE
  is "none" the waveform is discarded (good for benchmarking)
  Other options must appear before these options
  --version   Output flite version number
  --help      Output usage string
  -o WAVEFILE Explicitly set output filename
  -f TEXTFILE Explicitly set input filename
  -t TEXT     Explicitly set input textstring
  -p PHONES   Explicitly set input textstring and synthesize as phones
  --set F=V   Set feature (guesses type)
  -s F=V      Set feature (guesses type)
  --seti F=V  Set int feature
  --setf F=V  Set float feature
  --sets F=V  Set string feature
  -ssml       Read input text/file in ssml mode
  -b          Benchmark mode
  -l          Loop endlessly
  -voice NAME Use voice NAME (NAME can be filename or url too)
  -voicedir NAME Directory contain voice data
  -lv         List voices available
  -add_lex FILENAME add lex addenda from FILENAME
  -pw         Print words
  -ps         Print segments
  -psdur      Print segments and their durations (end-time)
  -pr RelName Print relation RelName
  -voicedump FILENAME Dump selected (cg) voice to FILENAME
  -v          Verbose mode
$ echo "Hello World!" | flite -voice slt
still i decided to download more voices:
Code:
$ cd
$ mkdir .config/flite && cd .config/flite
$ wget -r --no-parent --no-directories --accept flitevox http://www.festvox.org/flite/packed/flite-2.0/voices/
$ echo "Hello World!" | flite -voice ./cmu_us_axb.flitevox
that's a big download, ~550MB.
but it's also possible to use the voices without downloading them first:
Code:
echo "Hello World!" | flite -voice http://www.festvox.org/flite/packed/flite-2.0/voices/cmu_us_axb.flitevox
 
  


Reply



Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off



Similar Threads
Thread Thread Starter Forum Replies Last Post
Best Commercial Speech Synthesizer? Geremia Linux - Software 2 09-23-2014 10:39 PM
Text to Speech - Speech to Text in Linux & C/C++ aarsh Linux - Newbie 2 05-15-2012 11:46 AM
LXer: This week at LWN: The eSpeak Speech Synthesizer LXer Syndicated Linux News 0 07-07-2006 10:21 AM
Does anyone use festival speech synthesizer? gervin Linux - Software 0 01-25-2004 05:43 PM

LinuxQuestions.org > Forums > Linux Forums > Linux - Software

All times are GMT -5. The time now is 11:05 PM.

Main Menu
Advertisement
My LQ
Write for LQ
LinuxQuestions.org is looking for people interested in writing Editorials, Articles, Reviews, and more. If you'd like to contribute content, let us know.
Main Menu
Syndicate
RSS1  Latest Threads
RSS1  LQ News
Twitter: @linuxquestions
Open Source Consulting | Domain Registration