ÿþ<!-- X-URL: http://www.gnu.org/software/gnuspeech/gnuspeech.html --> <!-- <BASE HREF="http://www.gnu.org/software/gnuspeech/gnuspeech.html"> --> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN"> <HTML> <HEAD> <TITLE>Gnuspeech - GNU Project - Free Software Foundation (FSF)</TITLE> <LINK REV="made" HREF="mailto:webmasters@www.gnu.org"> <META NAME="keywords" CONTENT="gnuspeech articulatory speech synthesis tube model distinctive region formant sensitivity"> <meta http-equiv="content-type" content='text/html; charset=us-ascii'> </HEAD> <BODY BGCOLOR="#FFFFFF" TEXT="#000000" LINK="#1F00FF" ALINK="#FF0000" VLINK="#9900DD"> <A HREF="http://www.gnu.org"><IMG SRC="http://www.gnu.org/graphics/gnu-head-sm.jpg" ALT=" [image of the Head of a GNU] " WIDTH="129" HEIGHT="122"></A> <P> <H1><A NAME="TOC">gnuspeech</A></H1> <OL> <LI><A NAME="TOCWhatis" HREF="#Whatis">What is gnuspeech?</A> <LI><A NAME="TOCReleases" HREF="#Releases">Releases?</A> <UL> <LI><A NAME="TOCDevelopment" HREF="#Development"> Development & &#147;Coming Soon&#148;</A> <!-- <LI><A NAME="TOCStable" HREF="#Stable">Current Stable Release</A> <LI><A NAME="TOCHistory" HREF="#History">Release History</A> --> </UL> <!-- <LI><A NAME="TOCPlatforms" HREF="#Platforms">Supported Platforms</A> --> <LI><A NAME="TOCWhy" HREF="#Why">Why is it called gnuspeech?</A> <LI><A NAME="TOCObtaining" HREF="#Obtaining">Obtaining gnuspeech</A> <LI><A NAME="TOCHelp" HREF="#Help">Getting help with gnuspeech</A> <UL> <LI><A NAME="TOCManuals" HREF="#Manuals">Manuals</A> <!-- <LI><A NAME="TOCFAQ" HREF="#HelpFAQ">FAQ</A> <LI><A NAME="TOCHelpMailing" HREF="#HelpMailing">Mailing Lists</A> <LI><A NAME="TOCHelpUsenet" HREF="#HelpUsenet">Usenet</A> --> </UL> <LI><A NAME="TOCFindingPackages" HREF="#FindingPackages">Finding additional packages for gnuspeech</A> <LI><A NAME="TOCFurther" HREF="#Further">Further information</A> <LI><A NAME="TOCYouHelp" HREF="#YouHelp"> If you want to help with gnuspeech</A> </OL> <BR> <HR> <H2><A NAME="Whatis" HREF="#TOCWhatis">What is gnuspeech?</A></H2> <P> Gnuspeech is an extensible, text-to-speech package, based on real-time, articulatory, speech-synthesis-by-rules. That is, it converts text strings into phonetic descriptions, aided by a pronouncing dictionary, letter-to-sound rules, rhythm and intonation models; transforms the phonetic descriptions into parameters for a low-level articulatory synthesiser; and uses these to drive an articulatory model of the human vocal tract producing an output suitable for the normal sound output devices used by GNU/Linux. The research that provides the foundation of the system was carried out in research departments in France, Sweden, Poland, and Canada. Some of the features of gnuspeech and associated tools include: <UL> <LI>A Tube Resonance Model (TRM) for the human vocal tract (also known as a transmission-line analog, or a waveguide model) that truly represents the physical properties of the tract, including energy balance between the nasal and oral cavities as well as the radiation impedance at lips and nose. <LI>A control model for the TRM based on formant sensitivity analysis that allows accurate specification of the relevant vocal tract configurations for speech and comprising a low-level articulatory model having a small number of parameters and a low bit rate. The model is based on research at KTH in Stockholm, LCTI (ENST) in Paris, and The University of Calgary. <LI>Databases specifying the articulatory postures and control dynamics required to produce English speech from an augmented phonemic input. Some French vowels are also included. <LI>Models of English rhythm and intonation based on research at IPO in The Netherlands, the University of Essex (UK) and the University of Calgary. <LI><I>&#147;Monet&#148</I>&#8212;a GUI-based database creation and editing system that allows the phonetic data and dynamic rules to be set up and modified for arbitrary languages. The MONET real-time engine also translates augmented phonetic strings into synthesiser parameters. <LI>A text-to-augmented-phonetics module to convert arbitrary text, preferably with normal punctuation, into the input required by the MONET engine. This also provides the API for the text-to-speech system. <LI>A 70,000+ word English pronouncing dictionary with rules for derivatives such as plurals, and adverbs. The dictionary also provides part-of-speech information for later addition of grammatical parsing and includes 6000 given names. <LI>Sub-dictionaries that allow different user- or application-specific pronunciations to be substituted for the default pronunciations coming from the main dictionary. <LI>Letter-to-sound rules to deal with spellings and words that are not in the dictionaries. <LI>Tools for managing the dictionary and carrying out analysis of speech. <LI><I>&#147;Synthesiser&#148;</I>&#8212;a GUI-based application to allow experimentation with a stand-alone TRM. All parameters may be varied and the output monitored and analysed. It was an important component in the research needed to create the original English speech databases. </UL> <A NAME="diagram"> <CENTER> <!-- <IMG SRC="http://savannah.gnu.org/cgi-bin/viewcvs/*checkout*/software/gnuspeech/tts-block-diagram.png?rev=HEAD&cvsroot=www.gnu.org&content-type=image/jpeg"> --> <IMG SRC="http://www.gnu.org/software/gnuspeech/tts-block-diagram.png"> <BR> <BR> <H2> Overview of the main Articulatory Speech Synthesis System </H2> </CENTER> </P> <H2><A NAME="Why" HREF="#TOCWhy">Why is it called gnuspeech?</A></H2> <P> It is a play on words. This is a new approach to speech synthesis from text. It is also a GNU project, aimed at providing high quality text-to-speech output for GNU/Linux. In addition, it provides a comprehensive tool for psychophysical and linguistic experiments. <H2><A NAME="Releases" HREF="#TOCReleases">Releases</A></H2> <P> <B>gnuspeech</B> is currently under development. It is being ported from an original NeXTSTEP 3.x version to run under GNU/Linux. No full GNU/Linux release is currently available, but a release of the interactive Monet system for Mac OS/X and GNUstep is available, with some work remaining to be completed for GNUstep version (as of 2008-05-23&#8212see next section for details). </P> <UL> <LI><b><A NAME="Development" HREF="#TOCDevelopment"> Development & &#147;Coming Soon&#148;</A></b> <P> <B>gnuspeech</B> is being ported both to GNU/Linux and to the Macintosh under OS/X. There are a number of components/apps/modules which have to be ported. Some have already been ported. Interested persons are invited to contact the authors/developers through the <A HREF="http://savannah.gnu.org/projects/gnuspeech" TARGET="gnu2">gnu project facilities</A>. To join this mailing list, please visit<A HREF="http://mail.gnu.org/mailman/listinfo/gnuspeech-contact" TARGET="gnu3"> the subscription page.</A> The current state of the project is as follows: </P> <UL> <LI><B>Monet:</B><BR> The interactive language database and testing tool used to create the original databases for English Text-To-Speech conversion using the new articulatory model of the vocal tract (the &#8220;tube model&#8221;&#8212;basically a wave-guide or lattice filter that emulates the properties of the acoustic tube directly rather than through the use of formant filters etc). <I>Monet</I> translates its symbolic input into a digital waveform representing the &#8220;spoken&#8221; version of the input. <I>Monet</I> was originally designed developed by Craig Schock (based on an original specification by David Hill), with testing and suggestions for improvements by David Hill and Leonard Manzara) as proprietary research software used in-house for the development of the Trillium Sound Research <I>TextToSpeech</I> package offered on the NeXT computer. It also was available as part of the Trillium <I>Experimenter</I> kit. On the demise of NeXT (whose remains were bought by Apple Computer), <I>Monet</I>, and all other Trillium software was reconfigured as a GNU project (<I>gnuspeech</I>) and made available to the community under a <A HREF=" http://www.gnu.org/copyleft/">General Public Licence</A> and can be found at the web site <A HREF="http://savannah.gnu.org/projects/gnuspeech/" TARGET="gnuspeech2">http://savannah.gnu.org/projects/gnuspeech/</A>. To access the sources for <I>Monet</I> and other components, click on &#8220;-Browse Sources Repository&#8221; under &#8220;Development Tools&#8221;. Monet is there under &#8220;&#8230;/current/Applications&#8221; but requires the tube model and other components to compile (see &#8220;&#8230;/Frameworks&#8221; and &#8220;&#8230;/Tools&#8221; under &#8220;current&#8221;. At present, complete compilation is only possible under Macintosh OS/X 4.3 or later, though the sources are being modified to compile under GNUstep as well and this may introduce certain minor glitches in the Mac OS/X compilation from time to time. The big hold-up in getting full compilation under GNUstep is the lack of sutiable audio output facilities under GNUstep. Compilation under Mac OS/X uses Core Audio and the plan is to implement the needed components of Core Audio for GNUstep. Two people concerned with the ongoing GNUstep development (Greg Casamento&#8212;the Chief GNUStep maintainer&#8212;and Robert Slover) have been considering the problem. Both have been extremely busy&#8212;especially Greg after taking over as Chief on the GNUstep project. The implementation is on Robert's &#8220;to-do&#8221; list. Until then, those wishing to try out Monet and do further development will have to work on the Mac using the source which is designed to compile under either OS/X xcode/interface builder, or under GNUstep. The Mac port is pretty well complete except for a few items such as modifying the intonation patterns for the automatically generated speech and was done by Steve Nygard following his experience at OmniGroup. Steve had worked on the original NeXT implementation for Trillium whilst he was at the University of Calgary. Monet&#8217;s emulation of the human vocal tract depends on research carried out by Fant and his colleagues at the Speech Technology Lab at KTH, Stockholm on formant sensitivity analysis, and by René Carré at the ENST Dept. of Signals in Paris on the &#8220;Distinctive Region Model&#8221; for controlling the artificial vocal tract. <LI><B>The tube model:</B><BR> This was orignally a &#8216;C&#8217; implementation of the tube model that forms the core of the synthesis system, and was created by Leonard Manzara who also ported it to the DSP56001 signal processor and made it run in real-time. It is based on work by Perry Cook and Julius Smith at the <I>Stanford University Center for Computer Research in Music and Acoustics</I> (CCRMA). The version required to compile <I>Monet</I> is available in the same repository as <I>Monet</I>, but under &#8220;...current/Tools/softwareTRM&#8221;. A copy of the original &#8216;C&#8217 version is available in the repository under &#8220;gnuspeech/gnuspeech/trillium/src/softwareTRM/tube.c&#8221;. <LI><B>Synthesizer:</B><BR> This is not, in fact, a complete synthesizer! It is an interactive application that allows a user (usually a language developer or someone interested in the behaviour of the tube model) to interact directly with the tube model, listen to the output under different static conditions, and analyse the output. It was an important tool used in developing the databases for the original British English <I>TextToSpeech</I> system because it allowed the tube configurations needed to define the speech &#8220;postures&#8221; (of the vocal tract) to be explored and finalised. Although it has built-in analysis and display features, it was also used in conjunction with a Kay <I>Sonagraf</I> spectrum analyser that was used to analyse the spectrum of natural speech in order to compare the spectral analyses of putative &#8220;postures&#8221; with what was seen in natural speech in a form that was the same for both. The <I>Sonagraf</I> was also used to check the output of <I>Monet</I> against the same utterances in natural speech. <I>Synthesizer</I> is 70% ported to the Mac under OS/X but none of the new sources is yet available. I (David Hill) am the one working on this, but I keep getting diverted. It should have been finished 6 months ago! Real soon now! The original version of <I>Synthesizer</I> was created (for the NeXT) by Leonard Manzara. <LI><B>Preditor:</B><BR> This was an application to allow users to create and maintain their own dictionaries. The original <I>TextToSpeech</I> kit looked up several dictionaries in the order User, Application and Main. <I>PrEditor</I> allows the User and Application dictionaries to be created and maintained. An initial port was begun by Eric Zoerner and is in a sub-subdirectory under the same subdirectory as <I>Monet</I>. It is not yet functional. The original <I>PrEditor</I> on the NeXT was written by Vince DeMarco and David Marwood, documented by Leonard Manzara and later upgraded by Michael Forbes. <LI><B>The &#8220;Main&#8221; dictionary:</B><BR> This has not really changed since the original NeXT implementation and is incorporated as a module in the source code for Monet. It is an hybrid pronunciation between British (RP) English&#8212;mainly the vowels and related stuff; and General American&#8212;especially the rhotic &#8220;r&#8221; sound. It includes around 70,000 words, plus facilities for creating/checking derivatives such as plurals, adverbs &#8230;, and information concerning word stress, and part-of-speech. The part-of-speech information is still not used. The main dictionary was compiled mainly by me, David Hill, after a preliminary version plus creation tools were set up by Craig Schock. <LI><B>BigMouth:</B><BR> (Not to be confused with a different app of a similar name by a different company). This was an application that allowed text-to-speech to be tried out without reference to any particular application on the NeXT and also drove the speech service. It uses the <I>TextToSpeech Server</I> that ran as a daemon, started at boot time. It has yet to be ported (see also the next item on <I>Real-time Monet</I>). The original source for <I>BigMouth</I> was created by Leonard Manzara. <LI><B>Real-time Monet and the TextToSpeech Server (TTS Server):</B><BR> Monet incorporates all kinds of interactive interfaces for creating and modifying the databases relating to the language being created or managed. It also has the means to use these databases to create the output speech waveform. The original NeXT-based <I>TextToSpeech Kit</I> came in three versions. The <I>User Kit</I> which simply provided speech output as a service available to any application; the <I>Developer Kit</I> which provided the means to incorporate speech into applications directly; and the <I>Experimenter Kit</I> which allowed full access to all the tools used by Trillium in developing language databases including dictionaries. All of these used the <I>TextToSpeech Server</I> for the actual conversion of text to speech output. The task was made easier on the NeXT, which was relatively slow, by using the built-in DSP (a Motorola DSP-56001). In the Mac implementation of <I>Monet</I> and <I>Synthesizer</I>, the host computer performs all the computation&#8212;as CPU speeds are two orders of magnitude or more faster than the old NeXT. The use of the DSP on the NeXT also gave a certain absolute separation between the tasks associated with creating the event framework for synthesis, and the tasks associated with transforming the event framework into the digital speech waveform (<I>Real-time Monet</I>) and outputting it&#8212;the latter tasks being carried out by the tube model. Thus the tube model ran on the DSP in real-time and communicated by DMA access. There was also a &#8216;C&#8217; version of the tube model which could not run in real-time. It was useful for producing a slightly higher quality of speech since it did not have to be squeezed into the DSP and rigorously optimised because of the marginal ability (even on the DSP) to run in real-time. The &#8216;C&#8217 version of the tube model is what forms the basis of the current port&#8212;possible because of the greatly increased processor speeds these days. <BR> &#8195; <I>Real-time Monet</I> is a stripped-down version of <I>Monet</I>. All the database creation and manipulation components are absent, as are all interactive interfaces. On the NeXT version, the defaults database was used to hold the parameters for controlling static aspects of the synthesis (tube length, mean pitch, and so on&#8212;the so-called &#8220;utterance-rate parameters&#8221;) and <I>Real-time Monet</I> computed the event framework from the input text via an intermediate input syntax which resulted from pre-processing the text. This pre-processing included dictionary look-up to get the correct pronunciation (deficient in the sense there was no grammatical parsing or attempt to determine meaning, so that different pronunciations of words with the same spelling could not be disambiguated). The word stress information from the dictionary was used to determine the rhythmic framework according to the Jones/Abercrombie/Halliday (British) &#8220;tendency-towards-isochrony&#8221; theory of British English speech by placing &#8220;foot&#8221; boundaries before the word stress in words having word-stressed syllables. The punctuation was also used in this process, and allowed a distinction to be made between statements, emphatic statements, questions, and questions expecting a yes/no answer for purposes of selecting different intonation contours (not ever really done totally satisfactorily). Without using knowledge of meaning, it was hard to decide where the tonic (information point) of the phrase or sentence should be marked, which means that the tonic foot was generally placed in phrase/sentence final position by default. This causes some degradation of the speech rhythm and intonation and is the first deficiency that should be corrected. <BR> &#8195; That said, <I>Real-time Monet</I> and the <I>TextToSpeech</I> server have yet to be ported or rewritten for GNU/Linux and the Mac. The current <I>Monet</I> port, like the original <I>Monet</I>, incorporates the tube model to generate output and expects the output of the text pre-processor as input. A new applet (unfortunately named &#8220;GnuSpeech&#8221; and presently residing in the &#8220;gnuspeech/current/Frameworks&#8221; folder) allows plain text to be converted into the syntax needed for the current version of <I>Monet</I>. Steve Nygard recently &#8220;tidied things up&#8221;, following comments from people on the list, and I haven't checked out the resulting new arrangements to see if I can still understand the relationships well enough to compile it all, having many balls in the air. Any time I spend will be finishing <I>Synthesizer</I>. Knowing Steve, I am sure there&#8217;s no problem with compiling <I>Monet</I> and associated modules in their re-arranged form. Please communicate your experience on the mailing list (to join, visit<A HREF="http://mail.gnu.org/mailman/listinfo/gnuspeech-contact" TARGET="gnu3"> the subscription page.</A> <BR> &#8195; There's a diagram of the relationships between the various TTS components of the complete system <A HREF="#diagram">above</A>. <LI><B>ServerTest and ServerTestPlus:</B><BR> This was an interactive module to allow the functioning of the <I>TextToSpeech Server</I> to be tested as it was running. There were originally two versions (plain and Plus), the latter having a number of &#8220;hidden&#8221; methods that were restricted to Trillium's &#8220;in-house&#8221; use. Now that the whole system is available under a GPL, the restricted &#8220;ServerTest&#8221; version is obsolete and the name <I>ServerTest</I> will refer to a reimplementation of <I>ServerTestPlus</I>. One of the 18 originally-hidden methods allowed plain text to be converted into the intermediate <I>Real-time Monet</I> input syntax. It was hidden to keep the main dictionary material proprietary, as it could have been used to decode the encoded dictionary. This particular function is currently provided by the misleadingly-named <I>GnuSpeech</I> applet (see above). <I>ServerTest</I> will be needed once the <I>TextToSpeech Server</I> has been re-implemented&#8212;something that has not yet been done. The original versions were written by Leonard Manzara. <LI><B>WhosOnFirst:</B><BR> <I>WhosOnFirst</I> was the first publicly available software associated with the Trillium <I>TextToSpeech</I> system and was designed as a bit of a teaser. As issued, it provided indication, on the NeXT console, of remote logins. It also told the user that if they had the Trillium <I>TextToSpeech</I> system, they could get voice alerts not only to remote logins, but other system activity such as application launches. The App was written by Craig Schock and was instrumental in catching and identifying a hacker trying to break into our system soon after it was set up. <I>WhosOnFirst</I> has not yet been ported and for best value must await a ported version of the <I>TextToSpeech Server</I>. <LI><B>say:</B><BR> A command line interface to the <I>TextToSpeech Server</I> that can be used from a terminal or in shell scripts. It was written by Craig Schock and has not been ported yet. <LI><B>SpeechManager:</B><BR> The SpeechManager was provided to allow the <I>TextToSpeech Server</I> parameters to be optimised for different systems since no particular setting of priorities, initial silence fill, and so on could be right for all systems. In particular, in networked systems, or systems with a high compute load from other tasks, the speech would sometimes crackle due to interference from other tasks. The App, which could only be run as root, allowed the <I>TextToSpeech Server</I> to be restarted, and the various parameters controlling priority and so on to be set to new values to avoid crackling whilst minimising the use of system resources. It may be that these functions are obsolete these days, given the increased compute power available. Some functions (such as reporting the version of the main dictionary in use, or restarting the <I>TextToSpeech Server</I>) may still be required when the TTS Server is reimplemented. The original App was written by Craig Schock. It has not been ported. <LI><B>SpeechRegistrar:</B><BR> An applet that was provided to allow any of the <I>TextToSpeech Kits</I> to be registered, using a password, and run under the root account. The original function is now obsolete, but may be useful, in revised form, as a way of building user groups for the ported system. It was written by Craig Schock. It has not been ported. <LI><B>TrilliumSoundEditor:</B><BR> This was a speech editor and analysis program intended to provide a more versatile replacement for the publicly available <I>Sonagram</I> program written by Hiroshi Momose. Although <I>TrilliumSoundEditor</I> was never completely finished, it provided the basic functionality required for speech development and could be finished/upgraded/ported at some point in the future. The program was written by Craig Schock. None of the App has yet been ported. </UL> <P> As a summary, much of the core software has been or is being ported to the Mac under OS/X, but porting anything that &#8220;speaks&#8221; is blocked from completion under GNU/Linux by lack of adequatesuitable audio output facilities. Thus <I>Monet</I> has been ported to the Mac under OS/X using xcode/InterfaceBuilder and it produces speech from input text as well as providing the development facilities for managing and creating language databases for text-to-speech. The <I>Monet</I> source will also compile, more or less, under GNUstep within the GNU/Linux environment but without bult-in speech output facilities. The sources are in the gnuspeech repository (see below). <I>Synthesizer</I> is in the process of being ported to Mac OS/X using xcode/InterfaceBuilder and is about 70% complete. Sources are not yet publicly available. <I>PrEditor</I> is in the process of being ported and the sources are in the gnuspeech repository. Some accessory tools are available. There is an immediate need to port the <I>TextToSpeech Server</I> (the daemon, or stripped version of <I>Monet</I>), and stripping the current <I>Monet</I> is likely a better approach than porting the original for both the Mac and to GNU/Linux versions, based on a source that will compile for either using conditional compilation&#8212;as for the current <I>Monet</I>. Other items are as noted in the text above. Robert Slover has undertaken to solve the audio output requirement for GNU/Linux, he just needs time beyond that devoted to the work that earns his living! Greg Casamento has simply run out of resources for taking on this task as he is now the chief GNUstep maintainer. </P> <H2><A NAME="Obtaining" HREF="#TOCObtaining">Obtaining gnuspeech</A></H2> <P> Gnuspeech is currently fully available as a NextStep 3.x version, and partly available (specifically <I>Monet</I>) as a version that compiles for both Mac OS/X and GNU/Linux under GNUStep. These files are available in the CVS repository. </P> <H2><A NAME="Help" HREF="#TOCHelp">Getting Help with gnuspeech</A></H2> <P> Developers should contact the authors/developers through the <A HREF="http://savannah.gnu.org/projects/gnuspeech" TARGET="gnu2">gnu project facilities</A>. To join this mailing list, please visit<A HREF="http://mail.gnu.org/mailman/listinfo/gnuspeech-contact" TARGET="gnu3"> the subscription page.</A> Papers and manuals are available on-line (see below) </P> <H2><A NAME="Manuals" HREF="#TOCManuals">Manuals and papers</A></H2></B> <P> A number of papers and manuals relevant to gnuspeech exist: </P> <UL> <LI><A HREF="http://www.cpsc.ucalgary.ca/~hill/papers/monman/index.html" TARGET="new">The <I>&#147;Monet&#148;</I> manual</A> provides a detailed view of the facilities and screens associsated with the MONET subsystem, but does not describe the MONET engine that is used for real-time construction of parameters. <LI><A HREF="http://www.cpsc.ucalgary.ca/~hill/papers/synthesizer/index.html" TARGET="new">The <I>&#147;Synthesizer&#148;</I> manual</A> provides a detailed view of the interactive application that allows access to all the parameters and facilities of the Tube Resonance Model (TRM) synthesiser as a learning tool and a development tool. <LI><A HREF="http://www.cpsc.ucalgary.ca/~hill/papers/avios95/index.htm" TARGET="new">A paper presented at the American Voice I/O Society conference</A> in 1995</A> provides a reasonably detailed explanation of the theory underlying the tube resonance model. <LI><A HREF="http://www.cpsc.ucalgary.ca/~hill/papers/conc/index.htm" TARGET="new">A heavily cross-referenced &#147;conceptionary&#148;</A> is available to provide access to some of the background terms and research in the relevant scientific fields. <LI><A HREF="http://www.cpsc.ucalgary.ca/~hill/papers/pronguid.htm" TARGET="new">A guide to the pronunication notation used in the text-to-speech work</A> showing the relationship between standard forms (IPA, Websters) and the ASCII-friendly form used in gnuspeech, with examples of actual pronunciations. <LI><A HREF="./trm-write-up.pdf" TARGET="new">The Tube Resonance Model</A> a write-up of the waveguide model of the acoustic tubes that form the underlying model of the human vocal apparatus. </UL> <H2><A NAME="FindingPackages" HREF="#TOCFindingPackages">Finding packages for gnuspeech</A> </H2> <P> There is the original NeXTSTEP Developer Package, which is available under a GPL, but does not run under GNU/Linux. There is also now a version of the full <I>&#147;Monet&#148;</I> system for Mac OS/X and GNUstep that provides the core of the text-to-speech development facilities and allows arbitrary text to be changed to speech. Note that further work is needed to strip this version to make a daemon-like module for incorporation within applications, or as a service, as noted above. Check out <A HREF="http://savannah.gnu.org/projects/gnuspeech" TARGET="gnu4">the Savannah CVS repository</A> and search for &#147;gnuspeech&#148;. Current work is under the current directory. <H2><A NAME="Further" HREF="#TOCFurther">Further information?</A></H2> <P> See the section on <A HREF=#Manuals>Manuals and papers</A> </P> <H2><A NAME="YouHelp" HREF="#TOCYouHelp">How to help with gnuspeech</A></H2> <P> To contact the maintainers of gnuspeech, to report a bug, or to contribute fixes or improvements, to join the development team, or to join the <I>gnuspeech</I> mailing list, please visit<A HREF="http://savannah.gnu.org/projects/gnuspeech" TARGET="gnu4"> the <I>gnuspeech</I> project page</A> and use the facilities provided. <P> <HR> Return to <A HREF="http://www.gnu.org">GNU's home page</A>. <P> Please send FSF &amp; GNU inquiries &amp; questions to <A HREF="mailto:gnu@gnu.org"><EM>gnu@gnu.org</EM></A>. </P> <P> We thank David Hill for writing this page. <P> Please send comments on these web pages to <A HREF="mailto:webmasters@www.gnu.org"><EM>webmasters@www.gnu.org</EM></A>, send other questions to <A HREF="mailto:gnu@gnu.org"><EM>gnu@gnu.org</EM></A>. <P> Copyright (C) 1998, 2001 Free Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA 02111, USA <P> Verbatim copying and distribution of this entire article is permitted in any medium, provided this copyright notice is preserved.<P> <HR> <P> Page last updated 2008-10-16 @ 19:53 PDT </P> <HR> </BODY> </HTML>