AFsp ~ Audio File I/O Routines


The AFsp package is a library of routines for reading and writing
audio files. The emphasis is on providing support for the type of
audio file used by the speech processing research community. The
routines have been designed to be easy to use, yet provide transparent
support the reading of several audio file formats. A secondary
purpose for distributing these routines is to encourage the use of a
standard audio file format for the header information in the output
files.

The following file formats are supported for reading.

  • NIST SPHERE audio files
  • Sun/NeXT audio files
  • DEC audio files
  • IRCAM SoundFiles
  • INRS-Telecom audio files
  • ESPS sampled data feature files
  • Headerless audio files

The audio file open routine automatically senses the file type and
communicates it to the audio file reading routines. Formats are
converted on the fly as the file is read, so the user manipulates
floats and doesn’t need to worry about the underlying data format.

For writing, the routines produce a standard format file, though
options are available to produce headerless files if desired. This
standard format is a compatible with the Sun audio file format. There
is provision for storing extra information in the extensible part of
the header.

Several audio file utilities (for copying, comparing, and filtering
audio files) are included in the package.

www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/speech/systems/afsp/0

opencore-amr ~ Android Audio CODECS


Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.

sourceforge.net/projects/opencore-amr

Balabolka ~ Text To Speech


Balabolka is a Text-To-Speech (TTS) program. All computer voices installed on your system are available to Balabolka. The on-screen text can be saved as a WAV, MP3, MP4, OGG or WMA file. The program can read the clipboard content, view text from documents, customize font and background colour, control reading from the system tray or by the global hotkeys.

www.cross-plus-a.com/balabolka

Simple TTS Reader ~ Windows Text To Speech


Simple TTS Reader is a small clipboard reader. Simply copy any text, and it will be read aloud. You can choose any installed speech engine, e.g. Microsoft Anna. This text-to-speech utility can also be minimized to tray.

Requires .NET Framework 2.0

Features:

  • Supports WinXP’s Sam and Vista’s Anna engines
  • Ability to minimize to tray
  • Small and simple
  • Reads copied to clipboard text
  • Includes an installer

simpletts

simplettsreader.sourceforge.net/
sourceforge.net/projects/simplettsreader/

WaveSurfer ~ Visualize & Manipulate Sound


WaveSurfer is an open source tool for sound visualization and manipulation. Typical applications are speech / sound analysis and sound annotation / transcription. WaveSurfer may be extended by plug-ins as well as embedded in other applications.
Features:

  • Customizable – users can create their own configurations. Localization support.
  • Extensible – new functionality can be added through a plugin architecture.
  • Embeddable – WaveSurfer can be used as a widget in custom applications.
  • Transcription file formats – reads, and writes HTK (and MLF), TIMIT, ESPS/Waves+, and Phondat. Support for encodings and Unicode.
  • Multi-platform – Linux, OSX & Windows.
WaveSurfer

sourceforge.net/projects/wavesurfer
en.wikipedia.org/wiki/WaveSurfer

eSpeak ~ Open Source Speech Synthesizer


eSpeak is a compact open source Linux and Windows speech synthesizer for English and other languages. eSpeak uses a “formant synthesis” method. This allows many languages to be provided in a small size. The speech is clear, and can be used at high speeds, but is not as natural or smooth as larger synthesizers which are based on human speech recordings.

eSpeak is available as:

  • A command line program (Linux and Windows) to speak text from a file or from stdin.
  • A shared library version for use by other programs. (On Windows this is a DLL).
  • A SAPI5 version for Windows, so it can be used with screen-readers and other programs that support the Windows SAPI5 interface.
  • eSpeak has been ported to other platforms, including Android, Mac OSX and Solaris.

speak.sourceforge.net

Sweep ~ Unix Audio Editor


Sweep is an audio editor and live playback tool for GNU/Linux, BSD and compatible systems. It supports many music and voice formats including WAV, AIFF, Ogg Vorbis, Speex and mp3, with multichannel editing and LADSPA effects plugins.

sweep_20060117_mini

www.metadecks.org/software/sweep