Skip to content
/ saga Public

SAGA - Phonetic transcription software for all Spanish variants.

Notifications You must be signed in to change notification settings

TALP-UPC/saga

Folders and files

NameName
Last commit message
Last commit date

Latest commit

2677c91 · Nov 12, 2020
Aug 2, 2017
Mar 28, 2017
Apr 1, 2017
Aug 2, 2017
Apr 1, 2017
Aug 2, 2017
Apr 22, 2017
Aug 2, 2017
Apr 6, 2017
Aug 2, 2017
Aug 2, 2017
Aug 2, 2017
Aug 2, 2017
Jul 22, 2017
Aug 31, 2016
Aug 2, 2017
Aug 9, 2016
Nov 12, 2020
Oct 16, 2016

Repository files navigation

SAGA

Para leer la documentación en español consulte el fichero [README.es.md].

SAGA is an automatic tool for grapheme-to-allophone trasncription in Spanish, considering its multiple dialectal variants. It is distributed under the GNU General Public License (http://www.gnu.org/licenses/gpl-3.0.txt).

Obtain saga:

From git only:

git clone https://github.com/TALP-UPC/saga.git
cd saga

From a downloaded tarball only

tar xzf saga.tgz
cd saga

Installation using meson

Install meson if not available:

# From a Debian-like system
# sudo apt install meson
# Any system with python3 and pip3:
# pip3 --user install meson

Build:

meson --prefix="/$HOME/saga" builddir && cd builddir
ninja test
ninja install # may require sudo, depending on the prefix

Installation (using autotools)

(The autotools build system for saga is deprecated and will be removed in future versions)

Typically you will want to install SAGA to a custom directory, use --prefix= to specify the directory where Saga will be installed.

./configure --prefix="$HOME/saga"
make
make install

Usage

SAGA is able to cover several grapheme-to-allophone transcriptions, covering some of the Spanish dialectal variants:

  • The canonical Spanish from Spain, as stablished in the SAM-A project
  • The latinoamerica dialectal variants defined in the SALA (SpeechDat Across Latin-America) project.

The saga -L argument controls the supported dialectal variants.

Modifying SAGA

The dicc directory contains the dictionaries that change SAGA behaviour for the Spanish variants. These dictionaries can be modified to customize the phonetic transcriptions as desired. Each of the dicc directories contains a README file, describing the additional SAGA modifiers that are used for that dialect.

SAGA source code is available under the src directory.

Using SAGA as a library

In test/saga_example.c there is an example on how to use SAGA in an external program.

If Saga has been installed properly, pkg-config should be able to show the necessary compiler flags:

  • pkg-config --cflags saga
  • pkg-config --libs saga

Further documentation

In the doc directory the following documents are available:

  • SAGA user manual (SAGA_man)
  • Phonetic transcrition rule documentation (Saga_Reglas_Trans)
  • Publications describing SAGA transcriptions:
    • J. Llisterri, José B. Mariño, "Spanish adaptation of SAMPA and automatic phonetic transcription", Esprit project 6819. Report SAM-A/UPC /001/V1 (Feb. 1993). (SAMPA-Spanish-93)
    • A. Moreno, José B. Mariño, "Spanish dialects: phonetic transcription", Proc. ICSLP'98, pp. 189-192, Sydney, Australia (Nov. 1998). (SpanishDialectalVariants-ICSLP98)

Main Authors

  • José B. Mariño, Polytechnic University of Catalonia -- BarcelonaTech
  • Albino Nogueiras Rodríguez, Polytechnic University of Catalonia -- BarcelonaTech

Contributors

  • Sergio Oller Moreno took care of adding autotools support, update documentation, write the SagaEngine API, provide pkg-config support and include functions that replace their POSIX versions for portability.