X-SAMPA
The Extended Speech Assessment Methods Phonetic Alphabet (X-SAMPA) is a variant of SAMPA developed in 1995 by John C. Wells, professor of phonetics at the University of London.[1] It is designed to unify the individual language SAMPA alphabets, and extend SAMPA to cover the entire range of characters in the International Phonetic Alphabet (IPA). The result is a SAMPA-inspired remapping of the IPA into 7-bit ASCII.
SAMPA was devised as a hack to work around the inability of text encodings to represent IPA symbols. Later, as Unicode support for IPA symbols became more widespread, the necessity for a separate, computer-readable system for representing the IPA in ASCII decreased. However, X-SAMPA is still useful as the basis for an input method for true IPA.
Summary
Notes
- The IPA symbols that are ordinary lower-case letters have the same value in X-SAMPA as they do in the IPA.
- X-SAMPA uses backslashes as modifying suffixes to create new symbols. For example, O is a distinct sound from O\, to which it bears no relation. Such use of the backslash character can be a problem, since many programs interpret it as an escape character for the character following it. For example, you cannot use such X-SAMPA symbols in EMU, therefore you need to replace backslash with some other symbol (e.g. an asterisk: '*') when adding phonemic transcription to an EMU speech database.
- X-SAMPA diacritics follow the symbols they modify. Except for ~ for nasalization, = for syllabicity, and ` for retroflexion and rhotacization, diacritics are joined to the character with the underscore character _.
- The underscore character is also used to encode the IPA tiebar.
- The numbers _1 to _6 are reserved diacritics as shorthand for language-specific tone numbers.
Lower case symbols
| X-SAMPA | IPA | IPA image | Description | Examples |
|---|---|---|---|---|
| a | a | | open front unrounded vowel | French dame [dam], Spanish padre ["paD4e] |
| b | b | | voiced bilabial plosive | English bed [bEd], French bon [bO~] |
| b_< | ɓ | | voiced bilabial implosive | Sindhi ɓarʊ [b_<arU] |
| c | c | | voiceless palatal plosive | Hungarian latyak ["lQcQk] |
| d | d | | voiced alveolar plosive | English dig [dIg], French doigt [dwa] |
| d` | ɖ | | voiced retroflex plosive | Swedish hord [hu:d`] |
| d_< | ɗ | | voiced alveolar implosive | Sindhi ɗarʊ [d_<arU] |
| e | e | | close-mid front unrounded vowel | French ses [se], American English mate [met] |
| f | f | | voiceless labiodental fricative | English five [faIv], French femme [fam] |
| g | a | | voiced velar plosive | English game [geIm], French longue [lO~g] |
| g_< | ɠ | | voiced velar implosive | Sindhi ɠəro [g_<@ro] |
| h | h | | voiceless glottal fricative | English house [haUs] |
| h\ | f | | voiced glottal fricative | Czech hrad [h\rat] |
| i | i | | close front unrounded vowel | English be [bi:], French oui [wi], Spanish si [si] |
| j | j | | palatal approximant | English yes [jEs], French yeux [j2] |
| j\ | ʝ | | voiced palatal fricative | Greek γειά [j\a] |
| k | k | | voiceless velar plosive | English scat [sk{t], Spanish carro ["kar:o] |
| l | l | | alveolar lateral approximant | English lay [leI], French mal [mal] |
| l` | ɭ | | retroflex lateral approximant | Svealand Swedish sorl [so:l`] |
| l\ | ɺ | | alveolar lateral flap | Japanese rakuten [l\akM_0teN\] |
| m | ɭ | | bilabial nasal | English mouse [maUs], French homme [Om] |
| n | n | | alveolar nasal | English nap [n{p], French non [nO~] |
| n` | ɳ | | retroflex nasal | Swedish hörn [h2:n`] |
| o | o | | close-mid back rounded vowel | French gros [gRo] |
| p | p | | voiceless bilabial plosive | English speak [spik], French pose [poz], Spanish perro ["per:o] |
| p\ | ɸ | | voiceless bilabial fricative | Japanese fuku [p\M_0kM] |
| q | q | | voiceless uvular plosive | Arabic qasbah ["qQs_Gba] |
| r | r | | alveolar trill | Spanish perro ["per:o] |
| r` | ɽ | | retroflex flap | |
| r\ | ɹ | | alveolar approximant | English red [r\Ed] |
| r\` | ɻ | | retroflex approximant | Malayalam വഴി ["v@r\`i] |
| s | ɳ | | voiceless alveolar fricative | English seem [si:m], French session [se"sjO~] |
| s` | ʂ | | voiceless retroflex fricative | Swedish mars [mas`] |
| s\ | ɕ | | voiceless alveolo-palatal fricative | Polish świerszcz [s\v'erStS] |
| t | t | | voiceless alveolar plosive | English stew [stju:], French raté [Ra"te], Spanish tuyo ["tujo] |
| t` | ʈ | | voiceless retroflex plosive | Swedish mört [m2t`] |
| u | u | | close back rounded vowel | English boom [bu:m], Spanish su [su] |
| v | v | | voiced labiodental fricative | English vest [vEst], French voix [vwa] |
| v\ (or P) | ʋ | | labiodental approximant | Dutch west [v\Est]/[PEst] |
| w | w | | labial-velar approximant | English west [wEst], French oui [wi] |
| x | ɸ | | voiceless velar fricative | Scots loch [lOx] or [5Ox]; German Buch, Dach; Spanish caja, gestión |
| x\ | ɧ | | voiceless palatal-velar fricative | Swedish sjal [x\A:l] |
| y | ɹ | | close front rounded vowel | French tu [ty] German über ["y:b6] |
| z | ɺ | | voiced alveolar fricative | English zoo [zu:], French azote [a"zOt] |
| z` | ʐ | | voiced retroflex fricative | Mandarin Chinese rang [z`aN] |
| z\ | ʑ | | voiced alveolo-palatal fricative | Polish źrebak ["z\rEbak] |
Capital symbols
| X-SAMPA | IPA | IPA image | Description | Example |
|---|---|---|---|---|
| A | ɑ | | open back unrounded vowel | English father ["fA:D@(r\)] (RP and Gen.Am.) |
| B | β | | voiced bilabial fricative | Spanish lavar [la"Ba4] |
| B\ | ʙ | | bilabial trill | Reminiscent of shivering ("brrr") |
| C | ç | | voiceless palatal fricative | German ich [IC], English human ["Cjum@n] (broad transcription uses [hj-]) |
| D | ð | | voiced dental fricative | English then [DEn] |
| E | ɛ | | open-mid front unrounded vowel | French même [mEm], English met [mEt] (RP and Gen.Am.) |
| F | q | | labiodental nasal | English emphasis ["EFf@sIs] (spoken quickly, otherwise uses [Emf-]) |
| G | c | | voiced velar fricative | Greek γωνία [Go"nia], Danish vælge ["vElG@] |
| G\ | b | | voiced uvular plosive | Inuktitut nirivvik [niG\ivvik] |
| G\_< | ʛ | | voiced uvular implosive | Mam ʛa [G\_<a] |
| H | e | | labial-palatal approximant | French huit [Hit] |
| H\ | ʜ | | voiceless epiglottal fricative | |
| I | j | | near-close near-front unrounded vowel | English kit [kIt] |
| I\ | ɻ or ɪ̈ | near-close central unrounded vowel | Polish ryba [rI\bA] | |
| J | r | | palatal nasal | Spanish año ["aJo], English canyon ["k{J@n] (broad transcription uses [-nj-]) |
| J\ | ɟ | | voiced palatal plosive | Hungarian egy [EJ\] |
| J\_< | ʄ | | voiced palatal implosive | Sindhi ʄaro [J\_<aro] |
| K | l | | voiceless alveolar lateral fricative | Welsh llaw [KaU] |
| K\ | n | | voiced alveolar lateral fricative | |
| L | ʎ | | palatal lateral approximant | Italian famiglia [fa"miLLa], Castilian llamar [La"mar], English million ["mIL@n] (broad transcription uses [-lj-]) |
| L\ | ʟ | | velar lateral approximant | |
| M | o | | close back unrounded vowel | Korean 으 (eu) |
| M\ | p | | velar approximant | Spanish fuego ["fweM\o] |
| N | ŋ | | velar nasal | English thing [TIN] |
| N\ | t | | uvular nasal | Japanese san [saN\] |
| O | ɔ | | open-mid back rounded vowel | RP thought [TO:t], American English off [O:f] |
| O\ | ʘ | | bilabial click | |
| P (or v\) | ʋ | | labiodental approximant | Dutch west [PEst]/[v\Est], allophone of English phoneme /r\/ |
| Q | ɒ | | open back rounded vowel | RP lot [lQt] |
| R | ʁ | | voiced uvular fricative | German rein [RaIn] |
| R\ | ʀ | | uvular trill | French roi [R\wa] |
| S | ʃ | | voiceless postalveolar fricative | English ship [SIp] |
| T | θ | | voiceless dental fricative | English thin [TIn] |
| U | ʊ | | near-close near-back rounded vowel | English foot [fUt] |
| U\ | ᵿ or ʊ̈ | | near-close central rounded vowel | English euphoria [jU\"fO@r\i@] |
| V | ʌ | | open-mid back unrounded vowel | RP English strut [str\Vt] |
| W | ʍ | | voiceless labial-velar fricative | Scots when [WEn] |
| X | χ | | voiceless uvular fricative | Klallam sχaʔqʷaʔ [sXa?q_wa?] |
| X\ | ħ | | voiceless pharyngeal fricative | Arabic <ح>ha’ [X\A:] |
| Y | ʏ | | near-close near-front rounded vowel | German hübsch [hYpS] |
| Z | ʒ | | voiced postalveolar fricative | English vision ["vIZ@n] |
Other symbols
| X-SAMPA | IPA | IPA image | Description | Example |
|---|---|---|---|---|
| . | . | | syllable break | |
| " | ˈ | | primary stress | |
| % | ˌ | | secondary stress | |
| ' (or _j) | β | | palatalized | |
| : | ː | | long | |
| :\ | ˑ | | half long | Estonian differentiates three vowel lengths |
| - | separator | Polish trzy [t-S1] vs. czy [tS1] (affricate) | ||
| @ | ə | | schwa | English arena [@"r\i:n@] |
| @\ | ɘ | | close-mid central unrounded vowel | Paicĩ kɘ̄ɾɘ [k@\_M4@\_M] |
| { | æ | | near-open front unrounded vowel | English trap [tr\{p] |
| } | ʉ | | close central rounded vowel | Swedish sju [x\}:]; AuE/NZE boot [b}:t] |
| 1 | h | | close central unrounded vowel | Welsh tu [t1], American English rose's ["r\oUz1z] |
| 2 | ø | | close-mid front rounded vowel | Danish købe ["k2:b@], French deux [d2] |
| 3 | ɜ | | open-mid central unrounded vowel | English nurse [n3:s] (RP) or [n3`s] (Gen.Am.) |
| 3\ | ɞ | | open-mid central rounded vowel | Irish tomhail [t3\:l'] |
| 4 | ɾ | | alveolar flap | Spanish pero ["pe4o], American English better ["bE4@`] |
| 5 | k | | velarized alveolar lateral approximant; also see _e | English milk [mI5k], Portuguese livro ["5iv4u] |
| 6 | ɐ | | near-open central vowel | German besser ["bEs6], Australian English mud [m6d] |
| 7 | d | | close-mid back unrounded vowel | Estonian kõik [k7ik], Vietnamese mơ [m7_M] |
| 8 | u | | close-mid central rounded vowel | Swedish buss [b8s] |
| 9 | ɓ | | open-mid front rounded vowel | French neuf [n9f], Danish drømme [dR9m@] |
| & | v | | open front rounded vowel | Swedish skörd [x\&d`] |
| ? | ʔ | | glottal stop | Danish stød [sd2?], Cockney English bottle ["bQ?l] |
| ?\ | ʕ | | voiced pharyngeal fricative | Arabic ع (`ayn) [?\Ajn] |
| * | undefined escape character, SAMPA's "conjunctor" | |||
| / | indeterminacy in French vowels | |||
| < | begin nonsegmental notation (e.g., SAMPROSA) | |||
| <\ | ʢ | | voiced epiglottal fricative | |
| > | end nonsegmental notation | |||
| >\ | ʡ | | epiglottal plosive | |
| ^ | ꜛ | | upstep | |
| ! | ꜜ | | downstep | |
| !\ | ǃ | | postalveolar click | |
| | | | | | minor (foot) group | |
| |\ | ǀ | | dental click | |
| || | ‖ | | major (intonation) group | |
| |\|\ | ǁ | | alveolar lateral click | |
| =\ | ǂ | | palatal click | |
| -\ | ‿ | | linking mark |
Diacritics
| X-SAMPA | IPA | IPA image | Description |
|---|---|---|---|
| _" | ̈ | | centralized |
| _+ | ̟ | | advanced |
| _- | ̠ | | retracted |
| _/ | ̌ | | rising tone |
| _0 | ̥ | | voiceless |
| _< | implosive (IPA uses separate symbols for implosives) | ||
| = (or _=) | ̩ | | syllabic |
| _> | ʼ | | ejective |
| _?\ | ˤ | | pharyngealized |
| _\ | ̂ | falling tone | |
| _^ | ̯ | | non-syllabic |
| _} | ̚ | | no audible release |
| ` | ˞ | | rhotacization in vowels, retroflexion in consonants (IPA uses separate symbols for consonants, see t` for an example) |
| ~ (or _~) | ̃ | | nasalization |
| _A | ̘ | | advanced tongue root |
| _a | ̺ | | apical |
| _B | ̏ | | extra low tone |
| _B_L | ᷅ | | low rising tone |
| _c | ̜ | | less rounded |
| _d | ̪ | | dental |
| _e | ̴ | | velarized or pharyngealized; also see 5 |
| <F> | ↘ | | global fall |
| _F | ̂ | | falling tone |
| _G | ˠ | | velarized |
| _H | ́ | | high tone |
| _H_T | ᷄ | | high rising tone |
| _h | ʰ | | aspirated |
| _j (or ') | β | | palatalized |
| _k | ̰ | | creaky voice |
| _L | ̀ | | low tone |
| _l | ˡ | | lateral release |
| _M | ̄ | | mid tone |
| _m | ̻ | | laminal |
| _N | ̼ | | linguolabial |
| _n | ᵿ | | nasal release |
| _O | ̹ | | more rounded |
| _o | ̞ | | lowered |
| _q | ̙ | | retracted tongue root |
| <R> | ↗ | | global rise |
| _R | ̌ | | rising tone |
| _R_F | ᷈ | | rising falling tone |
| _r | ̝ | | raised |
| _T | ̋ | | extra high tone |
| _t | ̤ | | breathy voice |
| _v | ̬ | | voiced |
| _w | ʷ | | labialized |
| _X | ̆ | | extra-short |
| _x | ̽ | | mid-centralized |
Charts
Consonants
| Consonants (pulmonic) | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Place of articulation → | Labial | Coronal | Dorsal | Laryngeal | |||||||||||||
| Manner of articulation ↓ | Bilabial | Labio‐ dental |
Dental | Alveolar | Post‐ alveolar |
Retro‐ flex |
Palatal | Velar | Uvular | Pharyn‐ geal |
Epi‐ glottal |
Glottal | |||||
| Nasal | m | F | n | n` | J | N | N\ | ||||||||||
| Plosive | p b | p_d b_d | t d | t` d` | c J\ | k g | q G\ | >\ | ? | ||||||||
| Fricative | p\ B | f v | T D | s z | S Z | s` z` | C j\ | x G | X | R | X\ | ?\ | H\ | <\ | h h\ | ||
| Approximant | B_o | v\ | r\ | r\` | j | M\ | |||||||||||
| Trill | B\ | r | * | R\ | * | ||||||||||||
| Tap or Flap | *† | *† | 4 | r` | * | ||||||||||||
| Lateral Fricative | K K\ | * | * | * | |||||||||||||
| Lateral Approximant | l | l` | L | L\ | |||||||||||||
| Lateral Flap | l\ | * | * | * | |||||||||||||
- Daggers (†) mark IPA symbols that have recently been added to Unicode. Since April 2008, this is the case of the labiodental flap, symbolized by a right-hook v in the IPA:
A dedicated symbol for the labiodental flap does not yet exist in X-SAMPA.
| Coarticulated | |
|---|---|
| W | Voiceless labialized velar approximant |
| w | Voiced labialized velar approximant |
| H | Voiced labialized palatal approximant |
| s\ | Voiceless palatalized postalveolar (alveolo-palatal) fricative |
| z\ | Voiced palatalized postalveolar (alveolo-palatal) fricative |
| x\ | Voiceless "palatal-velar" fricative |
| Affricates and double articulation | |
|---|---|
| ts | voiceless alveolar affricate |
| dz | voiced alveolar affricate |
| tS | voiceless postalveolar affricate |
| dZ | voiced postalveolar affricate |
| ts\ | voiceless alveolo-palatal affricate |
| dz\ | voiced alveolo-palatal affricate |
| tK | voiceless alveolar lateral affricate |
| kp | voiceless labial-velar plosive |
| gb | voiced labial-velar plosive |
| Nm | labial-velar nasal stop |
| Consonants (non-pulmonic) | |||||
|---|---|---|---|---|---|
| Clicks | Implosives | Ejectives | |||
| O\ | Bilabial | b_< | Bilabial | _> | For example: |
| |\ | Laminal alveolar ("dental") | d_< | Alveolar | p_> | Bilabial |
| ǃ\ | Apical (post-) alveolar ("retroflex") | J\_< | Palatal | t_> | Alveolar |
| =\ | Laminal postalveolar ("palatal") | g_< | Velar | k_> | Velar |
| |\|\ | Lateral coronal ("lateral") | G\_< | Uvular | s_> | Alveolar fricative |
Vowels
See also
- International Phonetic Alphabet (IPA)
- International Phonetic Alphabet for English
- Kirshenbaum and WorldBet, similar systems.
- List of phonetics topics
- SAMPA, a language-specific predecessor of X-SAMPA.
- SAMPA chart for English
References
- ↑ Wells, J.C. "Computer-coding the IPA: a proposed extension of SAMPA" (PDF). UCL Phonetics and Linguistics. University College London. Retrieved 16 March 2016.
External links
- Computer-coding the IPA: A proposed extension of SAMPA
- Translate English texts into IPA phonetics with PhoTransEdit. This free software tool allows to export transcriptions to X-SAMPA.
- Online converter between IPA and X-Sampa
- Web-based translator for X-SAMPA documents. Produces Unicode text, XML text, PostScript, PDF, or LaTeX TIPA.
- Z-SAMPA, an extension of X-SAMPA sometimes used for conlangs
- Web-based X-SAMPA to IPA Converter
