Skip to content

CLDF dataset derived from Beijing University's "Chinese Dialect Vocabularies" from 1964

License

Notifications You must be signed in to change notification settings

lexibank/beidasinitic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

986761d · Jul 2, 2019

History

20 Commits
Jul 2, 2019
May 12, 2019
May 14, 2019
Apr 11, 2018
Apr 19, 2018
May 3, 2018
Aug 27, 2018
Apr 4, 2019
Jul 2, 2019
May 12, 2019
Jul 2, 2019
Apr 3, 2019
Apr 8, 2019
Jul 2, 2019
Jul 2, 2019

Repository files navigation

Chinese Dialect Vocabularies

Cite the source dataset as

Běijīng Dàxué 北京大学 (1964): {H}ànyǔ fāngyán cíhuì 汉语方言词汇 [Chinese dialect vocabularies]. Beijing: Wenzi Gaige.

This dataset is licensed under a GPL-3.0 license

Available online at https://github.com/digling/cddb/

Conceptlists in Concepticon:

Notes

This dataset, which is well-known among Sinologists, comprises 18 dialect varieties, collected during the 1950s and was digitized during 2012 and 2016. We offer the data in morpheme-segmented form, with a slightly adjusted IPA transcription.

Statistics

Build Status Glottolog: 100% Concepticon: 79% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 18
  • Concepts: 905
  • Lexemes: 18,069
  • Synonymy: 1.11
  • Invalid lexemes: 0
  • Tokens: 121,097
  • Segments: 247 (0 BIPA errors, 0 CTLS sound class errors, 247 CLTS modified)
  • Inventory size (avg): 61.06