Cite the source dataset as
Běijīng Dàxué 北京大学 (1964): {H}ànyǔ fāngyán cíhuì 汉语方言词汇 [Chinese dialect vocabularies]. Beijing: Wenzi Gaige.
This dataset is licensed under a GPL-3.0 license
Available online at https://github.com/digling/cddb/
Conceptlists in Concepticon:
This dataset, which is well-known among Sinologists, comprises 18 dialect varieties, collected during the 1950s and was digitized during 2012 and 2016. We offer the data in morpheme-segmented form, with a slightly adjusted IPA transcription.
- Varieties: 18
- Concepts: 905
- Lexemes: 18,069
- Synonymy: 1.11
- Invalid lexemes: 0
- Tokens: 121,097
- Segments: 247 (0 BIPA errors, 0 CTLS sound class errors, 247 CLTS modified)
- Inventory size (avg): 61.06