@@ -20,7 +20,7 @@ A collection of resources for natural language processing. Mostly links to datas
20
20
<br > ** Content** :
21
21
- 24 hours _ english_ by 1 voice
22
22
23
- 3 . VCTK Dataset
23
+ 3 . CSTR VCTK Corpus
24
24
<br > ** Link** : https://datashare.ed.ac.uk/handle/10283/3443
25
25
<br > ** Content** :
26
26
- ~ 400 sentences _ english_ each by 110 voices
@@ -41,6 +41,7 @@ A collection of resources for natural language processing. Mostly links to datas
41
41
<br > ** Content** :
42
42
- extracted from LibriVox (see 4.)
43
43
- ~ 1000 hours _ english_
44
+ - ~ 585 hours in higher quality at https://openslr.org/60/
44
45
45
46
6 . Vox Forge
46
47
<br > ** Link** : http://www.repository.voxforge1.org/downloads/SpeechCorpus/Trunk/
@@ -84,3 +85,74 @@ A collection of resources for natural language processing. Mostly links to datas
84
85
- 1529 sentences _ japanese_
85
86
- 198 sentences _ italian_
86
87
- and many more
88
+
89
+ 11 . Spoken Wikipedia Corpora
90
+ <br > ** Link** : https://nats.gitlab.io/swc/
91
+ <br > ** Content** :
92
+ - spoken wikipedia articles
93
+ - 386 hours _ german_ by 339 voices
94
+ - 395 hours _ english_ by 395 voices
95
+ - 224 hours _ dutch_ by 145 voices
96
+
97
+ 12 . M-AILABS Speech Dataset
98
+ <br > ** Link** : https://www.caito.de/2019/01/the-m-ailabs-speech-dataset/
99
+ <br > ** Content** :
100
+ - mostly extracted from LibriVox (see 4.)
101
+ - 237 hours _ german_
102
+ - 45 hours _ british english_
103
+ - 102 hours _ american english_
104
+ - 108 hours _ spanish_
105
+ - 127 hours _ italian_
106
+ - 87 hours _ ukranian_
107
+ - 46 hours _ russian_
108
+ - 190 hours _ french_
109
+ - 53 hours _ polish_
110
+ - contains _ mixed_ data i.e. female and male speakers
111
+
112
+ 13 . VCTK Noisy Speech Database
113
+ <br > ** Link** : https://datashare.ed.ac.uk/handle/10283/2791
114
+ <br > ** Content** :
115
+ - noisy and clean audio files by up to 56 voices
116
+ - includes written transcripts
117
+ - unknown amount of hours
118
+
119
+ 14 . American English Speech Corpus
120
+ <br > ** Link** : https://www.magicdatatech.com/datasets/mdt-tts-e018-american-english-speech-corpus-for-tts-1631179203
121
+ <br > ** Content** :
122
+ - ~ 2 hours _ american english_ by 1 female voice
123
+
124
+ 15 . American Male Voice Dataset
125
+ <br > ** Link** : https://www.magicdatatech.com/datasets/mdt-tts-e009-american-male-voice-tts-dataset
126
+ <br > ** Content** :
127
+ - 15 hours _ american english_ by 1 male voice
128
+
129
+ 16 . Facebook Vox Populi
130
+ <br > ** Link** : https://github.com/facebookresearch/voxpopuli
131
+ <br > ** Content** :
132
+ - download instructions in README of repository
133
+ - in 16 european languages including _ english_ , _ german_ , _ french_ and _ spanish_
134
+ - 1800 hours transcribed audio by unknown amount of voices
135
+
136
+ 17 . Multilingual Libri Speech
137
+ <br > ** Link** : https://openslr.org/94/
138
+ <br > ** Content** :
139
+ - unclear if transcripts provided
140
+ - extracted from LibriVox (see 4.)
141
+
142
+ 18 . Kensho SPGI Speech
143
+ <br > ** Link** : https://datasets.kensho.com/datasets/spgispeech
144
+ <br > ** Content** :
145
+ - transcribed company earnings calls
146
+ - ~ 5000 hours _ international business english_ by ~ 50000 voices
147
+
148
+ 19 . Free Spoken Digit Dataset
149
+ <br > ** Link** : https://github.com/Jakobovski/free-spoken-digit-dataset
150
+ <br > ** Content** :
151
+ - 3000 recordings _ english_ by 6 voices
152
+ - 50 recordings per digit per voice
153
+
154
+ 20 . Flickr Audio Captions Corpus
155
+ <br > ** Link** : https://groups.csail.mit.edu/sls/downloads/flickraudio/index.cgi
156
+ <br > ** Content** :
157
+ - 40000 spoken image captions _ english_ of 8000 images
158
+ - download original captions here https://www.kaggle.com/adityajn105/flickr8k
0 commit comments