Content deleted Content added
Citation bot (talk | contribs) Add: year. Removed URL that duplicated unique identifier. Removed parameters. | You can use this bot yourself. Report bugs here. | Activated by User:Zppix | Category:Artificial intelligence | via #UCB_Category |
m Bot: Removing category Category:Artificial intelligence, bcoz it's already in Category:Applications of artificial intelligence |
||
(7 intermediate revisions by 6 users not shown) | |||
Line 1:
The '''PCVC (Persian Consonant Vowel Combination) Speech Dataset''' is a [[Modern Persian]] [[speech corpus]] for [[speech recognition]] and also [[speaker recognition]]. The dataset contains sound samples of [[Modern Persian]] combination of [[vowel]] and [[consonant]] phonemes from different speakers. Every sound sample contains just one consonant and one vowel So it is somehow labeled in phoneme level. This dataset
Compared to Farsdat speech dataset<ref>Bijankhan, M., Sheikhzadegan, J., Roohani, M. R., Samareh, Y., Lucas, C., & Tebyani, M. (1994). FARSDAT-The Speech Database of Farsi Spoken Language. The Proceedings of the Australian Conference on Speech Science and Technology (Vol. 2, pp. 826–831).</ref> and Persian speech corpus<ref>Halabi, Nawar (2016). Modern Standard Persian Phonetics for Speech Synthesis. University of Southampton, School of Electronics and Computer Science.</ref> it is more easy to use because it is prepared in .mat data files.<ref>{{cite
==Contents==
The corpus is downloadable from its
* .mat data files of sound samples in a 23*6*30000 matrix, in which 23 is number of consonants, 6 is the number of vowels and 30000 is the length of sound sample.
==See also==
Line 19:
[[Category:Datasets in machine learning]]
[[Category:Speech recognition]]
[[Category:Speaker recognition]]
|