The '''PCVC (Persian Consonant Vowel Combination) Speech Dataset''' is a [[Modern Persian]] [[speech corpus]] for [[speech recognition]] and also [[speaker recognition]]. The dataset contains sound samples of [[Modern Persian]] combination of [[vowel]] and [[consonant]] phonemes from different speakers. Every sound sample contains just one consonant and one vowel So it is somehow labeled in phoneme level. This dataset containsconsists of 23 Persian consonants and 6 vowels. The sound samples are all possible combinations of vowels and consonants (138 samples for each speaker). The sample rate of all speech samples is 48000 which means there are 48000 sound samples in every 1 second. Every sound sample isstarts 276with seconds(138consonant twothen secondscontinues samples)with vowel. In each 2s sample, in average, 0.5 second of each sample is speech and the rest is silence. In eachEach sound sample 0.25sends ofwith start and 0.25s of end of it is surely scilencesilence.<ref>{{Cite journal|last1=Malekzadeh|first1=Saber MalekzadeH, |last2=Gholizadeh|first2=Mohammad Hossein Gholizadeh, Seyed|last3=Razavi|first3=Seyyed Naser Razavi {{cite paper |title=Full Persian Vowelphonemes recognition withusing MFCCPPNet|journal=Journal andof ANNSignal on PCVC speech datasetProcessing Systems|urlyear=http://bayanbox2018|arxiv=1812.ir08600|doi=10.13140/download/2723849504007807268/Full-Persian-Vowel-recognition-with-MFCC-and-ANN-on-PCVC-speech-datasetRG.pdf2.2.34836.96647|s2cid=214612057 }}</ref><ref>Malekzadeh, 5th InternationalS., conference of electrical engineeringGholizadeh, computer scienceM.H. and information technologyRazavi, Iran, TehranS.N., 2018.</ref> AlsoPersian inVowel eachrecognition 2swith firstMFCC consonantand phonemeANN pronouncedon andPCVC thenspeech voweldataset. is''arXiv preprint arXiv:1812.06953''.</ref> All of sound samples are denoised with "Adaptive noise reduction" algorithm.<ref>{{cite paperweb |title=PCVC GitHubKaggle page |url=https://githubwww.kaggle.com/S-Maleksabermalek/PCVCpcvcspeech/home }}</ref>
Compared to Farsdat speech dataset<ref>Bijankhan, M., Sheikhzadegan, J., Roohani, M. R., Samareh, Y., Lucas, C., & Tebyani, M. (1994). FARSDAT-The Speech Database of Farsi Spoken Language. The Proceedings of the Australian Conference on Speech Science and Technology (Vol. 2, pp. 826–831).</ref> and Persian Speechspeech Corpuscorpus<ref>Halabi, Nawar (2016). Modern Standard Persian Phonetics for Speech Synthesis. University of Southampton, School of Electronics and Computer Science.</ref> it is more easy to use because it is prepared in .mat data files.<ref>{{cite paperweb |title= Access and change variables directly in MAT-files, without loading into memory. |url=https://uk.mathworks.com/help/matlab/ref/matfile.html }}</ref> Also it is more based on phoneme based separation and alsoall itsamples isare denoised.
==Contents==
The corpus is downloadable from its GitHubKaggle web page, and contains the following:
* .mat data files of sound samples in a 23*6*30000 matrix, in which 23 is number of consonants, 6 is the number of vowels and 30000 is the length of 2s sound sample.
==See also==
==External links==
* [https://githubwww.kaggle.com/S-Maleksabermalek/pcvcspeech/PCVChome The GitHubKaggle page of PCVC speech dataset]
* [https://www.researchgate.net/publication/322298311_Full_Persian_Vowel_recognition_with_MFCC_and_ANN_on_PCVC_speech_dataset PCVC Paper on ResearchGate]
{{Corpus linguistics}}
[[:Category:CorporaDatasets in machine learning]]
[[:Category:Datasets in machineSpeech learningrecognition]]
[[:Category:PersianSpeaker languagerecognition]]
[[Category:Persian language]]
[[Category:Speech synthesis]]
|