Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
🇨🇳 Chinese TTS now available 😘 #201
Comments
|
Hello. Thanks for your great work! I'm new to the TTS area and this notebook could be a good start. |
|
cc: @azraelkuan (person in charge) |
|
@dathudeptrai Does it already in master branch for support? |
|
@jinfagang everything is on master branch. (updated content :D.) |
|
@dathudeptrai Any readme on how to train on Biaobei data? |
|
@jinfagang jsut download biaobei data and extract it to
and train it using baker's yaml. |
|
@azraelkuan As I know the sampling rate of the audio in the Biaobei dataset is 48k, but in the baker_preprocess.yaml the sampling rate is set as 24k. I didn't try the preprocess. Is that a mistake or I misunderstand the code? |
|
@IreneZhou2018 the sampling rate in config is target sampling rate, if the dataset's sample rate is 48k so we re-sample it (see code here https://github.com/TensorSpeech/TensorFlowTTS/blob/master/tensorflow_tts/bin/preprocess.py#L194-L196) |
|
@dathudeptrai ok, thanks for your reply and the work is amazing! |
|
TensorflowTTS训练数据生成:拉取文本数据、将文本专为拼音、基于阿里云TTS生成TensorflowTTS训练音频、训练前的preprocess/normalize:https://github.com/MachineLP/TensorFlowTTS_chinese/tree/master/generate_tts_data |
Hi, do you have a more specific Readme? It seems promising and I'd like to have to try :) |
|
|
it seems that fastspeech2 model do not work properly when sentence is long ? like 君不见 黄河之水天上来 奔流到海不复回 君不见 高堂明镜悲白发 朝如青丝暮成雪 人生得意须尽欢 莫使金樽空对月 |
|
|
Thank you . I am still evaluating the model . |
|
Chinese TTS欢迎加微信:lp9628,进入微信群讨论训练测试细节问题。 |
|
@dathudeptrai I try to train get this error:
My command:
|
|
hello, the link to baker dataset was expired , and the officical website show that have no right to access the dataset , i hate to say that ,but can you provide annother way to get the dataset? |
404 error ,you can acess the website ? i got 404 not found |
|
@jucaowei |
i already use VPN with HK node,not working, but my friend access the website right now, really appreciate for you reply |
|
@azraelkuan Hi! Thanks for your work. Compared with some other reproduction projects, your reproduced tacotron2 can synthesize very long sentences without stress or omission. |
|
no, this tacotron is implemented by the author of this project, i just use
cn dataset to train it.
may be i use the phoneme as the input, and each phoneme has a token #0
after it.
yuze <notifications@github.com> 于2020年11月7日周六 下午2:32写道:
… @azraelkuan <https://github.com/azraelkuan> Hi! Thanks for your work.
Compared with some other reproduction projects, your reproduced tacotron2
can synthesize very long sentences without stress or omission. Have you
done any special treatment of long sentences?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#201 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFSMBODUYYMWSRSPLUYJQ4LSOTSZJANCNFSM4P3B4TPA>
.
--
Kuan Chen (陈宽)
Speech Lab, Shanghai Jiao Tong University
Tel: +86 17621207116
|

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.

Chinese TTS now available, thank @azraelkuan for his support :D. The model used Baker dataset here (https://www.data-baker.com/open_source.htmlt). The pretrained model licensed under CC BY-NC-SA 4.0 (https://creativecommons.org/licenses/by-nc-sa/4.0/) since the dataset is non-commercial :D
Pls check out the colab bellow and enjoy :D.
https://colab.research.google.com/drive/1YpSHRBRPBI7cnTkQn1UcVTWEQVbsUm1S?usp=sharing
Note: this is just init results, there are more things can be done to make the model better.
cc: @candlewill @l4zyf9x @machineko