F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching
Paper โข 2410.06885 โข Published โข 47
F5-TTS finetune on all formosan data (ithuan, fb ilrdf dict, klokah) without samples only one word or no translation, using ipa as input.
g2p from this repo.
please refer source repo
Base model
SWivid/F5-TTS