What are some good losses' values when fine tuning XTTS v2? #3775

Jorvan758 · 2024-06-03T16:38:49Z

Jorvan758
Jun 3, 2024

I've recently came across this tutorial and decide to give XTTS a try (but running code in colab, instead of relying on a third party app). I've managed to generate a dataset with my own voice, following the LJSpeech format, and got no major problems running the base recipe for XTTS v2. However, I got no clue on what could be considered a good value for either avg_loader_time, avg_loss_text_ce, avg_loss_mel_ce not avg_loss in this case (seems like it hasn't been really discussed yet). I think I checked most of the few issues and discussions where these losses were shared by other people, but wasn't able to formulate any meaningful conclusions.
Would you be kind enough to share some of your knowledge/experience with me, pls?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What are some good losses' values when fine tuning XTTS v2? #3775

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

What are some good losses' values when fine tuning XTTS v2? #3775

Jorvan758 Jun 3, 2024

Replies: 0 comments

Jorvan758
Jun 3, 2024