xVASynth TTS

community

https://www.nexusmods.com/skyrimspecialedition/mods/44184?tab=description

Activity Feed Request to join this org

AI & ML interests

xVASynth = Voice Actor Synthesis TTS

xVA-TTS's activity

Pendrokar

posted an update 24 days ago

Post

2162

TTS: Added both Zonos model Spaces to the Arena Fork:
🏆 Pendrokar/TTS-Spaces-Arena

Zonos 🌍: Steveeeeeeen/Zonos

IMHO while Zonos still has fewer amount of capabilities than xVASynth, I'm still giving it the first row at the TTS capabilities table due to IPA and insta-clone/zero-shot:
1️⃣ Pendrokar/open_tts_tracker

Pendrokar

posted an update about 1 month ago

Post

3316

TTS: Added Kokoro v1, Parler Large, LlaSa 3B & MARS 6 TTS models to the Arena.
Pendrokar/TTS-Spaces-Arena

Also had added MaskGCT, GPT-SoVITS & OuteTTS a month ago. OuteTTS devs did say that is too early for it to be added to TTS Arenas.

Mars 5 does have a space with open weights models, but inference is way too slow (2 minutes+).

2 replies

Pendrokar

posted an update 4 months ago

Post

1053

TTS: Sorry, I just cannot get the hype behind F5 TTS. It has now gathered a thousand votes in the TTS Arena fork and **has remained in #8 spot** against the _mostly_ Open TTS adversaries.

The voice sample used is the same as XTTS. F5 has so far been unstable, being unemotional/monotone/depressed and mispronouncing words (_awestruck_).

If you have suggestions please give feedback in the following thread:
mrfakename/E2-F5-TTS#32

6 replies

Pendrokar

posted an update 4 months ago

Post

1383

Added @amphion MaskGCT & @hexgrad StyleTTS fine tuned model by the name of kokoro to the forked TTS Arena Space. If things keep up from what is seen in the preliminary results, then these two may end up in the TOP 5 of all TTS models. 🤞️🍀️

Pendrokar/TTS-Spaces-Arena
Svngoku/maskgct-audio-lab
hexgrad/Kokoro-TTS

I chose @Svngoku 's forked HF space over amphion's due to the overly high ZeroGPU duration demand on the latter. 300s!

amphion/maskgct

Had to remove @mrfakename 's MetaVoice-1B Space from the available models as that space has been down for quite some time. 🤕️

mrfakename/MetaVoice-1B-v0.1

I'm close to syncing the code to the original Arena's code structure. Then I'd like to use ASR in order to validate and create synthetic public datasets from the generated samples. And then make the Arena multilingual, which will surely attract quite the crowd!

1 reply

Pendrokar

updated a collection 4 months ago

xVASynth

Collection

1 item • Updated Oct 30, 2024 • 1

Pendrokar

posted an update 5 months ago

Post

641

How the 🗣🏆 leaderboard of a merged TTS Arena with the 🤗 Spaces fork would look like. These results are somewhat unreliable as some models have not challenged the other in the list. And the original TTS Arena used only narration type sentences.

2 replies

Pendrokar

posted an update 5 months ago

Post

1385

Made a notable change to the TTS Arena fork. I do not think anyone is interested in which bottomfeeder TTS is better than another beside it. So one of the top 5 TTS is always chosen in a challenge for more scrutiny. Also these top 5 are taken from preliminary results.
Pendrokar/TTS-Spaces-Arena

AI & ML interests

Team members 1

xVA-TTS's activity