TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5, F5-TTS, ParlerTTS)
rsxdalv.github.io/tts-generation-webui/
Roberts Slisans 8eb7e95e89 Merge pull request #213 from cdrini/minify-png | 11 months ago | |
---|---|---|
.github | 1 year ago | |
.vscode | 1 year ago | |
public | 1 year ago | |
screenshots | 1 year ago | |
src | 1 year ago | |
.eslintrc | 1 year ago | |
.eslintrc.json | 1 year ago | |
.gitignore | 1 year ago | |
LICENSE | 1 year ago | |
LICENSE_for_template | 1 year ago | |
README.md | 1 year ago | |
next.config.js | 1 year ago | |
package-lock.json | 1 year ago | |
package.json | 1 year ago | |
postcss.config.js | 1 year ago | |
tailwind.config.js | 1 year ago | |
tsconfig.json | 1 year ago |
Download || Upgrading || Manual installation
The AI Artist - Stable diffusion for MUSIC ?! tts-generation-webui | The AI Artist - how to use BARK AI voice cloning locally |
---|---|
audiobarkcontinued_generation__2023-05-04_16-07-49_long.webm
audiobarkcontinued_generation__2023-05-04_16-09-21_long.webm
audiobarkcontinued_generation__2023-05-04_16-10-55_long.webm
https://rsxdalv.github.io/bark-speaker-directory/
July 9:
July 5:
July 2:
July 1:
Jun 29:
Jun 27:
Jun 20
Jun 19
June 18:
Jun 14:
June 5:
June 4:
June 3:
May 21:
May 17:
May 16:
May 13:
May 10:
May 4:
May 3:
May 2 Update 2:
May 2 Update 1:
Before:
In case of issues, feel free to contact the developers.
Not exactly, the dependencies clash, especially between conda and python (and dependencies are already in a critical state, moving them to conda is ways off). Therefore, while it might be possible to just replace the old installer with the new one and running the update, the problems are unpredictable and unfixable. Making an update to installer requires a lot of testing so it's not done lightly.
conda install git
)conda install -y -c pytorch ffmpeg
)git clone https://github.com/rsxdalv/tts-generation-webui.git
pip install -r requirements.txt
run using (venv) python server.py
Potentially needed to install build tools (without Visual Studio): https://visualstudio.microsoft.com/visual-cpp-build-tools/
This project utilizes the following open source libraries:
suno-ai/bark - MIT License
tortoise-tts - Apache-2.0 License
ffmpeg - LGPL License
ffmpeg-python - Apache 2.0 License
audiocraft - MIT License
vocos - MIT License
RVC - MIT License
This technology is intended for enablement and creativity, not for harm.
By engaging with this AI model, you acknowledge and agree to abide by these guidelines, employing the AI model in a responsible, ethical, and legal manner.
The codebase is licensed under MIT. However, it's important to note that when installing the dependencies, you will also be subject to their respective licenses. Although most of these licenses are permissive, there may be some that are not. Therefore, it's essential to understand that the permissive license only applies to the codebase itself, not the entire project.
That being said, the goal is to maintain MIT compatibility throughout the project. If you come across a dependency that is not compatible with the MIT license, please feel free to open an issue and bring it to our attention.
Known non-permissive dependencies: | Library | License | Notes | |-------------|-------------------|-------------------------------------------------------------------------------------------| | encodec | CC BY-NC 4.0 | Newer versions are MIT, but need to be installed manually | | diffq | CC BY-NC 4.0 | Optional in the future, not necessary to run, can be uninstalled, should be updated with demucs | | lameenc | GPL License | Future versions will make it LGPL, but need to be installed manually | | unidecode | GPL License | Not mission critical, can be replaced with another library, issue: https://github.com/neonbjb/tortoise-tts/issues/494 |
Model weights have different licenses, please pay attention to the license of the model you are using.
Most notably: