SamuZai
Furkan Gözükara
Furkan Gözükara

patreon


IndexTTS2 SECourses Premium Voice Cloning and Generation App - 1-Click to Install on Windows, RunPod and Massed Compute - Generate Entire Audiobooks With Consistent High Quality Voice

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI  GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

Latest installer zip file : Index_TTS_v3_1.zip

Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs

25 September 2025 Update V3

24 September 2025 Update V2

Windows Requirements

Massed Compute (Recommend Cloud) :

RunPod (Cloud):

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI  GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

Latest installer zip file : Index_TTS_v3.zip

Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs

25 September 2025 Update V3

24 September 2025 Update V2

Windows Requirements

Massed Compute (Recommend Cloud) :

RunPod (Cloud):

Comments

hi easy. it is not version issue. make a fresh install and send me cmd logs : monstermmorpg@gmail.com probably you dont have c++ tools and accurate python version 3.10.11

Furkan Gözükara

The 3.1 installer will not recognize my rtx5080 and will only use the CPU. I reinstalled all dependencies multiple times and also followed the video instructions to ensure my machine was set up appropriately. No errors are being thrown and ffmpeg is installed. I reinstalled using the version 1 installer and the app utilizes the GPU appropriately.

billgill

you dont have effmpeg follow requirements : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

This worked this morning, and i did something like clicked on update instead of start, it then failed, i deleted everything and reinstalled v3 and get this error at the end... any idea pls?? >> Be patient, it may take a while to run in CPU mode. >> Text tokenizer loaded for preview functionality Warning: FFmpeg not found in PATH. Video/audio processing will not work. Please install FFmpeg: https://ffmpeg.org/download.html * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`. Error: FFmpeg not found. Please ensure FFmpeg is installed and in PATH. Emo control mode:0,weight:0.65,vec:None >> Loading models for first synthesis... >> GPT weights restored from: ./checkpoints\gpt.pth preprocessor_config.json: 100%|███████████████████████████████████████████████████████████████| 275/275 [00:00> semantic_codec weights restored from: checkpoints\hub\models--amphion--MaskGCT\snapshots\265c6cef07625665d0c28d2faafb1415562379dc\semantic_codec\model.safetensors cfm loaded length_regulator loaded gpt_layer loaded >> s2mel weights restored from: ./checkpoints\s2mel.pth campplus_cn_common.bin: 100%|█████████████████████████████████████████████████████| 28.0M/28.0M [00:01<00:00, 15.0MB/s] >> campplus_model weights restored from: checkpoints\hub\models--funasr--campplus\snapshots\fb71fe990cbf6031ae6987a2d76fe64f94377b7e\campplus_cn_common.bin config.json: 1.41kB [00:00, ?B/s] Loading weights from nvidia/bigvgan_v2_22khz_80band_256x bigvgan_generator.pt: 100%|█████████████████████████████████████████████████████████| 449M/449M [00:23<00:00, 18.9MB/s] Removing weight norm... >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x >> All models loaded successfully! >> starting inference... Traceback (most recent call last): File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\queueing.py", line 745, in process_events response = await route_utils.call_process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\route_utils.py", line 349, in call_process_api output = await app.get_blocks().process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 2123, in process_api result = await self.call_function( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 1630, in call_function prediction = await anyio.to_thread.run_sync( # type: ignore File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 2485, in run_sync_in_worker_thread return await future File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 976, in run result = context.run(func, *args) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\utils.py", line 915, in wrapper response = f(*args, **kwargs) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\webui.py", line 445, in gen_single output = tts.infer(spk_audio_prompt=prompt, text=text, File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 563, in infer audio,sr = self._load_and_cut_audio(spk_audio_prompt,max_speaker_audio_length,verbose) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 417, in _load_and_cut_audio audio, sr = librosa.load(audio_path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 176, in load y, sr_native = __soundfile_load(path, offset, duration, dtype) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 209, in __soundfile_load context = sf.SoundFile(path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 690, in __init__ self._file = self._open(file, mode_int, closefd) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 1261, in _open raise TypeError("Invalid file: {0!r}".format(self.name)) TypeError: Invalid file: None

Neil Rhodes

yes because either your python installation is wrong or you are running as administrators. follow this requirements tutorial and don't run as administrator : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

it doesn't seem like this installer works correctly? Attempting to launch the windows_install_or_update.bat does not work ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied:[path \\appdata\\local\\pip\\cache\\wheels\\c9\\69\\31\\d56d90b22a1777b0b231e234b00302a55be255930f8bd92dcd\\jieba-0.42.1-py3-none-any.whl'jieba-0.42.1-py3-none-any.whl'] Check the permissions. running as admin results in: Requirement already satisfied: pip in c:\windows\system32\premium_indextts2_secourses\venv\lib\site-packages (25.2) ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt' 'Windows_Model_Download_and_Fix.bat' is not recognized as an internal or external command, operable program or batch file.

zrikz

no worries I found a workaround

Trill OG

hi there, is it possible to create more than a single queue for multiple output files in one whole session . Im wanting to generate individual output files for each chapter of my ebook without having to generate each chapter with a new session.

Trill OG

VibeVoice supporting Turkish and i will publish hopefully soon

Furkan Gözükara

Hello, Is it support Turkish?

Ahmet Inceelli

for Italian there is VibeVoice. hopefully will publish very soon sorry for delay

Furkan Gözükara

Why does it work just in CPU mode? It's said the model works with different languages, and I fed it with an audio in Italian. It generated an output with a weird accent if it were trying to read Italian text with English pronunciation. Maybe, for Italian, there are different models?

Aldo Jones

thanks

Furkan Gözükara

Here is a useful prompt for AI to fix your Text for correct pronunciation, by all means use this in your text notes if you wish :) Goal: Process the provided text to ensure it is read aloud clearly, naturally, and accurately by a basic local Text-to-Speech (TTS) engine, eliminating all potential ambiguities, apostrophes, and punctuation-related errors that cause mispronunciation or awkward pacing. Instructions: Revise the text according to the following strict, non-negotiable rules. The output text must only contain standard letter characters (A-Z, a-z), numbers (0-9), commas (,), periods (.), question marks (?), exclamation points (!), and simple parentheses (). Eliminate All Contractions and Apostrophes: Spell out every single contraction and remove all apostrophes from the text entirely. (e.g., change "can't" to can not, "I'm" to I am, "it's" to it is or it has). Possessives must be handled by context or sentence restructuring. Spell Out Numbers and Abbreviations: Convert all numerical digits and common acronyms or initialisms to their fully spelled-out word form (e.g., change "16" to sixteen, "3:00 am" to three A M, "TBH" to to be honest). Standardize Punctuation and Flow: Correct any typos or instances of run-on words. Replace complex or grammatically ambiguous phrases with a clear, direct, and common alternative to ensure proper TTS cadence. Use Phonetic Respelling for Ambiguous Words: For words that a basic engine might struggle to pronounce clearly (especially slang, proper nouns, or technical terms), use a simple phonetic respelling immediately followed by the original word in parentheses. The Text to be Processed: [INSERT TEXT HERE]

Neil Rhodes

ye i noticed same. i dont know atm sadly

Furkan Gözükara

How do we get around the pronunciation issues like "We'd" has to be "weed" and "I'm" has to be "I am " and "read" in past tense has to be "red" (and many others) is there a tool for fixing this, It kinda breaks the immersion when having stories read back by your favoroite voice, or do we simply have to manually correct everything? anyone know?

Neil Rhodes

yes i know. please follow this video and this post exactly and reinstall. it should be fixed : https://youtu.be/DrhUHnYfwC0 https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-111553210

Furkan Gözükara

Hello, when I run the application it does not use my gpu. The command line says Be patient, it may take a while to run in CPU mode. I have a 5090 and it sits at 0% utilization when running this particular app. Anyone know how to fix this?

RenderDrgn

hello. thanks. yes you need c++ tools and MSVC as well for this. please follow this tutorial and its updated post : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

Hello, new to Patrion, long time follower, I am trying your installer and I am getting this failed item after install: LLVM ERROR: Symbol not found: __svml_cosf8_ha

James Charleston II

Currently Chinese and English : https://github.com/index-tts/index-tts/issues/418

Furkan Gözükara

How many languages support?

Hoàng Giang Sơn Trương

yep not a good idea at all. probably wont work either. but you can run on runpod or massed compute or computer with 8 gb GPU. 6 may also work but need to be tested

Furkan Gözükara

running Index_TTS with CPU-only on some Lenovo Thinkpad is no good idea?

Christoph Behrmann

hello are you requesting a feature i am confused can you eloborate more

Furkan Gözükara

thanks. i am working on more features right now

Furkan Gözükara

yep thanks

Furkan Gözükara

you are welcome. working on improvements right now

Furkan Gözükara

orjinal türkçe yok ama türkçe karakter kullanmadan okuyabiliyor. deneyebilirsin

Furkan Gözükara

teşekkürler hocam. Bunu türkçe desteği yok değilmi hocam

Cemil Hacimahmutoglu

Just what I needed. Thank you very much. =D

Hockey

It seems that the functionality for automatic voiceover into another language while preserving emotions needs to be improved for automatic use. Lack: - Cutting the original audio into parts while maintaining the integrity of the sounds - Transcription of these parts into text and translation into the desired language - Batch re-sound of sliced texts in accordance with the emotions of the sliced audio

Dmitry

It just works! amazing!

Neil Rhodes

Yep next level

Furkan Gözükara

I wasn't expecting you to cover this based on your focus towards image and video. This is a huge development in voice clone and I've been messing with it for a while now. Nice to see you included it in your busy schedule. Thanks.

Lou

ok this is actually insane, it perfectly emulates human speech with the pauses, uhms, breathing etc. just crazy

Hipno

if authors add yes for sure but i dont know how to

Furkan Gözükara

thanks for info i will update requirements

Furkan Gözükara

For all those who can't get the installer to do it's thing, make sure you've got Cuda covered Pytorch before grabbing your regular packages. Cheat sheet for this is just to run the following two commands in cmd (assuming windows): 1) pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu129 and then 2) pip install huggingface_hub transformers the installer shouldn't hiccup after that, have fun cloning Mike Wazowski👍.

PS

I fixed it. I'll put it here if anyone else has the same problem. Courtesy Grok Expert. To fix this reliably and safely, download and install the official Intel oneAPI DPC++/C++ Compiler Runtime for Windows. https://registrationcenter-download.intel.com/akdlm/IRC_NAS/47a201d7-d4cd-4079-a2d8-0e66b860aaaa/w_dpcpp_cpp_runtime_p_2025.2.1.1001.exe Run as administrator. Restart machine. Next time I ran the TTS webUI it worked like a charm!

DanO..

Hello, good afternoon. Do you think it could be implemented for the Spanish language as well?

Civitaier

It broke my python. Came back with this, "LLVM ERROR: Symbol not found: __svml_cosf8_ha" any ideas? I'm really looking forward to this! (I also ran install/update again just in case.)

DanO..


More Creators