Furkan Gözükara

IndexTTS2 SECourses Premium Voice Cloning and Generation App - 1-Click to Install on Windows, RunPod and Massed Compute - Generate Entire Audiobooks With Consistent High Quality Voice

Added 2025-09-19 23:30:01 +0000 UTC

Patreon exclusive posts index to find our scripts easily, Patreon exclusive posts index to see which scripts updated or added last and amazing Patreon special generative scripts list that you can use in any of your task.

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

Latest installer zip file : Index_TTS_v3_1.zip

Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs

I have significantly improved the app published here : https://github.com/index-tts/index-tts
Hopefully many more features coming and this is only initial release
Just run Windows_Install_or_Update.bat for installation
- You only need Python 3.10.11 and Git and FFmpeg installed
- I am using fully pre-compiled libraries for both Windows and Linux thus it should work with maximum speed on literally every GPU like RTX 2000 series, 3000, 4000, 5000, H100, B200, etc
Read the Gradio app interface extremely carefully since I added literally every option with detailed information and explanation
Default values are supposed to be good but you can play with values to further improve

25 September 2025 Update V3

Since official repo was getting out of git LFS quota and causing errors, everything uploaded into a new repo
Make a fresh install and you can move checkpoints folder into new install

24 September 2025 Update V2

Automatic FFmpeg installation added to RunPod and Massed Compute
Save Used Reference Audio added
Prevent VRAM Accumulation added - useful when High number Beam Search Beams used
- Higher number Beam Search Beams really improves quality like 8
Load from Audio File Path added
- Useful when you want to upload big audio into like RunPod or Massed Compute rather than Gradio live share link
Just run Windows_Install_or_Update.bat to update
More features coming hopefully soon

Windows Requirements

Python 3.10.11, CUDA 12.9, C++ tools, MSVC, FFmpeg and Git
If it doesn't work make sure to below tutorial and install everything exactly as shown in this below tutorial
https://youtu.be/DrhUHnYfwC0
- Follow this post entirely along with the video : https://www.patreon.com/posts/111553210
  - This above post is fully updated with links and screenshots, so easy to follow

Massed Compute (Recommend Cloud) :

Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
- Use our coupon SECourses
- Our coupon works on all GPUs now
  - I recommend RTX 6000 PRO but this app works on every GPU
  - Full details here : https://www.patreon.com/posts/26671823
- Then select our image SECourses from Creator dropdown
- Then follow Massed_Compute_Instructions_READ.txt
- Same as my any other Massed Compute installer script
- Example tutorial for learn how to install and use Massed Compute
  - (Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778

RunPod (Cloud):

Please register via this link : https://get.runpod.io/955rkuppqv4h
- Then follow Runpod_Instructions_READ.txt
- Same as my any other RunPod installer script
- Use the template written in Runpod_Instructions_READ.txt file
- Example tutorial for learn how to install and use RunPod
  - (starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323

Join discord to get help, chat, discuss and also tell me your discord username to get your special rank : SECourses Discord

Please also Star, Watch and Fork our Stable Diffusion & Generative AI GitHub repository and join our Reddit subreddit and follow me on LinkedIn (my real profile)

=======

Latest installer zip file : Index_TTS_v3.zip

Higher quality YouTube video: https://youtu.be/YbgFVKWB7hs

I have significantly improved the app published here : https://github.com/index-tts/index-tts
Hopefully many more features coming and this is only initial release
Just run Windows_Install_or_Update.bat for installation
- You only need Python 3.10.11 and Git and FFmpeg installed
- I am using fully pre-compiled libraries for both Windows and Linux thus it should work with maximum speed on literally every GPU like RTX 2000 series, 3000, 4000, 5000, H100, B200, etc
Read the Gradio app interface extremely carefully since I added literally every option with detailed information and explanation
Default values are supposed to be good but you can play with values to further improve

25 September 2025 Update V3

Since official repo was getting out of git LFS quota and causing errors, everything uploaded into a new repo
Make a fresh install and you can move checkpoints folder into new install

24 September 2025 Update V2

Automatic FFmpeg installation added to RunPod and Massed Compute
Save Used Reference Audio added
Prevent VRAM Accumulation added - useful when High number Beam Search Beams used
- Higher number Beam Search Beams really improves quality like 8
Load from Audio File Path added
- Useful when you want to upload big audio into like RunPod or Massed Compute rather than Gradio live share link
Just run Windows_Install_or_Update.bat to update
More features coming hopefully soon

Windows Requirements

Python 3.10.11, CUDA 12.9, C++ tools, MSVC, FFmpeg and Git
If it doesn't work make sure to below tutorial and install everything exactly as shown in this below tutorial
https://youtu.be/DrhUHnYfwC0
- Follow this post entirely along with the video : https://www.patreon.com/posts/111553210
  - This above post is fully updated with links and screenshots, so easy to follow

Massed Compute (Recommend Cloud) :

Please register via this link : https://vm.massedcompute.com/signup?linkId=lp_034338&sourceId=secourses&tenantId=massed-compute
- Use our coupon SECourses
- Our coupon works on all GPUs now
  - I recommend RTX 6000 PRO but this app works on every GPU
  - Full details here : https://www.patreon.com/posts/26671823
- Then select our image SECourses from Creator dropdown
- Then follow Massed_Compute_Instructions_READ.txt
- Same as my any other Massed Compute installer script
- Example tutorial for learn how to install and use Massed Compute
  - (Starts at 12:58) : https://youtu.be/KW-MHmoNcqo?si=G1WbG-Qw4ujWvOtG&t=778

RunPod (Cloud):

Please register via this link : https://get.runpod.io/955rkuppqv4h
- Then follow Runpod_Instructions_READ.txt
- Same as my any other RunPod installer script
- Use the template written in Runpod_Instructions_READ.txt file
- Example tutorial for learn how to install and use RunPod
  - (starts at 22:03) : https://youtu.be/KW-MHmoNcqo?si=QN8X8Sjn13ZYu-EU&t=1323

Comments

hi easy. it is not version issue. make a fresh install and send me cmd logs : monstermmorpg@gmail.com probably you dont have c++ tools and accurate python version 3.10.11

Furkan Gözükara

2025-10-19 10:24:15 +0000 UTC

The 3.1 installer will not recognize my rtx5080 and will only use the CPU. I reinstalled all dependencies multiple times and also followed the video instructions to ensure my machine was set up appropriately. No errors are being thrown and ffmpeg is installed. I reinstalled using the version 1 installer and the app utilizes the GPU appropriately.

billgill

2025-10-19 02:00:21 +0000 UTC

you dont have effmpeg follow requirements : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

2025-10-16 20:16:04 +0000 UTC

This worked this morning, and i did something like clicked on update instead of start, it then failed, i deleted everything and reinstalled v3 and get this error at the end... any idea pls?? >> Be patient, it may take a while to run in CPU mode. >> Text tokenizer loaded for preview functionality Warning: FFmpeg not found in PATH. Video/audio processing will not work. Please install FFmpeg: https://ffmpeg.org/download.html * Running on local URL: http://127.0.0.1:7860 * To create a public link, set `share=True` in `launch()`. Error: FFmpeg not found. Please ensure FFmpeg is installed and in PATH. Emo control mode:0,weight:0.65,vec:None >> Loading models for first synthesis... >> GPT weights restored from: ./checkpoints\gpt.pth preprocessor_config.json: 100%|███████████████████████████████████████████████████████████████| 275/275 [00:00> semantic_codec weights restored from: checkpoints\hub\models--amphion--MaskGCT\snapshots\265c6cef07625665d0c28d2faafb1415562379dc\semantic_codec\model.safetensors cfm loaded length_regulator loaded gpt_layer loaded >> s2mel weights restored from: ./checkpoints\s2mel.pth campplus_cn_common.bin: 100%|█████████████████████████████████████████████████████| 28.0M/28.0M [00:01<00:00, 15.0MB/s] >> campplus_model weights restored from: checkpoints\hub\models--funasr--campplus\snapshots\fb71fe990cbf6031ae6987a2d76fe64f94377b7e\campplus_cn_common.bin config.json: 1.41kB [00:00, ?B/s] Loading weights from nvidia/bigvgan_v2_22khz_80band_256x bigvgan_generator.pt: 100%|█████████████████████████████████████████████████████████| 449M/449M [00:23<00:00, 18.9MB/s] Removing weight norm... >> bigvgan weights restored from: nvidia/bigvgan_v2_22khz_80band_256x >> All models loaded successfully! >> starting inference... Traceback (most recent call last): File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\queueing.py", line 745, in process_events response = await route_utils.call_process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\route_utils.py", line 349, in call_process_api output = await app.get_blocks().process_api( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 2123, in process_api result = await self.call_function( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\blocks.py", line 1630, in call_function prediction = await anyio.to_thread.run_sync( # type: ignore File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 2485, in run_sync_in_worker_thread return await future File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\anyio\_backends\_asyncio.py", line 976, in run result = context.run(func, args) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\gradio\utils.py", line 915, in wrapper response = f(args, **kwargs) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\webui.py", line 445, in gen_single output = tts.infer(spk_audio_prompt=prompt, text=text, File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 563, in infer audio,sr = self._load_and_cut_audio(spk_audio_prompt,max_speaker_audio_length,verbose) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\indextts\infer_v2.py", line 417, in _load_and_cut_audio audio, sr = librosa.load(audio_path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 176, in load y, sr_native = __soundfile_load(path, offset, duration, dtype) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\librosa\core\audio.py", line 209, in __soundfile_load context = sf.SoundFile(path) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 690, in init self._file = self._open(file, mode_int, closefd) File "A:\Index_TTS_v3\Premium_IndexTTS2_SECourses\venv\lib\site-packages\soundfile.py", line 1261, in _open raise TypeError("Invalid file: {0!r}".format(self.name)) TypeError: Invalid file: None

Neil Rhodes

2025-10-16 20:12:06 +0000 UTC

yes because either your python installation is wrong or you are running as administrators. follow this requirements tutorial and don't run as administrator : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

2025-10-15 21:22:51 +0000 UTC

it doesn't seem like this installer works correctly? Attempting to launch the windows_install_or_update.bat does not work ERROR: Could not install packages due to an OSError: [Errno 13] Permission denied:[path \\appdata\\local\\pip\\cache\\wheels\\c9\\69\\31\\d56d90b22a1777b0b231e234b00302a55be255930f8bd92dcd\\jieba-0.42.1-py3-none-any.whl'jieba-0.42.1-py3-none-any.whl'] Check the permissions. running as admin results in: Requirement already satisfied: pip in c:\windows\system32\premium_indextts2_secourses\venv\lib\site-packages (25.2) ERROR: Could not open requirements file: [Errno 2] No such file or directory: 'requirements.txt' 'Windows_Model_Download_and_Fix.bat' is not recognized as an internal or external command, operable program or batch file.

zrikz

2025-10-15 20:53:40 +0000 UTC

no worries I found a workaround

Trill OG

2025-10-15 09:17:26 +0000 UTC

hi there, is it possible to create more than a single queue for multiple output files in one whole session . Im wanting to generate individual output files for each chapter of my ebook without having to generate each chapter with a new session.

Trill OG

2025-10-14 18:49:34 +0000 UTC

VibeVoice supporting Turkish and i will publish hopefully soon

Furkan Gözükara

2025-10-14 14:01:26 +0000 UTC

Hello, Is it support Turkish?

Ahmet Inceelli

2025-10-14 12:37:53 +0000 UTC

for Italian there is VibeVoice. hopefully will publish very soon sorry for delay

Furkan Gözükara

2025-10-03 10:58:23 +0000 UTC

Why does it work just in CPU mode? It's said the model works with different languages, and I fed it with an audio in Italian. It generated an output with a weird accent if it were trying to read Italian text with English pronunciation. Maybe, for Italian, there are different models?

Aldo Jones

2025-10-02 23:42:07 +0000 UTC

thanks

Furkan Gözükara

2025-10-01 10:16:21 +0000 UTC

Here is a useful prompt for AI to fix your Text for correct pronunciation, by all means use this in your text notes if you wish :) Goal: Process the provided text to ensure it is read aloud clearly, naturally, and accurately by a basic local Text-to-Speech (TTS) engine, eliminating all potential ambiguities, apostrophes, and punctuation-related errors that cause mispronunciation or awkward pacing. Instructions: Revise the text according to the following strict, non-negotiable rules. The output text must only contain standard letter characters (A-Z, a-z), numbers (0-9), commas (,), periods (.), question marks (?), exclamation points (!), and simple parentheses (). Eliminate All Contractions and Apostrophes: Spell out every single contraction and remove all apostrophes from the text entirely. (e.g., change "can't" to can not, "I'm" to I am, "it's" to it is or it has). Possessives must be handled by context or sentence restructuring. Spell Out Numbers and Abbreviations: Convert all numerical digits and common acronyms or initialisms to their fully spelled-out word form (e.g., change "16" to sixteen, "3:00 am" to three A M, "TBH" to to be honest). Standardize Punctuation and Flow: Correct any typos or instances of run-on words. Replace complex or grammatically ambiguous phrases with a clear, direct, and common alternative to ensure proper TTS cadence. Use Phonetic Respelling for Ambiguous Words: For words that a basic engine might struggle to pronounce clearly (especially slang, proper nouns, or technical terms), use a simple phonetic respelling immediately followed by the original word in parentheses. The Text to be Processed: [INSERT TEXT HERE]

Neil Rhodes

2025-10-01 09:27:28 +0000 UTC

ye i noticed same. i dont know atm sadly

Furkan Gözükara

2025-09-30 22:32:35 +0000 UTC

How do we get around the pronunciation issues like "We'd" has to be "weed" and "I'm" has to be "I am " and "read" in past tense has to be "red" (and many others) is there a tool for fixing this, It kinda breaks the immersion when having stories read back by your favoroite voice, or do we simply have to manually correct everything? anyone know?

Neil Rhodes

2025-09-30 08:50:16 +0000 UTC

yes i know. please follow this video and this post exactly and reinstall. it should be fixed : https://youtu.be/DrhUHnYfwC0 https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-111553210

Furkan Gözükara

2025-09-26 22:30:54 +0000 UTC

Hello, when I run the application it does not use my gpu. The command line says Be patient, it may take a while to run in CPU mode. I have a 5090 and it sits at 0% utilization when running this particular app. Anyone know how to fix this?

RenderDrgn

2025-09-26 14:50:47 +0000 UTC

hello. thanks. yes you need c++ tools and MSVC as well for this. please follow this tutorial and its updated post : https://youtu.be/DrhUHnYfwC0

Furkan Gözükara

2025-09-23 19:19:38 +0000 UTC

Hello, new to Patrion, long time follower, I am trying your installer and I am getting this failed item after install: LLVM ERROR: Symbol not found: __svml_cosf8_ha

James Charleston II

2025-09-23 16:04:32 +0000 UTC

Currently Chinese and English : https://github.com/index-tts/index-tts/issues/418

Furkan Gözükara

2025-09-22 07:48:29 +0000 UTC

How many languages support?

Hoàng Giang Sơn Trương

2025-09-22 02:27:37 +0000 UTC

yep not a good idea at all. probably wont work either. but you can run on runpod or massed compute or computer with 8 gb GPU. 6 may also work but need to be tested

Furkan Gözükara

2025-09-21 20:06:41 +0000 UTC

running Index_TTS with CPU-only on some Lenovo Thinkpad is no good idea?

Christoph Behrmann

2025-09-21 20:01:14 +0000 UTC

hello are you requesting a feature i am confused can you eloborate more

Furkan Gözükara

2025-09-21 19:16:02 +0000 UTC

thanks. i am working on more features right now

Furkan Gözükara

2025-09-21 18:08:19 +0000 UTC

yep thanks

Furkan Gözükara

2025-09-21 18:08:05 +0000 UTC

you are welcome. working on improvements right now

Furkan Gözükara

2025-09-21 18:07:38 +0000 UTC

orjinal türkçe yok ama türkçe karakter kullanmadan okuyabiliyor. deneyebilirsin

Furkan Gözükara

2025-09-21 18:07:29 +0000 UTC

teşekkürler hocam. Bunu türkçe desteği yok değilmi hocam

Cemil Hacimahmutoglu

2025-09-20 18:30:23 +0000 UTC

Just what I needed. Thank you very much. =D

Hockey

2025-09-20 17:52:58 +0000 UTC

It seems that the functionality for automatic voiceover into another language while preserving emotions needs to be improved for automatic use. Lack: - Cutting the original audio into parts while maintaining the integrity of the sounds - Transcription of these parts into text and translation into the desired language - Batch re-sound of sliced texts in accordance with the emotions of the sliced audio

Dmitry

2025-09-20 15:14:43 +0000 UTC

It just works! amazing!

Neil Rhodes

2025-09-20 12:47:19 +0000 UTC

Yep next level

Furkan Gözükara

2025-09-20 10:35:26 +0000 UTC

I wasn't expecting you to cover this based on your focus towards image and video. This is a huge development in voice clone and I've been messing with it for a while now. Nice to see you included it in your busy schedule. Thanks.

Lou

2025-09-20 10:32:00 +0000 UTC

ok this is actually insane, it perfectly emulates human speech with the pauses, uhms, breathing etc. just crazy

Hipno

2025-09-20 10:25:32 +0000 UTC

if authors add yes for sure but i dont know how to

Furkan Gözükara

2025-09-20 07:30:56 +0000 UTC

thanks for info i will update requirements

Furkan Gözükara

2025-09-20 07:30:39 +0000 UTC

For all those who can't get the installer to do it's thing, make sure you've got Cuda covered Pytorch before grabbing your regular packages. Cheat sheet for this is just to run the following two commands in cmd (assuming windows): 1) pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu129 and then 2) pip install huggingface_hub transformers the installer shouldn't hiccup after that, have fun cloning Mike Wazowski👍.

2025-09-20 04:40:06 +0000 UTC

I fixed it. I'll put it here if anyone else has the same problem. Courtesy Grok Expert. To fix this reliably and safely, download and install the official Intel oneAPI DPC++/C++ Compiler Runtime for Windows. https://registrationcenter-download.intel.com/akdlm/IRC_NAS/47a201d7-d4cd-4079-a2d8-0e66b860aaaa/w_dpcpp_cpp_runtime_p_2025.2.1.1001.exe Run as administrator. Restart machine. Next time I ran the TTS webUI it worked like a charm!

DanO..

2025-09-20 02:10:37 +0000 UTC

Hello, good afternoon. Do you think it could be implemented for the Spanish language as well?

Civitaier

2025-09-20 00:49:29 +0000 UTC

It broke my python. Came back with this, "LLVM ERROR: Symbol not found: __svml_cosf8_ha" any ideas? I'm really looking forward to this! (I also ran install/update again just in case.)

DanO..

2025-09-20 00:43:06 +0000 UTC

More Creators

yellowfeatherart

patreon

べりりうむ

fanbox

カメカメ万太郎

fanbox

hp44wg

patreon

KatWonders

patreon

vanjobi

gumroad

Adrien Latran

gumroad

spikysketches

patreon

Your Muse

patreon

mercuryvert

patreon

theh0ff

patreon

smilecutty

patreon

marrazan

patreon

Ksulolka

patreon

S.C.A.T.

gumroad

onethousand

fanbox

LucasBE

gumroad

御雌

fanbox

Marylandavedc

patreon

James Braley

gumroad

3twotwo

fanbox

man hu

patreon

Designrepos

gumroad

masoq095

patreon

Breesenpai

patreon

Aenaluck

patreon

dreadlabs

patreon

ピクシーpixy

patreon

miyabifan

fanbox

Harley1Forever

patreon

IPHERUS

patreon

rem99

fanbox

Refi

patreon

LittleSunBoy

patreon

Thomas Leon | Blocky

patreon

Gemo Ma

patreon

@rebel_421996

patreon

sakerukito

fanbox

icedev

fanbox

tsumikisata

fanbox

The Skin Man

gumroad

kivalagracia

patreon

Kazemaru15000

fanbox

コウタ高橋

patreon

mukka

patreon

Slava Alkin

gumroad

Joraell

patreon

syu

fantia

ahtaro

patreon

猛禅

patreon