TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #15421 · Jan 18

#python#audio#deeplearning#minicpm#python#pytorch#speech#speech_synthesis#text_to_speech#tts#tts_model#voice_cloning VoxCPM is a free, open-source TTS tool that turns text into realistic speech without tokens, creating expressive audio that matches context and clones voices perfectly from just 3-10 seconds of sample. Download VoxCPM1.5 (800M params) from Hugging Face, install via pip, and use simple Python or CLI commands for fast synthesis (RTF 0.15 on RTX 4090) or fine-tuning your own voices. You benefit by easily making natural audiobooks, podcasts, clones, or apps with pro-quality sound—saving time and costs on voice work. https://github.com/OpenBMB/VoxCPM

Results

1 similar post found

Search: #openpose

当前筛选 #openpose清除筛选
The 2ndDim: That was I talking about!

@The2ndDim · Post #1794 · 03/07/2023, 07:02 PM

#AI_Generated#StableDiffusion#ControlNet#OpenPose#vtuber#kohaku_nene Stable Diffusion 通过 ControlNet 基于 OpenPose 来产生指定动作姿势的图片的效果是真不错。 指定动作姿势是容易了,但调参本身还是很麻烦,尤其是指定动作姿势之后出现“人体炼成失败”的概率也增加了不少,要靠各种玄学微调才能修好。