TGTGInsighttelegram intelligenceLIVE / telegram public index
← GitHub Trends

TGINSIGHT SIMILAR POSTS

Find similar content

Source channel @githubtrending · Post #14686 · May 8

#python#asr#deeplearning#generative_ai#large_language_models#machine_translation#multimodal#neural_networks#speaker_diariazation#speaker_recognition#speech_synthesis#speech_translation#tts NVIDIA NeMo is a powerful, easy-to-use platform for building, customizing, and deploying generative AI models like large language models (LLMs), vision language models, and speech AI. It lets you quickly train and fine-tune models using pre-built code and checkpoints, supports the latest model architectures, and works on cloud, data center, or edge environments. NeMo 2.0 is even more flexible and scalable, with Python-based configuration and modular design, making it simple to experiment and scale up. The main benefit is that you can create advanced AI applications faster, with less effort, and at lower cost, while getting high performance and easy deployment options[1][2][3]. https://github.com/NVIDIA/NeMo

Results

1 similar post found

Search: #text2mask

当前筛选 #text2mask清除筛选
PHYGITAL+CREATIVE

@phygitalcreative · Post #2746 · 04/14/2023, 01:52 PM

SEEM: Segment Everything Everywhere All at Once SEEM позволяет пользователям легко сегментировать изображение, используя промпты различных типов: точки, грубые маски, рамки, языковые подсказки (текст и аудио) и т.д. Говорят, что работает и с видео без дообучения. Гитхаб (кода пока нет) Демо #image2mask, #video2mask, #segmentation#text2mask#audio2mask