@libreware · Post #1085 · 04/05/2022 09:32
Wenet Automatic #Speech#Recognition toolkit. https://github.com/wenet-e2e/wenet https://wenet.org.cn/wenet/
Hashtags
TGINSIGHT SIMILAR POSTS
Chaîne source @OnePlusGuide · Post #2525 · 24 mai
~ COME ABILITARE I 960 FPS E LA MACRO MODE SU ONEPLUS 7 PRO (OOS CAMERA + ROOT) ~ #OP#OOS#OP7PRO Dopo l'arrivo di queste modalità su OnePlus 7T, è possibile abilitarle anche sul OnePlus 7 Pro. Questa procedura richiede i permessi di root e la camera stock di OxygenOS. É richiesta una versione della Camera pari o superiore alla 3.10.17 (ve la linko) ed è consigliata l'installazione delle build Open Beta di OOS. • Assicurarsi di avere la corretta versione della Camera • Scaricare Preferences Manger (download nei bottoni) • Configurarla e abilitarla ai permessi di root • Entrare nella sezione dell'app Fotocamera • Cercare il file CameraInfo0.xml • Aggiungere una StringSet (con il tasto + in alto) che abbia come key Video960FpsSizes e come valore 1280x720 • Ripetere la stessa procedura nel file CameraInfo5.xml • Aprire il file CameraInfo_3.xml e cercare la variabile IsUWMacroSupported e cambiare il valore da false a true Fatto! Pierre
Recherche : #recognition
@libreware · Post #1085 · 04/05/2022 09:32
Wenet Automatic #Speech#Recognition toolkit. https://github.com/wenet-e2e/wenet https://wenet.org.cn/wenet/
Hashtags
@libreware · Post #1084 · 04/05/2022 09:32
Vosk Speech Recognition Toolkit Vosk is an offline open source #speech#recognition toolkit. It enables speech recognition for 20+ languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino, Ukrainian, Kazakh, Swedish, Japanese, Esperanto, Hindi, Czech. More to come. Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification. Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others. Vosk supplies speech recognition for chatbots, smart home appliances, virtual assistants. It can also create subtitles for movies, transcription for lectures and interviews. Vosk scales from small devices like Raspberry Pi or Android smartphone to big clusters. https://t.me/speech_recognition https://alphacephei.com/vosk https://github.com/alphacep/vosk-api
Hashtags
@libreware · Post #1021 · 09/01/2022 14:56
SongRec An open-source Shazam client for Linux, written in Rust. Features: • Recognize audio from an audio file. • Recognize audio from the microphone. • Usage from both GUI and command line. • Provide an history of the recognized songs. • Continuous song detection. • Ability to recognize songs from your speakers rather than your microphone. Download: https://github.com/marin-m/SongRec#installation https://github.com/marin-m/SongRec @foss_desktop #music#shazam#recognition
Hashtags
@libreware · Post #1192 · 06/10/2023 11:18
#Linux Desktop application that provides live #captioning FUTO Fellowship program interview; linux captions software 👉 Live Captions github: https://github.com/abb128/LiveCaptions 🔵 Q&A w/ billionaire alt-tech investor/philanthropist Eron Wolf https://www.youtube.com/watch?v=OJPmbcU-Vzo 🔵 FUTO Fellows program: https://futo.org/fellows/ 🔵 FUTO Youtube channel - @futotech ⚠️ Google's breaches of privacy have gone TOO FAR! https://www.youtube.com/watch?v=_vWAF13KigI #speech#recognition#stt#voice
@djangoproject · Post #448 · 18/09/2017 11:30
https://medium.com/@GalarnykMichael/logistic-regression-using-python-sklearn-numpy-mnist-handwriting-recognition-matplotlib-a6b31e2b166a Logistic Regression using Python (#Sklearn, #NumPy, #MNIST, Handwriting #Recognition, #Matplotlib) #machine_learning.
@libreware · Post #1114 · 09/03/2023 22:58
https://writeout.ai #Transcribe and #translate any #audio file. 100% free to use. This website with source code available (it can be hosted locally) allows you to upload any audio file and receive a transcription and/or text translation. It uses OpenAI's Whisper API on the back end. Source on GitHub: https://github.com/beyondcode/writeout.ai #writeout#ai#speech#recognition