We have hosted the application omnivoice in order to run this application in our online workstations with Wine or directly.
Quick description about omnivoice:
The OmniVoice project is a cutting-edge multilingual text-to-speech system designed to generate high-quality speech across more than 600 languages. Built on a diffusion language model-style architecture, it combines scalability with strong performance, enabling both natural-sounding voice synthesis and efficient inference speeds. One of its most notable capabilities is zero-shot voice cloning, allowing users to replicate a speaker’s voice using only a short reference audio clip. In addition, it supports voice design through configurable attributes such as gender, accent, pitch, and speaking style, giving users fine-grained control over generated speech. The system also includes advanced features like non-verbal expression tags and pronunciation overrides, enabling expressive and precise output. With support for both API-based and command-line usage, it is designed for research, production, and experimentation alike.Features:
- Support for over 600 languages in text-to-speech generation
- Zero-shot voice cloning using short reference audio
- Voice design through configurable speaker attributes
- Fine-grained control over pronunciation and non-verbal cues
- High-speed inference with real-time performance capabilities
- Multiple interfaces including Python API and CLI tools
Programming Language: Python.
Categories:
©2024. Winfy. All Rights Reserved.
By OD Group OU – Registry code: 1609791 -VAT number: EE102345621.