Speech to Text Python Module PyPI

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

VoiceCraft is a token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech (TTS) on in-the-wild data including ...

IEEE

Channel-Time-Frequency Attention Module for Improved Multi-Channel Speech Enhancement

Abstract: Both spatial and tempo-spectral information are essential for multi-channel speech enhancement, a field that has gained significant popularity in recent years. While many studies focus on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Channel-Time-Frequency Attention Module for Improved Multi-Channel Speech Enhancement

Trending now