OpenAI may launch customized speech engine: voice assistant, translation, generative music

OpenAI's new trademark reveals that it may be launching a speech engine covering a wide range of applications, including voice assistants, generative music and speech translation. The engine will also support voice command recognition, text-to-speech conversion, multi-language translation, and the ability to generate audio content based on various inputs, among other things. The large-scale model trained using AI technology and big data is designed to advance speech services and natural language understanding technology, and it is expected to see if it can cause industry shocks like the video model Sora. (AI Cambrian)

Search