Search results for "AUDIO"

Kimi releases a brand new universal audio foundation model Kimi-Audio

Jin10 data reported on April 26th, today, Kimi released a new Open Source project - the brand new general audio foundation model Kimi-Audio. According to the introduction, this model supports various tasks such as speech recognition, audio understanding, audio to text, and voice dialogue.
More

Alitongyi Open Source audio language model Qwen2-Audio, related paper selected for top conference ACL 2024

Jinshi data, August 13 news, Ali Tongyi's large model continues to be Open Source, and the Qwen2 series Open Source family has added the audio language model Qwen2-Audio. Qwen2-Audio can directly perform voice Q&A without the need for text input, understand and analyze the audio signals input by users, including human voice, natural sound, music, etc. The model has significantly surpassed the previous best models in multiple authoritative evaluations. Tongyi team also simultaneously released a new trap audio understanding model evaluation Benchmark, and the related paper has been selected for the international top conference ACL2024 being held this week.
More
  • 3