any2any多模态
自然打断和实时响应
one more thing
参考
-
whisper https://arxiv.org/abs/2212.04356 -
mms https://arxiv.org/abs/2305.13516 -
audioPaLM https://arxiv.org/abs/2306.12925 -
gpt4o https://openai.com/index/hello-gpt-4o/ -
moshi chat https://moshi.chat/ -
gmini https://arxiv.org/pdf/2312.11805 -
google astra https://deepmind.google/technologies/gemini/project-astra/ -
Speech ReaLLM https://arxiv.org/abs/2406.09569 -
BESTOW https://arxiv.org/abs/2406.19954
暂无评论内容