Kyutai的封面图片
Kyutai

Kyutai

科技、信息和网络

Build and democratize Artificial General Intelligence through open science.

关于我们

Build and democratize Artificial General Intelligence through open science.

网站
https://kyutai.org/
所属行业
科技、信息和网络
规模
11-50 人
类型
非营利机构
创立
2023

Kyutai员工

动态

  • 查看Kyutai的组织主页

    21,038 位关注者

    Meet Hibiki, our simultaneous speech-to-speech translation model, currently supporting ??????????. Hibiki produces spoken and text translations of the input speech in real-time, while preserving the speaker’s voice and optimally adapting its pace based on the semantic content of the source speech. Hibiki comes in two sizes, the smaller variant Hibiki-M runs locally on an iPhone 16 Pro as shown by Neil Zeghidour in this video. Learn more on our blog post https://lnkd.in/eGPSFwMg with links to the code on github and the weights on huggingface, and try the model on your laptop now!

  • 查看Kyutai的组织主页

    21,038 位关注者

    Meet Helium-1 preview, our 2B multi-lingual LLM, targeting edge and mobile devices, released under a CC-BY license. Helium supports 6 languages ???? ???? ???? ???? ???? ???? and will be extended to more shortly. Below is a summary of Helium's performance on multilingual benchmarks. We will also release the full model, a technical report, and open-source the code for training and for reproducing our dataset. We are looking forward to the feedback from the community, which will help us drive the development of Helium and make it the best multi-lingual lightweight model. ?? HuggingFace: https://lnkd.in/eKA4biWr Blog Post: https://lnkd.in/e9xFD3BS

    • 该图片无替代文字
  • Kyutai转发了

    查看Kyutai的组织主页

    21,038 位关注者

    Last week, we've released several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. Technical report: https://lnkd.in/eHquXSbF Repo: https://lnkd.in/g2U5HtZG HuggingFace: https://lnkd.in/ga7m_hth Blog post: https://lnkd.in/gSMzrnVT You can run it locally, on an Apple Silicon Mac just run: $ pip install moshi_mlx $ python -m moshi_mlx.local_web -q 4 It's all open-source under a permissive license, can't wait to see what the community will build with it!

  • 查看Kyutai的组织主页

    21,038 位关注者

    Last week, we've released several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. Technical report: https://lnkd.in/eHquXSbF Repo: https://lnkd.in/g2U5HtZG HuggingFace: https://lnkd.in/ga7m_hth Blog post: https://lnkd.in/gSMzrnVT You can run it locally, on an Apple Silicon Mac just run: $ pip install moshi_mlx $ python -m moshi_mlx.local_web -q 4 It's all open-source under a permissive license, can't wait to see what the community will build with it!

  • Kyutai转发了

    查看Neil Zeghidour的档案

    Chief Modeling Officer @ Kyutai

    Thanks Nessrine Berrama! Looking forward to speak at https://www.dotai.io/ and deep dive into the making of Moshi.

    查看Nessrine Berrama的档案

    CEO @dotConferences ?? | Enabling AI companies' GTM success and spotting emerging AI trends

    En seulement 6 mois, il crée une IA qui surperforme OpenAI, Amazon et Apple. Il fait partie d’une équipe de 8 fran?ais qui font littéralement trembler la Silicon Valley! Lui, c’est Neil Zeghidour, le Chief Modeling Officer de Kyutai, passé par Meta et Google, et qui a choisi un laboratoire fran?ais pour faire avancer la recherche sur l’IA. Le centre de recherche Kyutai – backé par Xavier Niel, Eric Schmidt et Rodolphe Saadé – commence déjà à produire des projets. En 6 mois. Et c’est hallucinant. Pour preuve: - L’IA – qui s’appelle Moshi – peut être testée librement en ligne. Ce qui constitue une première mondiale pour une IA vocale générative. - L' IA conversationnelle possède une latence incroyable à 160ms, qui laisse GPT4-o, Alexa et Siri bien loin derrière. - Ses capacités de synthèse vocale sont exceptionnelles en termes d'émotion et d'interaction entre plusieurs voix.? - Le tout avec approche complètement Open Source qui fait honneur à la communauté AI en Europe. Bref, Moshi a le potentiel de révolutionner l’usage de la parole dans le monde numérique. Et on est super curieux de suivre l’histoire. Je ne saurais vous en dire plus, car Neil nous prépare une keynote appelée “Multimodel Language Models” à dotAI en Octobre, et on a très hate de l’écouter! Merci Neil de nous rejoindre pour partager à la communauté vos avancements. Et vous, vous nous rejoignez? (lien en commentaire)

    • 该图片无替代文字

相似主页

查看职位