The French public tv released an excellent 4-episode documentary on AI (90mn in total): "L'homme à la machine". In Episode 2, Kyutai opens widely the doors of its vibrant lab: take the opportunity to meet passionate team members. https://lnkd.in/eckhHj_H
关于我们
Build and democratize Artificial General Intelligence through open science.
- 网站
-
https://kyutai.org/
Kyutai的外部链接
- 所属行业
- 科技、信息和网络
- 规模
- 11-50 人
- 类型
- 非营利机构
- 创立
- 2023
Kyutai员工
-
Guillaume Rouzaud
Part-time HR Director | Kyutai ???? | Join us
-
Jennifer Coscas
General Counsel | Legal & Compliance | Data Privacy | Transformative technologies
-
Alexandre Défossez
Chief exploration officer at Kyutai, formerly RS at FAIR Paris
-
Aude Durand
Directrice Générale Déléguée, Groupe iliad - #free #Scaleway #Play #iliaditalia
动态
-
Meet Hibiki, our simultaneous speech-to-speech translation model, currently supporting ??????????. Hibiki produces spoken and text translations of the input speech in real-time, while preserving the speaker’s voice and optimally adapting its pace based on the semantic content of the source speech. Hibiki comes in two sizes, the smaller variant Hibiki-M runs locally on an iPhone 16 Pro as shown by Neil Zeghidour in this video. Learn more on our blog post https://lnkd.in/eGPSFwMg with links to the code on github and the weights on huggingface, and try the model on your laptop now!
-
Meet Helium-1 preview, our 2B multi-lingual LLM, targeting edge and mobile devices, released under a CC-BY license. Helium supports 6 languages ???? ???? ???? ???? ???? ???? and will be extended to more shortly. Below is a summary of Helium's performance on multilingual benchmarks. We will also release the full model, a technical report, and open-source the code for training and for reproducing our dataset. We are looking forward to the feedback from the community, which will help us drive the development of Helium and make it the best multi-lingual lightweight model. ?? HuggingFace: https://lnkd.in/eKA4biWr Blog Post: https://lnkd.in/e9xFD3BS
-
-
Want to contribute to the future of #AI? Our lab is offering #research internships in 2025, providing a unique opportunity to work on groundbreaking projects. Don't miss out ??! #internship
-
We trained Moshi on synthetic dialogues generated with our own TTS system. To learn more about the technical details behind Moshi, check out Neil Zeghidour's talk at dotConferences. Link in comments ??
-
Kyutai转发了
Last week, we've released several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. Technical report: https://lnkd.in/eHquXSbF Repo: https://lnkd.in/g2U5HtZG HuggingFace: https://lnkd.in/ga7m_hth Blog post: https://lnkd.in/gSMzrnVT You can run it locally, on an Apple Silicon Mac just run: $ pip install moshi_mlx $ python -m moshi_mlx.local_web -q 4 It's all open-source under a permissive license, can't wait to see what the community will build with it!
-
Last week, we've released several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. Technical report: https://lnkd.in/eHquXSbF Repo: https://lnkd.in/g2U5HtZG HuggingFace: https://lnkd.in/ga7m_hth Blog post: https://lnkd.in/gSMzrnVT You can run it locally, on an Apple Silicon Mac just run: $ pip install moshi_mlx $ python -m moshi_mlx.local_web -q 4 It's all open-source under a permissive license, can't wait to see what the community will build with it!
-
Kyutai转发了
Thanks Nessrine Berrama! Looking forward to speak at https://www.dotai.io/ and deep dive into the making of Moshi.
En seulement 6 mois, il crée une IA qui surperforme OpenAI, Amazon et Apple. Il fait partie d’une équipe de 8 fran?ais qui font littéralement trembler la Silicon Valley! Lui, c’est Neil Zeghidour, le Chief Modeling Officer de Kyutai, passé par Meta et Google, et qui a choisi un laboratoire fran?ais pour faire avancer la recherche sur l’IA. Le centre de recherche Kyutai – backé par Xavier Niel, Eric Schmidt et Rodolphe Saadé – commence déjà à produire des projets. En 6 mois. Et c’est hallucinant. Pour preuve: - L’IA – qui s’appelle Moshi – peut être testée librement en ligne. Ce qui constitue une première mondiale pour une IA vocale générative. - L' IA conversationnelle possède une latence incroyable à 160ms, qui laisse GPT4-o, Alexa et Siri bien loin derrière. - Ses capacités de synthèse vocale sont exceptionnelles en termes d'émotion et d'interaction entre plusieurs voix.? - Le tout avec approche complètement Open Source qui fait honneur à la communauté AI en Europe. Bref, Moshi a le potentiel de révolutionner l’usage de la parole dans le monde numérique. Et on est super curieux de suivre l’histoire. Je ne saurais vous en dire plus, car Neil nous prépare une keynote appelée “Multimodel Language Models” à dotAI en Octobre, et on a très hate de l’écouter! Merci Neil de nous rejoindre pour partager à la communauté vos avancements. Et vous, vous nous rejoignez? (lien en commentaire)
-
-
"Hippie" Moshi tells its love for Hendrix...but "skeptical" Moshi is less enthusiastic about psychedelic rock. Moshi can play 70+ emotions, will you catch them all? Try now at https://moshi.chat