GenAI Model Updates: Latest Developments from Meta, Mistral, Apple, Google, OpenAI, and more
Meta's Llama 3.1
Not entirely open-source, but close enough. How open it is?
One of the key improvements in Llama 3.1 is its massive training dataset. It boasts a massive training dataset of 15 trillion tokens, a huge leap from the previous 1.5 trillion. This broader range of text data makes it a more knowledgeable and capable model.
The architecture and training objectives have also been enhanced, allowing the model to better understand nuanced language and generate more coherent text.
Plus, it achieves all this with fewer parameters - 70 billion, down from 100 billion - making it a more efficient model overall. Read more
Mistral Large 2
Mistral AI, another company competing with Meta to develop the best open-source models, has released Mistral Large 2.
According to their website, this model boasts 123 billion parameters, making it quite substantial. Their shared documentation shows that Mistral Large 2 outperforms Llama 3.1 70B in math performance.
You can find more benchmarks on their website. Overall, Mistral Large 2 performs on par with the best state-of-the-art closed and open-source models across various benchmarks. Read more
Apple's AI Models
They are a family of AI models, including a 7 billion parameter model and a 1.4 billion parameter model, as part of the DataComp for Language Models project.
Open-source or not: Yep! Apple has released the model weights, training code, and pretraining dataset, making the project truly open-source.
High-Performing: The 7B model has outperformed Mistral-7B and is closing in on other leading open models, including Llama 3 and Gemma.
Commercial Use: The smaller 1.4B model has been released under Apache 2.0, allowing for commercial use, distribution, and modification. Read more
??Bonus: Two free webinars are coming in less than 48 hours!
Google Gemini's Upgrade: Enhanced Conversational AI Capabilities
领英推荐
This represents a substantial improvement, offering enhanced capabilities, faster response times, and expanded access. Its focus on contextual understanding, attention mechanisms, and reinforcement learning makes it a robust and adaptable conversational AI solution. Read more
Not entirely open-source. While Google has released some components and tools related to Gemini, the full model architecture and weights are not publicly available.
SearchGPT Prototype: AI-Powered Search Engine
This is an experimental AI-powered search engine developed by OpenAI, leveraging the capabilities of the GPT model. Some key features and aspects:
Kling AI Video Generator Now Available Globally
Kling AI (proprietary model), developed by Kuaishou, has launched globally, offering a multilingual AI video generator platform that supports text-to-video and image-plus-text-to-video generation. Key features include:
Sakana AI's Image Models
Sakana AI, a Tokyo-based startup, has released two image generation models, Evo-Ukiyoe and Evo-Nishikie, designed to generate traditional Japanese ukiyo-e artwork.
These models use AI to create images from text and image prompts, focusing on Japan's historic art form that flourished between the 17th and 19th centuries. Read more
Open-source or not: Yep, they are open-source. But please note 1) they are not intended for commercial use or deployment in mission-critical environments. 2) The models are considered experimental prototypes, and their performance and outcomes are not guaranteed.
This issue is brought to you in partnership with Auxi.
auxi is an AI-driven solution that streamlines presentation slide creation, offering expert assistance in content suggestions, design, and data analysis.
Trusted by over 350 institutions, including top global consulting and investment banking firms. They currently offer free trials, start here.
Insightful! Good to see the progress of AI so far.
AI is changing the world - I am here to supercharge that change | Connecting HR and Tech | Leading People & Product Initiatives
7 个月thanks for sharing! Do you know if Apple released models are going to be the same powering Apple Intelligence on iOS 18?
Very informative article Alex Wang!! AIxBlock is looking forward to being featured in your future blog!
Consultor de Tecnología | Automatizaciones | Whatsapp | IA Generativa
7 个月Thanks for sharing, Alex Wang!
Join FREE LinkedIn Course & Community on Skool (Click link in Bio)
7 个月You raise a great point about keeping up with AI news. It helps everyone stay informed and find new tools for learning. Thanks for sharing Alex Wang.