Microsoft Phi-4, ChatGPT Vision, Grok Free?—?What’s Changing?
Welcome to This Edition of the DSA Newsletter! In this edition, we’ve got game-changing updates, cutting-edge tools, and exclusive training resources. Ready to master AI and take your skills to the next level? Dive in to explore the best of AI, and don’t forget to check out our AI Training Course below!
What’s new in?AI:
?? Master AI with our exclusive training sessions! Learn to harness cutting-edge tools, build prototypes, and become a leader in the AI space.
?? Explore AI Training Now ?? Have questions or ideas? Reply to this email and let’s chat!
ChatGPT Advanced Voice Mode gains vision capabilities
The Brief: OpenAI just launched a major upgrade to ChatGPT’s Advanced Voice Mode on Day 6 of its live stream event, enabling the AI to analyze and respond to live video input and screen sharing during conversations.
The details:
Why it matters: Seven months after its initial demo, OpenAI is finally delivering on the promise of visual understanding in conversational AI?—?moving ChatGPT beyond text and voice into true multimodal interaction. It’s been a big week for vision, with Gemini and ChatGPT Advanced Voice gaining some extremely powerful new capabilities.
Grok on ??: Faster, Smarter, and Now for?All!
The Brief: Grok AI, now sharper than ever, rolls out to all ?? users for free. Packed with groundbreaking features like real-time web search, citations, and the new Aurora image generator, Grok elevates the AI experience for both casual users and enterprises.
The details:
Why It Matters: Grok’s upgrades democratize access to high-functioning AI, blending creativity, knowledge, and utility. By integrating seamlessly into ??, Grok fosters smarter interactions, creative explorations, and deeper engagement.?
Try Grok on ?? for free and explore its powerful features like unfiltered reasoning, coding assistance, and stunning image generation. Premium+ users unlock even more capabilities! ?? Learn More & Sign Up
Anthropic’s Claude 3.5 Haiku is now generally available
The Brief: Anthropic quietly rolled out its fastest AI model, Claude 3.5 Haiku, to all Claude users on web and mobile platforms, expanding from its previous API-only availability?—?though no official announcement has been made.
The details:
Why it matters: It’s been a relatively quiet holiday season of releases for Anthropic compared to rivals. Although Haiku is impressive compared to previous generations, it doesn’t feel like a huge needle mover during a big week of AI releases?—?and it might take a launch of a top-tier 3.5 Opus to steal the spotlight from Google and OpenAI.
Microsoft releases small, powerful?Phi-4
The Brief: Microsoft just released Phi-4, a 14B parameter small language model that outperforms massive competitors like GPT-4o and Gemini Pro 1.5 in areas like mathematical reasoning despite a drastic size difference.
The details:
Why it matters: Microsoft’s Phi models continue to challenge the ‘bigger is better’ trend in AI, showing that smaller models can match or exceed the capabilities of larger ones?—?particularly in specialized areas. The AI future may not be about raw size but smarter architecture and training approaches that do more with less.
OpenAI’s Canvas goes public with new?features
The Brief: OpenAI just made Canvas available to all users, with the collaborative split-screen writing and coding interface gaining new features like Python execution and usability inside custom GPTs.
The details:
Why it matters: While this Canvas release may not be as hyped as the Sora launch, it represents a powerful shift in how users interact with ChatGPT, bringing more nuanced collaboration into conversations. Canvas’ Custom GPT integration is also a welcome sight and could breathe life into the somewhat forgotten aspect of the platform.
Apple Intelligence gets a big upgrade with iOS?18.2
The Brief: Apple just rolled out its biggest Apple Intelligence update yet, AI-powered emoji creation, image generation capabilities, Visual Intelligence with camera control, and more?—?alongside the broader integration of ChatGPT.
The details:
Why it matters: Apple Intelligence has been underwhelming so far, to say the least, but the ChatGPT integration brings the system closer to what users likely envisioned when upgrading their iPhones for the new AI tools. However, we’ll have to wait until 2025 for agentic Siri capabilities that can handle more complex actions.
Cognition launches Devin AI developer assistant
The Brief: Cognition Labs has officially launched Devin, its AI developer assistant, targeting engineering teams and offering capabilities ranging from bug fixes to automated PR creation.
The details:
Why it matters: Devin’s early demos felt like the start of a new paradigm, but the AI coding competition has increased heavily since. It’s clear that the future of development will largely be a collaborative effort between humans and AI, and $500/m might be a small price to pay for enterprises offloading significant work. Try it here.
Pika drops major 2.0 video?upgrade
The Brief: Pika Labs just released version 2.0 of its AI video generator, introducing a new ‘Ingredients’ tool that lets users incorporate their own images into AI-generated videos?—?alongside improved motion, prompting, and animation features.
The details:
Why it matters: Pika’s new upgrades are wild, continuing to move video outputs out of the ‘slot machine’ luck phase into a more customizable, personalized experience. While we patiently waited for Sora, the AI video scene leveled up in a major way?—?with Pika, Luma, Runway, Kling, Hailuo, and others dulling the impact of OpenAI’s latest release.
Anthropic analyzes real-world AI use with?Clio
The Brief: Anthropic introduced Clio, a new system that reveals patterns in how people actually use AI assistants worldwide, providing detailed insights into real-world AI adoption while maintaining user privacy.
The details:
Why it matters: AI assistants are becoming increasingly integrated into our daily lives, but each person leverages them in a different way?—?making this a fascinating window into how the tech is being used. Understanding the dominant real-world use cases can both help improve user experience and align development with actual user needs.
One orange gift that’s fit for?all
The Brief: Bring the best gift to your holiday festivities this year with the rabbit r1?—?an AI companion designed to grow more delightful over time thanks to continuous updates and new features that keep it fresh and engaging.
领英推荐
Whether young, young at heart, or in-between, gift r1 to:
Take advantage of the only discount of the year and secure r1s for everyone this holiday season.
Replit launches ‘Assistant’ for?coding
The Brief: Replit just officially launched its upgraded AI development suite, removing its Agent from early access and introducing a new Assistant tool, alongside a slew of other major platform improvements.
The details:
Why it matters: The competition in AI development has gotten intense, and tools like Replit continue to erase barriers, with builders able to create anything they can dream up. Both beginners and experienced devs now have no shortage of AI-fueled options to bring ideas to life and streamline existing projects.
ChatGPT gains ‘Projects’ for chat organization
The Brief: OpenAI launched Projects for ChatGPT on Day 7 of its ’12 Days of OpenAI’ event, a new organizational system that lets users group conversations, files, and custom instructions into individual workspaces with shared context.
The details:
Why it matters: While this isn’t the most groundbreaking feature (Anthropic released Projects for Claude in June), it’s important for user workflows?—?avoiding the dreaded need to refresh entire context and instruction prompts when starting new chats.
AI TRAINING
Turn your screenshots into working prototypes
The Brief: Claude Artifacts lets you create functional prototypes directly from screenshots or photos, bringing your ideas to life in minutes.
Step-by-step:
Pro tip: Use high-contrast images and break down complex interfaces into smaller components for better results.
>>.<<
Transform words into cinematic magic with?Sora
The Brief: OpenAI’s newly launched Sora AI video generator allows you to turn your text descriptions into realistic videos without cameras, actors, or editing software.
Step-by-step:
Pro tip: Test your concepts with shorter durations and lower resolutions first, then upgrade settings for your final version.
>>.<<
Practice job interviews with?ChatGPT
The Brief: ChatGPT’s Advanced Voice Mode can be turned into a personalized interview coach, conducting mock interviews and providing real-time feedback.
Step-by-step:
Pro tip: If you need more time to formulate your responses, you can customize how the AI responds in Custom Instructions.
<<.>>
Turn AI passion into a consulting career
The Brief: Innovating with AI’s new program, AI Consultancy Project, transforms AI enthusiasts into professional consultants?—?tapping into a market projected to reach $54.7B by 2032.
The 6-month program delivers:
NEW TOOLS &?JOBS
Trending AI?Tools
New AI Job Opportunities
QUICK HITS
Google announced Android XR, a new Gemini-powered operating system for mixed reality systems, with Samsung set to launch the first compatible headset codenamed ‘Project Moohan’ in 2025.
ChatGPT head of product Nick Turley discussed the platform’s future in an interview with The Verge, saying that chat-based interactions may soon feel as “outdated as ’90s instant messaging.”
Amazon Prime Video launched a new ‘AI Topics’ beta feature, using machine learning to group and recommend content based on viewers’ interests and watching habits.
xAI rolled out an upgraded version of Grok-2 to all X platform users, featuring tripled speed, improved multilingual capabilities, and integration of web search and advanced image generation features.
Meta’s FAIR released a suite of new AI research projects including Meta Motivo for embodied agent control and Meta Video Seal for video watermarking, alongside improved models for memory scaling and social intelligence.
OpenAI cofounder Ilya Sutskever warned that AI has reached ‘peak data’ during the NeurIPS conference, predicting a shift from current training methods to more autonomous, reasoning-based systems that will become increasingly unpredictable.
Google unveiled NotebookLM Plus with interactive audio features and Gemini 2.0 Flash integration, allowing users to verbally engage with AI hosts during Audio Overviews and access expanded enterprise capabilities.
OpenAI published new email correspondences and a timeline of events with Elon Musk, claiming that Musk initially wanted the company to be a for-profit entity despite active lawsuits.
DeepSeek released VL2, a new vision-language model family leveraging MoE architecture that performs similarly to rival models despite smaller sizes.
Anonymous-chatbot has returned to the LM Arena, which was previously used to test GPT 4o, sparking rumors of a potential GPT 4.5 or upgraded OpenAI model coming soon.
SPONSOR US
Get your product in front of over 800k+ AI enthusiasts
Our newsletter is read by thousands of tech executives, investors, engineers, managers, and business owners around the world. Get in touch today.
Want to sponsor us and get in front of 750k+ AI enthusiasts? Get in touch.
Looking for our Expert ChatGPT Prompt Guide? Download free.
Interested in podcasts? Check out ours here.
Go deeper? Join the TD8 AI University.