Tech Insights 2024 Week 33
Would you trust a robot to fix your dental issues? Maybe I am the minority, but I would definitely try it, especially if it could significantly reduce the time you need to sit still in that chair while someone is drilling in your mouth. Dental-robots are coming, and you can read more about it below. Also, Black Forest Labs Flux that I wrote about last week has been released, and wow the images it creates really are as good as they promised. You can see two generated photos below and try it out online for free. Finally, we now have a language model that can both listen and speak at the same time in real-time, solving one of the last puzzles of engaging discussions. Next year we will all be talking to AI models in our phones like we have been doing it for ages, and it will feel as natural to us as talking to a close friend.
WANT TO RECEIVE THIS NEWSLETTER AS A WEEKLY EMAIL?
If you prefer to receive this newsletter as a weekly email straight to your inbox, you can sign up at: https://techbyjohan.com/newsletter/ . You will receive one email per week, nothing else, and your contact details will never be shared with any third party.
THIS WEEK'S NEWS:
Robot Dentist Performs World's First Fully Automated Procedure
The News:
My take: Going to the dentist is never fun, but having the time cut from 2 hours into just 15 minutes is amazing, and we will probably see similar improvements to other dental procedures. I guess the main question is how it will relate to pain - everyone reacts differently and it's never completely painless when you have things to correct in your mouth. Still, it's an interesting concept and it will be interesting to see how long it takes before it gets regulatory approval.
Black Forest Labs Flux Realism + LoRA Creates Stunning AI Photos
The News:
My take: Well this is it. I can no longer identify AI generated photos, and the models will only get better from here. There are definitely mixed feelings about this - sure it's a great technological achievement, but also the feeling that you no longer for real can trust anything you see online, in print, or anywhere.
Google Creates Ping-Pong Playing Robot
The News:
My take: Table tennis is a fast-paced game that requires both high adaptability and skills to master. The fact that the robot beat 55% of all intermediate players is amazingly good, and demonstrates impressive progress in areas like real-time decision making, precise motor control, and adaptability.
领英推荐
Alibaba Claims Top Spot in AI Math Models with Qwen2-Math
The News:
My take: Another great release by Alibaba DAMO Academy, this time using the Apache 2.0 license! The use of Chain-of-Thought prompting is interesting as it mimics human problem-solving strategies, and other models will probably apply this concept shortly.
Meta Announces $2 Million in Llama 3.1 Impact Grants
The News:
My take: With it's current leadership Meta seems more committed than ever to open-source, and this Impact Grants program will help build a global community of developers and innovators that use their platforms. So far I have only good things to say about how Meta approaches AI and innovation, and this program only strengthens that view.
OpenAI Introduces Structured Outputs in API
The News:
My take: For anyone using ChatGPT to create applications that require consistent and structured data outputs this 100% reliability claim is a game-changer. Use cases include dynamically generating user interfaces and extracting structured data, and I can see this being used on a wide scale for research purposes and industrial applications.
AI Model Achieves Real-Time Listening and Speaking Capabilities
The News:
My take: While the new Voice Mode in ChatGPT allows you to interrupt the model while it is talking, being able to both listen and speak at the same time is critical in human dialog to allow engaging discussions. This could potentially revolutionize human-AI interactions - making conversations with machines feel truly natural and responsive.
Sign me up for toothbot! That’s Actually Useful(tm) I’ll give Flux a miss though. This is the same dev team that ensured an infinite supply of CSAM for the world under Emad Mostaque’s watch. A product which is being used to groom kids in encrypted Facebook chats and overwhelming FBI investigators, occupying them from policing actual crime. A highly toxic and damaging product — not to mention the blackbox data sourcing now being protested as illegal by a million organized creative professionals in the EU and UK as of last week, and the lack of watermarks.