Agents are here. Open Source model R1 outperforms.
The product team's dilemma: the pressure to keep pace with user expectations and new capabilities.
Agents that can perform tasks on your behalf are no longer just a promise. This week,OpenAI launched Operator, an agent that can perform tasks on the web for you, and Perplexity dropped Perplexity Assistant (only on Android for now), which can perform tasks such as writing emails and booking dinners.
It’s still early days — Operator, for instance, still needs human input to get through captcha, and it’s unclear how these agents will handle sensitive information like payment details. But we’re well on our way to a world where agents can handle daily tasks on our behalf.
Peeking under the hood, I’ve been digging into VLMs (vision language models), multimodal AI that can understand image and video, and take action on them — and how they will start to shape how humans interact with technology. A good example is UI-Tars, a paper outlining automated UI interactions.
Open source models got a firm leg up this week with R1, a reasoning model from Deep Seek, that has on-par performance with OpenAI’s o1. Being open source means it’s free, adaptable — plus R1 is incredibly efficient.
领英推荐
What folks are saying:
Last thoughts: product teams are in a dilemma. The pace of consumer expectations around new AI capabilities is accelerating — but shipping something subpar will cost. In the case of formerly-beloved sound system company Sonos, they lost billions of dollars (nearly 40% of their value) from a poorly managed app roll-out that got customers into an uproar.
Must-Know News
Marketing Leader and Tech Company Builder
1 个月While autonomous AI agents show promise, security researcher Bruce Schneier warns that rushing to deploy them creates significant risks. Giving AI systems permission to act on our behalf could enable sophisticated fraud and security breaches if proper safeguards aren't in place first. The focus should be on developing robust security frameworks before widespread adoption.