Moondream AI的封面图片
Moondream AI

Moondream AI

软件开发

Seattle,WA 527 位关注者

Promptable Vision AI?that runs everywhere.

关于我们

Moondream is a computer-vision model can answer real-world questions about images. It's tiny by today's models, with only 1.6B parameters. That enables it to run on a variety of devices, including mobile phones and edge devices.

网站
https://moondream.ai
所属行业
软件开发
规模
11-50 人
总部
Seattle,WA
类型
私人持股
创立
2024

地点

Moondream AI员工

动态

  • Moondream AI转发了

    查看Joe Heitzeberg的档案

    Working to Expand AI Tinkerers Globally

    What’s top of mind for Vik from Moondream AI right now? Coding agents. ???? In this clip from our latest One-Shot episode, Vik shares his experience testing tools like OpenDevin, why it feels like “having an intern on call 24/7,” and what editor he’s actually using (spoiler: it’s not Cursor). ??? A fun, unfiltered glimpse into how one of today’s top open-source builders actually works. Link in comments below.

  • Moondream AI转发了

    查看Joe Heitzeberg的档案

    Working to Expand AI Tinkerers Globally

    Everyone has that “spark” moment. For Vik from Moondream AI, it was stable diffusion. ?? In this clip from our latest One-Shot episode, Vik shares how open-source image generation reignited his passion for machine learning—years after leaving the field behind. What followed? A crash course in diffusion models, deep dives into Karpathy and Jonathan Whitaker’s videos, and ultimately, the launch of one of the most compelling open-source vision models out there. ??? Watch this snippet to hear how one pivotal project sparked the start of Moondream—and why it matters. Link in comments below.

  • Moondream AI转发了

    查看Maria Kevin的档案

    Software Developer

    I built an?automatic disclaimer-adding tool?for smoking and drinking scenes to experiment with automating video editing. For video editors,?manually adding disclaimers frame by frame?in a movie is not only?time-consuming?but also?boring. This tool uses?Moondream.ai, a small?vision-language model, to?analyze each video frame, detect smoking scenes, and automatically insert disclaimers?at the right moments. While?Moondream is quite accurate and efficient, it is trained on?a wide variety of images, making it?less optimized for this specific task?and?not suitable for processing an entire movie efficiently. This tool is?not yet 100% accurate, but it is an?early step towards automating video editing, just like coding automation. Future Improvements: ? Train a?custom CNN model?specifically for?smoking and drinking detection?to make it?faster and more accurate. Source (Github) - https://lnkd.in/d4q_F_y9 Output shown in video - https://lnkd.in/dG_Nxk5p Moondream - https://lnkd.in/dFWy5QvM

  • Moondream AI转发了

    查看Aastha Singh的档案

    Camera Modelling Engineer @Qualcomm | Ex - AI Engineer @ SparkCognition | Ex- Research Scholar @UCBerkeley | Edge AI, PyTorch, Deepstream

    ?? Excited to showcase my AI-powered bot ?? Built on NVIDIA Jetson Orin NX + ROSMASTER X3, optimized for real-time speech & vision AI using OpenAI Whisper & Moondream AI ? Runs fast, accurate, & efficient on edge! Check out the repository- https://lnkd.in/gTUXtxtc Please reach out for more information, Let's push the limits of AI-powered robotics! ?? #GTC25goldenticket NVIDIA NVIDIA AI NVIDIA Robotics #GTC25

  • 查看Moondream AI的组织主页

    527 位关注者

    Try out Moondream's Gaze Detection on your machine. https://lnkd.in/gV-uPiNa View the code: https://lnkd.in/g3QXeFfb

    查看AI的组织主页

    478,995 位关注者

    Moondream’s Gaze Detection AI is here ???—a cutting-edge tool that tracks your gaze with pinpoint, near-human accuracy. The applications are endless, from preventing distracted driving to enhancing retail customer experiences and usability research. Paired with Moondream’s existing tools like object detection and visual question answering, it’s the ultimate solution for all your visual AI needs! ?? All rights reserved to respective owner. DM for credits. Follow @AI for all the latest updates. #ai #aitips #aitools #machinelearning #technology

  • 查看Moondream AI的组织主页

    527 位关注者

    Check out how Driveline Baseball Enterprises, Inc leverages Moondream AI in their quality testing pipeline. Founder & CTO Kyle Boddy explains: "Moondream helps us preprocess motion capture videos, ensuring high-quality biomechanics reports for our coaches and athletes"

    查看Kyle Boddy的档案

    Founder/CTO, Driveline Baseball (Special Advisor, Boston Red Sox)

    Using?MoondreamAI's small vision model for Visual Question/Answer (VQA) quality testing in the Driveline Launchpads! We can use Moondream to preprocess motion capture videos to ensure we will get good quality biomechanics reports for coaches and athletes! #moondream #AI #VQA #programming #launchpad #CV #VLM

    • 该图片无替代文字
  • 查看Moondream AI的组织主页

    527 位关注者

    Moondream now brings structured outputs, gaze detection, enhanced OCR and textual understanding - all packed into our latest 2B Moondream model... we've fit 5 capabilities into our model, all while keeping the model size lean. Our model has improved performance across all capabilities.?How does we stack up against similarly sized models? Check it out:

    • 该图片无替代文字
  • 查看Moondream AI的组织主页

    527 位关注者

    With a new Moondream release about to drop we're announcing a new feature: Gaze Detection. Moondream will soon understand what people are looking at. We're already reaching near SOTA results and approaching human performance. Watch as it tracks attention in scenes from popular shows, action shots from climbing, and anime... The numbers tell the story: ? Near SOTA and nearing human performance (0.103 on Avg L2 GazeFollow benchmark) ? Zero cloud dependency - runs 100% locally ? Seamless integration with existing Moondream capabilities ? Works on everything from CCTV footage to anime You can now track and analyze where people are looking in any visual content, all while maintaining Moondream's lightweight footprint. This opens up entirely new possibilities for applications in driver safety, retail and sports analytics, education, and UX research - that you can run anywhere. Moondream's real power comes from our growing suite of capabilities. We are a vertically integrated vision AI company, and our capabilities continue to improve. You can combine Gaze Detection with our existing capabilities and create vision enabled applications easier than ever before... ? Object detection ? Visual pointing ? Image captioning ? Visual Q&A All in one efficient package. See it in action... https://lnkd.in/gqQw7Nz5

  • 查看Moondream AI的组织主页

    527 位关注者

    Turn any text description into precise visual coordinates. Moondream's lightweight model points to exactly what you're looking for. Moondream's pointing capability unlocks a new era of human-AI interaction - enabling assistive technologies that can precisely guide users, automation tools that understand natural instructions, and vision systems that can pinpoint exactly what matters in complex scenes. No more rigid bounding boxes or approximate regions. Develop with pointing now: https://lnkd.in/g6vaX6GN

    • 该图片无替代文字

相似主页

查看职位