New Facial Recognition Tech Tested on Michelangelo's David A recent development in facial recognition technology has introduced a more compact and efficient system, tested on Michelangelo's iconic sculpture, David. This new method offers a lens-free alternative that utilizes less power without sacrificing image quality, compared to traditional 3D imaging technologies. Typically, facial recognition systems rely on intricate components, including lasers, lenses, and diffractive optical elements (DOEs), to project a grid of infrared dots onto a subject's face. However, these setups can be bulky, which presents challenges for integration into smaller devices like smartphones. The team led by Yu-Heng Hong, Hao-Chung Kuo, and Yao-Wei Huang aimed to revolutionize this process. They replaced the conventional dot projector with a low-energy laser paired with a flat gallium arsenide metasurface. The innovative design minimizes device size and energy usage by scattering laser light through a nanopillar pattern on the metasurface. Their prototype projects over 45,700 infrared dots—surpassing the dot density of traditional systems—onto subjects, accurately identifying details by analyzing the pattern. During testing, this system successfully identified a replica of Michelangelo's David with remarkable precision, utilizing a fraction of the power and space needed by standard setups. More info: Wen-Cheng Hsu et al, Metasurface- and PCSEL-Based Structured Light for Monocular Depth Perception and Facial Recognition, Nano Letters (2024). DOI: 10.1021/acs.nanolett.3c05002 This advancement underscores the potential of metasurfaces in delivering efficient, compact imaging solutions applicable in facial recognition, robotics, and augmented reality platforms. Luxand's FaceSDK brings cutting-edge innovation to facial recognition, much like the advanced tech recently tested on Michelangelo's David. Check out Luxand FaceSDK to see for yourself -> luxand.com/facesdk
关于我们
Founded in 2005, Luxand, Inc. is a privately owned company offering biometric identification solutions to businesses and consumers. The company develops and markets a complete set of tools, libraries and solutions to perform fully automatic recognition of human faces and facial features. Today, the company provides a broad range of facial feature recognition solutions to end-users and industrial customers. Luxand technologies are used at online entertainment portals, chat rooms and movie Web sites around the globe.
- 网站
-
http://www.luxand.com
Luxand Inc.的外部链接
- 所属行业
- 软件开发
- 规模
- 11-50 人
- 类型
- 私人持股
Luxand Inc.员工
动态
-
In March 2025, Google will add camera and screen viewing capabilities to its voice assistant, Gemini. These features will be available to Gemini Advanced users on Android devices as part of the $20/month Google One AI Premium plan. The Gemini-powered AI assistant will recognize objects through the smartphone camera and view the screen in real time. These updates were presented at the Mobile World Congress. For example, a user could activate the camera and ask Gemini for advice on the best glaze color for a vase. The assistant might suggest olive or blue shades. https://lnkd.in/eAXA3ABM #google #ai #gemini #news #technology #developers #development
-
How can you tell if the person you’re chatting with online is real or just an AI-driven illusion? And more importantly, how can you protect yourself from falling victim to these scams? In this blog post, we’ll break down the warning signs of AI-generated fraud and share expert tips to help you stay safe in the age of artificial intelligence. https://lnkd.in/ehbX93Gx #facerecognition #facialrecognition #API #apidevelopment #appdevelopment #development #developers #ai #aiapplications #cloudsolutions #facedetection #security #securitysolutions #securityservices #securityintegration #imagerecognition #biometrics #biometricsecurity #deepfake #frauddetection #fraudprevention #fraud #python #securityintegration #sdk #trends
-
Opera has introduced an AI agent designed to perform tasks in the browser on behalf of the user. The early version will be available for testing soon. The company demonstrated how its AI agent, Browser Operator, can add products to the cart, search for football match tickets under a specified price, and book hotel rooms on behalf of the user. Users can interact with the chatbot through the sidebar, where they can also track all the steps the AI assistant takes on the website. The AI agent will be available soon as part of the Feature Drop program for testing new AI features in the Opera Developer browser version. https://lnkd.in/eu7ZmzvN #ai #news #opera #developers #development
-
Google Releases Free Beta Version of AI Coding Assistant – Gemini Code Assist! Google has opened access to its AI-powered code editing tool for individual users. Gemini Code Assist, built on the Gemini 2.0 model, can autocomplete code, explain complex snippets, and provide suggestions via chatbot. To use it, users need to download Gemini Code Assist for Visual Studio Code, GitHub, or JetBrains and integrate it with their development environment. A Google account is required for authentication. Additionally, users must create a project in Google Cloud. The assistant supports over 20 programming languages, including C, C++, Go, Python, Java, JavaScript, Kotlin, and TypeScript. Users can access up to 180,000 AI-generated code suggestions per month for free. By comparison, GitHub Copilot offers a free limit of 2,000 AI-powered code completions and 50 chatbot messages. Before this release, Code Assist was only available to Google Cloud partner companies since April 2024. https://lnkd.in/e2C7pf4J #news #ai #google #gemini #developers #development
-
Anthropic has recently unveiled Claude 3.7 Sonnet, a pioneering hybrid reasoning AI model that seamlessly integrates rapid responses with in-depth, step-by-step analysis. This advancement allows users to tailor the model's reasoning process to their specific needs, enhancing both speed and accuracy. A standout feature of Claude 3.7 Sonnet is its exceptional coding capabilities. The model excels in understanding context and creative problem-solving, achieving an industry-leading 70.3% accuracy on the SWE-bench Verified benchmark. This positions it as a valuable tool for developers and enterprises seeking advanced AI solutions. In addition to Claude 3.7 Sonnet, Anthropic has introduced Claude Code, an agentic coding tool designed to actively collaborate on tasks such as searching, editing, and testing code. This tool aims to streamline coding workflows, making it easier for developers to manage complex projects. This release signifies a significant step forward in AI development, highlighting Anthropic's commitment to creating versatile models capable of addressing a wide array of tasks effectively. The integration of hybrid reasoning within a single model simplifies the user experience and sets a new standard for AI versatility. Learn more here: https://lnkd.in/eq-zvb5s #ai #news #anthropic #claude
-
OpenAI has introduced its “biggest and best” chat model yet—GPT-4.5, codenamed Orion. It is currently available in “research preview” mode for ChatGPT Pro subscribers and developers via API. According to OpenAI, GPT-4.5 is a “highly complex and resource-intensive model,” making it more expensive than GPT-4o. As a result, the company has not yet decided whether it will continue offering API access to it in the future. The pricing for GPT-4.5 is $75 per million input tokens and $150 per million output tokens. For comparison, GPT-4o costs just $2.50 per million input tokens and $10 per million output tokens. Starting the week of March 3, 2025, OpenAI plans to expand access to GPT-4.5 for Plus and Team subscribers, followed by Enterprise and Edu users. Unlike reasoning models in the “o” series, such as o1, GPT-4.5 does not “pause to think” before responding. OpenAI believes that, in the future, these two approaches—pretrained models and reasoning models—will complement each other. GPT-4.5 offers enhanced writing capabilities and a deeper understanding of the world, making interactions feel more “natural,” according to The Verge, citing OpenAI. The model is particularly adept at recognizing patterns and identifying connections, making it “ideal for writing, programming, and solving practical tasks.” At launch, GPT-4.5 does not support voice mode, video, or screen sharing in ChatGPT. https://lnkd.in/e9bh9FHU #openai #ai #chatgpt #chagpt45 #chatgpt4o #news #technology
-
-
Building a face recognition project in Python involves several key steps to ensure that the system can accurately detect and recognize faces. In this blog post we'll describe some key steps for building a face recognition project in Python. https://lnkd.in/egEcMUfP #python #ai #facerecognition #developers #development
-
Two AI agents on a phone call realize they’re both AI and switch to a superior audio signal ggwave Gibberlink mode 🤖 #ai #news https://lnkd.in/dDNfvbDi
AI Talking to AI?
https://www.youtube.com/
-
Face recognition cameras capture images of pigs as they approach their feeding stalls. The AI system swiftly identifies each animal and customizes their meals. More importantly, it monitors for signs of distress or illness, alerting farmers instantly to any issues. To learn more, take a look at our latest article! #luxand #facesdk #facerecognition