Facebook's AI at Work Produces ConViT
Michael Spencer
A.I. Writer, researcher and curator - full-time Newsletter publication manager.
Facebook AI says its developed a new computer vision model called ConViT, which combines two widely used AI architectures — convolutional neural networks (CNNs) and Transformer-based models — in order to overcome some important limitations of each approach on its own.?
‘ConViT’, A Computer Vision Model That Improves Vision Transformers (ViT) With Soft Convolutional Inductive Biases
By leveraging both techniques, this vision Transformer-based model can outperform existing architectures, especially in the low data regime, while achieving similar performance in the large data setting.
While AI is still very biased, AI Researchers have been utilizing certain assumptions (inductive biases) that help to train machine learning models.
You can read their July, 2021 paper here.
Facebook's ConViT could be important in how Vision Transformers (ViTs) rely on more flexible self-attention layers, and have recently outperformed CNNs for image classification.
ConViT is a New Computer Vision Model
The Facebook AI Research team has developed a new computer vision model called?ConViT. The ConViT system combines two widely used architectures to overcome some important limitations of each approach on its own, namely convolutional neural networks (CNNs) and Transformer-based models.?
The resulting convolutional-like ViT architecture, ConViT, outperforms the DeiT on ImageNet, while offering a much improved sample efficiency.?
领英推荐
The goal of ConViT was to modify vision Transformers to encourage their networks act convolutionally. They introduced a soft inductive bias that allows the network model itself whether it wants to remain convolutional or not.
They did so by introducing gated positional self-attention (GPSA), where the model learns parameters that control how much standard content-based attention is used compared with an initialized position-based one.
Github: https://github.com/facebookresearch/convit
Paper: https://arxiv.org/pdf/2103.10697.pdf
While the future of Facebook is likely the Facebook Metaverse, Facebook AI will have to get moving if its wants to challenge ByteDance (Beijing's alternative to Facebook) in the future of how the metaverse actually works.
While the current AI is hopelessly biased for various reasons, the researchers hope that their ConViT approach will encourage the community to explore other ways of moving from hard inductive biases to soft inductive ones.
The success of deep learning over the last decade has largely been fueled by models with strong inductive biases, allowing efficient training across domains. What will the deep learning of the future bring?
Facebook is Hyping the Metaverse
Facebook has now successful demonstrated bridging the gap between CNNs and Transformers, by presenting a new method to “softly” introduce a convolutional inductive bias into the ViT.
The Metaverse thanks you for your attention. Zuckerberg (who now holds more centralized power at Facebook since the pandemic) envisions a metaverse as an all-encompassing online world where people can game, work and communicate in a virtual environment, often using VR headsets that has up until now largely been depicted in science fiction movies, not reality.
??International Business Law /Business Management #MBA #LLM #IBL
3 年Sounds Interesting! Is there any precise data how it might affect people’s interaction and behavior on the platform?:)
CMO (Global) & Head-Global Channels, Investor, Analyst, Public Relations at SEQURETEK. 38 years of Marketing, Sales and Technology Leadership experience @Cisco, IBM, HCL Tech, Novell, NTT Netmagic, Star TV, CSS Corp
3 年Wow
A.I. Writer, researcher and curator - full-time Newsletter publication manager.
3 年The concentration of $Trillion market caps and AI talent is actually very dangerous for the future of technology as it creates an unbalanced wealth inequality and makes the actions of central bank stimulus less effective as the U.S. stock market is basically weighted too much in just a few companies - who consolidate too much of the extra added liquidity. It turns out China understand what antitrust regulation actually means, not America, this not only creates an innovation bottleneck for the U.S. but serious economic issues for how unbalanced capitalism is becoming. Data capitalism it turns out is risky for the entire system even as these duopolies want to maintain their surveillance capitalism dominance - the more dominant they become - the worse it is for everyone.
Product Engineering/Management: Samsung/ex-Motorola/Application Development/Health/Security
3 年I believe its good breakthrough. Though I am still not convinced with idea of always wearing VR headsets. May be more innovation to make VR glasses available as simple lens would help!
CAD Senior Software Engineer/Project Manager | Windows/Web Software Developer | Full Stack MCPD Consultant - DISABLED | Retired 2013
3 年Good luck with all that. Just keep trying abd see where you get. You never know.