登录查看更多内容

AI News Bytes: Meta AI introduces SAM (Segment Anything Model); UC Berkeley Researchers Introduce Koala; Meet OpenFlamingo.....

Asif Razzaq

AI Research Editor | CEO @ Marktechpost | 1 Million Monthly Readers and 52k+ ML Subreddit

发布日期: 2023年4月15日

Sponsor??|??Join Discord?|??Join 18K+ ML SubReddit

Meta AI introduces SAM (Segment Anything Model): A Foundation model for image segmentation.?Meta AI team released both their general?Segment Anything Model (SAM)?and?Segment Anything 1-Billion mask dataset (SA-1B), the largest ever segmentation dataset, to enable a broad set of applications and foster further research into foundation models for computer vision. They are making the SA-1B dataset available for research purposes, and the Segment Anything Model is available under a permissive open license (Apache 2.0). Check out the?demo to try SAM?with your own images.

UC Berkeley Researchers Introduce Koala:?A Dialogue Model for Academic Research. Koala is a chatbot trained by fine-tuning Meta’s LLaMA on dialogue data gathered from the web. The researchers describe the dataset curation and training process of their model, and also present the results of a user study that compares our model to ChatGPT and Stanford’s Alpaca. The results show that Koala can effectively respond to a variety of user queries, generating responses that are often preferred over Alpaca, and at least tied with ChatGPT in over half of the cases.

What is AI Hallucination? What Goes Wrong with AI Chatbots? How to Spot a Hallucinating Artificial Intelligence?:?The phenomenon known as artificial intelligence hallucination happens when an AI model produces results that are not what was anticipated. Be aware that some AI models have been taught to purposefully make outputs without connection to real-world input (data). Hallucinations may occur in big language-based models like ChatGPT and its equivalents due to improper transformer decoding (machine learning model). Using an encoder-decoder (input-output) sequence, a transformer in AI is a deep learning model that employs self-attention (semantic connections between words in a sentence) to create text that resembles what a human would write.

Forget plugins. ChatGPT can solve general computer tasks using a keyboard and mouse!!-?The trick? Recursively criticizing and improving the output (RCI):?The RCI approach significantly outperforms existing LLM methods for automating computer tasks and surpasses supervised learning (SL) and reinforcement learning (RL) approaches on the MiniWoB++ benchmark. RCI is competitive with the state-of-the-art SL+RL method, using only a handful of demonstrations per task rather than tens of thousands, and without a task-specific reward function. Furthermore, we demonstrate RCI prompting's effectiveness in enhancing LLMs' reasoning abilities on a suite of natural language reasoning tasks, outperforming chain of thought (CoT) prompting. We find that RCI combined with CoT performs better than either separately.

领英推荐

What the future may look like with AI

Sudip Majumder 1 年前

Generative AI and the Role of Chat GPT

Vishal Singh 1 年前

What is Generative AI and How It Could Aid Your…

VARTEQ Inc. 1 年前

Meet SwissBERT:?Researchers from the University of Zurich propose a multilingual language model for Switzerland. SwissBERT is a masked language model created specifically for processing Switzerland-related text. SwissBERT is a pre-trained model that is adapted to news articles written in the national languages of Switzerland -- German, French, Italian, and Romansh. The research team evaluated SwissBERT on natural language understanding tasks related to Switzerland and find that it tends to outperform previous models on these tasks, especially when processing contemporary news and/or Romansh Grischun. Since SwissBERT uses language adapters, it may be extended to Swiss German dialects in future work. The model and our open-source code are publicly released.

Meet OpenFlamingo:?A framework for training and evaluating large multimodal models (LMMs) capable of processing images and text.?OpenFlamingo?is an open-source framework that aims to democratize access to state-of-the-art Large Multimodal Models (LMMs) by providing a system capable of handling various vision-language tasks. Developed as a reproduction of DeepMind’s Flamingo model, OpenFlamingo offers a Python framework to train Flamingo-style LMMs, a large-scale multimodal dataset, an in-context learning evaluation benchmark, and the first version of OpenFlamingo-9B model based on LLaMA.

Databricks Open-Sources Dolly:?A ChatGPT like Generative AI Model that is Easier and Faster to Train.?Dolly?is a low-cost large language model (LLM) that demonstrates surprisingly high levels of the instruction-following abilities seen in ChatGPT. This work indicates that anyone with access to high-quality training data and an out-of-date open-source large language model (LLM) can train it to perform like ChatGPT in under 30 minutes on a single machine. Dolly uses data from Alpaca to make minor adjustments to an existing, open-source 6 billion parameter model from EleutherAI to elicit instruction following capabilities such as brainstorming and text production.

LLMs Can Outperform Humans on Data Annotation:?The University of Zurich researchers used 2,382 tweets to compare the performance of ChatGPT with crowd-workers and trained annotators for various annotation tasks. ChatGPT was found to outperform crowd-workers in relevance, stance, topics, and frames detection, with its zero-shot accuracy being higher than that of crowd-workers for four out of five tasks. The intercoder agreement of ChatGPT was also higher than that of both crowd-workers and trained annotators for all tasks. Additionally, the cost per annotation with ChatGPT was less than $0.003, making it twenty times cheaper than using MTurk. These findings demonstrate the potential of large language models in significantly improving the efficiency of text classification.

Do You Know?Marktechpost?has a community of?1.5 Million+?AI Professionals and Engineers?

Sponsor??|??Join Discord?|??Join 18K+ ML SubReddit

AI News Bytes

10,475 位关注者

Angshool Deka

Student at Cotton University

1 年

Thanks for sharing this valuable information Asif

1 次回应

Dr. Martha Boeckenfeld

CEO & Founder, Top 100 Women of the Future | AI, Web3, Metaverse - Advisor & Deep Tech Impact Investor | Keynote Speaker | Masterclass | Leading Human-Centric Tech for Global Change

1 年

Thanks for sharing- very valuable insights into latest developments Asif Razzaq!

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

AI News Bytes: Meta AI introduces SAM (Segment Anything Model); UC Berkeley Researchers Introduce Koala; Meet OpenFlamingo.....

Asif Razzaq

AI Research Editor | CEO @ Marktechpost | 1 Million Monthly Readers and 52k+ ML Subreddit

领英推荐

AI News Bytes

10,475 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

To Business Leaders of the Future,

4 charts that suggest there is no slowing down for AI progress

The Evolution of Generative AI and its applications

Upend Digital: 2023 Digital Transformation Trends for Enterprises

What is Chat GPT?

101 Best Chat GPT Prompts for Training and Development in 2024

Can AI solve every problem?

Artificial Intelligence: An introduction to the faculty of CGSS

Nurturing AI To Create Superhero's and Not Supervillains: How To Make The Most Of Our Infant Technology

Strawberry: A New AI Model with the Potential to Revolutionize Many Fields

领英推荐

AI News Bytes

10,475 位关注者

AI Research Updates: Q-GaLore Released + Lynx + NuminaMath 7B TIR Released + AgentInstruct + and many more...

2024年7月17日

Here are 15 Super ?? Cool AI Research Papers ALONG with SUMMARY from Microsoft (2024)

2024年3月11日

Here are 11 Super ?? Cool AI Research Papers ALONG with SUMMARY from CMU (2024)

2024年3月3日

Here are 9 Super ?? Cool AI Research Papers ALONG with SUMMARY from Apple (2024)

2024年2月22日

?? What is Trending in AI Research?:

2023年10月8日

?? What is Trending in AI Research?: InstaFlow + ReLU vs. Softmax in Vision Transformers + OmnimatteRF + DeciDiffusion 1.0...

2023年10月1日

?? What is Trending in AI Research?: PromptTTS 2 + CoALA + BigVSAN + Verba + Persimmon-8B + Falcon 180B + AskIt...

2023年9月13日

?? What is Trending in AI Research?: IP-Adapter + FineRecon + PUMA + DeciCoder + SeamlessM4T....

2023年8月28日

AI News Bytes: Tired of trying to get RL to work with Human Feedback? Try this new method - SLiC; LLMs Outperform Reinforcement Learning- Meet SPRING

2023年6月3日

AI News Bytes: The first Open-Source Text2video 1.7 billion parameter diffusion model; Meet Instruct-NeRF2NeRF; Memoji on Steroids.....

2023年3月29日

社区洞察

其他会员也浏览了

To Business Leaders of the Future,

4 charts that suggest there is no slowing down for AI progress

The Evolution of Generative AI and its applications

Upend Digital: 2023 Digital Transformation Trends for Enterprises

What is Chat GPT?

101 Best Chat GPT Prompts for Training and Development in 2024

Can AI solve every problem?

Artificial Intelligence: An introduction to the faculty of CGSS

Nurturing AI To Create Superhero's and Not Supervillains: How To Make The Most Of Our Infant Technology

Strawberry: A New AI Model with the Potential to Revolutionize Many Fields