Needle in a Haystack: A Framework for Seeking Small Objects in Big Datasets
An example of a meme-style image from the subreddit /r/photoshopbattles.

Needle in a Haystack: A Framework for Seeking Small Objects in Big Datasets

Here's a recent result of our research on Provenance Analysis. This is a collaboration of University of Notre Dame (U.S.) and University of Campinas (Brazil).

Needle in a Haystack: A Framework for Seeking Small Objects in Big Datasets

Joel BroganAparna BharatiDaniel MoreiraKevin BowyerPatrick FlynnAnderson RochaWalter Scheirer

Images from social media can reflect diverse viewpoints, heated arguments, and expressions of creativity --- adding new complexity to search tasks. Researchers working on Content-Based Image Retrieval (CBIR) have traditionally tuned their search algorithms to match filtered results with user search intent. However, we are now bombarded with composite images of unknown origin, authenticity, and even meaning. With such uncertainty, users may not have an initial idea of what the results of a search query should look like. For instance, hidden people, spliced objects, and subtly altered scenes can be difficult for a user to detect initially in a meme image, but may contribute significantly to its composition.

We propose a new framework for image retrieval that models object-level regions using image keypoints retrieved from an image index, which are then used to accurately weight small contributing objects within the results, without the need for costly object detection steps. We call this method Needle-Haystack (NH) scoring, and it is optimized for fast matrix operations on CPUs.

We show that this method not only performs comparably to state-of-the-art methods in classic CBIR problems, but also outperforms them in fine-grained object- and instance-level retrieval on the Oxford 5K, Paris 6K, Google-Landmarks, and NIST MFC2018 datasets, as well as meme-style imagery from Reddit.


要查看或添加评论,请登录

Anderson Rocha的更多文章

社区洞察

其他会员也浏览了