登录查看更多内容

Tech for Wildlife: MegaDetector (Part 2 of 2) – Improving MegaDetector results (Theoretical/Hypothetical) ????

Hugo Markoff

Robots, Wildlife, AI and more!

发布日期: 2024年11月13日

About half a year ago, we had a meeting with Serge Belongie , where I represented Really A Robot and spoke about our Animal Detect plans and ideas (which have changed a lot since then). Serge is one of the leading figures in AI, and we in Denmark got lucky to have "imported” him to be the director of the Pioneer Centre of AI. He has been deeply involved in AI for animals and wildlife for a while, including his work with iNaturalist and Merlin. ????

Anyways, enough fanboying – one of the takeaways from the meeting that stuck with me was: “Use MegaDetector as a benchmark and see how well you compare.”

Now a bit wiser, I realized I wouldn't want to collect millions of images, label, customize, optimize, and train a neural network to create a model worthy of competing against MegaDetector. You can read my first part of MegaDetector (Part 1 of 2) Here to learn a bit about what MegaDetector is, along with some use cases.

The idea of benchmarking with MegaDetector really stuck with me, and instead of making my own MegaDetector model, I thought about seeing if I could get even better results from MegaDetector itself instead. ?????

Understanding Neural Networks ????

To understand a bit about how neural networks work (without diving into backpropagation formulas like the pain I went through during my master’s thesis):

We have an input image. This image goes through “layers” of the neural network, where different operations take place, looking for shapes, lines, and patterns using various image processing steps. At each step, a neuron is activated, leading to the final layers where the result is presented. If you really want to learn more about neural networks, I highly recommend the 3Blue1Brown YouTube channel, which also helped me through my exams. He also has a video on backpropagation! ????

Alright Hugo, get to the point and stop yapping about neural networks! ??

The Core Idea ????

MegaDetector has been trained on millions of images from different resolutions, brands, etc., but maybe there’s an invisible bias towards some types of images. What if we modify our images to match that bias the most to increase our chances of getting a correct detection? ????

Just look at the GIF of images – a slight modification can completely erase the detection or strengthen it! (Please scroll a bit more down to see it)

So, my hypothesis is: even if the neural network performs numerous operations to minimize image variance, would modifying my input have a direct effect? The short answer is yes – looks like it! The long one is #IWishIHadMoreTimeToPlayAround. ???

Testing the Hypothesis ????

Obviously, I had to test this partially out which I did a while ago and, stupid me, didn’t save the folder with code and images. But I found some images from an email and some notes on the test and results:

Test Setup:

200 images: 100 for “training” and 100 for validation.
The images selected were ones where MegaDetector had no or very poor confidence on images with actual animals.
Baseline detection: With a 0.1 confidence threshold, MegaDetector had only a 64% true positive rate on these images.

领英推荐

Microalgae: Artificial Neural Networks and Satellite…

Jani Chetankumar 10 个月前

Evolving Complexity: Unveiling the Power of Cellular…

David Cain 10 个月前

Life in 2100 According to the Most Powerful AI Model…

Thomas Cherickal 8 个月前

Training Process:

Used MegaDetector combined with some manual labeling of animals, so all 100 training images had only true positive animals saved to a JSON file.
Adjusted three parameters: sharpness, contrast, and gamma. Created 100 different random settings for these parameters (I honestly can’t recall the exact ranges, which isn’t very scientific of me).
Applied these settings to all 100 training images, resulting in 10,000 images in total.
Ran MegaDetector on all modified images, saved the results, and compared:

Does the detector find an animal?

How close is the bounding box compared to the ground truth?

What confidence does the bounding box have?

Based on these three parameters, I asked ChatGPT (sorry for the laziness ??) to write a code to find the optimal settings of sharpness, contrast, and gamma that could identify the most correct animals while maintaining or improving confidence levels and avoiding false positives. With these settings, I ran the tests on all 200 images (should have maybe just used the validation data – my bad ??) and achieved an 88.5% detection rate (177 out of 200 images) correctly finding animals, with no false positives!

GIF of series of images. Each image has the same original image - with some variation of

Conclusion and Discussion ????

As mentioned, even though I managed to get improved results, the work is still theoretical/hypothetical. I can’t see myself finding the time anytime soon to continue the idea of "improving MegaDetector results". Therefore, I would love to see others get inspired to explore what can be done. ???

My test limitations:

Used a limited number of images from the same camera brand and three locations. Applying the same settings to other images, might not yield as promising results.
Focused on images with poor/no detection, so I didn’t include images with strong detections to see if these settings would negatively impact them.
Use of Human-in-the-Loop. The approach included manual labeling of the "training" set, which isn’t ideal for scaling. ???

Future Aspirations: I hope to see ways to find the “ideal” settings for each individual image for the best detections and modify them dynamically, not with static numbers, to boost performance. ???

This could also be combined with other AI technologies, such as Generative Adversarial Networks (GANs), which can be used to colorize grayscale images or upscale images to enhance detection results, as I illustrated in my first post Recoloring of grayscale wildlife image.

Please let me know if you enjoy these hypothetical posts or if you’d prefer I stick to existing and developed AI technologies! ????

#TechForWildlife #AI #MachineLearning #WildlifeConservation #MegaDetector #NeuralNetworks #AIInnovation #WildlifeTechnology #DeepLearning #TechForGood #ArtificialIntelligence #WildlifeResearch #ConservationTech #Robotics #GenerativeAI

Lars Holst Hansen

Field Biologist, UAV operator, Nature Photographer, Tour Leader, Guest Lecturer

4 个月

Very nice and cool idea! Would it be possible to set up a workflow that would match any incoming image to the to “ideal”parameter values - using GANs for instance.

1 次回应

查看更多评论

要查看或添加评论，请登录

Hugo Markoff的更多文章

SpeciesNet – A Global AI-Powered Wildlife Classifier

2025年3月13日

SpeciesNet – A Global AI-Powered Wildlife Classifier

SpeciesNet is Google’s latest AI-powered wildlife classifier, designed to identify animals in camera trap images…
Building the perfect Wildlife Camera (Guide) – What we learned and why we pivoted to Software

2025年2月17日

Building the perfect Wildlife Camera (Guide) – What we learned and why we pivoted to Software

Thinking about building a wildlife camera? Or maybe you already have plans in motion? Either way, you’re in the right…

26 条评论
MegaDetector: The New vs. The Old (MDV5 vs MDV6)

2025年2月10日

MegaDetector: The New vs. The Old (MDV5 vs MDV6)

Quick Intro: MegaDetector is a fantastic open-source animal detection model designed for wildlife cameras. It’s used in…
How far away is that Animal? Depth Estimation for wildlife images!

2025年1月27日

How far away is that Animal? Depth Estimation for wildlife images!

Preword: I’ve always enjoyed talking to people who truly understand the challenges within wildlife conservation. While…

2 条评论
Animal Re-Identification for Wildlife - Tech deep dive

2025年1月18日

Animal Re-Identification for Wildlife - Tech deep dive

If you didn’t read my attempt to make a very easy introduction to Animal Re-identification for wildlife last week…

1 条评论
Introduction to Animal Re-identification in Wildlife

2025年1月11日

Introduction to Animal Re-identification in Wildlife

Foreword: In this article, I will introduce the subject of animal re-identification, highlighting its benefits and…
Using and Testing an Animal Classifier

2024年12月9日

Using and Testing an Animal Classifier

In my previous post, I explored the process of using animal classifiers, focusing on pipelines that detect animals…
Introduction to "classical" Animal Classifiers: How They Work and What to Expect

2024年12月4日

Introduction to "classical" Animal Classifiers: How They Work and What to Expect

When it comes to identifying animals in images, there's a fascinating world of technology behind the scenes. In this…

1 条评论
Tech for Wildlife: MegaDetector (Part 1 of 2) – Pros & Cons + Use Cases

2024年11月11日

Tech for Wildlife: MegaDetector (Part 1 of 2) – Pros & Cons + Use Cases

GIF - Using MegaDetector for finding Animals Welcome to this week's 'Tech for Wildlife' series! In this part, I will be…

See all articles

Tech for Wildlife: MegaDetector (Part 2 of 2) – Improving MegaDetector results (Theoretical/Hypothetical) ????

Hugo Markoff

Robots, Wildlife, AI and more!

Understanding Neural Networks ????

The Core Idea ????

Testing the Hypothesis ????

领英推荐

Conclusion and Discussion ????

Hugo Markoff的更多文章

社区洞察

其他会员也浏览了

Harnessing Nature's Genius: The Convergence of Evolutionary Algorithms and Cellular Automata in AI

How does Swarm Intelligence work and what are its potential applications?

How to Predict Traffic Flow with H-STFormer

6 things from the world of artificial intelligence in January 2023

The Quest for "What is Where?"

Harnessing Geo Techniques and Game Theory for Enhanced Damage Prevention

Scientist Dan Riskin Answers Your Questions About AI, Aliens, and the Animal Kingdom

The Three-Body Problem in AI

Intelligence Geocentrism and an AI-Centric World: Earth as the intelligence center of the Universe

Tech Highlights: Spot's Transformation, Reddit vs. AI, BTSbot's Astronomy Aid

Understanding Neural Networks ????

The Core Idea ????

Testing the Hypothesis ????

领英推荐

Conclusion and Discussion ????

Hugo Markoff的更多文章

SpeciesNet – A Global AI-Powered Wildlife Classifier

Building the perfect Wildlife Camera (Guide) – What we learned and why we pivoted to Software

MegaDetector: The New vs. The Old (MDV5 vs MDV6)

How far away is that Animal? Depth Estimation for wildlife images!

Animal Re-Identification for Wildlife - Tech deep dive

Introduction to Animal Re-identification in Wildlife

Using and Testing an Animal Classifier

Introduction to "classical" Animal Classifiers: How They Work and What to Expect

Tech for Wildlife: MegaDetector (Part 1 of 2) – Pros & Cons + Use Cases

社区洞察

其他会员也浏览了

Harnessing Nature's Genius: The Convergence of Evolutionary Algorithms and Cellular Automata in AI

How does Swarm Intelligence work and what are its potential applications?

How to Predict Traffic Flow with H-STFormer

6 things from the world of artificial intelligence in January 2023

The Quest for "What is Where?"

Harnessing Geo Techniques and Game Theory for Enhanced Damage Prevention

Scientist Dan Riskin Answers Your Questions About AI, Aliens, and the Animal Kingdom

The Three-Body Problem in AI

Intelligence Geocentrism and an AI-Centric World: Earth as the intelligence center of the Universe

Tech Highlights: Spot's Transformation, Reddit vs. AI, BTSbot's Astronomy Aid