登录查看更多内容

9 Essential Features for a Bounding Box Annotation Tool

ARNAB MUKHERJEE ????

Automation Specialist (Python & Analytics) at Capgemini ??|| Master's in Data Science || PGDM (Product Management) || Six Sigma Yellow Belt Certified || Certified Google Professional Workspace Administrator

发布日期: 2023年9月7日

There are plenty of image annotation platforms out there, and a bounding box tool seems like a simple enough functionality.

But here’s the thing—

The accuracy and quality of your bounding boxes define your model performance, and you may need millions of these to build the most accurate model to market within your use case.

Have you taken the time to consider every feature that will help you achieve this?

Bounding box annotations

?A bounding box is an imaginary rectangle drawn around the object that describes its spatial location. It’s the most basic tool used for object detection and localization tasks.

Bounding box annotations contain the coordinates with information about where the object is located in the image or the video. They are suitable for uniformly shaped objects, low-compute projects, and objects that don’t overlap.

?V7 allows you to draw bounding boxes with pixel-perfect precision, add attributes, copy-paste boxes of similar size, interpolate them in the video, ?and easily convert polygon masks to bounding boxes.

Easy Class Creation

The first thing to look out for is your class structure.

Making a box is easy, but—

How is that data stored?

?Classes are the names of objects in a dataset. If you’re building a service to detect dents and scratches, you will want to make sure these two entries can be reused in new projects or branched out hierarchically as your data grows.

Here are a few must-have functionalities:

Bounding box classes can be used across projects.
? ?Class creation has a simple enough interface to be used by non-technical users. ?
You can add at least 1 thumbnail to represent the object.
You can add a description or instructions for this class.?
You can assign it a hotkey.

Below is the class creation experience on V7.

?We kept our design language consistent and added rich info tooltips to inform users of what each functionality does because we understand that not everyone is familiar with computer vision terminology.

Boxes that feel good—Responsive interactions

?Prior to building V7, we tested several bounding box tools in the market and found that most didn’t prioritize interaction design.

?Placing and editing millions of bounding boxes requires a very smooth user experience.

Here are the things to look out for:

? ?Are the corners and edges of boxes easy to move and adjust? Does this smoothness persist when images are full of annotations? ?
? ?Are their coordinates sub-pixel accurate? Is this reflected in the export files? ?
Are they easy to interact with on touch interfaces?
? ?Was the cursor designed to “snap” to corners to facilitate speedy edits? ?

Video interpolation

?V7 supports videos and a number of series-like data like volumetric MRI or CT ?scans or time-lapses.

?All of these allow you to interpolate boxes throughout a sequence smoothly.

Any annotator wants an experience that requires minimal tweaks on the timeline, automatically generating keyframes where you can edit boxes manually or using models.

Separated position keyframes with attribute keyframes are required, allowing bounding boxes to gain or lose attributes or other sub-annotations throughout the video as part of the same instance.

Here are a few things to look out for:

? ?Do bounding boxes interpolate, or alternatively, do they track objects?Tracking in image annotation isn’t as good as it may sound initially. Trackers focus on individual features (usually at the center of an object) while bounding boxes rely on the edges of an object being pixel-perfect. Therefore, they can create more work than necessary and you might need to adjust box edges for each frame.
Do bounding boxes preserve their instance ID throughout the video?
? ?Can you add attributes in the video, and can they have keyframes so they can be switched on and off temporarily? ?
Is the video system frame accurate?HTML5 video players aren’t, they can have errors of up to 200ms. Most video labeling tools rely on browser-based video players, resulting in the exported box timecode not matching with the original files. This can happen at any point throughout a video and is most prevalent in CCTV.

Copy-paste, and other power user shortcuts

Do you have a few similar objects to annotate using bounding boxes?

?Copy-pasting your boxes can be very handy for speeding up your annotation process. It also ensures that your annotations are consistent for the same objects located in different areas of an image or a video.

领英推荐

Generative AI and Design Tools for Professionals

Michael Spencer 1 年前

A New AI Platform Is Delivering Clear, Generative Text…

Creativize.ai 1 年前

The risk of generative AI transforming the creative…

Fast Company Middle East 1 年前

Are hotkeys a priority in your annotation tool?

?Your labeling team should attempt to turn everyone into a power user. Keyboard shortcuts are a good way to get more training data and less fatigue (which leads to some of the hardest training data errors to spot).

Shortcuts to consider are switching classes, cycling between boxes, or points in a box.

Some projects might require you to copy all your annotations from one image to another. It often happens when your dataset images are sequential.

Here are things to look out for in power user shortcuts:

Can you copy/paste boxes?
Can you only paste attributes from one box to another? ?
Can you switch between any class without needing to switch from keyboard to mouse? ?
Can you cycle between bounding boxes to quickly perform a quality review using a single key and camera controls? ?
Can you edit the size and position of boxes using your keyboard?

Bounding boxes attributes and other sub-annotations

Attributes are simply annotation tags that can define the specific features of a given object.

?Many object detection projects require labelers to add label attributes on top of the bounding box annotation—it helps describe a given object in greater detail.

?For example, it’s common to add label attributes such as occluded, truncated, and crowded, indicating that annotated objects are in close relationship with other objects in the image.

Here are things to watch out for in sub annotations:

Can you add attributes to bounding boxes?
? ?Are attributes re-usable in other classes (in other words, do they become ? ?entries as part of that class right away?) ?
? ?Can attributes be inserted easily and is there keyboard shortcut support for them? (Attribute application accounts for >50% of most bounding box ? ?labeling projects) ?
? ?Are other sub-annotation types similarly designed to support shortcuts? ?
Can you easily remove attributes during the review?
? ?Can you edit them once they are created, and do these changes propagate to the whole dataset?A few annotation tools have dataset management capabilities, which means that if you make a change to an attribute or class name after creating it, you might have to go and propagate the change to every annotation file using a script to avoid breaking changes.?This can be incredibly frustrating, so it's always best to invest in a dataset management solution before you start any labeling project.

Visibility options for bounding boxes

Every annotation with an editable Z-value, You simply have to drag an annotation to reorder it.

?The same can be done in the video timeline, with an option to automatically adjust this order to save vertical space.

?This one is especially useful when you have hundreds of annotations—such as in sports analytics.

Image Manipulation Options

You can also adjust the box opacity, border opacity, and visual features of the image. The tool must also have windowing and color map options, which allow you to see elements of the image not visible by the naked eye in regular RGB monitors.

How many annotations can it handle?

Most JavaScript libraries aren’t made to handle the scale that AI projects ?bring to the table.

?Make sure that your tool is tested for performance when hundreds of bounding boxes enter the scene. This is especially important in videos where annotations must be kept in memory to ensure smooth playback.

Here are things to consider:

How many boxes per image can the tool handle?
? ?What happens if the boxes all contain attributes, Does the limit change? ??
? ?Have you tested the platform on a video or a high-resolution image when there are hundreds of boxes in the scene? ?
What features start to perform poorly when images become very busy?

Converting other annotations to Bounding Boxes

Some annotation formats such as COCO expect a bounding box to be around each polygon. Models like Mask R-CNN also benefit from this detector/segmenter approach. Moreover, you won’t have to make a box “around” a polygon, you can simply draw a polygon and use its “free” surrounding box to train a detector.

API functionalities and common bugs to watch out for

Ultimately, nothing can be more dangerous than a tool you commit to and encounter breaking bugs in its API halfway through your project.

?Here are the most common bugs, or feature failures that we’ve encountered ?across ?image annotation tools, in order of frequency:

The coordinates of imported boxes don’t align perfectly with the source image. ?
The tool doesn’t support common computer vision annotation formats or requires them to be modified. ?
You cannot export past a certain number of images in one go.
? ?When a class or attribute is edited, these no longer show up in exports or their changes don’t propagate throughout a dataset. ?
? ?When datasets become really large, database failures cause images to lose their annotations randomly. ??
? ?Annotation histories are not preserved, therefore old versions cannot be restored in case of bad bulk changes. ??
The API doesn’t have a set of easy CLI commands.
You cannot import annotations into videos.
The bounding box coordinates round to the wrong pixel.

?These are all issues that out of ten, at least once a customer has faced, who was switching from their internal tools or other labeling platforms.

AI and Beyond

2,825 位关注者

要查看或添加评论，请登录

ARNAB MUKHERJEE ????的更多文章

Agentic AI: The Next Big Breakthrough That's Transforming Business And Technology

2025年3月19日

Agentic AI: The Next Big Breakthrough That's Transforming Business And Technology

What Is Agentic AI? At its core, agentic AI refers to artificial intelligence systems that possess a degree of autonomy…
The Illustrated Children’s Guide to Kubernetes

2025年3月17日

The Illustrated Children’s Guide to Kubernetes

Dedicated to all the parents who try to explain software engineering to their children. Once upon a time there was an…

1 条评论
The Silent Resignation Phenomenon: Is Employee Engagement in IT at Risk?

2025年3月14日

The Silent Resignation Phenomenon: Is Employee Engagement in IT at Risk?

Understanding Silent Resignation Silent resignation doesn’t involve formal resignations or job changes. Instead, it…
Technologies that will 100% be labelled in some places as AI Agents or Agentic in some places.

2025年3月13日

Technologies that will 100% be labelled in some places as AI Agents or Agentic in some places.

01. Simple Reflex AI Agent “Simple Reflex AI Agent” they’re real, and they are ideal for people who make rules engines…
Understanding the Basics of Generative AI

2025年3月1日

Understanding the Basics of Generative AI

Generative Models Generative models are at the core of AI’s ability to create content. These models include Generative…

2 条评论
DeepSeek fever fuels patriotic bets on Chinese AI stocks

2025年2月19日

DeepSeek fever fuels patriotic bets on Chinese AI stocks

Chinese investors are rushing into AI-related stocks, betting the artificial intelligence advance of home-grown startup…

2 条评论
Death Is Not the End: How the Bhagavad Gita Explains Life After Death

2025年2月16日

Death Is Not the End: How the Bhagavad Gita Explains Life After Death

1. What We Are, Beyond Our Bodies We are eternal souls, not just physical bodies.
Lucknow's Ascent on Cryptocurrency Investments

2025年2月9日

Lucknow's Ascent on Cryptocurrency Investments

According to CoinDCX's latest report, Lucknow now ranks eighth among Indian cities in terms of cryptocurrency…
What India Needs for a Seamless EV Ecosystem: Challenges and Opportunities

2025年2月7日

What India Needs for a Seamless EV Ecosystem: Challenges and Opportunities

India is on a mission to transition towards a sustainable and eco-friendly mobility ecosystem. With ambitious goals of…
The Shrinking Demand for Data Annotation Jobs

2025年2月5日

The Shrinking Demand for Data Annotation Jobs

1. Advancements in AI-powered auto-labeling Companies have heavily invested in self-supervised learning and synthetic…

See all articles

9 Essential Features for a Bounding Box Annotation Tool

ARNAB MUKHERJEE ????

Automation Specialist (Python & Analytics) at Capgemini ??|| Master's in Data Science || PGDM (Product Management) || Six Sigma Yellow Belt Certified || Certified Google Professional Workspace Administrator

Bounding box annotations

Easy Class Creation

Boxes that feel good—Responsive interactions

Video interpolation

Copy-paste, and other power user shortcuts

领英推荐

Bounding boxes attributes and other sub-annotations

Visibility options for bounding boxes

Image Manipulation Options

How many annotations can it handle?

Converting other annotations to Bounding Boxes

API functionalities and common bugs to watch out for

AI and Beyond

2,825 位关注者

ARNAB MUKHERJEE ????的更多文章

社区洞察

其他会员也浏览了

Can a LLM Design Like Jony Ive? The Answer is Surprising.

Designs United by Language > Service Design in the AI of the Storm - #3

o1 takes over, MS powers up, and I’m drowning in UI design!

Plot Twist? — ?Generative AI May Actually Encourage Architects to Draw MORE

Generative AI Tools Landscape - Image Applications– Part1

DR.AI AI-Powered Web Customization, Midjourney's Style Tuning, and the Rise of Generative 3D +5 tools +2 tutorials

Photoshop: AI Generative Fill

AI White Backgrounds for Stunning Visuals

How I Use AI To Create Comic In Under 5 Minutes.

A Look at HOW You Say It

Bounding box annotations

Easy Class Creation

Boxes that feel good—Responsive interactions

Video interpolation

Copy-paste, and other power user shortcuts

领英推荐

Bounding boxes attributes and other sub-annotations

Visibility options for bounding boxes

Image Manipulation Options

How many annotations can it handle?

Converting other annotations to Bounding Boxes

API functionalities and common bugs to watch out for

AI and Beyond

2,825 位关注者

ARNAB MUKHERJEE ????的更多文章

Agentic AI: The Next Big Breakthrough That's Transforming Business And Technology

The Illustrated Children’s Guide to Kubernetes

The Silent Resignation Phenomenon: Is Employee Engagement in IT at Risk?

Technologies that will 100% be labelled in some places as AI Agents or Agentic in some places.

Understanding the Basics of Generative AI

DeepSeek fever fuels patriotic bets on Chinese AI stocks

Death Is Not the End: How the Bhagavad Gita Explains Life After Death

Lucknow's Ascent on Cryptocurrency Investments

What India Needs for a Seamless EV Ecosystem: Challenges and Opportunities

The Shrinking Demand for Data Annotation Jobs

社区洞察

其他会员也浏览了

Can a LLM Design Like Jony Ive? The Answer is Surprising.

Designs United by Language > Service Design in the AI of the Storm - #3

o1 takes over, MS powers up, and I’m drowning in UI design!

Plot Twist? — ?Generative AI May Actually Encourage Architects to Draw MORE

Generative AI Tools Landscape - Image Applications– Part1

DR.AI AI-Powered Web Customization, Midjourney's Style Tuning, and the Rise of Generative 3D +5 tools +2 tutorials

Photoshop: AI Generative Fill

AI White Backgrounds for Stunning Visuals

How I Use AI To Create Comic In Under 5 Minutes.

A Look at HOW You Say It