Open Source library for detect image faults

Open Source library for detect image faults

In the field of Computer Vision, the most challenging and time-consuming task is image validation and detecting issues within the image dataset before training a deep learning model. Detecting issues within the image dataset typically requires human intervention or manual inspection. However, when dealing with a large dataset, this process can be time-consuming and demanding.

Now, with the assistance of the CleanVision library, we can automate this process and complete it in significantly less time.

The CleanVision library help to detect following issues in image dataset :

  • Dark colour or low light images
  • High or sharp light images
  • Blurry images
  • Odd Aspect ratio images
  • Low information images
  • Exact duplicate images
  • Nearly duplicate images
  • Gray Scale images

As we all know, if our training image dataset contains defects, it can adversely impact the performance of our deep learning model, leading to poor results.

By adding just 4-5 lines of code, we can automate this task and significantly reduce the manual effort required for data validation and defect check within the team.

from cleanvision.imagelab import Imagelab

# Specify path to folder containing the image files in your dataset
imagelab = Imagelab(data_path="FOLDER_WITH_IMAGES/")

# Automatically check for a predefined list of issues within your dataset
imagelab.find_issues()

# Produce a neat report of the issues found in your dataset
imagelab.report()        

I have also invested time in exploring this library for detecting defects in image datasets, specifically focusing on datasets from Hugging Face and TorchVision. I meticulously documented my experiments in a Jupyter notebook. The complete project, including the Jupyter notebook, can be found at the following link.

https://github.com/sankalpvarshney/cleanvision

The credit for this remarkable invention goes to CleanLab, which has developed an extraordinary solution.

sulaiman mahmoud

Machine Learning Engineer/Deep Learning/Computer Vision/Data Scientist

1 年

Thank you for sharing

要查看或添加评论,请登录

Sankalp Varshney的更多文章

社区洞察

其他会员也浏览了