Model Comparison and Selection for Computer Vision Project: Detecting Cassava Diseases and Classifying Symptoms
Casmir Anyaegbu
Data Scientist | Data Analyst |Sales Analyst | Python | Pandas |Seaborn | Machine Learning | R | SQL | Power BI | Tableau | Looker Studio| Excel | STATA | Eviews |Dashboard| Researcher
In this project, titled "Computer Vision: Detect Cassava Diseases, Identify & Classify Disease Symptoms," we aim to compare four models to determine the best fit for detecting and classifying cassava diseases through images. The models under evaluation are:
To determine which model best suits the project, we’ll compare them using the following metrics:
1. VGG-13 Fine-Tuned
Strengths: Fine-tuning VGG-13 leads to very high training accuracy, which suggests that the model has the capacity to learn intricate patterns from the training data.
Weaknesses: The high gap between training and test accuracy signals that the model is overfitting to the training data. The generalization to unseen test data is poor, making it unreliable in real-world applications.
2. VGG-13 Frozen
Strengths: VGG-13 Frozen performs slightly better than the fine-tuned version, with more stability in test accuracy and less pronounced overfitting. Freezing the pre-trained layers helps retain the learned general features.
Weaknesses: Despite better performance, the model still suffers from overfitting. Freezing layers limits its adaptability to specific features of cassava diseases, which might be causing the lower test accuracy compared to other models.
3. ResNet-18 Fine-Tuned
Strengths: Fine-tuning ResNet-18 provides the highest test accuracy (78.1%), outperforming both VGG models. The model's deeper architecture enables it to capture more complex features, which helps in better classification of cassava disease symptoms.
Weaknesses: Although ResNet-18 Fine-Tuned shows less overfitting, there is still a noticeable gap between training and test performance, suggesting that some generalization issues remain.
领英推荐
4. ResNet-18 Frozen
Strengths: ResNet-18 Frozen has stable performance, with no significant overfitting. This makes it a reliable option for tasks where stability is preferred over the highest possible accuracy.
Weaknesses: The test accuracy (72.2%) is lower compared to the fine-tuned version. Freezing the model limits its ability to adapt to the specific nuances of the cassava disease dataset.
Comparison and Conclusion
Based on the comparison:
Best Model: ResNet-18 Fine-Tuned
Among the four models, ResNet-18 Fine-Tuned stands out as the best fit for this project. It strikes the right balance between high test accuracy (78.1%) and manageable overfitting, making it the most effective model for detecting and classifying cassava disease symptoms.
Strengths and Weaknesses Recap:
In conclusion, ResNet-18 Fine-Tuned is the most balanced model for this computer vision project, offering high accuracy with manageable overfitting, making it a robust choice for detecting and classifying cassava diseases.
Data source: https://www.amdari.io/