Results from SAM Video Testing
Test 1: Prompting One Sheep in a Herd
| Prompted Image | Initial Mask |
|---|---|
![]() | ![]() |
Observations
-
Timeframe: Between 0.03 and 0.04 seconds, the prompted object (black sheep) disappears from the frame.
-
Issue: When the object reappears, the model detects two different objects with the same mask (both black sheep). It correctly distinguishes the black sheep from the white ones but fails to separate the two black sheep.
-
Model Used: SAM2_TINY
Using the SAM2 Small Model
Analysis
| Prompted Image | Segmented Image |
|---|---|
| |
Issues Identified
| Tracks the sheep at the beginning | Overlaps when similar objects appear | Ambiguity with different objects in the same area |
|---|---|---|
| | |
Fixes Implemented
- Improvement: Switching to the large model resolves the issue of ambiguity, especially when multiple objects of the same class are in the frame.
- Additional Enhancement: Applying a polynomial mask over the plain mask improved segmentation accuracy.

Notes:
- The large model is recommended for scenarios with multiple similar objects to enhance tracking accuracy and reduce ambiguity.
- Using a more complex mask, such as a polynomial mask, helps in better distinguishing objects within the frame.
Suggestions for Further Improvement
- Detailed Metrics: Include quantitative results such as segmentation accuracy, precision, recall, and F1 scores for each model.
- Comparative Analysis: Consider providing a side-by-side comparison of the performance of different models (SAM2_TINY vs. SAM2 Small vs. SAM2 Large) on various test cases.
- Visual Annotations: Add annotations to video frames and images to highlight issues like overlapping and ambiguity for easier understanding.
2nd Video
Unclear Mask and Improper Tracking

