Skip to main content

Results from SAM Video Testing

Test 1: Prompting One Sheep in a Herd

Prompted Image	Initial Mask

Observations

Timeframe: Between 0.03 and 0.04 seconds, the prompted object (black sheep) disappears from the frame.
Issue: When the object reappears, the model detects two different objects with the same mask (both black sheep). It correctly distinguishes the black sheep from the white ones but fails to separate the two black sheep.
Model Used: SAM2_TINY

Using the SAM2 Small Model

Analysis

Prompted Image	Segmented Image

Issues Identified

Tracks the sheep at the beginning	Overlaps when similar objects appear	Ambiguity with different objects in the same area

Fixes Implemented

Improvement: Switching to the large model resolves the issue of ambiguity, especially when multiple objects of the same class are in the frame.
Additional Enhancement: Applying a polynomial mask over the plain mask improved segmentation accuracy.

Improved Mask

Notes:

The large model is recommended for scenarios with multiple similar objects to enhance tracking accuracy and reduce ambiguity.
Using a more complex mask, such as a polynomial mask, helps in better distinguishing objects within the frame.

Suggestions for Further Improvement

Detailed Metrics: Include quantitative results such as segmentation accuracy, precision, recall, and F1 scores for each model.
Comparative Analysis: Consider providing a side-by-side comparison of the performance of different models (SAM2_TINY vs. SAM2 Small vs. SAM2 Large) on various test cases.
Visual Annotations: Add annotations to video frames and images to highlight issues like overlapping and ambiguity for easier understanding.

2nd Video

Unclear Mask and Improper Tracking

Test 1: Prompting One Sheep in a Herd
2nd Video