Single-file model formats for the SeC (Segment Concept) video object segmentation model, optimized for use with ComfyUI SeC Nodes.
| Format | Size | Description | GPU Requirements |
|---|---|---|---|
| SeC-4B-fp16.safetensors | 7.35 GB | Recommended - Best balance of quality and size | All CUDA GPUs |
| SeC-4B-fp8.safetensors | 3.97 GB | VRAM-constrained systems (saves 1.5-2GB VRAM) | RTX 30 series or newer |
| SeC-4B-bf16.safetensors | 7.35 GB | Alternative to FP16 | All CUDA GPUs |
| SeC-4B-fp32.safetensors | 14.14 GB | Full precision | All CUDA GPUs |
SeC (Segment Concept) uses Large Vision-Language Models for video object segmentation, achieving +11.8 points improvement over SAM 2.1 on complex semantic scenarios (SeCVOS benchmark).
Key features:
These models are designed for use with the ComfyUI SeC Nodes custom nodes.
Installation:
ComfyUI/models/sams/These are converted single-file versions of the original model:
Original Model: Developed by OpenIXCLab
Single-File Conversions: Created for ComfyUI SeC Nodes
Apache 2.0 (same as original SeC-4B model)
If you use this model in your research, please cite the original SeC paper:
@article{zhang2025sec, title = {SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction}, author = {Zhixiong Zhang and Shuangrui Ding and Xiaoyi Dong and Songxin He and Jianfan Lin and Junsong Tang and Yuhang Zang and Yuhang Cao and Dahua Lin and Jiaqi Wang}, journal = {arXiv preprint arXiv:2507.15852}, year = {2025} }