VLM
-
Brand Tagging with VLMs
2025-11-15
Fast logo retrieval with SigLIP-2 embeddings, then strict JSON verification with LLaVA-OneVision-1.5. Uses a single Creative-Commons video as the running example.
-
ClipTagger-12B VLM: Frame Captioning Tutorial
2025-11-01
Step-by-step GPU-accelerated inference for ClipTagger-12b, including prompts, environment setup, and a BF16 PyTorch quickstart.