Indic-CLIP 🖼️<->📝
Multimodal Vision-Language Model for Indic Languages (Hindi/Sanskrit)
Provide an image or text to retrieve corresponding matches, or perform zero-shot classification.
Note: This demo uses a small, fixed gallery for retrieval. Model checkpoint: best_valid_loss.pth