Indic-CLIP 🖼️<->📝

Multimodal Vision-Language Model for Indic Languages (Hindi/Sanskrit)

Provide an image or text to retrieve corresponding matches, or perform zero-shot classification.

Note: This demo uses a small, fixed gallery for retrieval. Model checkpoint: best_valid_loss.pth

Sample Images (Click to Load)