Indic-CLIP 🖼️<->📝
Multimodal Vision-Language Model for Indic Languages (Hindi/Sanskrit)
Provide an image or text to retrieve corresponding matches, or perform zero-shot classification.
Note: This demo uses a small, fixed gallery for retrieval. Model checkpoint: best_valid_loss.pth
Sample Images (Click to Load)
Sample Text Queries (Click to Load)
Sample Images and Labels (Click to Load)
| Input Image | Candidate Labels (Comma-separated) |
|---|