VisQ
(Visual Query)
On-device iOS application for composed image and video retrieval using a Qwen3-VL-2B runtime on iPhone. VisQ supports text search, reference-image + edit-prompt retrieval, and explainable "Why This Matched" reasoning for local media search.
- Searches indexed local photos and videos directly on iPhone.
- Runs composed retrieval with a reference image and natural-language edit instruction.
- Surfaces reasoning-backed explanation chips for retrieved matches.
- Adapts recent multimodal retrieval research into a product-facing mobile UX.
Based on our recent work CoVR-R: Reason-Aware Composed Video Retrieval, which brings reasoning into composed retrieval and inspired VisQ's local-first iPhone experience.
Available now on the Apple App Store for iPhone.








