Visual Discovery in Retail: Operationalizing AI-Powered Visual Search at Boyner
Abstract
In today's retail landscape, where millions of products and visual stimuli compete for customer attention, the integration of artificial intelligence into visual search has emerged as a crucial lever of operational efficiency. This paper presents Boyner Group's AI-powered visual discovery system, which enables customers to search using photos instead of keywords, making product discovery more intuitive and visually engaging. The architecture leverages a hybrid approach combining Large Language Models (LLMs), vision models such as GroundingDINO, and vector-based semantic similarity engines like SigLIP+Milvus to deliver scalable and high-accuracy image retrieval. The system, currently operational across the Boyner.com.tr ecosystem, supports enhanced filtering and storytelling capabilities, increasing customer satisfaction and conversion rates. The implementation process, system components, and operational results of this large-scale AI integration are explored, highlighting its transformative impact within omnichannel retail.
Keywords: Visual Search, Multimodal AI, GroundingDINO, SigLIP, Milvus, Retail Intelligence, Semantic Search, AI in E-Commerce, Omnichannel Retail, Customer Experience
References
- 1.Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). ImageNet classification with deep convolutional neural networks. Communications of the ACM, 60(6), 84–90.
- 2.Kannan, P. K., & Li, H. (2017). Digital marketing: A framework, review and research agenda. International Journal of Research in Marketing, 34(1), 22–45.
- 3.Gu, J., Wang, Z., Kuen, J., et al. (2018). Recent advances in convolutional neural networks. Pattern Recognition, 77, 354–377.
- 4.Radford, A., Kim, J. W., Hallacy, C., et al. (2021). Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning (ICML).
- 5.Vaswani, A., Shazeer, N., Parmar, N., et al. (2017). Attention is all you need. Advances in Neural Information Processing Systems, 30.
- 6.Liu, S., Qi, L., Qin, H., et al. (2023). Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection. arXiv:2303.05499.
- 7.Wang, J., Zhu, Y., & Wang, Y. (2022). Milvus: A Purpose-Built Vector Database to Power Embedding-Based Applications. In Proceedings of the VLDB Endowment, 15(12), 3596–3603.
- 8.Zhang, J., Zhang, Z., & Wang, Y. (2021). FashionBERT: Text and Image Matching with Adaptive Loss for Cross-Modal Retrieval. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval.
Alacan, M., Dursun, S., Önel, B., Işıkkent, T., Çelik, S. (2025). Visual Discovery in Retail: Operationalizing AI-Powered Visual Search at Boyner. *Orclever Proceedings of Research and Development*, 7(1), 126-137. https://doi.org/10.56038/oprd.v7i1.742
Bibliographic Info
More from Orclever Proceedings of Research and Development
Single-Bath Dyeing of Blends of Cotton Fibers with New Generation Polyacrylonitrile Fibers with Reactive Dye in Line with the Target of Sustainable Production
Yıldıray Fatih Dilsiz, Seda Keskin, Rıza Atav
2025 · Vol 7 · Issue 1
The Green Step Upper: A Novel Sustainable Bonding Method Replacing Solvent-Based Adhesives in Footwear Upper Assembly
Baris Bekiroglu, Mustafa Yener
2025 · Vol 7 · Issue 1
Innovative Technological Strategies to Enhance Bioavailability in Germinated Grains
Ebru Bozkurt Abdik
2025 · Vol 7 · Issue 1
Graph-Based Customer Segmentation with GraphSAGE on a Customer–Vehicle Bipartite Network
Abdullah Sezdi, Metin Bilgin
2025 · Vol 7 · Issue 1
Natural Language Processing-Based Layered Reconciliation System for Financial Transaction Analysis
Dilara Hazırlar, Özlem Avcı, Mesut Tekir
2025 · Vol 7 · Issue 1
An Integrated Deep Learning Framework for Automated Quality Control and Process Optimization in Slasher Indigo Dyeing
Mohammad Muttaqi, Gizem Daskaya, Kerem Cakir
2025 · Vol 7 · Issue 1