Large visual collections, such as paintings, photographs, drawings, and other forms of visual media, offer valuable insights into historical events, social life, and artistic expression. These collections are key to understanding how societies produce and use images to shape cultural meaning over time. Yet they remain difficult to study due to their sheer size, often consisting of hundreds of thousands of items, and their intrinsic complexity, including diverse visual features, contents, contexts, and metadata structures.