Making Art Historical Photo Archives Searchable - From the Physical Archive to the Digitized Archive
Expanded information to be added soon for each subproject. Complete information can always be found in the manuscript of my PhD thesis
- Spearheaded the developement of dhSegment (originally created for parsing the documents of the Foundazione Cini, the main data for my PhD). As a generic document processing pipeline dhSegment is now widely used for the Venice Time Machine project, and by many Universities around the world.
- Complete semi-automatic parsing of the 340’000 documents of the photo-archive of the Cini foundation (separation visual elements, OCR recognition, metadata assignment, partial Linked-Open-Data linking, …)
- Global visual analyzing of the Cini photo-archive, automatically discovering conflicting attributions for more than 1’600 artworks, conflicts that were unknown to archivists.
- Completely built the first visual search engine for exploring the propagation of “form” in the Visual Arts, leveraging the power of Convolutional Neural Networks. The system was continuously trained through feedback from users to reach impressive performance. Most of the interface, and the exact same backend is now powering the visual search capabilities of the Diamond platform of the Time Machine project (section Images).