Automatic Algorithms for Medieval Manuscript Analysis

Pintus, Ruggero; Yang, Ying; Rushmeier, Holly; Gobbetti, Enrico

DSpace CRIS

DSpace-CRIS consists of a data model describing objects of interest to Research and Development and a set of tools to manage the data. Standard DSpace is used to deal with publications and data sets, whereas DSpace-CRIS involves other CRIS entities: Researcher Pages, Projects, Organization Units and Second Level Dynamic Objects (single entities specialized by a profile, such as Journal, Prize, Event etc; because any profile can define its own set of properties and nested objects)

Please use this identifier to cite or link to this item: https://dspace.crs4.it/jspui/handle/1138/43

Title:	Automatic Algorithms for Medieval Manuscript Analysis
Authors:	Pintus, Ruggero Yang, Ying Rushmeier, Holly Gobbetti, Enrico
Keywords:	Semantic Feature Extraction;Medieval Manuscript;Document Layout Analysis;Word spotting;Multi-spectral analysis
Issue Date:	2017
Project:	info:eu-repo/grantAgreement/EC/H2020/665091/EU/Scan4Reco/Scan4Reco/
Abstract:	Massive digital acquisition and preservation of deteriorating historical and artistic documents is of particular importance due to their value and fragile condition. The study and browsing of such digital libraries is invaluable for scholars in the Cultural Heritage field, but requires automatic tools for analyzing and indexing these datasets. We will describe a set of completely automatic solutions to estimate per-page text leading, to extract text lines, blocks and other layout elements, and to perform query-by-example word-spotting on medieval manuscripts. Those techniques have been evaluated on a huge heterogeneous corpus of illuminated medieval manuscripts of different writing styles, languages, image resolutions, amount of illumination and ornamentation, and levels of conservation, with various problematic issues such as holes, spots, ink bleed-through, ornamentation, and background noise. We also present a quantitative analysis to better assess the quality of the proposed algorithms. By not requiring any human intervention to produce a large amount of annotated training data, the developed methods provide Computer Vision researchers and Cultural Heritage practitioners with a compact and efficient system for document analysis.
URI:	http://hdl.handle.net/1138/43 http://dspace.crs4.it/jspui/handle/1138/43
Rights:	info:eu-repo/semantics/openAccess
Appears in Collections:	CRS4 publications

Files in This Item:

File	Description	Size	Format
igs2017-manuscripts.pdf	Main article	168,88 kB	Adobe PDF	View/Open

Show full item record

Google Scholar^TM

Check

DSpace CRIS

Files in This Item:

Google ScholarTM

Google Scholar^TM