Digitize historic architectural plans with OCR and NER transformer models

Löffler, Kevin (2023) Digitize historic architectural plans with OCR and NER transformer models. Other thesis, OST Ostschweizer Fachhochschule.

[thumbnail of HS 2023 2024-SA-EP-Löffler-AI-assisted Digitalization of Landscaping plans.pdf] Text
HS 2023 2024-SA-EP-Löffler-AI-assisted Digitalization of Landscaping plans.pdf - Supplemental Material

Download (4MB)

Abstract

The Swiss Archive for Landscaping Architecture, located at OST in Rapperswil, administers more than 100’000 historic plans. These plans need to be digitzed to make them accessible. This paper proposes a three-model architecture consisting of a layout model to find text on the plans, an optical character recognition model to extract the found words, and finally, a named entity recognition model to label the relevant words like the client, location, or date. K-means clustering is used to group the text blocks from the layout model into related blocks for OCR.

Different deep-learning models are compared and evaluated. The most suitable models are then retrained on the NVIDIA DGX-2 system in a custom-built apptainer image with different training strategies to improve their accuracy. Different pre- and post-processing techniques are employed to improve the accuracy of the pipeline.

The final image pipeline achieves an F1 score of 48% with 35% precision and 77% recall. The chosen NER model ”German BERT” scored an F1-score of 86% after re-training and the OCR pipeline extracted 54% of words correctly and 18% close to correct.

The insights from this SA can be applied to future projects to build an application usable by the archive, enabling it to catalog its documents and make them accessible to the world.

Item Type: Thesis (Other)
Subjects: Topics > Other
Area of Application > Image/Video Processing
Technologies > Programming Languages > Python
Metatags > IFS (Institute for Software)
Divisions: Bachelor of Science FHO in Informatik > Student Research Project
Depositing User: OST Deposit User
Contributors:
Contribution
Name
Email
Thesis advisor
Purandare, Mitra
UNSPECIFIED
Date Deposited: 16 May 2024 11:39
Last Modified: 16 May 2024 11:39
URI: https://eprints.ost.ch/id/eprint/1189

Actions (login required)

View Item
View Item