Pho(SC)-CTC—a hybrid approach towards zero-shot word image recognition

Ravi Bhatt; Anuj Rai; Sukalpa Chanda; Narayanan Chatapuram  Krishnan

doi:10.1007/s10032-022-00407-6

Profiles Research Units Publications

Articles

Pho(SC)-CTC—a hybrid approach towards zero-shot word image recognition

Ravi Bhatt, Anuj Rai, Sukalpa Chanda,

Published in Springer Science and Business Media Deutschland GmbH

2023

DOI: 10.1007/s10032-022-00407-6

Volume: 26

Issue: 1

Abstract

Annotating words in a historical document image archive for word image recognition purpose demands time and skilled human resource (like historians, paleographers). In a real-life scenario, obtaining sample images for all possible words is also not feasible. However, zero-shot learning methods could aptly be used to recognize unseen/out-of-lexicon words in such historical document images. Based on previous state-of-the-art method for zero-shot word recognition “Pho(SC)Net”, we propose a hybrid model based on the CTC framework (Pho(SC)-CTC) that takes advantage of the rich features learned by Pho(SC)Net followed by a “connectionist temporal classification” (CTC) framework to perform the final classification. Encouraging results were obtained on two publicly available historical document datasets and one synthetic handwritten dataset, which justifies the efficacy of Pho(SC)-CTC and Pho(SC)Net. © 2022, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

About the journal

Journal	International Journal on Document Analysis and Recognition
Publisher	Springer Science and Business Media Deutschland GmbH
ISSN	14332833

Authors (1)

Narayanan Chatapuram Krishnan
- Department of Data Science

About IIT Palakkad

Research & Development

Academics

Quick Find

About IIT Palakkad

Research & Development

Academics

Quick Find