Pho(SC)Net: An Approach Towards Zero-Shot Word Image Recognition in Historical Documents

A. Rai; Narayanan Chatapuram  Krishnan; S. Chanda

doi:10.1007/978-3-030-86549-8_2

Profiles Research Units Publications

Conferences

Pho(SC)Net: An Approach Towards Zero-Shot Word Image Recognition in Historical Documents

A. Rai, , S. Chanda

Published in Springer Science and Business Media Deutschland GmbH

2021

DOI: 10.1007/978-3-030-86549-8_2

Volume: 12821 LNCS

Pages: 19 - 33

Abstract

Annotating words in a historical document image archive for word image recognition purpose demands time and skilled human resource (like historians, paleographers). In a real-life scenario, obtaining sample images for all possible words is also not feasible. However, Zero-shot learning methods could aptly be used to recognize unseen/out-of-lexicon words in such historical document images. Based on previous state-of-the-art methods for word spotting and recognition, we propose a hybrid representation that considers the character’s shape appearance to differentiate between two different words and has shown to be more effective in recognizing unseen words. This representation has been termed as Pyramidal Histogram of Shapes (PHOS), derived from PHOC, which embeds information about the occurrence and position of characters in the word. Later, the two representations are combined and experiments were conducted to examine the effectiveness of an embedding that has properties of both PHOS and PHOC. Encouraging results were obtained on two publicly available historical document datasets and one synthetic handwritten dataset, which justifies the efficacy of “Phos” and the combined “Pho(SC)” representation. © 2021, Springer Nature Switzerland AG.

Topics: Word recognition (58)% and Historical document (56)%

View more info for "Pho(SC)Net: An Approach Towards Zero-Shot Word Image Recognition in Historical Documents."

About the journal

Journal	Data powered by SciSpaceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Publisher	Data powered by SciSpaceSpringer Science and Business Media Deutschland GmbH
ISSN	03029743

Authors (1)

Narayanan Chatapuram Krishnan
- Department of Data Science

About IIT Palakkad

Research & Development

Academics

Quick Find

About IIT Palakkad

Research & Development

Academics

Quick Find