SCE: Shared Concept Extractor to Explain a CNN's Classification Dynamics

Vidhya Kamakshi; Narayanan Chatapuram  Krishnan

doi:10.1145/3632410.3632425

Profiles Research Units Publications

Conferences

SCE: Shared Concept Extractor to Explain a CNN's Classification Dynamics

Vidhya Kamakshi,

Published in Association for Computing Machinery

2024

DOI: 10.1145/3632410.3632425

Abstract

To better understand how accurate opaque black box models work, it is necessary to explain their internal workings in terms of human-interpretable image sub-regions known as concepts. This explanation will provide insights into how these models perceive the sharedness of concepts across related classes, as frequently observed in the real world. With this objective in mind, the proposed work aims to leverage an incremental Non-negative Matrix Factorization technique to extract shared concepts in a memory-efficient manner, thereby reflecting the sharedness of concepts across classes. The relevance of the extracted concepts towards prediction, as well as the encoding of primitive image aspects such as color, texture and shape by the concept, will be estimated after training the concept extractor. This approach reduces training overhead and simplifies the explanation pipeline, enabling the elucidation of various concepts - some genuine, some spurious - on which different black box architectures trained on the Imagenet dataset group and distinguish related classes. © 2024 ACM.

About the journal

Journal	ACM International Conference Proceeding Series
Publisher	Association for Computing Machinery

Authors (1)

Narayanan Chatapuram Krishnan
- Department of Data Science

About IIT Palakkad

Research & Development

Academics

Quick Find

About IIT Palakkad

Research & Development

Academics

Quick Find