TUdatalib : Fast Axiomatic Attribution for Neural Networks

Am Montag, 7.4.2025 wird TUdatalib wegen geplanten Wartungsarbeiten am Speichersystem von 9:00 bis voraussichtlich 9:30 nur eingeschränkt nutzbar sein (kein Datenupload und Download) | Due to scheduled maintenance on the storage system, using TUdatalib will be limited on Monday, April 7 2025 from 9:00 to approx. 9:30 (no data upload or download)

Description

Mitigating the dependence on spurious correlations present in the training dataset is a quickly emerging and important topic of deep learning. Recent approaches include priors on the feature attribution of a deep neural network (DNN) into the training process to reduce the dependence on unwanted features. However, until now one needed to trade off high-quality attributions, satisfying desirable axioms, against the time required to compute them. This in turn either led to long training times or ineffective attribution priors. In this work, we break this trade-off by considering a special class of efficiently axiomatically attributable DNNs for which an axiomatic feature attribution can be computed with only a single forward/backward pass. We formally prove that nonnegatively homogeneous DNNs, here termed X-DNNs, are efficiently axiomatically attributable and show that they can be effortlessly constructed from a wide range of regular DNNs by simply removing the bias term of each layer. Various experiments demonstrate the advantages of X-DNNs, beating state-of-the-art generic attribution methods on regular DNNs for training with attribution priors.

Collections

Explainable AI [2]

The following license files are associated with this item:

License description

Except where otherwise noted, this item's license is described as Apache License 2.0

Version	Item	Description version	Date	Summary
2	tudatalib/3389.2*		2023-08-04T09:26:36Z	Adding funding
1	tudatalib/3389		2022-01-19T09:36:04Z

Fast Axiomatic Attribution for Neural Networks

Count of file(s): 7

Date

Author

Type

Description

Subject

DFG subject classification

URI

Related third party funded projects

Related Resources

Collections

Explainable AI [2]

Version History

Fast Axiomatic Attribution for Neural Networks

Count of file(s): 7

Date

Author

Type

Metadata

Export

Description

Subject

DFG subject classification

URI

Related third party funded projects

Related Resources

Collections

Explainable AI [2]

Version History