Fault-tolerant deep networks by hardware-aware compression

Running

EPFL - École Polytechnique Fédérale de Lausanne

Organisational Unit

co-sponsored Research

02 December 2024

Duration: 24 months

Objective

Deep Neural Networks (DNNs) are becoming viable for diverse space applications such as Earth observation [1] and spacecraft pose estimation [2]. Deployment in space, however, raises the question of resilience to memory and input errors. Common steps toward resilience include fault-tolerant training [3-6] and ensemble models [7,8]. These techniques, however, do not consider the impact of the DNN architecture on fault tolerance. This is the overarching goal of this project. Considering the memory and computation constraints due to space deployment, we will investigate the influence of DNN compression strategies on error resilience. While a few works have studied fault tolerance for pruned or quantized DNNs, they have reached contradictory conclusions, showing either improved [9,10] or degraded [11,12] error resilience. We argue that these inconsistencies are due to the limited scope of these studies. In other words, they evidence that the impact of DNN compression is intertwined with the DNN architecture and the task at hand. In this project, we will investigate via 3 main axes: (i) A large-scale analysis of the effect of different quantization and knowledge distillation strategies, including our own [13-15], on fault tolerance, using a variety of DNN architectures and datasets; (ii) the incorporation of diverse faults occurring within the network and at the input, culminating in a space-environment fault injection pipeline; (iii) the study of the impact of hardware on fault tolerance, incorporating it in the compression strategy. In contrast to most studies focussed on image recognition [8-12,16-18], we will address spacecraft pose estimation, which has clear industry applications for on-orbit servicing and debris removal (e.g., Clearspace, Astroscale, Thales Alenia Space etc.). Thanks to access to ESA hardware platforms and testing facilities, validation will be carried out on diverse hardware (CPU, FPGA, GPU) using Klepsydra‘s software inference tools.

Contract number

4000146826

Programme

Discovery

OSIP Idea Id

I-2023-02144

Related OSIP Campaign

Open Discovery Ideas Channel

Main application area

Generic for multiple space applications

Budget

90000€

Fault-tolerant deep networks by hardware-aware compression

Fault-tolerant deep networks by hardware-aware compression

Overview

Events

Assessments

Switzerland

EPFL - École Polytechnique Fédérale de Lausanne

TEC-SFA

co-sponsored Research

Fault-tolerant deep networks by hardware-aware compression

Menu Overview Events Assessments

Switzerland

EPFL - École Polytechnique Fédérale de Lausanne

TEC-SFA

co-sponsored Research

Overview

Events

Assessments