Enabling the use of SoCs embedding a Machine Learning accelerator in satellites and institutional missions

Running

co-sponsored Research

10 April 2024

Duration: 24 months

Objective

Low-cost and low-power accelerators for Machine Learning (ML) execution have recently been introduced on the market. By optimizing the data transfer and relying on dedicated functional units design, ML accelerators efficiency outperforms traditional computing devices. The power efficiency of ML accelerators can potentially enable the use of ML even in power-constraint small or nano satellites, institutional missions, and NewSpace applications that go from image processing and debrits detention, to segmentation and cloud screening. The goal of this project is to characterize the reliability and improve the availability of the novel commercial Adaptive Compute Acceleration Platforms (ACAPs) embedding a ML accelerator. Because of the unprecedented complexity of the heterogeneous architecture and of the software application, such a study has never been done before. We target the Xilinx EdgeAI Versal VE2302, that embeds the AI Core AIE, since various companies are currently evaluating it for future space systems and it is expected that the device will be proposed for both institutional missions and commercial satellites; however, questions about SEE effect remain. We exploit the synergy between expertise from material sciences, radiation reliability, computing architecture, and robotics to provide a thoughtful radiation response of the component. We will proceed with a bottom up reliability and availability study. First, we will perform experiments to measure the latch-up response of the VE2302. Then, we will characterize the availability of the VE2302 components by measuring the single event data corruptions and functional interrupts probabilities under operative mission conditions and running OBPMark-ML payload. Finally, we will perform ML specific application-level tests to identify the errors that cause critical misbehaviours and propose innovative solutions to increase the device availability by adding dedicated (software) redundancy to mask or remove errors.

Contract number

4000144416

Programme

Discovery

OSIP Idea Id

I-2023-00183

Related OSIP Campaign

Open Discovery Ideas Channel

Main application area

Generic for multiple space applications

Budget

90000€

(Embodied) AI

Resilience, Crisis and Security

Enabling the use of SoCs embedding a Machine Learning accelerator in satellites and institutional missions

Enabling the use of SoCs embedding a Machine Learning accelerator in satellites and institutional missions

Overview

Events

Assessments

Italy

University of Trento

TEC-EDD

co-sponsored Research

Enabling the use of SoCs embedding a Machine Learning accelerator in satellites and institutional missions

Menu Overview Events Assessments

Italy

University of Trento

TEC-EDD

co-sponsored Research

Overview

Events

Assessments