ADVIS-DistriMuSe-SR: Real-Time Anomaly Detection for Safe Human–Robot Interaction

ADVIS-DistriMuSe-SR is a real-time anomaly detection and visual intelligence system developed for the DistriMuSe Use Case 3 — Safe Interaction with Robots.

GitHub Repo

Project Summary

The project focuses on monitoring industrial safety zones in collaborative robotics environments. It uses VAE/VAE-GAN models to learn normal visual patterns and detect unexpected or unsafe conditions through reconstruction-based anomaly scoring.

The system is designed for real-time deployment with:

ROS2 image streams
Safety-area-specific models
Threshold calibration
GUI-based inference
RuleX-compatible message publishing

Safety Areas

Safety Area	Description
`RoboArm`	Robot arm zone
`ConvBelt`	Conveyor belt zone
`PLeft`	Left personnel safety area
`PRight`	Right personnel safety area

Pipeline

Raw Video / ROS Stream
        ↓
Frame Extraction + Masking
        ↓
Safety Area Cropping / Resize
        ↓
VAE-GAN Training
        ↓
Threshold Calibration
        ↓
Live Inference / Alert Publishing

Safety Monitoring Areas

The industrial scene is decomposed into multiple monitored safety zones.

Safety Area	Description
RoboArm	Robotic arm operational zone
ConvBelt	Conveyor belt monitoring area
PLeft	Left personnel interaction area
PRight	Right personnel interaction area

Each zone is processed independently using dedicated anomaly detection models.

Deep Learning Models

VAE / VAE-GAN Framework

The system uses Variational Autoencoder-based models trained only on normal operating conditions.

During inference:

Normal patterns are reconstructed correctly
Unexpected conditions generate larger reconstruction errors
Reconstruction errors are converted into anomaly scores

Core Components

Encoder
Latent representation
Decoder
Reconstruction loss
Adversarial discriminator (VAE-GAN variant)

Threshold Calibration

The framework supports threshold calibration across multiple anomaly scoring strategies.

Features

Multiple reconstruction-error metrics
Quantile-based thresholding
Local neighborhood tolerance
Area-specific calibration

Supported Analysis

Threshold sweeping
Quantile optimization
Validation-based tuning

Real-Time ROS2 Inference

The system supports live industrial deployment through ROS2.

Supported Features

ROS2 image subscriptions
ROS bag replay
Real-time GUI visualization
Timeline inspection
Live anomaly scoring

Example ROS2 Topic

/camera/back_view/image_raw

Explainability and Visualization

The framework supports interpretable inspection of anomaly predictions.

Visual Outputs

Reconstruction maps
Difference maps
Heatmaps
Timeline visualization
Safety-area overlays

Goal

Enable engineers and operators to understand:

why the anomaly was triggered
where the anomaly occurred
whether the model decision is trustworthy

GUI-Based Monitoring

The GUI interface supports:

Real-time visualization
Detector outputs
Reconstruction inspection
Timeline analysis
Multi-zone monitoring

Performance

Experimental Results

Metric	Result
Accuracy	99.61%
Recall	95.1%
F1-score	90.9%
Real-time Speed	~12.5 FPS

Technologies

AI & Vision

PyTorch
OpenCV
NumPy
torchvision

Robotics & Streaming

ROS2
rclpy
cv_bridge
sensor_msgs
Zenoh

Environment & Deployment

Pixi
Python
Linux
Industrial ROS pipelines

Research Contributions

Main Contributions

Real-time industrial anomaly detection
Explainable AI integration
Area-specific safety monitoring
Threshold calibration framework
ROS2 industrial deployment
Human-centered trustworthy AI

Can I Trust My Anomaly Detection System? A Case Study Based on Explainable AI
ShapBPT in Perspective: A Consolidated Review and an eXplainable Anomaly Detection Case Study
ShapBPT: Image Feature Attributions using Data-Aware Binary Partition Trees

Open Source

The full project implementation is publicly available on GitHub.

View Repository

Future Directions

Potential future improvements include:

Vision-Language integration
Multimodal anomaly detection
Explainable robotics systems
Edge-AI deployment
Foundation models for industrial monitoring
Human feedback integration

Citation

@misc{rashid2026advis,
  title={ADVIS-DistriMuSe-SR: Real-Time Anomaly Detection for Safe Human-Robot Interaction},
  author={Rashid, Muhammad, Amparore, Elvio},
  year={2026},
  howpublished={GitHub Repository},
  url={https://github.com/rashidrao-pk/advis_distrimuse_unito_SR}
}

Muhammad Rashid