Towards Generalizable Safety in Crowd Navigation via Conformal Uncertainty Handling

Towards Generalizable Safety in Crowd Navigation via
Conformal Uncertainty Handling

9th Conference on Robot Learning (CoRL 2025) Oral & Best Paper Nominee at OODWorkshop@RSS2025

Abstract

Mobile robots navigating in crowds trained using reinforcement learning are known to suffer performance degradation when faced with out-of-distribution scenarios. We propose that by properly accounting for the uncertainties of pedestrians, a robot can learn safe navigation policies that are robust to distribution shifts. Our method augments agent observations with prediction uncertainty estimates generated by adaptive conformal inference, and it uses these estimates to guide the agent’s behavior through constrained reinforcement learning. The system helps regulate the agent’s actions and enables it to adapt to distribution shifts. In the in-distribution setting, our approach achieves a 96.93% success rate, which is over 8.80% higher than the previous state-of-the-art baselines with over 3.72 times fewer collisions and 2.43 times fewer intrusions into ground-truth human future trajectories. In three out-of-distribution scenarios, our method shows much stronger robustness when facing distribution shifts in velocity variations, policy changes, and transitions from individual to group dynamics. We deploy our method on a real robot, and experiments show that the robot makes safe and robust decisions when interacting with both sparse and dense crowds.

Key Ideas and Contributions

1) Uncertainty-Guided Framework: We propose a robust uncertainty-guided framework for safe crowd navigation, which can effectively handle prediction uncertainties to generate robust decision when facing distribution shifts.
2) Behavior-Level Constraint Mechanism: We introduce a behavior-level constraint mechanism that constrains cumulative intrusions into uncertainty areas rather than directly constraining collision rates, providing richer cost feedback and effectively addressing the sparse constraint feedback issue.
3) Performance and Robustness: Our method achieves the state-of-the-art performance with 96.93% success rate in dense crowd navigation and over 3.72× fewer collisions in in-distribution settings, while demonstrating superior robustness across three different OOD scenarios including velocity variations, policy changes, and group dynamics.

Test Results in In-Distribution and OOD Settings

In this paper, we validated our framework in both in-distribution and OOD settings. Please refer to the paper for more details.

Citation

@inproceedings{yao2025towards,
    title={Towards Generalizable Safety in Crowd Navigation via Conformal Uncertainty Handling},
    author={Yao, Jianpeng and Zhang, Xiaopan and Xia, Yu and Roy-Chowdhury, Amit K and Li, Jiachen},
    booktitle={Conference on Robot Learning (CoRL)},
    year={2025}
}