Efficient Deep Learning of Robust Policies from MPC via Imitation and Tube-Guided Data Augmentation

In this work, we propose an Imitation Learning strategy to efficiently compress a computationally expensive MPC into a deep neural network policy that is robust to previously unseen disturbances.
By using a robust variant of the MPC, called Robust Tube MPC, and leveraging properties from the controller, we introduce computationally-efficient data augmentation methods that enable a significant reduction of the number of MPC demonstrations and training efforts required to generate a robust policy.
Our approach opens the possibility of zero-shot transfer of a policy trained from a single MPC demonstration collected in a nominal domain, such as a simulation or a robot in a lab/controlled environment, to a new domain with previously unseen bounded model errors/perturbations.
Numerical evaluations performed using linear and nonlinear MPC for agile flight on a multirotor show that our method outperforms strategies commonly employed in IL (such as Dataset-Aggregation (DAgger) and Domain Randomization (DR)) in terms of demonstration-efficiency, training time, and robustness to perturbations unseen during training. Experimental evaluations validate the efficiency and real-world robustness.
Arxiv: arxiv.org/abs/...
Accepted to the IEEE Transactions on Robotics (T-RO) 2024.
Interested to know more? Check out our related work:
- Tube-NeRF: Efficient learning of vision-based policies: • Tube-NeRF: Efficient I...
- SAMA: Efficient learning of adaptive policies: • Experimental Results f...

Жүктеу

AI can't cross this line and we don't know why.

Why Does Diffusion Work Better than Auto-Regression?

Fortunately, Tang Wutong and her sister were smart and rescued the baby!#斗罗大陆#唐舞桐#唐老六#斗罗大陆

Don't Toss The Apple Worm🍏, A Tasty Treat For Birds!🐦 #catvideos #catmemes #trending

孩子太多！一碗水真的能端平吗？ #瞧这一家子 #带娃 #四小只吖 #日常 #搞笑 #搞笑家庭 #姐弟 #家庭生活

My daughter is creative when it comes to eating food #funny #comedy #cute #baby#smart girl

PhD Thesis Defense: Jesus Tordesillas

[ICRA24] PUMA: Decentr. Uncertainty-aware Multiagent Traj. Planner w/ Image Segmentation Frame Align

How are memories stored in neural networks? | The Hopfield Network #SoME2

Transformer Neural Networks Derived from Scratch

I Built The First LAMINAR FLOW ROCKET ENGINE

LoRA explained (and a bit about precision and quantization)

T-RO/IROS 2021 Presentation: Distributed Certifiably Correct Pose-Graph Optimization

Physics Informed Neural Networks (PINNs) [Physics Informed Machine Learning]

Watching Neural Networks Learn

Natasha Jaques PhD Thesis Defense

Fortunately, Tang Wutong and her sister were smart and rescued the baby!#斗罗大陆#唐舞桐#唐老六#斗罗大陆

Efficient Deep Learning of Robust Policies from MPC via Imitation and Tube-Guided Data Augmentation

Пікірлер