Publications -

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

2024

TL;DR: In POMDPs, the pure exploration task over latent states can be addressed by looking at observations only, and the induced mismatch is far from being hopeless. We show when this is the case and how to simply overcome the possible (structural) limitations under the assumption of knowing at least the observation model. https://arxiv.org/pdf/2406.12795

How to Explore with Belief: State Entropy Maximization in POMDPs

2024

TL;DR: We know that in POMDPs the pure exploration task over latent states can be addressed by looking at the observations only. Yet, passing through beliefs and solving the task over states sampled from them shows promizing properties (compared to working with observations only). In particular when the latter are not behaving so well or their model is not accessible. https://arxiv.org/pdf/2406.02295

Distributional Policy Evaluation: a Maximum Entropy approach to Representation Learning

2023

TL;DR: Distributional Policy Evaluation offers new tools to build distributed (a.k.a. disjoint) representations of the state space, if we look for representations that induce the maximum entropic distribution of returns compatible with the returns of a policy. https://proceedings.neurips.cc/paper_files/paper/2023/file/2a98af4fea6a24b73af7b588ca95f755-Paper-Conference.pdf

Adaptive and Energy-efficient Optimal Control in CPGs through Tegotae-based Feedback

2021

TL;DR: Including sensory-motor feedbacks in the control policies result in the emergence of motion patterns. In particular, a feedback describing the extent to which a perceived reaction matches the intended action results in an energy efficient policy as well. https://www.frontiersin.org/articles/10.3389/frobt.2021.632804/full

Energy Efficiency Analysis of the Tegotae Approach for Bio-inspired Hopping

2019

TL;DR: Including sensory-motor feedbacks in the control policies result in the emergence of motion patterns. In particular, a feedback describing the extent to which a perceived reaction matches the intended action results in an energy efficient policy as well. https://infoscience.epfl.ch/record/272142?v=pdf

Riccardo Zamboni

Publications

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

How to Explore with Belief: State Entropy Maximization in POMDPs

Distributional Policy Evaluation: a Maximum Entropy approach to Representation Learning

Adaptive and Energy-efficient Optimal Control in CPGs through Tegotae-based Feedback

Energy Efficiency Analysis of the Tegotae Approach for Bio-inspired Hopping