Dealing with non-stationary environments using context detection BC da Silva, EW Basso, ALC Bazzan, PM Engel International Conference on Machine Learning (ICML 2006), 217-224, 2006 | 233 | 2006 |
Learning parameterized skills BC da Silva, G Konidaris, A Barto International Conference on Machine Learning (ICML 2012), 2012 | 231 | 2012 |
Preventing undesirable behavior of intelligent machines PS Thomas, B Castro da Silva, AG Barto, S Giguere, Y Brun, E Brunskill Science 366 (6468), 999-1004, 2019 | 211 | 2019 |
Learning in groups of traffic signals ALC Bazzan, D De Oliveira, BC da Silva Engineering Applications of Artificial Intelligence 23 (4), 560-568, 2010 | 128 | 2010 |
Gaussian Processes for Learning and Control: A Tutorial with Examples M Liu, G Chowdhary, BC Da Silva, SY Liu, JP How IEEE Control Systems Magazine 38 (5), 53-86, 2018 | 123 | 2018 |
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator. D de Oliveira, ALC Bazzan, BC da Silva, EW Basso, L Nunes, R Rossetti, ... 4th European Workshop on Multi-Agent Systems (EUMAS 2006), 2006 | 91 | 2006 |
ITSUMO: an intelligent transportation system for urban mobility BC Da Silva, R Junges, D de Oliveira, ALC Bazzan [Demonstration Track] (AAMAS 2006) - Proceedings of the 5th International …, 2006 | 78 | 2006 |
A task-and-technique centered survey on visual analytics for deep learning model engineering R Garcia, AC Telea, BC da Silva, J Tørresen, JLD Comba Computers & Graphics 77, 30-49, 2018 | 61 | 2018 |
Learning parameterized motor skills on a humanoid robot BC Da Silva, G Baldassarre, G Konidaris, A Barto IEEE International Conference on Robotics and Automation (ICRA 2014), 5239-5244, 2014 | 59 | 2014 |
Universal off-policy evaluation Y Chandak, S Niekum, B da Silva, E Learned-Miller, E Brunskill, ... Advances in Neural Information Processing Systems (NeurIPS 2021) 34, 27475-27490, 2021 | 56 | 2021 |
Fairness Guarantees under Demographic Shift S Giguere, B Metevier, BC da Silva, Y Brun, PS Thomas, S Niekum International Conference on Learning Representations (ICLR 2022), 2022 | 54 | 2022 |
Analysing the impact of travel information for minimising the regret of route choice GO Ramos, ALC Bazzan, BC da Silva Transportation Research Part C: Emerging Technologies 88, 257-271, 2018 | 50 | 2018 |
Optimistic linear support and successor features as a basis for optimal policy transfer LN Alegre, A Bazzan, BC Da Silva International Conference on Machine Learning (ICML 2022), 394-413, 2022 | 42 | 2022 |
Adaptive traffic control with reinforcement learning B da Silva, D Oliveira, AL Bazzan, EW Basso 4th Workshop on Agents in Traffic and Transportation (ATT@AAMAS 2006), 80-86, 2006 | 38 | 2006 |
MO-Gym: A Library of Multi-Objective Reinforcement Learning Environments LN Alegre, F Felten, EG Talbi, G Danoy, A Nowé, ALC Bazzan, ... Proceedings of the 34th Benelux Conference on Artificial Intelligence BNAIC …, 2022 | 32 | 2022 |
Active learning of parameterized skills B Da Silva, G Konidaris, A Barto International Conference on Machine Learning (ICML 2014), 1737-1745, 2014 | 32 | 2014 |
Improving reinforcement learning with context detection BC Da Silva, EW Basso, FS Perotto, AL C Bazzan, PM Engel (AAMAS 2006) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2006 | 32 | 2006 |
Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization LN Alegre, ALC Bazzan, DM Roijers, A Nowé, BC da Silva arXiv preprint arXiv:2301.07784, 2023 | 31 | 2023 |
Autonomous Reinforcement Learning of Multiple Interrelated Tasks VG Santucci, E Cartoni, BC da Silva, G Baldassarre International Conference on Development and Learning (ICDL 2019), 2019 | 30 | 2019 |
Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection LN Alegre, ALC Bazzan, BC da Silva (AAMAS 2021) Intl. Conference on Autonomous Agents and Multiagent Systems …, 2021 | 25 | 2021 |