Change search
Link to record
Permanent link

Direct link
Publications (6 of 6) Show all publications
Botteghi, N., Alaa, K., Poel, M., Sirmacek, B., Brune, C., Mersha, A. & Stramigioli, S. (2021). Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces. In: IEEE International Conference on Intelligent Robots and Systems: . Paper presented at 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2021, 27 September 2021 through 1 October 2021 (pp. 190-197). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces
Show others...
2021 (English)In: IEEE International Conference on Intelligent Robots and Systems, Institute of Electrical and Electronics Engineers (IEEE), 2021, p. 190-197Conference paper, Published paper (Refereed)
Abstract [en]

Reinforcement learning algorithms have proven to be capable of solving complicated robotics tasks in an end-to-end fashion without any need for hand-crafted features or policies. Especially in the context of robotics, in which the cost of real-world data is usually extremely high, Reinforcement Learning solutions achieving high sample efficiency are needed. In this paper, we propose a framework combining the learning of a low-dimensional state representation, from high-dimensional observations coming from the robot's raw sensory readings, with the learning of the optimal policy, given the learned state representation. We evaluate our framework in the context of mobile robot navigation in the case of continuous state and action spaces. Moreover, we study the problem of transferring what learned in the simulated virtual environment to the real robot without further retraining using real-world data in the presence of visual and depth distractors, such as lighting changes and moving obstacles. A video of our experiments can be found at: https://youtu.be/rUdGPKr2Wuo.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2021
Series
IEEE International Conference on Intelligent Robots and Systems, ISSN 2153-0858
Keywords
Mobile robots, Reinforcement learning, Robotics, Virtual reality, Action spaces, Continuous actions, End to end, High-dimensional, Low dimensional, Real-world, Reinforcement learning algorithms, Reinforcement learning solution, Robotic tasks, State representation, Learning algorithms
National Category
Robotics and automation
Identifiers
urn:nbn:se:hj:diva-55968 (URN)10.1109/IROS51168.2021.9635936 (DOI)2-s2.0-85124346914 (Scopus ID)9781665417150 (ISBN)
Conference
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2021, 27 September 2021 through 1 October 2021
Available from: 2022-03-02 Created: 2022-03-02 Last updated: 2025-02-09Bibliographically approved
Arvidsson, S., Gullstrand, M., Sirmacek, B. & Riveiro, M. (2021). Sensor fusion and convolutional neural networks for indoor occupancy prediction using multiple low-cost low-resolution heat sensor data. Sensors, 21(4), 1-21, Article ID 1036.
Open this publication in new window or tab >>Sensor fusion and convolutional neural networks for indoor occupancy prediction using multiple low-cost low-resolution heat sensor data
2021 (English)In: Sensors, E-ISSN 1424-8220, Vol. 21, no 4, p. 1-21, article id 1036Article in journal (Refereed) Published
Abstract [en]

Indoor occupancy prediction is a prerequisite for the management of energy consumption, security, health, and other systems in smart buildings. Previous studies have shown that buildings that automatize their heating, lighting, air conditioning, and ventilation systems through considering the occupancy and activity information might reduce energy consumption by more than 50%. However, it is difficult to use high-resolution sensors and cameras for occupancy prediction due to privacy concerns. In this paper, we propose a novel solution for predicting occupancy using multiple low-cost and low-resolution heat sensors. We suggest two different methods for fusing and processing the data captured from multiple heat sensors and we use a Convolutional Neural Network for predicting occupancy. We conduct experiments to assess both the performance of the proposed solutions and analyze the impact of sensor field view overlaps on the prediction results. In summary, our experimental results show that the implemented solutions show high occupancy prediction accuracy and real-time processing capabilities.

Place, publisher, year, edition, pages
MDPI, 2021
Keywords
Artificial intelligence (AI), Heat sensors, Machine learning, Multi-sensor, Neural networks, Occupancy prediction, Sensor fusion, Smart offices, Air conditioning, Convolution, Costs, Data handling, Energy conservation, Energy utilization, Forecasting, Intelligent buildings, Sensor data fusion, Ventilation, Activity informations, High resolution sensors, Low resolution, Occupancy predictions, Privacy concerns, Realtime processing, Reduce energy consumption, Ventilation systems, Convolutional neural networks
National Category
Computer Systems
Identifiers
urn:nbn:se:hj:diva-51845 (URN)10.3390/s21041036 (DOI)000624660900001 ()33546305 (PubMedID)2-s2.0-85100262267 (Scopus ID)GOA;;1526575 (Local ID)GOA;;1526575 (Archive number)GOA;;1526575 (OAI)
Funder
Knowledge Foundation
Available from: 2021-02-08 Created: 2021-02-08 Last updated: 2024-07-16Bibliographically approved
Botteghi, N., Obbink, R., Geijs, D., Poel, M., Sirmacek, B., Brune, C., . . . Stramigioli, S. (2020). Low dimensional state representation learning with reward-shaped priors. In: Proceedings - International Conference on Pattern Recognition: . Paper presented at 25th International Conference on Pattern Recognition, ICPR 2020, 10 January 2021 through 15 January 2021 (pp. 3736-3743). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Low dimensional state representation learning with reward-shaped priors
Show others...
2020 (English)In: Proceedings - International Conference on Pattern Recognition, Institute of Electrical and Electronics Engineers (IEEE), 2020, p. 3736-3743Conference paper, Published paper (Refereed)
Abstract [en]

Reinforcement Learning has been able to solve many complicated robotics tasks without any need for feature engineering in an end-to-end fashion. However, learning the optimal policy directly from the sensory inputs, i.e the observations, often requires processing and storage of a huge amount of data. In the context of robotics, the cost of data from real robotics hardware is usually very high, thus solutions that achieve high sample-efficiency are needed. We propose a method that aims at learning a mapping from the observations into a lower-dimensional state space. This mapping is learned with unsupervised learning using loss functions shaped to incorporate prior knowledge of the environment and the task. Using the samples from the state space, the optimal policy is quickly and efficiently learned. We test the method on several mobile robot navigation tasks in a simulation environment and also on a real robot. A video of our experiments can be found at: https://youtu.be/dgWxmfSv95U.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2020
Keywords
Reinforcement Learning, Robotics, State Representation Learning, Digital storage, Mapping, Mobile robots, Pattern recognition, Feature engineerings, Loss functions, Low dimensional, Mobile Robot Navigation, Optimal policies, Prior knowledge, Simulation environment, State representation
National Category
Robotics and automation
Identifiers
urn:nbn:se:hj:diva-54224 (URN)10.1109/ICPR48806.2021.9412421 (DOI)000678409203111 ()2-s2.0-85110530994 (Scopus ID)9781728188089 (ISBN)
Conference
25th International Conference on Pattern Recognition, ICPR 2020, 10 January 2021 through 15 January 2021
Available from: 2021-08-13 Created: 2021-08-13 Last updated: 2025-02-09Bibliographically approved
Botteghi, N., Kamilaris, A., Sinai, L. & Sirmacek, B. (2020). Multi-Agent Path Planning of Robotic Swarms in Agricultural Fields. In: N. Paparoditis, C. Mallet, F. Lafarge, S. Hinz, R. Feitosa, M. Weinmann, B. Jutzi (Ed.), ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences: . Paper presented at 2020 24th ISPRS Congress on Technical Commission I, 31 August 2020 through 2 September 2020 (pp. 361-368). Copernicus GmbH, 5(1)
Open this publication in new window or tab >>Multi-Agent Path Planning of Robotic Swarms in Agricultural Fields
2020 (English)In: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences / [ed] N. Paparoditis, C. Mallet, F. Lafarge, S. Hinz, R. Feitosa, M. Weinmann, B. Jutzi, Copernicus GmbH , 2020, Vol. 5, no 1, p. 361-368Conference paper, Published paper (Refereed)
Abstract [en]

Collaborative swarms of robots/UAVs constitute a promising solution for precision agriculture and for automatizing agricultural processes. Since agricultural fields have complex topologies and different constraints, the problem of optimized path routing of these swarms is important to be tackled. Hence, this paper deals with the problem of optimizing path routing for a swarm of ground robots and UAVs in different popular topologies of agricultural fields. Four algorithms (Nearest Neighbour based on K-means clustering, Christofides, Ant Colony Optimisation and Bellman-Held-Karp) are applied on various farm types commonly found around Europe. The results indicate that the problem of path planning and the corresponding algorithm to use, are sensitive to the field topology and to the number of agents in the swarm.

Place, publisher, year, edition, pages
Copernicus GmbH, 2020
Keywords
Agriculture, Multiple agents, Optimization, Path Planning, Robotic swarms, Ant colony optimization, K-means clustering, Multi agent systems, Robot programming, Robotics, Robots, Topology, Agricultural fields, Agricultural process, Complex topology, Ground robot, Multi agent, Nearest neighbour, Path routing, Agricultural robots
National Category
Robotics and automation
Identifiers
urn:nbn:se:hj:diva-50715 (URN)10.5194/isprs-annals-V-1-2020-361-2020 (DOI)2-s2.0-85091101235 (Scopus ID)POA JTH 2020;JTHDatateknikIS (Local ID)POA JTH 2020;JTHDatateknikIS (Archive number)POA JTH 2020;JTHDatateknikIS (OAI)
Conference
2020 24th ISPRS Congress on Technical Commission I, 31 August 2020 through 2 September 2020
Available from: 2020-09-29 Created: 2020-09-29 Last updated: 2025-02-09Bibliographically approved
Sirmacek, B. & Riveiro, M. (2020). Occupancy prediction using low-cost and low-resolution heat sensors for smart offices. Sensors, 20(19), Article ID 5497.
Open this publication in new window or tab >>Occupancy prediction using low-cost and low-resolution heat sensors for smart offices
2020 (English)In: Sensors, E-ISSN 1424-8220, Vol. 20, no 19, article id 5497Article in journal (Refereed) Published
Abstract [en]

Solving the challenge of occupancy prediction is crucial in order to design efficient and sustainable office spaces and automate lighting, heating, and air circulation in these facilities. In office spaces where large areas need to be observed, multiple sensors must be used for full coverage. In these cases, it is normally important to keep the costs low, but also to make sure that the privacy of the people who use such environments are preserved. Low-cost and low-resolution heat (thermal) sensors can be very useful to build solutions that address these concerns. However, they are extremely sensitive to noise artifacts which might be caused by heat prints of the people who left the space or by other objects, which are either using electricity or exposed to sunlight. There are some earlier solutions for occupancy prediction that employ low-resolution heat sensors; however, they have not addressed nor compensated for such heat artifacts. Therefore, in this paper, we presented a low-cost and low-energy consuming smart space implementation to predict the number of people in the environment based on whether their activity is static or dynamic in time. We used a low-resolution (8×8) and non-intrusive heat sensor to collect data from an actual meeting room. We proposed two novel workflows to predict the occupancy; one that is based on computer vision and one based on machine learning. Besides comparing the advantages and disadvantages of these different workflows, we used several state-of-the-art explainability methods in order to provide a detailed analysis of the algorithm parameters and how the image properties influence the resulting performance. Furthermore, we analyzed noise resources that affect the heat sensor data. The experiments show that the feature classification based method gives high accuracy when the data are clean from noise artifacts. However, when there are noise artifacts, the computer vision based method can compensate for those artifacts providing robust results. Because the computer vision based method requires an empty room recording, the feature classification based method should be chosen either when there is no expectancy of seeing noise artifacts in the data or when there is no empty recording available. We hope that our analysis brings light into understanding how to handle very low-resolution heat images in these environments. The presented workflows could be used in various domains and applications other than smart offices, where occupancy prediction is essential, e.g., for elderly care.

Place, publisher, year, edition, pages
MDPI, 2020
Keywords
heat sensors, smart offices, occupancy prediction, machine learning, computer vision, feature engineering, explainability, explainable AI
National Category
Computer Systems
Identifiers
urn:nbn:se:hj:diva-50708 (URN)10.3390/s20195497 (DOI)000586529500001 ()32992789 (PubMedID)2-s2.0-85091635230 (Scopus ID)GOA JTH 2020 (Local ID)GOA JTH 2020 (Archive number)GOA JTH 2020 (OAI)
Funder
Knowledge Foundation
Available from: 2020-09-28 Created: 2020-09-28 Last updated: 2024-07-16Bibliographically approved
Botteghi, N., Sirmacek, B., Schulte, R., Poel, M. & Brune, C. (2020). Reinforcement learning helps slam: Learning to build maps. In: N. Paparoditis, C. Mallet, F. Lafarge, S. Zlatanova, S. Dragicevic, G. Sithole, G. Agugiaro, J. J. Arsanjani, P. Boguslawski, M. Breunig, M. A. Brovelli, S. Christophe, A. Coltekin, M. R. Delavar, M. Al Doori, E. Guilbert, C. C. Fonte, J. Haworth, U. Isikdag, I. Ivanova, Z. Kang, K. Khoshelham, M. Koeva, M. Kokla, Y. Liu, M. Madden, M. A. Mostafavi, G. Navratil, D. R. Paudyal, C. Pettit, A. Spanò, E. Stefanakis, W. Tu, G. Vacca, L. Díaz-Vilariño, S. Wise, H. Wu, and X. G. Zhou (Ed.), International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives: . Paper presented at 2020 24th ISPRS Congress - Technical Commission IV on Spatial Information Science, 31 August 2020 through 2 September 2020 (pp. 329-336). International Society for Photogrammetry and Remote Sensing, 43(B4)
Open this publication in new window or tab >>Reinforcement learning helps slam: Learning to build maps
Show others...
2020 (English)In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives / [ed] N. Paparoditis, C. Mallet, F. Lafarge, S. Zlatanova, S. Dragicevic, G. Sithole, G. Agugiaro, J. J. Arsanjani, P. Boguslawski, M. Breunig, M. A. Brovelli, S. Christophe, A. Coltekin, M. R. Delavar, M. Al Doori, E. Guilbert, C. C. Fonte, J. Haworth, U. Isikdag, I. Ivanova, Z. Kang, K. Khoshelham, M. Koeva, M. Kokla, Y. Liu, M. Madden, M. A. Mostafavi, G. Navratil, D. R. Paudyal, C. Pettit, A. Spanò, E. Stefanakis, W. Tu, G. Vacca, L. Díaz-Vilariño, S. Wise, H. Wu, and X. G. Zhou, International Society for Photogrammetry and Remote Sensing , 2020, Vol. 43, no B4, p. 329-336Conference paper, Published paper (Refereed)
Abstract [en]

In this research, we investigate the use of Reinforcement Learning (RL) for an effective and robust solution for exploring unknown and indoor environments and reconstructing their maps. We benefit from a Simultaneous Localization and Mapping (SLAM) algorithm for real-time robot localization and mapping. Three different reward functions are compared and tested in different environments with growing complexity. The performances of the three different RL-based path planners are assessed not only on the training environments, but also on an a priori unseen environment to test the generalization properties of the policies. The results indicate that RL-based planners trained to maximize the coverage of the map are able to consistently explore and construct the maps of different indoor environments.

Place, publisher, year, edition, pages
International Society for Photogrammetry and Remote Sensing, 2020
Series
International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, ISSN 1682-1750
Keywords
Autonomous Exploration, Indoor Environments, Reinforcement Learning, Simultaneous Localization and Mapping, Indoor positioning systems, Mapping, Planning, Robot applications, SLAM robotics, Generalization properties, Indoor environment, Path planners, Real-time robots, Reward function, Robust solutions, Simultaneous localization and mapping algorithms, robotics, artificial intelligence, reinforcement learning, navigation
National Category
Robotics and automation
Identifiers
urn:nbn:se:hj:diva-50791 (URN)10.5194/isprs-archives-XLIII-B4-2020-329-2020 (DOI)2-s2.0-85091581618 (Scopus ID)
Conference
2020 24th ISPRS Congress - Technical Commission IV on Spatial Information Science, 31 August 2020 through 2 September 2020
Available from: 2020-10-07 Created: 2020-10-07 Last updated: 2025-02-09Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-0343-5072

Search in DiVA

Show all publications