Please use this identifier to cite or link to this item:
https://hdl.handle.net/11499/46099
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ezercan Kayir, Hatice Hilal | - |
dc.date.accessioned | 2023-01-09T21:09:29Z | - |
dc.date.available | 2023-01-09T21:09:29Z | - |
dc.date.issued | 2021 | - |
dc.identifier.issn | 1300-7009 | - |
dc.identifier.issn | 2147-5881 | - |
dc.identifier.uri | https://doi.org/10.5505/pajes.2021.90490 | - |
dc.identifier.uri | https://search.trdizin.gov.tr/yayin/detay/488099 | - |
dc.identifier.uri | https://hdl.handle.net/11499/46099 | - |
dc.description.abstract | The use of Q-learning methods in multi-robot systems is a challenging area. Multi-robot systems have dynamic and partially observable nature because of robot's independent decision-making and acting mechanisms. Whereas, Q-learning is defined on Markovian environments theoretically. One way to apply Q-learning in multi robot systems is centralized learning. It learns optimal Q-values for state space of overall system and joint action spaces of all agents. In this case, the system can be considered as stationary and optimal solutions can be converged. But, centralized learning requires full knowledge of the environment, perfect inter-robot communication and good computational power. Especially for large systems, the computational cost becomes huge because of exponentially growing learning space size with the number of robots. The proposed approach in this study, subG-CQL, divides the overall system into small-sized sub-groups without adversely affecting the system's task performing abilities. Each sub-group consists of less number of robots performing less tasks and learns in centralized manner for its own team. So, the learning space dimension is reduced to a reasonable level and required communication remains limited to the robots in the same the sub-group. Due the centralized learning is used, it is expected that the successful results are achieved. Experimental studies show that the proposed algorithm provides increase in the task assignment performance of the system and efficient use of system resources. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Pamukkale Univ | en_US |
dc.relation.ispartof | Pamukkale University Journal Of Engineering Sciences-Pamukkale Universitesi Muhendislik Bilimleri Dergisi | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Multi-Robot systems | en_US |
dc.subject | Task allocation | en_US |
dc.subject | Q-Learning | en_US |
dc.subject | Centralized learning | en_US |
dc.subject | Coordination | en_US |
dc.title | An effective method to use centralized Q-learning in multi-robot task allocation | en_US |
dc.type | Article | en_US |
dc.identifier.volume | 27 | en_US |
dc.identifier.issue | 5 | en_US |
dc.identifier.startpage | 579 | en_US |
dc.identifier.endpage | 588 | en_US |
dc.identifier.doi | 10.5505/pajes.2021.90490 | - |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.authorscopusid | #N/A | - |
dc.identifier.trdizinid | 488099 | en_US |
dc.identifier.wos | WOS:000708158900001 | en_US |
item.fulltext | With Fulltext | - |
item.openairecristype | http://purl.org/coar/resource_type/c_18cf | - |
item.cerifentitytype | Publications | - |
item.languageiso639-1 | en | - |
item.openairetype | Article | - |
item.grantfulltext | open | - |
crisitem.author.dept | 10.04. Electrical-Electronics Engineering | - |
Appears in Collections: | Mühendislik Fakültesi Koleksiyonu TR Dizin İndeksli Yayınlar Koleksiyonu / TR Dizin Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Files in This Item:
File | Size | Format | |
---|---|---|---|
PAJES-90490-RESEARCH_ARTICLE-EZERCAN_KAYIR.pdf | 873.79 kB | Adobe PDF | View/Open |
CORE Recommender
Page view(s)
24
checked on Aug 24, 2024
Download(s)
14
checked on Aug 24, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.