An effective method to use centralized Q-learning in multi-robot task allocation

Ezercan Kayir, Hatice Hilal

Please use this identifier to cite or link to this item: https://hdl.handle.net/11499/46099

Full metadata record

DC Field	Value	Language
dc.contributor.author	Ezercan Kayir, Hatice Hilal	-
dc.date.accessioned	2023-01-09T21:09:29Z	-
dc.date.available	2023-01-09T21:09:29Z	-
dc.date.issued	2021	-
dc.identifier.issn	1300-7009	-
dc.identifier.issn	2147-5881	-
dc.identifier.uri	https://doi.org/10.5505/pajes.2021.90490	-
dc.identifier.uri	https://search.trdizin.gov.tr/yayin/detay/488099	-
dc.identifier.uri	https://hdl.handle.net/11499/46099	-
dc.description.abstract	The use of Q-learning methods in multi-robot systems is a challenging area. Multi-robot systems have dynamic and partially observable nature because of robot's independent decision-making and acting mechanisms. Whereas, Q-learning is defined on Markovian environments theoretically. One way to apply Q-learning in multi robot systems is centralized learning. It learns optimal Q-values for state space of overall system and joint action spaces of all agents. In this case, the system can be considered as stationary and optimal solutions can be converged. But, centralized learning requires full knowledge of the environment, perfect inter-robot communication and good computational power. Especially for large systems, the computational cost becomes huge because of exponentially growing learning space size with the number of robots. The proposed approach in this study, subG-CQL, divides the overall system into small-sized sub-groups without adversely affecting the system's task performing abilities. Each sub-group consists of less number of robots performing less tasks and learns in centralized manner for its own team. So, the learning space dimension is reduced to a reasonable level and required communication remains limited to the robots in the same the sub-group. Due the centralized learning is used, it is expected that the successful results are achieved. Experimental studies show that the proposed algorithm provides increase in the task assignment performance of the system and efficient use of system resources.	en_US
dc.language.iso	en	en_US
dc.publisher	Pamukkale Univ	en_US
dc.relation.ispartof	Pamukkale University Journal Of Engineering Sciences-Pamukkale Universitesi Muhendislik Bilimleri Dergisi	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Multi-Robot systems	en_US
dc.subject	Task allocation	en_US
dc.subject	Q-Learning	en_US
dc.subject	Centralized learning	en_US
dc.subject	Coordination	en_US
dc.title	An effective method to use centralized Q-learning in multi-robot task allocation	en_US
dc.type	Article	en_US
dc.identifier.volume	27	en_US
dc.identifier.issue	5	en_US
dc.identifier.startpage	579	en_US
dc.identifier.endpage	588	en_US
dc.identifier.doi	10.5505/pajes.2021.90490	-
dc.relation.publicationcategory	Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı	en_US
dc.authorscopusid	#N/A	-
dc.identifier.trdizinid	488099	en_US
dc.identifier.wos	WOS:000708158900001	en_US
item.languageiso639-1	en	-
item.openairetype	Article	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.fulltext	With Fulltext	-
item.grantfulltext	open	-
item.cerifentitytype	Publications	-
crisitem.author.dept	10.04. Electrical-Electronics Engineering	-
Appears in Collections:	Mühendislik Fakültesi Koleksiyonu TR Dizin İndeksli Yayınlar Koleksiyonu / TR Dizin Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection

Files in This Item:

File	Size	Format
PAJES-90490-RESEARCH_ARTICLE-EZERCAN_KAYIR.pdf	873.79 kB	Adobe PDF	View/Open

Show simple item record

CORE Recommender

Page view(s)

58

checked on Feb 8, 2025

Download(s)

32

checked on Feb 8, 2025

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Google Scholar^TM