بررسی عملکرد عامل باتجربه در تیم‌های اقتضایی پهپادی‌

حیدری, رقیه; افشارچی, محسن; خان‌محمدی, رضا

بررسی عملکرد عامل باتجربه در تیم‌های اقتضایی پهپادی‌

نویسندگان

رقیه حیدری ¹

محسن افشارچی ²

رضا خان‌محمدی ³

¹ شرکت توزیع نیروی برق استان زنجان، زنجان، ایران

² دانشکده مهندسی، دانشگاه زنجان, زنجان، ایران

³ شرکت برق منطقه ای زنجان، زنجان، ایران

چکیده

در تیم‌های رباتیک خودمختار در کاربردهای دنیای واقعی لازم است عامل‌ها برای رسیدن به بیشترین سود باهم همکاری کنند. تصمیم‌گیری و چگونگی عملکرد و همکاری آنان با هم به دلیل پویا بودن محیط، پیوستگی برخی از پارامترها، غیرقطعی بودن محیط و ناشناخته بودن هم‌تیمی‌ها فرآیندی پیچیده محسوب می‌شود. در این مقاله مأموریت نظارت مداوم پهپادها به عنوان یک سیستم چندعامله در دنیای واقعی معرفی شده است. در ادامه، مسئله تصمیم‌گیری برخط عامل‌ها در شرایطی که اعضای تیم و محیط به طور کامل شناخته شده نیستند، مطرح شده است . این مسئله با کمک فرآیند تصمیم‌گیری مارکوف مدل شده و درنهایت یک الگوریتم حریصانه برای تصمیم‌گیری برخط عامل‌ در تیم ارائه شده است. آزمایش‌های انجام شده نشان می‌دهند این روش تصمیم‌گیری که مبتنی بر دانش آموخته شده قبلی عامل‌ است، عملکرد تیم را در محیط ناشناخته بهبود می‌دهد.

کلیدواژه‌ها

همکاری

تیم اقتضایی

فرآیند تصمیم‌گیری مارکوف

پهپاد

[1] S. Barrett, A. Rosenfeld, S. Kraus and P. Stone, "Making friends on the fly: Cooperating with new teammates," Artificial Intelligence, vol. 242, pp. 132-171, 2017.
[2] D. V. Pynadath and M. Tambe, "The communicative multiagent team decision problem: Analyzing teamwork theories and models," Journal of artificial intelligence research, vol. 16, pp. 389-423, 2002.
[3] W. Ren, R. W. Beard and E. M. Atkins, "A survey of consensus problems in multi-agent coordination," In American Control Conference, 2005. Proceedings of the 2005, pp. 1859-1864, IEEE, 2005.
[4] O. Obst and J. Boedecker, "Flexible coordination of multiagent team behavior using HTN planning," In Robot Soccer World Cup, pp. 521-528. Springer, 2005.
[5] M. Tambe, "Towards flexible teamwork," Journal of artificial intelligence research, vol. 7, pp. 83-124, 1997.
[6] C. Boutilier, "Sequential optimality and coordination in multiagent systems," In IJCAI, vol. 99, pp. 478-485, 1999.
[7] C. Claus and C. Boutilier, "The dynamics of reinforcement learning in cooperative multiagent systems," AAAI/IAAI, vol. 1998, pp. 746-752, 1998.
[8] D. Virginia, “Handbook of Research on Multi-Agent Systems: Semantics and Dynamics of Organizational Models,” IGI Global, 2009.
[9] P. Stone, G. A. Kaminka and J. S. Rosenschein, "Leading a best-response teammate in an ad hoc team," In Agent-mediated electronic commerce, Designing trading strategies and mechanisms for electronic markets, pp. 132-146, Springer, 2010.
[10] P. Stone, G. A. Kaminka, S. Kraus and J. S. Rosenschein, "Ad Hoc Autonomous Agent Teams: Collaboration without Pre-Coordination," In AAAI, 2010.
[11] F. Wu, S. Zilberstein and X. Chen, "Online planning for ad hoc autonomous agent teams," In IJCAI, pp. 439-445, 2011.
[12] S. Barrett, P. Stone, S. Kraus and A. Rosenfeld, "Learning teammate models for ad hoc teamwork," In AAMAS Adaptive Learning Agents (ALA) Workshop, pp. 57-63, 2012.
[13] S. Barrett and P. Stone, "An analysis framework for ad hoc teamwork tasks," In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems-Volume 1, pp. 357-364, 2012.
[14] S. Barrett, P. Stone and S. Kraus, “Empirical evaluation of ad hoc teamwork in the pursuit domain,” In the 10th International Conference on Autonomous Agents and Multiagent Systems-Volume 2, pp. 567-574, International Foundation for Autonomous Agents and Multiagent Systems, 2011.
[15] J. D. Redding, "Approximate multi-agent planning in dynamic and uncertain environments," PhD diss., Massachusetts Institute of Technology, 2011.
[16] B. Bethke, J. How and J. Vian, "Multi-UAV Persistent Surveillance with Communication Constraints and Health Mangement," In AIAA Guidance, Navigation, and Control Conference, p. 5654, 2009.
[17] J. D. Redding, N. K. Ure, J. P. How, M. A. Vavrina and J. Vian, "Scalable, MDP-based planning for multiple, cooperating agents," In American Control Conference (ACC), pp. 6011-6016, IEEE, 2012.
[18] S. Barrett and P. Stone, "Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork," In AAAI, pp. 2010-2016, 2015.
[19] J. M. Vidal, “Fundamentals of Multi-Agent Systems with NetLogo,” Available: http://multiagent.com/, 2010.
[20] B. Dunin-Keplicz and R. Verbrugge, “Teamwork in multi-agent systems: A formal approach,” vol. 21, John Wiley & Sons, 2011.
[21] D. V. Pynadath and M. Tambe, "Multiagent teamwork: Analyzing the optimality and complexity of key theories and models," In Proceedings of the first international joint conference on Autonomous agents and multiagent systems: part 2, pp. 873-880, ACM, 2002
[22] K. L. Genter, N. Agmon and P. Stone, "Role-Based Ad Hoc Teamwork," In Plan, Activity, and Intent Recognition, 2011.
[23] R. A. Howard, "Dynamic programming and Markov processes," NEW YORK: JOHN-WILEY, 1964.

دوره 16، شماره 2
پاییز و زمستان
پاییز 1397

XML

اصل مقاله 942.47 K

تعداد مشاهده مقاله	339
تعداد دریافت فایل اصل مقاله	70

بررسی عملکرد عامل باتجربه در تیم‌های اقتضایی پهپادی‌

دوره 16، شماره 2پاییز و زمستانپاییز 1397

فایل ها

هم رسانی

ارجاع به این مقاله

آمار

دوره 16، شماره 2
پاییز و زمستان
پاییز 1397