Інтеграція адаптивних моделей навчання з підкріпленням у керування аграрними дронами

Родінков, Ю. М.; Савицький, А. Ю.; Rodinkov, Yu. M.; Savitsky, A. Yu.

dc.contributor.author	Родінков, Ю. М.	uk
dc.contributor.author	Савицький, А. Ю.	uk
dc.contributor.author	Rodinkov, Yu. M.	en
dc.contributor.author	Savitsky, A. Yu.	en
dc.date.accessioned	2026-04-14T07:44:53Z
dc.date.available	2026-04-14T07:44:53Z
dc.date.issued	2025
dc.identifier.citation	Родінков Ю. М., Савицький А. Ю. Інтеграція адаптивних моделей навчання з підкріпленням у керування аграрними дронами // Вісник Вінницького політехнічного інституту. 2025. № 5. С. 187-191. URI: https://visnyk.vntu.edu.ua/index.php/visnyk/article/view/3353.	uk
dc.identifier.issn	1997-9274
dc.identifier.uri	https://ir.lib.vntu.edu.ua//handle/123456789/51129
dc.description.abstract	Запропоновано інтегрований підхід до реалізації адаптивного оброблення сільськогосподарських культур за допомогою безпілотних літальних апаратів (БПЛА), що базується на методах навчання з підкріпленням (Reinforcement Learning, RL), зокрема на алгоритмі Proximal Policy Optimization (PPO). Основнууваг уприділленопрактичному впровадженню математичних моделей у симуляційні та onboard-середовища керування дронами. Показано, як моделі похибки покриття, втрат препарату та стохастичного вітру можуть бути формалізовані у reward-функцію та враховані під час симуля-ційного навчання агента. Алгоритм PPO реалізовано у середовищі AirSim з використанням бібліоте-ки Stable-Baselines3. Агента навчено на основі множини змінних вхідного стану, таких як положення, вітровий вплив, густота посівів та карта покриття. Навчання проводилося поетапно — від умов зі слабким вітром до поривчастого, з подальшою адаптацією політики для реального onboard-викона-ння. Стратегію експортовано у форматі ONNX та оптимізовано під NVIDIA Jetson Nanoчерез Ten-sorRT, що дозволило використовувати агент у реальному часі без перевантаження системи. Про-грамну реалізацію протестовано як у симуляційних середовищах (AirSim, Gazebo), так і на фізичній платформі PX4 SITL. Проведено серію експериментів, які включали моделювання обприскування за умов змінного вітру (2...14 м/с) та порівняння з класичними алгоритмами керування. Результати засвідчили зменшення середньої похибки покриття до 30% та зниження втрат препарату до 28% у порівнянні з фіксованими стратегіями. Особливістю підходу є його гнучкість і практичність: вперше представлено повний цикл розроблення — від формалізації математичної моделі до її навчання, реа-лізації в onboard-системі та валідації в реальних умовах. Такожпо дано скріншоти навчального проце-су, симуляційного середовища, кривих збіжності помилки та GUI Gazebo, що забезпечують прозо-рість методології та можуть бути використані як еталон для інших дослідників. Робота сприяє подальшому розвитку автономних рішень у точному землеробстві та закладає основу для застосування самонавчальних моделей у польових умовах.	uk
dc.description.abstract	This paper presents an integrated approach to adaptive agricultural spraying using unmanned aerial vehicles (UAVs), leveraging reinforcement learning (RL) techniques, particularly the Proximal Policy Optimization (PPO) algorithm. The study focuses on the practical implementation of mathematical models in simulation and onboard control systems. It demonstrates how spray coverage error, chemical loss, and stochastic wind models can be formalized into a reward function and incorporated during RL agent training. The PPO algorithm was implemented using the Stable-Baselines3 library in the AirSim simulator. The agent was trained based on a complex input state vector, including position, wind velocity, crop density, and coverage maps. The training was carried out in stages, starting with low wind conditions and gradually increasing to gusty wind scenarios. The resulting policy was exported in ONNX format and optimized for real-time execution via TensorRT on an NVIDIA Jetson Nano platform, enabling efficient inference onboard the drone. The developed solution was tested in both simulation environments (AirSim, Gazebo) and a physical PX4 SITL platform. A series of experiments were conducted with simulated wind intensities ranging 2 to 14 m/s. The proposed RL-based adaptive spraying strategy was compared with traditional fixed-parameter control methods. Results showed a reduction in average coverage error by up to 30 % and a decrease in chemical losses by 28 %, confirming the agent’s ability to adapt in real time. A key feature of this approach is its end-to-end practicality: for the first time, a complete development pipeline is presented — mathematical modeling and training to onboard deployment and real-world validation. The article includes screenshots of the training process, simulated environments, error convergence curves, and the Gazebo GUI, offering transparency and reproducibility for future researchers. This work contributes to the advancement of autonomous precision agriculture systems and lays the groundwork for deploying self-learning UAVs in dynamic field environments.	en
dc.language.iso	uk_UA	uk_UA
dc.publisher	ВНТУ	uk
dc.relation.ispartof	Вісник Вінницького політехнічного інституту. № 5 : 187-191.	uk
dc.relation.uri	https://visnyk.vntu.edu.ua/index.php/visnyk/article/view/3353
dc.subject	безпілотні літальні апарати	uk
dc.subject	обприскування	uk
dc.subject	симуляція	uk
dc.subject	адаптивне керування	uk
dc.subject	втрати препарату	uk
dc.subject	unmanned aerial vehicles	en
dc.subject	agricultural spraying	en
dc.subject	simulation	en
dc.subject	adaptive control	en
dc.subject	product loss	en
dc.title	Інтеграція адаптивних моделей навчання з підкріпленням у керування аграрними дронами	uk
dc.title.alternative	Integration of Adaptive Reinforcement Learning Models into the Control of Agricultural Drones	en
dc.type	Article, professional native edition
dc.type	Article
dc.identifier.udc	629.07:004.8
dc.relation.references	A. Pretto, A. Bevilacqua, E. Menegatti, and E. Pagello “Cooperative robotics and autonomous vehicles in precision agriculture”, MDPI Journal Robotics , no. 8 (4), 2019. https://doi.org/10.3390/robotics8040096 .	en
dc.relation.references	R. S. Sutton, and A. G. Barto, “Reinforcement Learning: An Introduction”, 2nd ed., MIT Press, 2018, 552 p.	en
dc.relation.references	C. Zhang, and J. M. Kovacs, “The application of small unmanned aerial systems for precision agriculture: a review,” International Journal on Advances in Precision Agriculture, no. 13, pp. 693-712, 2012. https://doi.org/10.1007/s11119-012-9274-5 .	en
dc.relation.references	J. Hu, T. Wang, J. C. Yang, Y. B. Lan, S. L. Lv, and Y. L. Zhang, “WSN-assisted UAV trajectory adjustment for pesticide drift control,” Sensors , MDPI Journal, no. 20 (19), pp. 5473, 2020. https://doi.org/10.3390/s20195473 .	en
dc.relation.references	Z. Y. Hao, X. Z. Li, C. Meng, W. Yang, and M. Z. Liб “Adaptive spraying decision system for UAV based on RL,” International Journal of Agricultural and Biological Engineering, no. 15 (4), pp. 16-26, 2022. https://doi.org/10.25165/j.ijabe.20221504.6929 .	en
dc.relation.references	C. Kang, B. Park, and J. Choiб “Scheduling PID attitude and position control for quadrotor UAVs under external disturbances,” Sensors, MDPI Journal, no. 22(1), 150, 2022. https:/doi.org/10.3390/s22010150 .	en
dc.identifier.doi	https://doi.org/10.31649/1997-9266-2025-182-5-187-191

Файли в цьому документі

Ім'я:: 191706.pdf
Розмір:: 627.0Kb
Формат:: PDF

Відкрити

Даний документ включений в наступну(і) колекцію(ї)

Вісник Вінницького політехнічного інституту. 2025. № 5 [24]

Показати скорочену інформацію