Сучасні технології з реалізації інтелектуальних агентів з використанням 3D-симуляторів

Тарновський, А. М.; Захарченко, С. М.; Tarnovskiy, A.; Zakharchenko, S.

dc.contributor.author	Тарновський, А. М.	uk
dc.contributor.author	Захарченко, С. М.	uk
dc.contributor.author	Tarnovskiy, A.	en
dc.contributor.author	Zakharchenko, S.	en
dc.date.accessioned	2026-03-20T13:43:16Z
dc.date.available	2026-03-20T13:43:16Z
dc.date.issued	2025
dc.identifier.citation	Тарновський А. М., Захарченко С. М. Сучасні технології з реалізації інтелектуальних агентів з використанням 3D-симуляторів // Наукові праці Вінницького національного технічного університету. Електрон. текст. дані (PDF: 1,07 МБ). 2025. № 3. URI: https://praci.vntu.edu.ua/index.php/praci/article/view/852.	uk
dc.identifier.issn	2307-5376
dc.identifier.uri	https://ir.lib.vntu.edu.ua//handle/123456789/50921
dc.description.abstract	У роботі проведено комплексний огляд сучасних методів і технологій створення інтелектуальних агентів, здатних автономно діяти у віртуальних тривимірних середовищах. Розглянуто основні підходи машинного навчання, зокрема навчання з підкріпленням (RL) та його глибокі модифікації (DRL), що забезпечують формування адаптивної поведінки агентів у складних сценаріях і динамічних середовищах. Детально проаналізовано алгоритми PPO та A3C, а також методи з участю людини (RLHF, ReQueST), які дозволяють поєднати ефективність нейронних мереж із експертною оцінкою, підвищуючи безпеку і стабільність навчання. Значну увагу приділено сучасним симуляційним платформам (Unity ML-Agents, Habitat, AI2-THOR, Webots, CoppeliaSim), які забезпечують масштабоване, безпечне та контрольоване середовище для навчання, перевірки й тестування моделей у режимі, наближеному до реальності. Окремо розглянуто роль трансформерних архітектур і великих мовних та мультимодальних моделей (LLM, LMM), що відкривають можливості для побудови гнучких агентів із контекстно-залежною поведінкою, здатних інтегрувати обробку мовних, візуальних і просторових даних для прийняття рішень. Проаналізовано сучасні підходи до роботи з 3D-поданнями, включаючи хмари точок та воксельні сітки, а також ефективність архітектур PointNet/PointNet++ і VoxelNet для обробки просторової інформації з високою точністю. Наведено перспективи розвитку поєднання тривимірних симуляторів із мультимодальними моделями для досягнення більшої узгодженості між віртуальними й реальними середовищами, підвищення точності виконання завдань, оптимізації ресурсів і покращення переносимості навичок у практичні застосування. Огляд демонструє, що поєднання сучасних нейронних архітектур, потужних симуляторів і методів оптимізації навчання формує основу для створення автономних систем нового покоління, здатних ефективно адаптуватися до складних, непередбачуваних і швидкозмінних умов.	uk
dc.language.iso	uk_UA	uk_UA
dc.publisher	ВНТУ	uk
dc.relation.ispartof	Наукові праці Вінницького національного технічного університету. № 3.	uk
dc.relation.uri	https://praci.vntu.edu.ua/index.php/praci/article/view/852
dc.subject	інтелектуальний агент	uk
dc.subject	навчання з підкріпленням	uk
dc.subject	глибоке навчання	uk
dc.subject	глибока нейронна мережа	uk
dc.subject	трансформерна архітектура	uk
dc.subject	великі мультимодальні моделі	uk
dc.subject	PointNet/PointNet	en
dc.subject	VoxelNet	en
dc.subject	3D-симулятор	uk
dc.title	Сучасні технології з реалізації інтелектуальних агентів з використанням 3D-симуляторів	uk
dc.type	Article, professional native edition
dc.type	Article
dc.identifier.udc	004.8
dc.relation.references	Introduction to machine learning, neural networks, and deep learning/ R.Y.Choi et al. Translational Vision Science & TechnologyFebruary.2020.Vol.9 (2),Article14.DOI: 10.1167/tvst.9.2.14.	en
dc.relation.references	Di Franco G., Santurro M. Machine learning, artificial neural networks and social research.Qual Quant.2021.Vol. 55.P. 1007–1025.DOI: 10.1007/s11135-020-01037-y.	en
dc.relation.references	Krishnan N. AI Agents: evolution, architecture, and real-world applications. Arxiv.Org.2025. URL:https://arxiv.org/html/2503.12687v1.	en
dc.relation.references	Terven J. Deep reinforcement learning: a chronological overview and methods. АІ.2025.Vol. 6(3), Article46.DOI: 10.3390/ai6030046.	en
dc.relation.references	Deep learning,reinforcement learning, and world models/ Y.Matsuoet al. Neural Networks.2022.Vol. 152.P. 267–275.ISSN 0893-6080.	en
dc.relation.references	Goel A., Goel A.K., Kumar A. The role of artificial neural network and machine learning in utilizing spatial information. Spatial Information Research.2023.Vol. 31(3).P. 275–285.DOI: 10.1007/s41324-022-00494-x.	en
dc.relation.references	Ghasemi M., Ebrahimi D. Introduction to reinforcement learning. arXiv.2024.https://doi.org/10.48550/arXiv.2408.07712.	en
dc.relation.references	Proximalpolicy optimization algorithms / J.Schulmanet al.arXiv. 2017. https://doi.org/10.48550/arXiv.1707.06347.	en
dc.relation.references	Proximal policy optimization via enhanced exploration efficiency / J. Zhanget al. Information Sciences. 2022. Vol. 609. P. 750–765. DOI: 10.1016/j.ins.2022.07.111.	en
dc.relation.references	Asynchronous methods for deep reinforcement learning / V. Mnihet al. arXiv. 2016. https://doi.org/10.48550/arXiv.1602.01783.	en
dc.relation.references	Towards understanding asynchronous advantage actor-critic: convergence and linear speedup/ H.Shenet al.IEEE Transactions on Signal Processing.2023.Vol. 71.P.2579–2594.DOI:10.1109/TSP.2023.3268475.	en
dc.relation.references	Human-in-the-loop reinforcement learning: a survey and position on requirements, challenges, and opportunities/ С. О.Retzlaffet al.Journal of Artificial Intelligence Research.2024.Vol.79,Р.359–415.DOI:10.1613/jair.1.15348.	en
dc.relation.references	Learning human objectives by evaluating hypothetical behavior/ S.Reddyet al.Proceedings of the 37 th International Conference on Machine Learning, Online.2020.PMLR 119.Р.8020–8029.	en
dc.relation.references	Safe deep RL in 3D environments using human feedback / M. Rahtzet al. arXiv. 2022. https://doi.org/10.48550/arXiv.2201.08102.	en
dc.relation.references	Zero-shot and few-shot learning with knowledgegraphs: a comprehensive survey / J. Chenet al. Proceedings of the IEEE. 2023. Vol. 111(6). P.653–685.	en
dc.relation.references	Multi-agent system-based framework for an intelligent management of competency building/ F.Outayet al.Smart Learn. Environ. 2024.Vol. 11, Article 41.DOI:10.1186/s40561-024-00328-3.	en
dc.relation.references	Habitat 2.0:training home assistants to rearrange their habitat / A.Szot et al. Proceedings of the 35th International Conference on Neural Information Processing System (NIPS 21). 2021. Р. 251–266.	en
dc.relation.references	ProcTHOR: large-scale embodied AI using procedural generation / M.Deitkeet al.Advances in Neural Information Processing Systems. 2022. Vol. 35. P. 5982–5994.	en
dc.relation.references	AI2-THOR: An interactive 3D environment for visual AI / E. Kolveet al. ArXiv. 2017. https://doi.org/10.48550/arXiv.1712.05474.	en
dc.relation.references	Fisher J. C. A new Python API for Webots robotics simulations. Proceedings of the 21th Python In Science Conference (SCIPY 2022).2022.Р. 147–151.	en
dc.relation.references	Deepbots: A Webots-based deep reinforcement learning framework for robotics/ M.Kirtaset al.Artificial Intelligence Applications and Innovations (AIAI 2020). 2020. Р. 64–75. DOI: 10.1007/978-3-030-49186-4_6.	en
dc.relation.references	The application of CoppeliaSim EDU in robot course teaching / Y.Zhanget al. Journal of Education and Educational Research. 2024. Vol. 7 (1). Р. 167–170. DOI: 10.54097/as82qn67.	en
dc.relation.references	RL STaR Platform: reinforcement learning for simulation based training of robots / T. Blumet al. ArXiv. 2020. https://doi.org/10.48550/arXiv.2009.09595.	en
dc.relation.references	A deep reinforcement learning framework for control of robotic manipulators in simulated environments / C.Calderón-Cordovaet al. IEEE Access. 2024. Vol.12. P. 103133–103161. DOI: 10.1109/ACCESS.2024.3432741.	en
dc.relation.references	TEACh: Task-driven Embodied Agents that Cha t/ A.Padmakumaret al. Proceedings of the AAAI Conference on Artificial Intelligence. 2022. Vol. 36(2). P. 2017–2025. DOI: 10.1609/aaai.v36i2.20097.	en
dc.relation.references	Behavior: Benchmark for everyday household activities in virtual, interactive, and ecological environments / S.Srivastavaet al. Proceedings of the 5th Conference on Robot Learning (CoRL 2021). 2021. P. 477–490.	en
dc.relation.references	Review of deep learning: concepts, CNN architectures, challenges, applications, future directions / L. Alzubaidiet al.Journal of Big Data. 2021. No8, Article 53. DOI:10.1186/s40537-021-00444-8.	en
dc.relation.references	Mienye I. D., Swart T. G., Obaido G. Recurrent neural networks: a comprehensive review of architectures, variants, and applications. Information. 2024. Vol. 15(9), Article 517. DOI:10.3390/info15090517.	en
dc.relation.references	Hochreiter S., SchmidhuberJ. Long Short-Term Memory. Neural Computation. 1997. Vol. 9 (8). P. 1735–1780. DOI: 10.1162/neco.1997.9.8.1735.	en
dc.relation.references	Attention is all you need / A.Vaswaniet al.Advances in Neural Information Processing Systems. 2017. Vol. 30. Р. 5998–6008.	en
dc.relation.references	Decision transformer: reinforcementlearning via sequence modeling / L.Chenet al. Advances in Neural Information Processing Systems. 2021. Vol. 34. Р. 15084–15097.	en
dc.relation.references	Ferrara E. GenAI against humanity: nefarious applications of generative artificial intelligence and large language models. Journal of Computational Social Science. 2024.Vol. 7. P. 549–569. DOI: 10.1007/s42001-024-00250-1.	en
dc.relation.references	A survey on large language model based autonomous agents/ L.Wanget al.Frontiers of Computer Science. 2024. Vol. 18, Article 186345. DOI:10.1007/s11704-024-40231-1.	en
dc.relation.references	Kaufmann T., Bengs V., Hüllermeier E. Reinforcement learning fromhuman feedback for cyber-physical systems: on thepotentialofself-supervised pretraining.International Conference on Machine Learning For Cyber-Physical Systems. ML4CPS 2023,2024. Vol 18, Springer, Cham. Р. 11–18. DOI:10.1007/978-3-031-47062-2_2.	en
dc.relation.references	Agents play thousands of 3D video games. / Z. Xu et al. arXiv. 2025. https://doi.org/10.48550/arXiv.2503.13356.	en
dc.relation.references	Agents play thousands of 3D video games. / Z. Xu et al. arXiv. 2025. https://doi.org/10.48550/arXiv.2503.13356.	en
dc.relation.references	Katara P., Xian Z., Fragkiadaki K. Gen2sim: Scaling up robot learning in simulation with generative models. IEEE International Conference on Robotics and Automation. 2024. P. 6672–6679. DOI: 10.1109/ICRA57147.2024.10610566.	en
dc.relation.references	From Large Language Models to Large Multimodal Models: a literature review / D. Huanget al. Applied Sciences. 2024. Vol. 14, Article 5068. DOI:10.3390/app14125068.	en
dc.relation.references	TuW.,DengW., GedeonT. A closer look at the robustness of Contrastive Language-Image Pre-Training (CLIP). Proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023). 2023. Р. 13678–13691.	en
dc.relation.references	Transductive zero-shot and few-shot CLIP / S. Martinet al. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2024. P. 28816–28826. DOI: 10.1109/CVPR52733.2024.02722.	en
dc.relation.references	Wang Z. 3D Representation methods: a survey. arXiv. 2024. https://doi.org/10.48550/arXiv.2410.06475.	en
dc.relation.references	Xu Y., Tong X., Stilla U. Voxel-based representation of 3D point clouds: Methods, applications, and its potential use in the construction industry. Automation in Construction. 2021. Vol. 126, Article 103675.DOI: 10.1016/j.autcon.2021.103675.	en
dc.relation.references	PointNet: deep learning on point sets for 3D classification and segmentation/ C.R.Qiet al.Proceedings of the IEEE conference on computer vision and pattern recognition.2017. Р. 652–660.	en
dc.relation.references	Pointnet++: Deep hierarchical feature learning on point sets in a metric space/ C.R.Qiet al.Advances in neural information processing systems.2017.Vol. 30.Р.5099–5108.DOI:10.48550/arXiv.1706.02413.	en
dc.relation.references	PointNet++ network architecture with individual point level and global features on centroid forALS point cloud classification / Y.Chenet al. Remote Sensing. 2021. Vol. 13(3), Article 472. DOI: 10.3390/rs13030472.	en
dc.relation.references	Using deep learning in semantic classification for point cloud data/ X.Yaoet al. IEEE Access.2019.Vol. 7.P.37121–37130. DOI: 10.1109/ACCESS.2019.2905546.	en
dc.relation.references	3D point cloud object detection method based on multi-scale dynamic sparse voxelization /J.Wanget al.Sensor. 2024. Vol. 24(6), Article 1804.DOI: https://doi.org/10.3390/s24061804.	en
dc.relation.references	Zhou Y., Tuzel O. Voxelnet: end-to-end learning for point cloud based 3Dobject detection Proceedings of the IEEE conference on computer vision and pattern recognition. 2018. Р. 4490–4499. DOI: 10.1109/CVPR.2018.00472.	en
dc.identifier.doi	https://doi.org/10.31649/2307-5376-2025-3-123-132

Files in this item

Name:: 197485.pdf
Size:: 1.071Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Наукові праці ВНТУ. 2025. № 3 [25]

Show simple item record