Методи тонкого налаштування штучного інтелекту

Кудрик, О. В.; Бісікало, О. В.; Здітовецький, Ю. С.; Kudryk, О.; Bisikalo, О.; Zditovetskiy, Yu.

Автор

Кудрик, О. В.

Бісікало, О. В.

Здітовецький, Ю. С.

Kudryk, О.

Bisikalo, О.

Zditovetskiy, Yu.

Дата

2024

Metadata

Показати повну інформацію

Collections

Анотації

Approaches to fine-tuning of artificial intelligence based on two proposed methods are considered. For this purpose , a review of the main methods of retraining artificial intelligence was carried out, their advantages and disadvantages were determined. The proposed methods have their own characteristics, each of them is suitable for different types of tasks. The "Step-by-Step" method is based on multiplying the information entered into the AI verification model and consolidating this information at each stage of training. Using this approach, it becomes possible to detect errors at the early stages and provide additional information to the model, which allows the artificial intelligence to learn the material better. In its turn, the "All at once" method simultaneously introduces a large amount of information to the AI model at the initial stage, after which the verification of the fixed information begins by placing questions. This approach can be effective for tasks that require rapid learning and need complex comprehension of large amounts of information. Each of the methods has its advantages and disadvantages, the effectiveness of each of them may vary depending on the specific context of application. The Step-by-Step method allows the AI to learn the details better, but may require more time and resources. The “All at once” method allows to achieve results faster, but it can lead to a superficial understanding of the material and increase the number of errors. In this work, a schematic block of each of the two methods was developed, a comparative analysis of the effectiveness of these methods was carried out using the example of fine-tuning the ChatGPT model. An experimental approbation of the AI training process was carried out based on each of the methods, due to which a comparative assessment of the effectiveness of the results of fine-tuning was performed and appropriate conclusions were drawn. The results of the study can be useful for developers and researchers working in the field of artificial intelligence and can help to use a better method of retraining for specific tasks. Therefore, modified methods of fine-tuning of artificial intelligence have been studied, which allow using a smaller amount of resources and obtaining high accuracy and efficiency of work for specific tasks.

Розглянуто підходи до тонкого налаштування штучного інтелекту на основі двох запропонованих методів. З цією метою проведено огляд основних методів донавчання штучного інтелекту, визначено їхні переваги та недоліки. Запропоновані методи мають свої особливості, кожний з них підходить для різних типів завдань. Метод «Крок за кроком» базується на поступовому внесенні інформації до моделі ШІ та перевірці засвоєння цієї інформації на кожному етапі навчання. Використовуючи цей підхід, стає можливим виявляти помилки на ранніх стадіях та надавати додаткову інформацію до моделі, що дозволяє штучному інтелекту краще засвоїти матеріал. У свою чергу, метод «Все одразу» одночасно вносить великий обсяг інформації до моделі ШІ на початковому етапі, після чого розпочинається перевірка засвоєння інформації шляхом задавання запитань. Цей підхід може бути ефективним для завдань, що потребують швидкого навчання та вимагають комплексного розуміння великих обсягів інформації. Кожен з методів має свої переваги та недоліки, ефективність кожного з них може варіюватися в залежності від конкретного контексту застосування. Метод «Крок за кроком» дозволяє штучному інтелекту краще засвоювати деталі, але може вимагати більше часу та ресурсів. Метод «Все одразу» дозволяє швидше досягати результатів, проте може спричиняти поверхневе розуміння матеріалу та збільшення кількості помилок. Розроблено блок-схеми кожного з двох методів, проведено порівняльний аналіз ефективності цих методів на прикладі тонкого налаштування моделі ChatGPT. Проведено експериментальну апробацію процесу донавчання ШІ на основі кожного з методів, завдяки чому отримано порівняльну оцінку ефективності результатів тонкого налаштування та дійшли висновків. Результати дослідження можуть бути корисними для розробників та дослідників, які працюють у сфері штучного інтелекту, та можуть допомогти обрати кращий метод донавчання для конкретних завдань. Отже, досліджено модифіковані методи тонкого налаштування штучного інтелекту, які забезпечують використання меншої кількості ресурсів та отримання високої точності та ефективності роботи для конкретних завдань.

URI:

https://ir.lib.vntu.edu.ua//handle/123456789/51093

Відкрити

197944.pdf (509.8Kb)