2024 ж. 4 сәуір

Introducing improvements to the fine-tuning API and expanding our custom models program

Жүктелуде…

8 мамыр 2026 жылдағы жаңарту: OpenAI жеке баптау платформасын біртіндеп тоқтатып жатыр. Платформа жаңа пайдаланушыларға енді қолжетімді емес, бірақ жеке баптау платформасының қолданыстағы пайдаланушылары алдағы айларда оқыту тапсырмаларын жасай алады. Барлық жеке бапталған модельдер өздерінің базалық модельдері қолданыстан шығарылғанға⁠(жаңа терезеде ашылады) дейін инференс үшін қолжетімді болып қалады. Толық уақыт кестесі мұнда⁠(жаңа терезеде ашылады).

Әзірлеушілер модель өнімділігін арттыру үшін кідірісті азайту, дәлдікті жақсарту және шығындарды төмендету мақсатында қолдана алатын әртүрлі әдістер⁠(жаңа терезеде ашылады) бар. Іздеу арқылы толықтырылған генерация (RAG) көмегімен модель білімін кеңейту, баптау арқылы модельдің әрекетін теңшеу немесе жаңа салаға тән біліммен арнайы оқытылған модель жасау сияқты бірқатар нұсқалар әзірледік, олар тұтынушыларымыздың жасанды интеллект енгізулерін қолдауға арналған. Бүгін біз әзірлеушілерге API арқылы дәл баптауды көбірек бақылау мүмкіндігін беретін жаңа функцияларды іске қосып, арнайы модельдер жасау үшін ЖИ сарапшылары мен зерттеушілерден тұратын командамызбен жұмыс істеудің қосымша жолдарын ұсынып отырмыз.

Fine-tuning API-інің жаңа мүмкіндіктері

We launched the self-serve fine-tuning API⁠(жаңа терезеде ашылады) for GPT‑3.5 in August 2023. Since then, thousands of organizations have trained hundreds of thousands of models using our API. Fine-tuning can help models deeply understand content and augment a model’s existing knowledge and capabilities for a specific task. Our fine-tuning API also supports a larger volume of examples than can fit in a single prompt to achieve higher quality results while reducing cost and latency. Some of the common use cases of fine-tuning include training a model to generate better code in a particular programming language, to summarize text in a specific format, or to craft personalized content based on user behavior.

For example, Indeed⁠(жаңа терезеде ашылады), a global job matching and hiring platform, wants to simplify the hiring process. As part of this, Indeed launched a feature that sends personalized recommendations to job seekers, highlighting relevant jobs based on their skills, experience, and preferences. They fine-tuned GPT‑3.5 Turbo to generate higher quality and more accurate explanations. As a result, Indeed was able to improve cost and latency by reducing the number of tokens in prompt by 80%. This let them scale from less than one million messages to job seekers per month to roughly 20 million.

Today, we’re introducing new features⁠(жаңа терезеде ашылады) to give developers even more control over their fine-tuning jobs, including:

Epoch-based Checkpoint Creation: Automatically produce one full fine-tuned model checkpoint during each training epoch, which reduces the need for subsequent retraining, especially in the cases of overfitting
Comparative Playground: A new side-by-side Playground UI for comparing model quality and performance, allowing human evaluation of the outputs of multiple models or fine-tune snapshots against a single prompt
Third-party Integration: Support for integrations with third-party platforms (starting with Weights and Biases⁠(жаңа терезеде ашылады) this week) to let developers share detailed fine-tuning data to the rest of their stack
Comprehensive Validation Metrics: The ability to compute metrics like loss and accuracy over the entire validation dataset instead of a sampled batch, providing better insight on model quality
Hyperparameter Configuration: The ability to configure available hyperparameters from the Dashboard⁠(жаңа терезеде ашылады) (rather than only through the API or SDK)
Fine-Tuning Dashboard Improvements: Including the ability to configure hyperparameters, view more detailed training metrics, and rerun jobs from previous configurations

Custom Models Program бағдарламасын кеңейту

Assisted Fine-Tuning

Өткен қарашадағы DevDay іс-шарасында біз жариялаған⁠ болатынбыз: OpenAI зерттеушілерінің арнайы тобымен серіктестікте нақты бір домен үшін модельдерді үйретіп, оңтайландыруға арналған Custom Model бағдарламасы. Содан бері біз ондаған клиентпен кездесіп, олардың custom model қажеттіліктерін бағаладық және өнімділікті барынша арттыру үшін бағдарламамызды одан әрі дамыттық.

Бүгін біз Assisted Fine-Tuning ұсынысымызды Custom Model бағдарламасының бір бөлігі ретінде ресми түрде жариялаймыз. Assisted fine-tuning — бұл fine-tuning API шеңберінен тыс тәсілдерді, мысалы қосымша гиперпараметрлерді және әртүрлі parameter efficient fine-tuning (PEFT) әдістерін кеңірек ауқымда пайдалануға бағытталған техникалық командаларымызбен бірлескен жұмыс. Бұл әсіресе өз қолдану жағдайы не тапсырмасы үшін модель өнімділігін барынша арттыру мақсатында тиімді үйрету деректері құбырларын, бағалау жүйелерін және арнайы параметрлер мен әдістерді орнатуға қолдау қажет ұйымдарға пайдалы.

Мысалы, Оңтүстік Кореяда 30 миллионнан астам абонентке қызмет көрсететін телекоммуникациялық оператор SK Telecom⁠(жаңа терезеде ашылады) бастапқы назарды клиенттерге қызмет көрсетуге аудара отырып, телекоммуникация саласының сарапшысы болатын модельді бейімдегісі келді. Олар OpenAI-пен бірге GPT‑4‑ті fine-tuning арқылы корей тіліндегі телеком тақырыбындағы әңгімелердегі өнімділігін жақсартты. Бірнеше апта ішінде SKT мен OpenAI телеком клиенттеріне қызмет көрсету тапсырмаларында елеулі өнімділік жақсаруына қол жеткізді — әңгімелерді түйіндеу сапасы 35% артты, ниетті тану дәлдігі 33% өсті, ал fine-tuned модельді GPT‑4‑пен салыстырғанда қанағаттану ұпайы 3,6-дан 4,5-ке дейін (5 балдың ішінен) көтерілді.

Custom-Trained Model

In some cases, organizations need to train a purpose-built model from scratch that understands their business, industry, or domain. Fully custom-trained models imbue new knowledge from a specific domain by modifying key steps of the model training process using novel mid-training and post-training techniques. Organizations that see success with a fully custom-trained model often have large quantities of proprietary data—millions of examples or billions of tokens—that they want to use to teach the model new knowledge or complex, unique behaviors for highly specific use cases.

For example, Harvey⁠(жаңа терезеде ашылады), an AI-native legal tool for attorneys, partnered with OpenAI to create a custom-trained large language model for case law⁠. While foundation models were strong at reasoning, they lacked the extensive knowledge of legal case history and other knowledge required for legal work. After testing out prompt engineering, RAG, and fine-tuning, Harvey worked with our team to add the depth of context needed to the model—the equivalent of 10 billion tokens worth of data. Our team modified every step of the model training process, from domain-specific mid-training to customizing post-training processes and incorporating expert attorney feedback. The resulting model achieved an 83% increase in factual responses and attorneys preferred the customized model’s outputs 97% of the time over GPT‑4.

Index > Introducing Improvements > Media Item > Gif 2

Модельді баптаудың келесі қадамдары

Болашақта ұйымдардың басым көпшілігі өз саласына, бизнесіне немесе қолдану жағдайына жекелендірілген бейімделген модельдер әзірлейді деп сенеміз. Custom model жасауға арналған түрлі тәсілдердің болуының арқасында кез келген көлемдегі ұйымдар AI енгізулерінен анағұрлым мәнді, нақты әсер алу үшін жекелендірілген модельдер жасай алады. Ең бастысы — қолдану жағдайын нақты айқындау, бағалау жүйелерін жобалап енгізу, дұрыс тәсілдерді таңдау және модель оңтайлы өнімділікке жетуі үшін уақыт өте келе қайталап жетілдіруге дайын болу.

OpenAI көмегімен ұйымдардың көпшілігі self-serve fine-tuning API арқылы жылдам елеулі нәтижелер көре алады. Ал модельдерін тереңірек fine-tuning арқылы баптауы немесе модельге жаңа, доменге тән білім сіңіруі қажет ұйымдарға біздің Custom Model бағдарламаларымыз көмектесе алады.

Модельдерімізді fine-tuning арқылы баптауды бастау үшін fine-tuning API⁠(жаңа терезеде ашылады) құжаттамасына кіріңіз.

Қатысты мақалалар

Барлығын қарау

Video generation models as world simulators

Жарияланым2024 ж. 15 ақп.

Building an early warning system for LLM-aided biological threat creation

Жарияланым2024 ж. 31 қаң.

Weak-to-strong generalization

Қауіпсіздік2023 ж. 14 жел.