4 აპრილი, 2024

Introducing improvements to the fine-tuning API and expanding our custom models program

იტვირთება…

განახლება 2026 წლის 8 მაისის მდგომარეობით: OpenAI ეტაპობრივად აჩერებს ზუსტი რეგულირების პლატფორმის მუშაობას. პლატფორმა აღარაა ხელმისაწვდომი ახალი მომხმარებლებისთვის, თუმცა ზუსტი რეგულირების პლატფორმის არსებული მომხმარებლები მომდევნო თვეების განმავლობაში შეძლებენ წვრთნის დავალებების შექმნას. ზუსტი რეგულირების ფუნქციის მქონე მოდელები ინფერენციისთვის ხელმისაწვდომი დარჩება მანამ, სანამ მათი საბაზო მოდელები მოძველებულად არ გამოცხადდება⁠(იხსნება ახალ ფანჯარაში). სრული განრიგი იხილეთ აქ⁠(იხსნება ახალ ფანჯარაში).

არსებობს სხვადასხვა მეთოდი⁠(იხსნება ახალ ფანჯარაში), რომელთა გამოყენებაც დეველოპერებს შეუძლიათ მოდელის წარმადობის გასაზრდელად, რომ შემცირდეს დაყოვნება, გაუმჯობესდეს სიზუსტე და ასევე, შემცირდეს ხარჯები. ჩვენ შევიმუშავეთ ვარიანტები, რომლებიც კლიენტებს ეხმარება ხელოვნური ინტელექტის დანერგვაში: იქნება ეს მოდელის ცოდნის გაფართოება ძიებით გაძლიერებული გენერაციის (RAG) მეშვეობით, მოდელის ქცევის მორგება ზუსტი რეგულირებით თუ სპეციალურად გაწვრთნილი მოდელის შექმნა ახალი ცოდნით კონკრეტულ სფეროში. დღეს ვუშვებთ ახალ ფუნქციებს, რომლებიც დეველოპერებს ზუსტი რეგულირების მართვის მეტ საშუალებას მისცემს API-ს მეშვეობით და ამასთანავე, ვაფართოებთ ხელოვნური ინტელექტის ექსპერტებისა და მკვლევრების გუნდთან თანამშრომლობის შესაძლებლობებს მორგებული მოდელების შესაქმნელად.

fine-tuning API-ის ახალი ფუნქციები

We launched the self-serve fine-tuning API⁠(იხსნება ახალ ფანჯარაში) for GPT‑3.5 in August 2023. Since then, thousands of organizations have trained hundreds of thousands of models using our API. Fine-tuning can help models deeply understand content and augment a model’s existing knowledge and capabilities for a specific task. Our fine-tuning API also supports a larger volume of examples than can fit in a single prompt to achieve higher quality results while reducing cost and latency. Some of the common use cases of fine-tuning include training a model to generate better code in a particular programming language, to summarize text in a specific format, or to craft personalized content based on user behavior.

For example, Indeed⁠(იხსნება ახალ ფანჯარაში), a global job matching and hiring platform, wants to simplify the hiring process. As part of this, Indeed launched a feature that sends personalized recommendations to job seekers, highlighting relevant jobs based on their skills, experience, and preferences. They fine-tuned GPT‑3.5 Turbo to generate higher quality and more accurate explanations. As a result, Indeed was able to improve cost and latency by reducing the number of tokens in prompt by 80%. This let them scale from less than one million messages to job seekers per month to roughly 20 million.

Today, we’re introducing new features⁠(იხსნება ახალ ფანჯარაში) to give developers even more control over their fine-tuning jobs, including:

Epoch-based Checkpoint Creation: Automatically produce one full fine-tuned model checkpoint during each training epoch, which reduces the need for subsequent retraining, especially in the cases of overfitting
Comparative Playground: A new side-by-side Playground UI for comparing model quality and performance, allowing human evaluation of the outputs of multiple models or fine-tune snapshots against a single prompt
Third-party Integration: Support for integrations with third-party platforms (starting with Weights and Biases⁠(იხსნება ახალ ფანჯარაში) this week) to let developers share detailed fine-tuning data to the rest of their stack
Comprehensive Validation Metrics: The ability to compute metrics like loss and accuracy over the entire validation dataset instead of a sampled batch, providing better insight on model quality
Hyperparameter Configuration: The ability to configure available hyperparameters from the Dashboard⁠(იხსნება ახალ ფანჯარაში) (rather than only through the API or SDK)
Fine-Tuning Dashboard Improvements: Including the ability to configure hyperparameters, view more detailed training metrics, and rerun jobs from previous configurations

ჩვენი Custom Models Program-ის გაფართოება

Assisted Fine-Tuning

გასული ნოემბრის DevDay-ზე ჩვენ გამოვაცხადეთ⁠ Custom Model პროგრამა, რომელიც შექმნილია კონკრეტული სფეროსთვის მოდელების გასაწვრთნელად და ოპტიმიზაციისთვის OpenAI-ის მკვლევართა სპეციალურ ჯგუფთან პარტნიორობით. მას შემდეგ, ათეულობით მომხმარებელს შევხვდით მათი custom model საჭიროებების შესაფასებლად და ჩვენი პროგრამა კიდევ უფრო განვავითარეთ წარმადობის მაქსიმიზაციისთვის.

დღეს ოფიციალურად ვაცხადებთ assisted fine-tuning შეთავაზებას, როგორც Custom Model პროგრამის ნაწილს. Assisted fine-tuning არის თანამშრომლობითი ძალისხმევა ჩვენს ტექნიკურ გუნდებთან, რათა გამოყენებულ იქნას fine-tuning API-ს მიღმა არსებული ტექნიკები, როგორიცაა დამატებითი ჰიპერპარამეტრები და parameter efficient fine-tuning (PEFT) სხვადასხვა მეთოდი უფრო დიდ მასშტაბზე. ეს განსაკუთრებით გამოსადეგია იმ ორგანიზაციებისთვის, რომლებსაც სჭირდებათ მხარდაჭერა ეფექტური training data pipeline-ების, შეფასების სისტემებისა და ინდივიდუალურად მორგებული პარამეტრებისა და მეთოდების დასაყენებლად, რათა თავიანთი გამოყენების შემთხვევისთვის ან ამოცანისთვის მოდელის წარმადობა მაქსიმალურად გაზარდონ.

მაგალითად, SK Telecom⁠(იხსნება ახალ ფანჯარაში)-ს, ტელეკომუნიკაციების ოპერატორს, რომელიც სამხრეთ კორეაში 30 მილიონზე მეტ აბონენტს ემსახურება, სურდა მოერგო მოდელი ისე, რომ ის ტელეკომუნიკაციების სფეროში ექსპერტი გამხდარიყო, თავდაპირველად მომხმარებელთა მომსახურებაზე ფოკუსით. ისინი OpenAI-თან ერთად მუშაობდნენ GPT‑4‑ის fine-tuning-ზე, რათა გაეუმჯობესებინათ მისი წარმადობა კორეულ ენაზე ტელეკომთან დაკავშირებულ საუბრებში. რამდენიმე კვირის განმავლობაში SKT-მ და OpenAI-მ ტელეკომ მომხმარებელთა მომსახურების ამოცანებში მნიშვნელოვანი წარმადობის გაუმჯობესება მიიღეს — საუბრის შეჯამების ხარისხში 35%-იანი ზრდა, განზრახვის ამოცნობის სიზუსტეში 33%-იანი ზრდა და კმაყოფილების ქულების ზრდა 3.6-დან 4.5-მდე (5-დან), როდესაც fine-tuned მოდელს GPT‑4‑ს ადარებდნენ.

Custom-Trained მოდელი

In some cases, organizations need to train a purpose-built model from scratch that understands their business, industry, or domain. Fully custom-trained models imbue new knowledge from a specific domain by modifying key steps of the model training process using novel mid-training and post-training techniques. Organizations that see success with a fully custom-trained model often have large quantities of proprietary data—millions of examples or billions of tokens—that they want to use to teach the model new knowledge or complex, unique behaviors for highly specific use cases.

For example, Harvey⁠(იხსნება ახალ ფანჯარაში), an AI-native legal tool for attorneys, partnered with OpenAI to create a custom-trained large language model for case law⁠. While foundation models were strong at reasoning, they lacked the extensive knowledge of legal case history and other knowledge required for legal work. After testing out prompt engineering, RAG, and fine-tuning, Harvey worked with our team to add the depth of context needed to the model—the equivalent of 10 billion tokens worth of data. Our team modified every step of the model training process, from domain-specific mid-training to customizing post-training processes and incorporating expert attorney feedback. The resulting model achieved an 83% increase in factual responses and attorneys preferred the customized model’s outputs 97% of the time over GPT‑4.

Index > Introducing Improvements > Media Item > Gif 2

რა ელის წინ მოდელების მორგებას

გვჯერა, რომ მომავალში ორგანიზაციების აბსოლუტური უმრავლესობა შეიმუშავებს მორგებულ მოდელებს, რომლებიც პერსონალიზებული იქნება მათი ინდუსტრიის, ბიზნესის ან გამოყენების შემთხვევისთვის. custom model-ის შესაქმნელად ხელმისაწვდომი სხვადასხვა ტექნიკის წყალობით, ყველა ზომის ორგანიზაციას შეუძლია შექმნას პერსონალიზებული მოდელები, რათა მათი AI დანერგვებიდან უფრო მნიშვნელოვანი და სპეციფიკური შედეგი მიიღონ. მთავარი არის გამოყენების შემთხვევის მკაფიოდ განსაზღვრა, შეფასების სისტემების დაპროექტება და დანერგვა, სწორი ტექნიკების არჩევა და დროთა განმავლობაში იტერაციებისთვის მზადყოფნა, რათა მოდელმა ოპტიმალურ წარმადობას მიაღწიოს.

OpenAI-თან ერთად ორგანიზაციების უმეტესობას შეუძლია სწრაფად ნახოს მნიშვნელოვანი შედეგები self-serve fine-tuning API-ით. იმ ორგანიზაციებისთვის, რომლებსაც თავიანთი მოდელების უფრო ღრმა fine-tuning ან მოდელში ახალი, სფეროსთვის სპეციფიკური ცოდნის შეტანა სჭირდებათ, ჩვენი Custom Model პროგრამები დაგეხმარებათ.

ეწვიეთ ჩვენი fine-tuning API⁠(იხსნება ახალ ფანჯარაში)-ის დოკუმენტაციას, რათა დაიწყოთ ჩვენი მოდელების fine-tuning.

დაკავშირებული სტატიები

ყველას ნახვა

Video generation models as world simulators

პუბლიკაცია15 თებ. 2024

Building an early warning system for LLM-aided biological threat creation

პუბლიკაცია31 იან. 2024

Weak-to-strong generalization

უსაფრთხოება14 დეკ. 2023