Key Updates from OpenAI DevDay

2023-11-07Renee's Entrepreneurial Journey

New Progress in GPT-4 Turbo:

the API.
Function calls have become more powerful, allowing simultaneous requests for multiple APIs. For example, getting weather updates for three locations at once.
Perform output control: response_format to { type: "json_object" }

Assistants API Function Enhancement:

. The API provides three powerful tools: code interpreter, retrieval, and function calls.

：Execute Python coding in a safe environment, generate charts, and process complex data. The Code Interpreter was introduced earlier.

Google Bard /Anthropic Claude2 / ChatGPT Code Interpreter Latest Updates

：Leverage external data sources to extend the assistant's knowledge, such as domain-specific data and user-provided documents, simplifying the document processing workflow.
：Allow the assistant to invoke and integrate user-defined function responses.

Multimodal capability enhancement:

：As previously introduced, specific details can be found in relevant materials.

: Currently, DALL·E 3 only has text-to-image functionality; outpainting, inpainting, and image-to-image are still in DALL·E 2.

：The text-to-speech function discussed earlier.

ChatGPT Multimodal Trial

Personalization and Custom Models:

The fine-tuning details of GPT-3 are now applicable to GPT-4, bringing more personalized options for model interaction. As previously introduced.

The GPT-3.5 Turbo fine-tuning (micro-tuning function) has been released!

The custom model program is well-suited for training datasets with tens of billions of tokens, offering the possibility to create customized GPT-4 models.

Significant price reduction:

Input costs reduced to one-third of the original price, and output costs reduced to half of the original price.
Price comparison table.
The API call limit per second has doubled, providing better cost efficiency for large-scale operations.

Technical Enhancement:

: Provides stronger speech recognition capabilities.
: The VAE component of Stable Diffusion, which improves text processing, facial control, and line work, bringing higher quality and accuracy.