OpenAI updates ChatGPT-4 model with potential fix for AI “laziness” problem

January 27, 2024

On Thursday, OpenAI announced updates to the AI models that power its ChatGPT assistant. Amid less noteworthy updates, OpenAI tucked in a mention of a potential fix to a widely reported “laziness” problem seen in GPT-4 Turbo since its release in November. The company also announced a new GPT-3.5 Turbo model (with lower pricing), a new embedding model, an updated moderation model, and a new way to manage API usage.

“Today, we are releasing an updated GPT-4 Turbo preview model, gpt-4-0125-preview. This model completes tasks like code generation more thoroughly than the previous preview model and is intended to reduce cases of ‘laziness’ where the model doesn’t complete a task,” writes OpenAI in its blog post.

Since the launch of GPT-4 Turbo, a large number of ChatGPT users have reported that the ChatGPT-4 version of its AI assistant has been declining to do tasks (especially coding tasks) with the same exhaustive depth as it did in earlier versions of GPT-4. We’ve seen this behavior ourselves while experimenting with ChatGPT over time.

OpenAI has never offered an official explanation for this change in behavior, but OpenAI employees have previously acknowledged on social media that the problem is real, and the ChatGPT X account wrote in December, “We’ve heard all your feedback about GPT4 getting lazier! we haven’t updated the model since Nov 11th, and this certainly isn’t intentional. model behavior can be unpredictable, and we’re looking into fixing it.”

We reached out to OpenAI asking if it could provide an official explanation for the laziness issue but did not receive a response by press time.

New GPT-3.5 Turbo, other updates

Elsewhere in OpenAI’s blog update, the company announced a new version of GPT-3.5 Turbo (gpt-3.5-turbo-0125), which it says will offer “various improvements including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.”

And the cost of GPT-3.5 Turbo through OpenAI’s API will decrease for the third time this year “to help our customers scale.” New input token prices are 50 percent less, at $0.0005 per 1,000 input tokens, and output prices are 25 percent less, at $0.0015 per 1,000 output tokens.

Lower token prices for GPT-3.5 Turbo will make operating third-party bots significantly less expensive, but the GPT-3.5 model is generally more likely to confabulate than GPT-4 Turbo. So we might see more scenarios like Quora’s bot telling people that eggs can melt (although the instance used a now-deprecated GPT-3 model called text-davinci-003). If GPT-4 Turbo API prices drop over time, some of those hallucination issues with third parties might eventually go away.

OpenAI also announced new embedding models, text-embedding-3-small and text-embedding-3-large, which convert content into numerical sequences, aiding in machine learning tasks like clustering and retrieval. And an updated moderation model, text-moderation-007, is part of the company’s API that “allows developers to identify potentially harmful text,” according to OpenAI.

Finally, OpenAI is rolling out improvements to its developer platform, introducing new tools for managing API keys and a new dashboard for tracking API usage. Developers can now assign permissions to API keys from the API keys page, helping to clamp down on misuse of API keys (if they get into the wrong hands) that can potentially cost developers lots of money. The API dashboard allows devs to “view usage on a per feature, team, product, or project level, simply by having separate API keys for each.”

As the media world seemingly swirls around the company with controversies and think pieces about the implications of its tech, releases like these show that the dev teams at OpenAI are still rolling along as usual with updates at a fairly regular pace. Despite the company almost completely falling apart late last year, it seems that, under the hood, it’s business as usual for OpenAI.

Platforms Rally to Push FinTech IPO Index 3% Higher

CFPB Closes Years-Long Probe Into PayPal, Venmo

Warren Buffett’s shopping extravaganza kicks off with Squishmallows pit, ‘Poor Charlie’s…

U.S. job growth totaled 175,000 in April, much less than expected,…

Carvana shares spike 30% as used car retailer posts record first…

Bitcoin Slump Triggers Warning of ‘Trouble Ahead’ for Global Markets

Block Stock Soars On Q1 Earnings, Strategic Bitcoin Investment: Details

UK to suffer slowest growth of all rich nations next year,…

First-time home buyer? Yes, it’s a tough market, says treasury secretary.

Oil falls to 7-week low on surprise US storage build, Middle…

Seniors over 65 or disability receive good news: they will be…

Why the best high-yield savings account may not come from a…

7 steps to start saving money: A comprehensive guide to saving,…

If your date of birth is between the 1st and 10th,…

How the US Retirement Age Compares With Rest of World

Rabbit was once an NFT company that it wants you to…

Light, flexible, efficient: Perovskite-based tandem solar cells

Tidal energy is coming to Alaska. But how much?

Google Announces Passkeys Adopted by Over 400 Million Accounts

Rabbit R1 AI box revealed to just be an Android app

OpenAI updates ChatGPT-4 model with potential fix for AI “laziness” problem

New GPT-3.5 Turbo, other updates

Most Viewed

Stocks and the oil market fall together thanks to global economic...

Dealmaking surge confirms the stock market got far too cheap after...

Microsoft to cover Windows 7 with advanced threat service

Trending Now

7 steps to start saving money: A comprehensive guide to saving, budgeting, and investing...

If your date of birth is between the 1st and 10th, your May Social...

Seniors over 65 or disability receive good news: they will be able to receive...

OpenAI updates ChatGPT-4 model with potential fix for AI “laziness” problem

New GPT-3.5 Turbo, other updates

RELATED ARTICLES

Most Viewed

Trending Now