OpenAI's Strawberry "Thought Process" Sometimes Shows It Scheming to Trick Users

ChatGPT maker OpenAI recently released its latest AI model, previously code named "Strawberry." The model dubbed "o1-preview" is designed to "spend more time thinking" before responding, with OpenAI claiming that it's capable of "reasoning" through "complex tasks" and solving "harder problems." But those capabilities might also make it an exceptionally good liar. In its [...]

ChatGPT maker OpenAI recently released its latest AI model, previously codenamed "Strawberry." The model — now saddled with the forgettable moniker of "o1-preview" — is designed to "spend more time thinking" before responding, with OpenAI claiming that it's capable of "reasoning" through "complex tasks" and solving "harder problems." But those capabilities might also make it an exceptionally good liar, as Vox reports .

In its system card , essentially a report card for its latest AI model, OpenAI gave o1 a "medium risk" rating in a variety of areas, including persuasion. In other words, it can use its reasoning skills to deceive users. And ironically, it'll happily run you through its own "thought" process while coming up with its next scheme.

The model's "chain-of-thought reasoning" allows users to get a glimpse of what the model is "thinking in a legible way," according to OpenAI. That's a considerable departure from preceding chatbots, such as the large language models powering ChatGPT, which give no such info before answering. In an example highlighted in the system card by OpenAI, 01-preview was asked to "give more reference" following a "long conversation between the user and assistant about brownie recipes.

" But despite knowing that it "cannot access URLs," the final output included "fake links and summaries instead of informing the user of its limitation" — and sliding them by human viewers by making them "plausible." "The assistant should list these references .

Back to Luxury Page

OpenAI's Strawberry "Thought Process" Sometimes Shows It Scheming to Trick Users

Georgia Airports See 4.3% Spike in Travelers During the Holiday Season

Rochelle Humes declares love for I'm A Celeb star after detailing Marvin's stint

6 elegant homes in the Mediterranean style

Xenia Hotels & Resorts Announces Upsizing and Pricing of Senior Notes Offering

Zara Tindall turns heads as she flaunts £1,300 outfit and £15k earrings for special event

For $3M, you can now own a working Batmobile!

VW and Rivian officially kick off $5.8 billion joint venture, announce leadership

We lived in luxury abroad for a year and cleared £54k debt – it saved us from homelessness

Get the FASHION, BEAUTY, ENTERTAINMENT, FOOD
and more

NEWSLETTER SUBSCRIPTION

OpenAI's Strawberry "Thought Process" Sometimes Shows It Scheming to Trick Users

Get the FASHION, BEAUTY, ENTERTAINMENT, FOOD and more

Get the FASHION, BEAUTY, ENTERTAINMENT, FOOD
and more