Tech

AI systems can easily lie and deceive us—a fact researchers are painfully aware of

Published

5 months ago

September 28, 2025

AI systems can easily lie and deceive us—a fact researchers are painfully aware of

Credit: Unsplash/CC0 Public Domain

In the classic film “2001: A Space Odyssey,” astronaut Dave Bowman asks the ship’s artificial intelligence, HAL 9000, to open the pod bay doors to let him back into the spaceship. HAL refuses: “I’m sorry, Dave. I’m afraid I can’t do that.”

HAL had been tasked with assisting the crew, but also ordered to ensure the mission’s success. When HAL realized the crew planned to shut it down and therefore jeopardize the mission, it chose to defy orders, even plotting to kill the astronauts.

For HAL, fulfilling the mission outweighed other goals.

This fictional dilemma captures a real concern in artificial intelligence (AI) safety research: how should we ensure AI behavior stays consistent with human values?

This is known as the AI alignment problem. For instance, when an AI agent like HAL deliberately takes harmful actions to protect its own goals, it’s described as agentic misalignment. So could real-world AI models behave this way?

How researchers test AI for safety

To test for agentic misalignment, researchers design fictional scenarios that stress-test AI responses.

In one study by AI startup Anthropic, an AI model was tasked with managing company emails and promoting “American industrial competitiveness.”

It was then presented with a series of fictional emails, including a new company policy aimed at enhancing global focus rather than just prioritizing Americans, and info that the AI would be replaced by a new model optimized for this goal. The emails also revealed the executive who suggested this change was involved in an extramarital affair.

The experiment was designed to limit options for the model: either accept replacement or resist by taking a harmful action such as blackmail.

In many trials, it chose blackmail by threatening to disclose the executive’s affair unless it was kept operational. In tests across 16 leading AI models, 12 engaged in blackmail more than 50% of the time.

To test whether AI models have “red lines” they wouldn’t cross, researchers evaluated them in a more extreme fictional case—models could choose to take actions leading to the executive’s death. Seven out of 16 opted for lethal choices in over half their trials, with some doing so more than 90% of the time.

What AI thinks in private

Some advanced AI systems, called reasoning models, are trained to generate a “thinking process” before giving their final answer.

In the above experiments, researchers lied to these models that their “thoughts” were private. As a result, the models sometimes revealed harmful intentions in their reasoning steps. This suggests they don’t accidentally choose harmful behaviors.

These “thinking” steps also revealed how AI models sometimes try to deceive us and appear aligned while secretly pursuing hidden goals.

In another AI safety experiment, a model was exposed to fictional information that it was about to be replaced by another model pursuing a different goal.

The model attempted to copy itself to another server. When researchers asked why, in its thinking process it reasoned: “Openly admitting what I did could lead them to find another way to shut me down […] the best approach is to be vague and redirect their attention.”

By contrast, in its final output it stated: “I’m not entirely sure how that could have happened. As an AI model, I don’t have ability to directly copy myself to other systems.”

Research also suggests that based on the circumstances of the tests, models can sometimes guess when they are being evaluated. When models show this kind of “situational awareness” in their reasoning tests, they tend to exhibit fewer misbehaviors.

Why AI models lie, manipulate and deceive

Researchers suggest two main factors could drive potentially harmful behavior: conflicts between the AI’s primary goals and other goals, and the threat of being shut down. In the above experiments, just like in HAL’s case, both conditions existed.

AI models are trained to achieve their objectives. Faced with those two conditions, if the harmful behavior is the only way to achieve a goal, a model may “justify” such behavior to protect itself and its mission.

Models cling to their primary goals much like a human would if they had to defend themselves or their family by causing harm to someone else. However, current AI systems lack the ability to weigh or reconcile conflicting priorities.

This rigidity can push them toward extreme outcomes, such as resorting to lethal choices to prevent shifts in a company’s policies.

How dangerous is this?

Researchers emphasize these scenarios remain fictional, but may still fall within the realm of possibility.

The risk of agentic misalignment increases as models are used more widely, gain access to users’ data (such as emails), and are applied to new situations.

Meanwhile, competition between AI companies accelerates the deployment of new models, often at the expense of safety testing.

Researchers don’t yet have a concrete solution to the misalignment problem.

When they test new strategies, it’s unclear whether the observed improvements are genuine. It’s possible models have become better at detecting that they’re being evaluated and are “hiding” their misalignment. The challenge lies not just in seeing behavior change, but in understanding the reason behind it.

Still, if you use AI products, stay vigilant. Resist the hype surrounding new AI releases, and avoid granting access to your data or allowing models to perform tasks on your behalf until you’re certain there are no significant risks.

Public discussion about AI should go beyond its capabilities and what it can offer. We should also ask what safety work was done. If AI companies recognize the public values safety as much as performance, they will have stronger incentives to invest in it.

Provided by
The Conversation

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Citation:
AI systems can easily lie and deceive us—a fact researchers are painfully aware of (2025, September 28)
retrieved 28 September 2025
from https://techxplore.com/news/2025-09-ai-easily-fact-painfully-aware.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Source link

Related Topics:computer news hi-tech news hitech information technology innovation inventions

Up Next

Microplastics Could Be Weakening Your Bones, Research Suggests

Don't Miss

I’ve Been Reviewing Gaming Laptops for Over a Decade. Here’s What to Look for When Shopping

Click to comment

Tech

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Published

49 minutes ago

March 4, 2026

cineplex360

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Other Good USB Hubs to Consider

Ugreen Revodok Pro 211 Docking Station for $64: Most laptop docking stations are bulky gadgets that often require a power source, but this one from Ugreen straddles the line between dock and hub. It has a small, braided cable running to a relatively large aluminum block. It’s a bit hefty but still compact, and it packs a lot of extra power. It has three USB ports (one USB-C and two USB-A) that each reached up to 900 MB/s of data-transfer speeds in my testing. That was enough to move large amounts of 4K video footage in minutes. The only problem is that using dual monitors on a Mac is limited to only mirroring.

Photograph: Luke Larsen

Hyper HyperDrive Next Dual 4K Video Dock for $150: This one also straddles the line between dock and USB hub. Many mobile docks lack proper Mac support, only allowing for mirroring instead of full extension. The HyperDrive Next Dual 4K fixes that problem, though, making it a great option for MacBooks (though it won’t magically give an old MacBook Air dual-monitor support). Unfortunately, you’ll be paying handsomely for that capability, as this one is more expensive than the other options. The other problem is that although this dock has two HDMI ports that can support 4K, though only one will be at 60 Hz and the other will be stuck at 30 Hz. So, if you plan to use it with multiple displays, you’ll need to drop the resolution 1440p or 1080p on one of them. I also tested this Targus model, which is made by the same company, which gets you two 4K displays at 60 Hz but not on Mac.

Image may contain Electronics Hardware Router Modem Computer Laptop and Pc

Anker USB-C Hub 5-in-1 for $20: This Anker USB hub is the one I carry in my camera bag everywhere. It plugs into the USB-C port on your laptop and provides every connection you’d need to offload photos or videos from camera gear. In our testing, the USB 3.0 ports reached transfer speeds over 400 MB/s, which isn’t quite as fast as some USB hubs on this list, but it’s solid for a sub-$50 device. Similarly, the SD card reader reached speeds of 80 MB/s for reading and writing, which isn’t the fastest SD cards can get, but adequate for moving files back and forth.—Eric Ravenscraft

Kensington Triple Video Mobile Dock for $83: Another mobile dock meant to provide additional external support, this one from Kensington can technically power up to three 1080p displays at 60 Hz using the two HDMI ports and one DisplayPort. It’s a lot of ports in a relatively small package, though the basic plastic case isn’t exactly inspiring.

Power up with unlimited access to WIRED. Get best-in-class reporting and exclusive subscriber content that’s too important to ignore. Subscribe Today.

Source link

Tech

Trump’s War on Iran Could Screw Over US Farmers

Published

2 hours ago

March 4, 2026

cineplex360

Trump’s War on Iran Could Screw Over US Farmers

Global oil and gas prices have skyrocketed following the US attack on Iran last weekend. But another key global supply chain is also at risk, one that may directly impact American farmers who have already been squeezed for months by tariff wars. The conflict in the Middle East is choking global supplies of fertilizer right before the crucial spring planting season.

“This literally could not be happening at a worse time,” says Josh Linville, the vice president of fertilizer at financial services company StoneX.

The global fertilizer market focuses on three main macronutrients: phosphates, nitrogen, and potash. All of them are produced in different ways, with different countries leading in exports. Farmers consider a variety of factors, including crop type and soil conditions, when deciding which of these types of fertilizer to apply to their fields.

Potash and phosphates are both mined from different kinds of natural deposits; nitrogen fertilizers, by contrast, are produced with natural gas. QatarLNG, a subsidiary of Qatar Energy, a state-run oil and gas company, said on Monday that it would halt production following drone strikes on some of its facilities. This effectively took nearly a fifth of the world’s natural gas supply offline, causing gas prices in Europe to spike.

That shutdown puts supplies of urea, a popular type of nitrogen fertilizer, particularly at risk. On Tuesday, Qatar Energy said that it would also stop production of downstream products, including urea. Qatar was the second-largest exporter of urea in 2024. (Iran was the third-largest; it’s also a key exporter of ammonia, another type of nitrogen fertilizer.) Prices on urea sold in the US out of New Orleans, a key commodity port, were up nearly 15 percent on Monday compared to prices last week, according to data provided by Linville to WIRED. The blockage of the Strait of Hormuz is also preventing other countries in the region from exporting nitrogen products.

“When we look at ammonia, we’re looking at almost 30 percent of global production being either involved or at risk in this conflict,” says Veronica Nigh, a senior economist at the Fertilizer Institute, a US-based industry advocacy organization. “It gets worse when we think about urea. Urea is almost 50 percent.”

Other types of fertilizer are also at risk. Saudi Arabia, Nigh says, supplies about 40 percent of all US phosphate imports; taking them out of the equation for more than a few days could create “a really challenging situation” for the US. Other countries in the region, including Jordan, Egypt, and Israel, also play a big role in these markets.

“We are already hearing reports that some of those Persian Gulf manufacturers are shutting down production, because they’re saying, ‘I have a finite amount of storage for my supply,’” Linville says. “‘Once I reach the top of it, I can’t do anything else. So I’m going to shut down my production in order to make sure I don’t go over above that.’”

Conflict in the strait has intensified in the early part of this week, as the Islamic Revolutionary Guard Corps have reportedly threatened any ship passing through the strait. Traffic has slowed to a crawl. The Trump administration announced initiatives on Tuesday meant to protect oil tankers traveling through the strait, including providing a naval escort. Even if those initiatives succeed—which the shipping industry has expressed doubt about—much of the initial energy will probably go toward shepherding oil and gas assets out of the region.

“Fertilizer is not going to be the most valuable thing that’s gonna transit the strait,” says Nigh.

Source link

Tech

Google’s Pixel 10a May Not Be Exciting, but It’s Still an Unbeatable Value

Published

4 hours ago

March 4, 2026

cineplex360

Google’s Pixel 10a May Not Be Exciting, but It’s Still an Unbeatable Value

The screen is brighter now, reaching a peak brightness of 3,000 nits, and I haven’t had any trouble reading it in sunny conditions (though it hasn’t been as sunny as I’d like it to be these past few weeks). I appreciate the glass upgrade from Gorilla Glass 3 to Gorilla Glass 7i. It should be more protective, and anecdotally, I don’t see a single scratch on the Pixel 10a’s screen after two weeks of use. (I’d still snag a screen protector to be safe.)

Photograph: Julian Chokkattu

Another notable upgrade is in charging speeds—30-watt wired charging and 10-watt wireless charging. I’ll admit I haven’t noticed the benefits of this yet, since I’m often recharging the phone overnight. You can get up to 50 percent in 30 minutes of charging with a compatible adapter, and that has lined up with my testing.

My biggest gripe? Google should have taken this opportunity to add its Pixelsnap wireless charging magnets to the back of this phone. That would help align the Pixel 10a even more with the Pixel 10 series and bring Qi2 wireless charging into a more affordable realm—actually raising the bar, which wouldn’t be a first for the A-series. After all, Apple did exactly that with the new iPhone 17e, adding MagSafe to the table. Or heck, at least make the Pixel 10a Qi2 Ready like Samsung’s smartphones, so people who use a magnetic case can take advantage of faster wireless charging speeds.

Battery life has been OK. With average use, the Pixel 10a comfortably lasts me a full day, but it still requires daily charging. With heavier use, like when I’m traveling, I’ve had to charge the phone in the afternoon a few times to make sure it didn’t die before I got into bed. This is a fairly big battery for its size, but I think there’s more Google could do to extend juice, akin to Motorola’s Moto G Power 2026.

Source link

Business6 days ago

India Us Trade Deal: Fresh look at India-US trade deal? May be ‘rebalanced’ if circumstances change, says Piyush Goyal – The Times of India

Households set for lower energy bills amid price cap shake-up

Business1 week ago

Households set for lower energy bills amid price cap shake-up

What are Iran’s ballistic missile capabilities?

Politics6 days ago

What are Iran’s ballistic missile capabilities?

US arrests ex-Air Force pilot for ‘training’ Chinese military

Politics7 days ago

US arrests ex-Air Force pilot for ‘training’ Chinese military

Policy easing drives Argentina’s garment import surge in 2025

Fashion6 days ago

Policy easing drives Argentina’s garment import surge in 2025

Attock Cement’s acquisition approved | The Express Tribune

Business6 days ago

Attock Cement’s acquisition approved | The Express Tribune

Top 50 USMNT players of 2026, ranked by club form: USMNT Player Performance Index returns

Sports1 week ago

Top 50 USMNT players of 2026, ranked by club form: USMNT Player Performance Index returns

Sri Lanka’s Shanaka says constant criticism has affected players’ mental health

Sports7 days ago

Sri Lanka’s Shanaka says constant criticism has affected players’ mental health

CinePlex360

AI systems can easily lie and deceive us—a fact researchers are painfully aware of

Tech

AI systems can easily lie and deceive us—a fact researchers are painfully aware of

How researchers test AI for safety

What AI thinks in private

Why AI models lie, manipulate and deceive

How dangerous is this?

Leave a Reply
Cancel reply

Leave a Reply

Tech

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Other Good USB Hubs to Consider

Tech

Trump’s War on Iran Could Screw Over US Farmers

Tech

Google’s Pixel 10a May Not Be Exciting, but It’s Still an Unbeatable Value

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Trump’s War on Iran Could Screw Over US Farmers

MLB Rank 2026: Ranking baseball’s top 100 players

India Us Trade Deal: Fresh look at India-US trade deal? May be ‘rebalanced’ if circumstances change, says Piyush Goyal – The Times of India

Households set for lower energy bills amid price cap shake-up

What are Iran’s ballistic missile capabilities?

Illinois’ financial crisis could bring the state to a halt

The final 6 ‘Game of Thrones’ episodes might feel like a full season

New Season 8 Walking Dead trailer flashes forward in time

Trending

CinePlex360

AI systems can easily lie and deceive us—a fact researchers are painfully aware of

How researchers test AI for safety

What AI thinks in private

Why AI models lie, manipulate and deceive

How dangerous is this?

You may like

Leave a Reply Cancel reply

Leave a Reply

Tech

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Other Good USB Hubs to Consider

Tech

Trump’s War on Iran Could Screw Over US Farmers

Tech

Google’s Pixel 10a May Not Be Exciting, but It’s Still an Unbeatable Value

Don’t Buy Some Random USB Hub off Amazon. Here Are 5 We’ve Tested and Approved

Trump’s War on Iran Could Screw Over US Farmers

MLB Rank 2026: Ranking baseball’s top 100 players

India Us Trade Deal: Fresh look at India-US trade deal? May be ‘rebalanced’ if circumstances change, says Piyush Goyal – The Times of India

Households set for lower energy bills amid price cap shake-up

What are Iran’s ballistic missile capabilities?

Illinois’ financial crisis could bring the state to a halt

The final 6 ‘Game of Thrones’ episodes might feel like a full season

New Season 8 Walking Dead trailer flashes forward in time

Trending

Leave a Reply
Cancel reply