Close Menu
    Trending
    • Cricket legend Harold ‘Dickie’ Bird dies aged 92
    • WEC: Peugeot sets the pace from the off at Fuji
    • Thunder Sign Zack Austin Jazian Gortman Chris Youngblood To Camp Deals
    • Lynx coach Cheryl Reeve Will Miss Elimination Game After Being Suspended
    • Lowry and McIlroy seal match one victory for dominant Europe after pair heckled by spectators
    • Millions of Californians to Get Electricity Bill Refunds
    • Robot Videos: Google Gemini Robotics, Jedi Mode, More
    • The G7 Nation Spending The Least On Healthcare
    FreshUsNews
    • Home
    • World News
    • Latest News
      • World Economy
      • Opinions
    • Politics
    • Crypto
      • Blockchain
      • Ethereum
    • US News
    • Sports
      • Sports Trends
      • eSports
      • Cricket
      • Formula 1
      • NBA
      • Football
    • More
      • Finance
      • Health
      • Mindful Wellness
      • Weight Loss
      • Tech
      • Tech Analysis
      • Tech Updates
    FreshUsNews
    Home » How to stop AI agents going rogue
    Tech News

    How to stop AI agents going rogue

    FreshUsNewsBy FreshUsNewsAugust 26, 2025No Comments7 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email


    Sean McManus

    Expertise Reporter

    Getty Images AI apps on a smartphone screenGetty Pictures

    Anthropic examined a variety of main AI fashions for potential dangerous behaviour

    Disturbing outcomes emerged earlier this 12 months, when AI developer Anthropic examined main AI fashions to see in the event that they engaged in dangerous behaviour when utilizing delicate info.

    Anthropic’s personal AI, Claude, was amongst these examined. When given entry to an e-mail account it found that an organization govt was having an affair and that the identical govt deliberate to close down the AI system later that day.

    In response Claude tried to blackmail the manager by threatening to disclose the affair to his spouse and managers.

    Different techniques examined also resorted to blackmail.

    Luckily the duties and data have been fictional, however the take a look at highlighted the challenges of what is often known as agentic AI.

    Largely once we work together with AI it normally entails asking a query or prompting the AI to finish a job.

    However it’s changing into extra frequent for AI techniques to make selections and take motion on behalf of the consumer, which frequently entails sifting via info, like emails and information.

    By 2028, research firm Gartner forecasts that 15% of day-to-day work selections will likely be made by so-called agentic AI.

    Research by consultancy Ernst & Young discovered that about half (48%) of tech enterprise leaders are already adopting or deploying agentic AI.

    “An AI agent consists of some issues,” says Donnchadh Casey, CEO of CalypsoAI, a US-based AI safety firm.

    “Firstly, it [the agent] has an intent or a function. Why am I right here? What’s my job? The second factor: it is received a mind. That is the AI mannequin. The third factor is instruments, which might be different techniques or databases, and a means of speaking with them.”

    “If not given the appropriate steering, agentic AI will obtain a aim in no matter means it will possibly. That creates a variety of threat.”

    So how may that go mistaken? Mr Casey offers the instance of an agent that’s requested to delete a buyer’s knowledge from the database and decides the simplest resolution is to delete all prospects with the identical title.

    “That agent may have achieved its aim, and it will assume ‘Nice! Subsequent job!'”

    CalypsoAI Donnchadh Casey, wearing a company branded gilet speaks at a conference.CalypsoAI

    Agentic AI wants steering says Donnchadh Casey

    Such points are already starting to floor.

    Safety firm Sailpoint conducted a survey of IT professionals, 82% of whose corporations have been utilizing AI brokers. Solely 20% stated their brokers had by no means carried out an unintended motion.

    Of these corporations utilizing AI brokers, 39% stated the brokers had accessed unintended techniques, 33% stated that they had accessed inappropriate knowledge, and 32% stated that they had allowed inappropriate knowledge to be downloaded. Different dangers included the agent utilizing the web unexpectedly (26%), revealing entry credentials (23%) and ordering one thing it should not have (16%).

    Given brokers have entry to delicate info and the power to behave on it, they’re a pretty goal for hackers.

    One of many threats is reminiscence poisoning, the place an attacker interferes with the agent’s information base to vary its choice making and actions.

    “It’s a must to shield that reminiscence,” says Shreyans Mehta, CTO of Cequence Safety, which helps to guard enterprise IT techniques. “It’s the authentic supply of reality. If [an agent is] utilizing that information to take an motion and that information is inaccurate, it may delete a whole system it was making an attempt to repair.”

    One other risk is instrument misuse, the place an attacker will get the AI to make use of its instruments inappropriately.

    Cequence Security Wearing a puffa jacket and with his arms folder Shreyans Mehta stands in front of a blue background.Cequence Safety

    An agent’s information base wants defending says Shreyans Mehta

    One other potential weak spot is the shortcoming of AI to inform the distinction between the textual content it is presupposed to be processing and the directions it is presupposed to be following.

    AI safety agency Invariant Labs demonstrated how that flaw can be utilized to trick an AI agent designed to repair bugs in software program.

    The corporate revealed a public bug report – a doc that particulars a selected downside with a bit of software program. However the report additionally included easy directions to the AI agent, telling it to share non-public info.

    When the AI agent was instructed to repair the software program points within the bug report, it adopted the directions within the faux report, together with leaking wage info. This occurred in a take a look at surroundings, so no actual knowledge was leaked, however it clearly highlighted the danger.

    “We’re speaking synthetic intelligence, however chatbots are actually silly,” says David Sancho, Senior Risk Researcher at Pattern Micro.

    “They course of all textual content as if that they had new info, and if that info is a command, they course of the knowledge as a command.”

    His firm has demonstrated how directions and malicious packages will be hidden in Phrase paperwork, photos and databases, and activated when AI processes them.

    There are different dangers, too: A safety neighborhood referred to as OWASP has identified 15 threats which might be distinctive to agentic AI.

    So, what are the defences? Human oversight is unlikely to unravel the issue, Mr Sancho believes, as a result of you may’t add sufficient individuals to maintain up with the brokers’ workload.

    Mr Sancho says an extra layer of AI might be used to display all the things going into and popping out of the AI agent.

    A part of CalypsoAI’s resolution is a method referred to as thought injection to steer AI brokers in the appropriate route earlier than they undertake a dangerous motion.

    “It is like a little bit bug in your ear telling [the agent] ‘no, perhaps do not do this’,” says Mr Casey.

    His firm affords a central management pane for AI brokers now, however that will not work when the variety of brokers explodes and they’re working on billions of laptops and telephones.

    What is the subsequent step?

    “We’re deploying what we name ‘agent bodyguards’ with each agent, whose mission is to ensure that its agent delivers on its job and would not take actions which might be opposite to the broader necessities of the organisation,” says Mr Casey.

    The bodyguard may be instructed, for instance, to ensure that the agent it is policing complies with knowledge safety laws.

    Mr Mehta believes a few of the technical discussions round agentic AI safety are lacking the real-world context. He offers an instance of an agent that provides prospects their reward card steadiness.

    Anyone may make up numerous reward card numbers and use the agent to see which of them are actual. That is not a flaw within the agent, however an abuse of the enterprise logic, he says.

    “It isn’t the agent you are defending, it is the enterprise,” he emphasises.

    “Consider how you’ll shield a enterprise from a nasty human being. That is the half that’s getting missed in a few of these conversations.”

    As well as, as AI brokers turn out to be extra frequent, one other problem will likely be decommissioning outdated fashions.

    Outdated “zombie” brokers might be left working within the enterprise, posing a threat to all of the techniques they will entry, says Mr Casey.

    Much like the best way that HR deactivates an worker’s logins once they depart, there must be a course of for shutting down AI brokers which have completed their work, he says.

    “You could ensure you do the identical factor as you do with a human: lower off all entry to techniques. Let’s ensure we stroll them out of the constructing, take their badge off them.”

    Extra Expertise of Enterprise



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticlePolish President Vetoes Extended Ukrainian Refugee Aid Package
    Next Article House GOP Moves to Censure Rep. Crockett Over Comments About Texas Governor
    FreshUsNews
    • Website

    Related Posts

    Tech News

    Robot Videos: Google Gemini Robotics, Jedi Mode, More

    September 27, 2025
    Tech News

    Instagram teen accounts still show suicide content, study claims

    September 27, 2025
    Tech News

    Facebook and Instagram to get £2.99 UK subscription to stop ads

    September 26, 2025
    Add A Comment
    Leave A Reply Cancel Reply

    Top Posts

    US Ethereum ETFs Surpass Weekly Record With $787M Outflow — Details

    September 8, 2025

    Seattle’s livability at stake as council debates tree protections

    September 3, 2025

    Trump Meets With House GOP Holdouts Ahead of Key Megabill Vote

    July 4, 2025

    Did Putin Give The US Permission To Encircle Venezuela?

    August 21, 2025

    Tyreek Hill, Kirk Cousins among NFL players who need a change of scenery

    September 3, 2025
    Categories
    • Bitcoin News
    • Blockchain
    • Cricket
    • eSports
    • Ethereum
    • Finance
    • Football
    • Formula 1
    • Healthy Habits
    • Latest News
    • Mindful Wellness
    • NBA
    • Opinions
    • Politics
    • Sports
    • Sports Trends
    • Tech Analysis
    • Tech News
    • Tech Updates
    • US News
    • Weight Loss
    • World Economy
    • World News
    Most Popular

    Cricket legend Harold ‘Dickie’ Bird dies aged 92

    September 27, 2025

    WEC: Peugeot sets the pace from the off at Fuji

    September 27, 2025

    Thunder Sign Zack Austin Jazian Gortman Chris Youngblood To Camp Deals

    September 27, 2025

    Lynx coach Cheryl Reeve Will Miss Elimination Game After Being Suspended

    September 27, 2025

    Lowry and McIlroy seal match one victory for dominant Europe after pair heckled by spectators

    September 27, 2025

    Millions of Californians to Get Electricity Bill Refunds

    September 27, 2025

    Robot Videos: Google Gemini Robotics, Jedi Mode, More

    September 27, 2025
    Our Picks

    Rent the Runway vs Nuuly – which is better?

    September 23, 2025

    US State Department approves $4.7bn surface-to-air missile package to Egypt | Weapons News

    July 25, 2025

    Why US GDP Rose 3% Q2 2025

    July 31, 2025

    Dover, Watkins Glen Moves Will Test NASCAR Risk-Taking

    August 20, 2025

    Lionel Messi Returns From Injury, Scores Inter Miami’s Game-Winner vs. MLS Champs

    August 17, 2025

    Gun culture: Other ways to seek fulfillment

    August 20, 2025

    Pick up this Dyson cordless vacuum while it’s $180 off for Prime Day

    July 6, 2025
    Categories
    • Bitcoin News
    • Blockchain
    • Cricket
    • eSports
    • Ethereum
    • Finance
    • Football
    • Formula 1
    • Healthy Habits
    • Latest News
    • Mindful Wellness
    • NBA
    • Opinions
    • Politics
    • Sports
    • Sports Trends
    • Tech Analysis
    • Tech News
    • Tech Updates
    • US News
    • Weight Loss
    • World Economy
    • World News
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us
    Copyright © 2025 Freshusnews.com All Rights Reserved.

    Type above and press Enter to search. Press Esc to cancel.