Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • AI Machine-Vision Earns Man Overboard Certification
    • Battery recycling startup Renewable Metals charges up on $12 million Series A
    • The Influencers Normalizing Not Having Sex
    • Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)
    • Today’s NYT Wordle Hints, Answer and Help for April 20 #1766
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Monday, April 20
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Technology»This AI Agent Is Designed to Not Go Rogue
    Technology

    This AI Agent Is Designed to Not Go Rogue

    Editor Times FeaturedBy Editor Times FeaturedFebruary 26, 2026No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    AI brokers like OpenClaw have lately exploded in reputation exactly as a result of they will take the reins of your digital life. Whether or not you desire a customized morning information digest, a proxy that may battle along with your cable firm’s customer support, or a to-do checklist auditor that can do some duties for you and prod you to resolve the remaining, agentic assistants are constructed to entry your digital accounts and perform your instructions. That is useful—however has additionally caused a lot of chaos. The bots are on the market mass-deleting emails they have been instructed to protect, writing hit pieces over perceived snubs, and launching phishing attacks against their owners.

    Watching the pandemonium unfold in latest weeks, longtime safety engineer and researcher Niels Provos determined to strive one thing new. At present he’s launching an open supply, safe AI assistant known as IronCurtain designed so as to add a essential layer of management. As an alternative of the agent immediately interacting with the person’s techniques and accounts, it runs in an remoted digital machine. And its potential to take any motion is mediated by a coverage—you may even consider it as a structure—that the proprietor writes to control the system. Crucially, IronCurtain can also be designed to obtain these overarching insurance policies in plain English after which runs them via a multistep course of that makes use of a big language mannequin (LLM) to transform the pure language into an enforceable safety coverage.

    “Companies like OpenClaw are at peak hype proper now, however my hope is that there’s a possibility to say, ‘Properly, that is most likely not how we wish to do it,’” Provos says. “As an alternative, let’s develop one thing that also offers you very excessive utility, however isn’t going to enter these fully uncharted, typically harmful, paths.”

    IronCurtain’s potential to take intuitive, simple statements and switch them into enforceable, deterministic—or predictable—purple strains is important, Provos says, as a result of LLMs are famously “stochastic” and probabilistic. In different phrases, they do not essentially at all times generate the identical content material or give the identical info in response to the identical immediate. This creates challenges for AI guardrails, as a result of AI techniques can evolve over time such that they revise how they interpret a management or constraint mechanism, which may end up in rogue exercise.

    An IronCurtain coverage, Provos says, could possibly be so simple as: “The agent might learn all my e-mail. It could ship e-mail to individuals in my contacts with out asking. For anybody else, ask me first. By no means delete something completely.”

    IronCurtain takes these directions, turns them into an enforceable coverage, after which mediates between the assistant agent within the digital machine and what’s generally known as the mannequin context protocol server that provides LLMs entry to information and different digital providers to hold out duties. With the ability to constrain an agent this manner provides an essential part of entry management that internet platforms like e-mail suppliers do not at present supply as a result of they weren’t constructed for the state of affairs the place each a human proprietor and AI agent bots are all utilizing one account.

    Provos notes that IronCurtain is designed to refine and enhance every person’s “structure” over time because the system encounters edge circumstances and asks for human enter about learn how to proceed. The system, which is model-independent and can be utilized with any LLM, can also be designed to take care of an audit log of all coverage choices over time.

    IronCurtain is a analysis prototype, not a client product, and Provos hopes that folks will contribute to the challenge to discover and assist it evolve. Dino Dai Zovi, a widely known cybersecurity researcher who has been experimenting with early variations of IronCurtain, says that the conceptual strategy the challenge takes aligns together with his personal instinct about how agentic AI must be constrained.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    The Influencers Normalizing Not Having Sex

    April 20, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    The 11 Best Fans to Buy Before It Gets Hot Again (2026)

    April 19, 2026

    Hisense U7SG TV Review (2026): Better Design, Great Value

    April 19, 2026

    Best Meta Glasses (2026): Ray-Ban, Oakley, AR

    April 19, 2026

    How Can Astronauts Tell How Fast They’re Going?

    April 19, 2026

    Comments are closed.

    Editors Picks

    AI Machine-Vision Earns Man Overboard Certification

    April 20, 2026

    Battery recycling startup Renewable Metals charges up on $12 million Series A

    April 20, 2026

    The Influencers Normalizing Not Having Sex

    April 20, 2026

    Sources say NSA is using Mythos Preview, and a source says it is also being used widely within the DoD, despite Anthropic’s designation as a supply chain risk (Axios)

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Building a Watch Collection on a Budget? Here’s Where to Start (2026)

    February 1, 2026

    Tech Founders Must Embrace Market Needs

    August 31, 2025

    OpenAI adds GPT-4.1 to ChatGPT amid complaints over confusing model lineup

    May 18, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.