Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Technology»OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents
    Technology

    OpenAI Is Asking Contractors to Upload Work From Past Jobs to Evaluate the Performance of AI Agents

    Editor Times FeaturedBy Editor Times FeaturedJanuary 10, 2026No Comments4 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    OpenAI is asking third-party contractors to add actual assignments and duties from their present or earlier workplaces in order that it may well use the info to judge the efficiency of its next-generation AI models, in keeping with information from OpenAI and the coaching knowledge firm Handshake AI obtained by WIRED.

    The undertaking seems to be a part of OpenAI’s efforts to ascertain a human baseline for various duties that may then be in contrast with AI fashions. In September, the corporate launched a brand new evaluation course of to measure the efficiency of its AI fashions in opposition to human professionals throughout quite a lot of industries. OpenAI says this can be a key indicator of its progress in the direction of attaining AGI, or an AI system that outperforms people at most economically beneficial duties.

    “We’ve employed people throughout occupations to assist gather real-world duties modeled off these you’ve performed in your full-time jobs, so we are able to measure how nicely AI fashions carry out on these duties,” reads one confidential doc from OpenAI. “Take present items of long-term or complicated work (hours or days+) that you simply’ve performed in your occupation and switch every right into a activity.”

    OpenAI is asking contractors to explain duties they’ve performed of their present job or previously and to add actual examples of labor they did, in keeping with an OpenAI presentation concerning the undertaking considered by WIRED. Every of the examples needs to be “a concrete output (not a abstract of the file, however the precise file), e.g., Phrase doc, PDF, Powerpoint, Excel, picture, repo,” the presentation notes. OpenAI says folks can even share fabricated work examples created to reveal how they might realistically reply in particular situations.

    OpenAI and Handshake AI declined to remark.

    Actual-world duties have two parts, in keeping with the OpenAI presentation. There’s the duty request (what an individual’s supervisor or colleague advised them to do) and the duty deliverable (the precise work they produced in response to that request). The corporate emphasizes a number of instances in directions that the examples contractors share ought to replicate “actual, on-the-job work” that the individual has “truly performed.”

    One instance within the OpenAI presentation outlines a activity from a “Senior Life-style Supervisor at a luxurious concierge firm for ultra-high-net-worth people.” The purpose is to “Put together a brief, 2-page PDF draft of a 7-day yacht journey overview to the Bahamas for a household who will probably be touring there for the primary time.” It consists of extra particulars relating to the household’s pursuits and what the itinerary ought to seem like. The “skilled human deliverable” then exhibits what the contractor on this case would add: an actual Bahamas itinerary created for a shopper.

    OpenAI instructs the contractors to delete company mental property and personally identifiable info from the work information they add. Beneath a bit labeled “Essential reminders,” OpenAI tells the employees to “Take away or anonymize any: private info, proprietary or confidential knowledge, materials nonpublic info (e.g., inner technique, unreleased product particulars).”

    One of many information considered by WIRED doc mentions an ChatGPT device known as “Superstar Scrubbing” that gives recommendation on tips on how to delete confidential info.

    Evan Brown, an mental property lawyer with Neal & McDevitt, tells WIRED that AI labs that obtain confidential info from contractors at this scale might be topic to commerce secret misappropriation claims. Contractors who provide paperwork from their earlier workplaces to an AI firm, even scrubbed, might be vulnerable to violating their earlier employers’ non-disclosure agreements, or exposing commerce secrets and techniques.

    “The AI lab is placing quite a lot of belief in its contractors to resolve what’s and isn’t confidential,” says Brown. “In the event that they do let one thing slip via, are the AI labs actually taking the time to find out what’s and isn’t a commerce secret? It appears to me that the AI lab is placing itself at nice danger.”



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    The 11 Best Fans to Buy Before It Gets Hot Again (2026)

    April 19, 2026

    Hisense U7SG TV Review (2026): Better Design, Great Value

    April 19, 2026

    Best Meta Glasses (2026): Ray-Ban, Oakley, AR

    April 19, 2026

    How Can Astronauts Tell How Fast They’re Going?

    April 19, 2026

    The ‘Lonely Runner’ Problem Only Appears Simple

    April 19, 2026

    Comments are closed.

    Editors Picks

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    Today’s NYT Strands Hints, Answer and Help for April 20 #778

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Side Projects Ignite Engineering Passion

    November 8, 2025

    Africa’s AI researchers are ready for takeoff

    November 12, 2024

    With One Million Displaced, Lebanon Turns to Digital Wallets for Aid

    April 5, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.