Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • Scandi-style tiny house combines smart storage and simple layout
    • Our Favorite Apple Watch Has Never Been Less Expensive
    • Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)
    • Today’s NYT Strands Hints, Answer and Help for April 20 #778
    • KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.
    • OneOdio Focus A1 Pro review
    • The 11 Best Fans to Buy Before It Gets Hot Again (2026)
    • A look at Dylan Patel’s SemiAnalysis, an AI newsletter and research firm that expects $100M+ in 2026 revenue from subscriptions and AI supply chain research (Abram Brown/The Information)
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»Artificial Intelligence»Generalists Can Also Dig Deep
    Artificial Intelligence

    Generalists Can Also Dig Deep

    Editor Times FeaturedBy Editor Times FeaturedSeptember 13, 2025No Comments7 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    Within the Creator Highlight sequence, TDS Editors chat with members of our neighborhood about their profession path in information science and AI, their writing, and their sources of inspiration. Immediately, we’re thrilled to share our dialog with Ida Silfverskiöld.

    Ida is a generalist, educated as an economist and self-taught in software program engineering. She has an expert background in product and advertising administration, which suggests she has a uncommon mix of product, advertising and growth abilities. Over the previous few years, she’s been instructing and constructing within the LLM, NLP, and laptop imaginative and prescient area, digging into areas akin to agentic AI, chain‑of‑thought methods, and the economics of internet hosting fashions.


    You studied economics, then discovered to code and moved via product, development, and now hands-on AI constructing. What perspective does that generalist path offer you that specialists generally miss?

    I’m undecided. 

    Folks see generalists as having shallow data, however generalists can even dig deep. 

    I see generalists as individuals with a number of pursuits and a drive to grasp the entire, not only one half. As a generalist you have a look at the tech, the client, the information, the market, the price of the structure, and so forth. It offers you an edge to maneuver throughout matters and nonetheless do good work. 

    I’m not saying specialists can’t do that, however generalists are likely to adapt sooner as a result of they’re used to choosing issues up shortly.

    You’ve been writing rather a lot about agentic methods recently. When do “brokers” truly outperform easier LLM + RAG patterns, and when are we overcomplicating issues?

    It depends upon the use case, however normally we throw AI into a number of issues that in all probability don’t want it. Should you can management the system programmatically, it’s best to. LLMs are nice for translating human language into one thing a pc can perceive, however additionally they introduce unpredictability.

    As for RAG, including an agent means including prices, so doing it only for the sake of getting an agent isn’t an incredible thought. You possibly can work round it through the use of smaller fashions as routers (however this provides work). I’ve added an agent to a RAG system as soon as as a result of I knew there could be questions on constructing it out to additionally “act.” So once more, it depends upon the use case. 

    While you say Agentic AI wants “evaluations” what’s your checklist of go-to metrics? And the way do you resolve which one to make use of?

    I wouldn’t say you all the time want evals, however firms will ask for them, so it’s good to know what groups measure for product high quality. If a product will likely be utilized by lots of people, ensure you have some in place. I did numerous analysis right here to grasp the frameworks and metrics which have been outlined. 

    Generic metrics are in all probability not sufficient although. You want a couple of customized ones on your use case. So the evals differ by utility. 

    For a coding copilot, you can observe what p.c of completions a developer accepts (acceptance price) and whether or not the complete chat reached the objective (completeness).

    For commerce brokers, you would possibly  measure whether or not the agent picked the suitable merchandise and whether or not solutions are grounded within the retailer’s information.

    Safety and security associated metrics are essential too, akin to bias, toxicity, and the way straightforward it’s to interrupt the system (jailbreaks, information leaks).

    For RAG, see my article the place I break down the standard metrics. Personally, I’ve solely arrange metrics for RAG to this point.

    It might be attention-grabbing to map how completely different AI apps arrange evals in an article. For instance, Shopify Sidekick for commerce brokers and different instruments akin to authorized analysis assistants.

    In your Agentic RAG Applications article, you constructed a Slack agent that takes firm data into consideration (with LlamaIndex and Modal). What design selection ended up mattering greater than anticipated? 

    The retrieval half is the place you’ll get caught, particularly chunking. While you work with RAG purposes, you break up the method into two. The primary half is about fetching the proper data, and getting it proper is essential as a result of you may’t overload an agent with an excessive amount of irrelevant data. To make it exact the chunks should be fairly small and related to the search question.

    Nevertheless, when you make the chunks too small, you danger giving the LLM too little context. With chunks which might be too massive, the search system might grow to be imprecise.

    I arrange a system that chunked primarily based on the kind of doc, however proper now I’ve an thought for utilizing context growth after retrieval. 

    One other design selection you want to remember is that though retrieval usually advantages from hybrid search, it might not be sufficient. Semantic search can join issues that reply the query with out utilizing the precise wording, whereas sparse strategies can determine actual key phrases. However sparse strategies like BM25 are token-based by default, so plain BM25 received’t match substrings.

    So, when you additionally wish to seek for substrings (a part of product IDs, that sort of factor), it’s good to add a search layer that helps partial matches as effectively.

    There’s extra, however I danger this turning into a whole article if I hold going.

    Throughout your consulting initiatives over the previous two years, what issues have come up most frequently on your shoppers, and the way do you tackle them? 

    The problems I see are that almost all firms are in search of one thing customized, which is nice for consultants, however constructing in-house is riddled with complexities, particularly for individuals who haven’t carried out it earlier than. I noticed that 95% quantity from the MIT study about initiatives failing, and I’m not shocked. I believe consultants ought to get good at sure use instances the place they’ll shortly implement and tweak the product for shoppers, having already learnt how you can do it. However we’ll see what occurs.

    You’ve written on TDS about so many various matters. The place do your article concepts come from? Shopper work, instruments you wish to attempt, or your individual experiments? And what subject or drawback is high of thoughts for you proper now?

    A little bit of every little thing, frankly. The articles additionally assist me floor my very own data, filling in lacking items I could not have researched myself but. Proper now I’m researching a bit on how smaller fashions (mid-sized, round 3B–7B) can be utilized in agent methods, safety, and particularly how you can enhance RAG. 

    Zooming out: what’s one non-obvious functionality groups ought to domesticate within the subsequent 12–18 months (technical or cultural) to grow to be genuinely AI-productive fairly than simply AI-busy?

    In all probability be taught to construct within the area (particularly for enterprise individuals): simply getting an LLM to do one thing persistently is a strategy to perceive how unpredictable LLMs are. It makes you a bit extra humble. 

    To be taught extra about Ida‘s work and keep up-to-date together with her newest articles, you may observe her on TDS or LinkedIn.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    KV Cache Is Eating Your VRAM. Here’s How Google Fixed It With TurboQuant.

    April 19, 2026

    Proxy-Pointer RAG: Structure Meets Scale at 100% Accuracy with Smarter Retrieval

    April 19, 2026

    Dreaming in Cubes | Towards Data Science

    April 19, 2026

    AI Agents Need Their Own Desk, and Git Worktrees Give Them One

    April 18, 2026

    Your RAG System Retrieves the Right Data — But Still Produces Wrong Answers. Here’s Why (and How to Fix It).

    April 18, 2026

    Europe Warns of a Next-Gen Cyber Threat

    April 18, 2026

    Comments are closed.

    Editors Picks

    Scandi-style tiny house combines smart storage and simple layout

    April 19, 2026

    Our Favorite Apple Watch Has Never Been Less Expensive

    April 19, 2026

    Vercel says it detected unauthorized access to its internal systems after a hacker using the ShinyHunters handle claimed a breach on BreachForums (Lawrence Abrams/BleepingComputer)

    April 19, 2026

    Today’s NYT Strands Hints, Answer and Help for April 20 #778

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    The Benefits of Being Unpredictable

    April 15, 2026

    Uber is in talks with Travis Kalanick to help fund his acquisition of Pony.ai’s US arm; Pony.ai has permits to operate robo taxis in the US and China (Mike Isaac/New York Times)

    June 26, 2025

    Commercial spyware vendor exploits used by Kremlin-backed hackers, Google says

    August 30, 2024
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.