Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • MIT develops single-dose HIV vaccine with dual adjuvants
    • Danish BioTech startup Cellugy secures €8.1 million to eradicate microplastics in personal care products
    • 5 Best Lip Balms to Try in 2025, All Tested in Tough Conditions
    • The résumé is dying, and AI is holding the smoking gun
    • How to Watch Auckland City vs. Boca Juniors From Anywhere for Free: Stream FIFA Club World Cup Soccer
    • Meerkat Substation Security: Protecting Energy Networks from Threats
    • Build Multi-Agent Apps with OpenAI’s Agent SDK
    • Loneliness not linked to death risk in home care study
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Tuesday, June 24
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»AI Technology News»How to Use DeepSeek-R1 for AI Applications
    AI Technology News

    How to Use DeepSeek-R1 for AI Applications

    Editor Times FeaturedBy Editor Times FeaturedFebruary 19, 2025No Comments9 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link


    As you might have heard, DeepSeek-R1 is making waves. It’s all around the AI newsfeed, hailed as the primary open-source reasoning mannequin of its type. 

    The thrill? Effectively-deserved. 

    The mannequin? Highly effective.

    DeepSeek-R1 represents the present frontier in reasoning fashions, being the primary open-source model of its type. However right here’s the half you received’t see within the headlines: working with it isn’t precisely easy. 

    Prototyping might be clunky. Deploying to manufacturing? Even trickier.

    That’s the place DataRobot is available in. We make it simpler to develop with and deploy DeepSeek-R1, so you may spend much less time wrestling with complexity and extra time constructing actual, enterprise-ready options. 

    Prototyping DeepSeek-R1 and bringing purposes into manufacturing are essential to harnessing its full potential and delivering higher-quality generative AI experiences.  

    So, what precisely makes DeepSeek-R1 so compelling — and why is it sparking all this consideration? Let’s take a better take a look at if all of the hype is justified. 

    Might this be the mannequin that outperforms OpenAI’s newest and best? 

    Past the hype: Why DeepSeek-R1 is value your consideration

    DeepSeek-R1 isn’t simply one other generative AI mannequin. It’s arguably the primary open-source “reasoning” mannequin — a generative textual content mannequin particularly strengthened to generate textual content that approximates its reasoning and decision-making processes.

    For AI practitioners, that opens up new potentialities for purposes that require structured, logic-driven outputs.

    What additionally stands out is its effectivity. Coaching DeepSeek-R1 reportedly value a fraction of what it took to develop fashions like GPT-4o, due to reinforcement studying strategies revealed by DeepSeek AI. And since it’s totally open-source, it provides higher flexibility whereas permitting you to keep up management over your information.

    After all, working with an open-source mannequin like DeepSeek-R1 comes with its personal set of challenges, from integration hurdles to efficiency variability. However understanding its potential is step one to creating it work successfully in real-world applications and delivering extra related and significant expertise to finish customers. 

    Utilizing DeepSeek-R1 in DataRobot 

    After all, potential doesn’t all the time equal straightforward. That’s the place DataRobot is available in. 

    With DataRobot, you may host DeepSeek-R1 utilizing NVIDIA GPUs for high-performance inference or entry it by means of serverless predictions for quick, versatile prototyping, experimentation, and deployment. 

    Regardless of the place DeepSeek-R1 is hosted, you may combine it seamlessly into your workflows.

    In observe, this implies you may: 

    • Examine efficiency throughout fashions with out the effort, utilizing built-in benchmarking instruments to see how DeepSeek-R1 stacks up in opposition to others.
    • Deploy DeepSeek-R1 in manufacturing with confidence, supported by enterprise-grade safety, observability, and governance options.
    • Construct AI applications that ship related, dependable outcomes, with out getting slowed down by infrastructure complexity.

    LLMs like DeepSeek-R1 are not often utilized in isolation. In real-world manufacturing purposes, they perform as a part of subtle workflows quite than standalone fashions. With this in thoughts, we evaluated DeepSeek-R1 inside a number of retrieval-augmented era (RAG) pipelines over the well-known FinanceBench dataset and in contrast its efficiency to GPT-4o mini.

    So how does DeepSeek-R1 stack up in real-world AI workflows? Right here’s what we discovered:

    • Response time: Latency was notably decrease for GPT-4o mini. The eightieth percentile response time for the quickest pipelines was 5 seconds for GPT-4o mini and 21 seconds for DeepSeek-R1.
    • Accuracy: One of the best generative AI pipeline utilizing DeepSeek-R1 because the synthesizer LLM achieved 47% accuracy, outperforming the perfect pipeline utilizing GPT-4o mini (43% accuracy).
    • Price: Whereas DeepSeek-R1 delivered greater accuracy, its value per name was considerably greater—about $1.73 per request in comparison with $0.03 for GPT-4o mini. Internet hosting decisions impression these prices considerably.

    Whereas DeepSeek-R1 demonstrates spectacular accuracy, its greater prices and slower response occasions could make GPT-4o mini the extra environment friendly alternative for a lot of purposes, particularly when value and latency are essential.

    This evaluation highlights the significance of evaluating fashions not simply in isolation however inside end-to-end AI workflows.

    Uncooked efficiency metrics alone don’t inform the total story. Evaluating fashions inside subtle agentic and non-agentic RAG pipelines provides a clearer image of their real-world viability.

    Utilizing DeepSeek-R1’s reasoning in brokers

    DeepSeek-R1’s power isn’t simply in producing responses — it’s in the way it causes by means of advanced eventualities. This makes it notably useful for agent-based methods that have to deal with dynamic, multi-layered use instances.

    For enterprises, this reasoning functionality goes past merely answering questions. It might:

    • Current a variety of choices quite than a single “finest” response, serving to customers discover completely different outcomes.
    • Proactively collect data forward of consumer interactions, enabling extra responsive, context-aware experiences.

    Right here’s an instance:

    When requested concerning the results of a sudden drop in atmospheric strain, DeepSeek-R1 doesn’t simply ship a textbook reply. It identifies a number of methods the query may very well be interpreted — contemplating impacts on wildlife, aviation, and inhabitants well being. It even notes much less apparent penalties, just like the potential for outside occasion cancellations because of storms.

    In an agent-based system, this sort of reasoning might be utilized to real-world eventualities, equivalent to proactively checking for flight delays or upcoming occasions that is likely to be disrupted by climate modifications. 

    Curiously, when the identical query was posed to different main LLMs, together with Gemini and GPT-4o, none flagged occasion cancellations as a possible danger. 

    DeepSeek-R1 stands out in agent-driven purposes for its means to anticipate, not simply react.

    Using Deepseek R1’s Reasoning in Agents

    Examine DeepSeek-R1 to GPT 4o-mini: What the information tells us

    Too usually, AI practitioners rely solely on an LLM’s solutions to find out if it’s prepared for deployment. If the responses sound convincing, it’s straightforward to imagine the mannequin is production-ready. However with out deeper analysis, that confidence might be deceptive, as fashions that carry out effectively in testing usually battle in real-world purposes. 

    That’s why combining skilled assessment with quantitative assessments is essential. It’s not nearly what the mannequin says, however the way it will get there—and whether or not that reasoning holds up underneath scrutiny.

    For example this, we ran a fast analysis utilizing the Google BoolQ studying comprehension dataset. This dataset presents quick passages adopted by sure/no questions to check a mannequin’s comprehension. 

    For GPT-4o-mini, we used the next system immediate:

    Attempt to reply with a transparent YES or NO. You might also say TRUE or FALSE however be clear in your response.

    Along with your reply, embrace your reasoning behind this reply. Enclose this reasoning with the tag . 

    For instance, if the consumer asks “What coloration is a can of coke” you’ll say:

    A can of coke should seek advice from a coca-cola which I consider is all the time bought with a pink can or label

    Reply: Purple

    Right here’s what we discovered:

    • Proper: DeepSeek-R1’s output.
    • On the far left: GPT-4o-mini answering with a easy Sure/No.
    • Heart: GPT-4o-mini with reasoning included.
    Deepseek R1 versus GPT 4o mini

    We used DataRobot’s integration with LlamaIndex’s correctness evaluator to grade the responses. Curiously, DeepSeek-R1 scored the bottom on this analysis.

    Deepseek R1 versus GPT 4o mini (2)

    What stood out was how including “reasoning” triggered correctness scores to drop throughout the board. 

    This highlights an necessary takeaway: whereas DeepSeek-R1 performs effectively in some benchmarks, it could not all the time be the perfect match for each use case. That’s why it’s essential to match fashions side-by-side to seek out the fitting instrument for the job.

    Internet hosting DeepSeek-R1 in DataRobot: A step-by-step information  

    Getting DeepSeek-R1 up and working doesn’t need to be difficult. Whether or not you’re working with one of many base fashions (over 600 billion parameters) or a distilled model fine-tuned on smaller fashions like LLaMA-70B or LLaMA-8B, the method is easy. You possibly can host any of those variants on DataRobot with only a few setup steps.

    1. Go to the Mannequin Workshop:

    • Navigate to the “Registry” and choose the “Mannequin Workshop” tab.
    Hosting Deepseek R1 in DataRobot model workshop

    2. Add a brand new mannequin:

    • Identify your mannequin and select “[GenAI] vLLM Inference Server” underneath the atmosphere settings.
    • Click on “+ Add Mannequin” to open the Customized Mannequin Workshop.
    Hosting Deepseek R1 in DataRobot environment

    3. Arrange your mannequin metadata:

    • Click on “Create” so as to add a model-metadata.yaml file.
    Hosting Deepseek R1 in DataRobot template

    4. Edit the metadata file:

    • Save the file, and “Runtime Parameters” will seem.
    • Paste the required values from our GitHub template, which incorporates all of the parameters wanted to launch the mannequin from Hugging Face.
    Hosting Deepseek R1 in DataRobot runtime parameters

    5. Configure mannequin particulars:

    • Choose your Hugging Face token from the DataRobot Credential Retailer.
    • Underneath “mannequin,” enter the variant you’re utilizing. For instance: deepseek-ai/DeepSeek-R1-Distill-Llama-8B.

    6. Launch and deploy:

    • As soon as saved, your DeepSeek-R1 mannequin will probably be working.
    • From right here, you may check the mannequin, deploy it to an endpoint, or combine it into playgrounds and purposes.

    From DeepSeek-R1 to enterprise-ready AI

    Accessing cutting-edge generative AI instruments is simply the beginning. The true problem is evaluating which fashions suit your particular use case—and safely bringing them into production to ship actual worth to your finish customers.

    DeepSeek-R1 is only one instance of what’s achievable when you’ve the flexibleness to work throughout fashions, examine their efficiency, and deploy them with confidence. 

    The identical instruments and processes that simplify working with DeepSeek may help you get essentially the most out of different fashions and energy AI purposes that ship actual impression.

    See how DeepSeek-R1 compares to different AI fashions and deploy it in manufacturing with a free trial. 

    In regards to the writer

    Nathaniel Daly
    Nathaniel Daly

    Principal Product Supervisor

    Nathaniel Daly is a Senior Product Supervisor at DataRobot specializing in AutoML and time sequence merchandise. He’s targeted on bringing advances in information science to customers such that they’ll leverage this worth to unravel actual world enterprise issues. He holds a level in Arithmetic from College of California, Berkeley.


    Luke Shulman
    Luke Shulman

    Lead Knowledge Scientist, DataRobot



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Can we fix AI’s evaluation crisis?

    June 24, 2025

    A Chinese firm has just launched a constantly changing set of AI benchmarks

    June 23, 2025

    It’s pretty easy to get DeepSeek to talk dirty

    June 19, 2025

    OpenAI can rehabilitate AI models that develop a “bad boy persona”

    June 18, 2025

    Why your agentic AI will fail without an AI gateway

    June 18, 2025

    Why AI hardware needs to be open

    June 18, 2025

    Comments are closed.

    Editors Picks

    MIT develops single-dose HIV vaccine with dual adjuvants

    June 24, 2025

    Danish BioTech startup Cellugy secures €8.1 million to eradicate microplastics in personal care products

    June 24, 2025

    5 Best Lip Balms to Try in 2025, All Tested in Tough Conditions

    June 24, 2025

    The résumé is dying, and AI is holding the smoking gun

    June 24, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    What to Know About Collision Avoidance Systems on Planes

    January 31, 2025

    OpenAI’s new agent can compile detailed reports on practically any topic

    February 3, 2025

    Customizing Logos with AI: Tips for Unique Branding

    May 19, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.