Close Menu
    Facebook LinkedIn YouTube WhatsApp X (Twitter) Pinterest
    Trending
    • The ‘Lonely Runner’ Problem Only Appears Simple
    • Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)
    • Today’s NYT Connections Hints, Answers for April 19 #1043
    • Rugged tablet boasts built-in projector and night vision
    • Asus TUF Gaming A14 (2026) Review: GPU-Less Gaming Laptop
    • Mistral, which once aimed for top open models, now leans on being an alternative to Chinese and US labs, says it’s on track for $80M in monthly revenue by Dec. (Iain Martin/Forbes)
    • Today’s NYT Wordle Hints, Answer and Help for April 19 #1765
    • Powerful lightweight sports car available now
    Facebook LinkedIn WhatsApp
    Times FeaturedTimes Featured
    Sunday, April 19
    • Home
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    • More
      • AI
      • Robotics
      • Industries
      • Global
    Times FeaturedTimes Featured
    Home»News»AI bots strain Wikimedia as bandwidth surges 50%
    News

    AI bots strain Wikimedia as bandwidth surges 50%

    Editor Times FeaturedBy Editor Times FeaturedApril 19, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email WhatsApp Copy Link

    Crawlers that evade detection

    Making the scenario harder, many AI-focused crawlers don’t play by established guidelines. Some ignore robots.txt directives. Others spoof browser consumer brokers to disguise themselves as human guests. Some even rotate via residential IP addresses to keep away from blocking, techniques which have turn out to be widespread sufficient to pressure particular person builders like Xe Iaso to undertake drastic protecting measures for his or her code repositories.

    This leaves Wikimedia’s Site Reliability team in a perpetual state of protection. Each hour spent rate-limiting bots or mitigating visitors surges is time not spent supporting Wikimedia’s contributors, customers, or technical enhancements. And it’s not simply content material platforms beneath pressure. Developer infrastructure, like Wikimedia’s code evaluate instruments and bug trackers, can also be continuously hit by scrapers, additional diverting consideration and assets.

    These issues mirror others within the AI scraping ecosystem over time. Curl developer Daniel Stenberg has previously detailed how faux, AI-generated bug experiences are losing human time. On his weblog, SourceHut’s Drew DeVault highlight how bots hammer endpoints like git logs, far past what human builders would ever want.

    Throughout the Web, open platforms are experimenting with technical options: proof-of-work challenges, slow-response tarpits (like Nepenthes), collaborative crawler blocklists (like “ai.robots.txt“), and business instruments like Cloudflare’s AI Labyrinth. These approaches tackle the technical mismatch between infrastructure designed for human readers and the industrial-scale calls for of AI coaching.

    Open commons in danger

    Wikimedia acknowledges the significance of offering “information as a service,” and its content material is certainly freely licensed. However because the Basis states plainly, “Our content material is free, our infrastructure is just not.”

    The group is now specializing in systemic approaches to this situation beneath a brand new initiative: WE5: Responsible Use of Infrastructure. It raises important questions on guiding builders towards much less resource-intensive entry strategies and establishing sustainable boundaries whereas preserving openness.

    The problem lies in bridging two worlds: open information repositories and business AI growth. Many corporations depend on open information to coach business fashions however do not contribute to the infrastructure making that information accessible. This creates a technical imbalance that threatens the sustainability of community-run platforms.

    Higher coordination between AI builders and useful resource suppliers might probably resolve these points via devoted APIs, shared infrastructure funding, or extra environment friendly entry patterns. With out such sensible collaboration, the platforms which have enabled AI development might battle to keep up dependable service. Wikimedia’s warning is evident: Freedom of entry doesn’t imply freedom from penalties.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Editor Times Featured
    • Website

    Related Posts

    Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)

    April 19, 2026

    Mistral, which once aimed for top open models, now leans on being an alternative to Chinese and US labs, says it’s on track for $80M in monthly revenue by Dec. (Iain Martin/Forbes)

    April 19, 2026

    Airbnb launches a pilot in NYC, LA, and other cities that lets users to select from a range of boutique hotels alongside private homes in a bid to boost growth (Stephanie Stacey/Financial Times)

    April 19, 2026

    Anthropic’s Mythos adds to concerns about rising workloads for open-source maintainers, as many have already been dealing with a “crazy” number of bug reports (Chris Stokel-Walker/Bloomberg)

    April 18, 2026

    Salesforce announces Headless 360, an initiative that will give AI agents access to Salesforce’s platform capabilities through APIs, MCP tools or CLI commands (Michael Nuñez/VentureBeat)

    April 18, 2026

    A profile of OpenTable CEO Debby Soo, who shifted its focus from diners to restaurants; it now seats ~2B diners a year across 65K restaurants, an all-time high (Brent Crane/Bloomberg)

    April 18, 2026

    Comments are closed.

    Editors Picks

    The ‘Lonely Runner’ Problem Only Appears Simple

    April 19, 2026

    Binance and Bitget to probe a rally in RaveDAO’s RAVE token, which surged 4,500% in a week, after ZachXBT alleged RAVE insiders engineered a large short squeeze (Francisco Rodrigues/CoinDesk)

    April 19, 2026

    Today’s NYT Connections Hints, Answers for April 19 #1043

    April 19, 2026

    Rugged tablet boasts built-in projector and night vision

    April 19, 2026
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    About Us
    About Us

    Welcome to Times Featured, an AI-driven entrepreneurship growth engine that is transforming the future of work, bridging the digital divide and encouraging younger community inclusion in the 4th Industrial Revolution, and nurturing new market leaders.

    Empowering the growth of profiles, leaders, entrepreneurs businesses, and startups on international landscape.

    Asia-Middle East-Europe-North America-Australia-Africa

    Facebook LinkedIn WhatsApp
    Featured Picks

    Robot Videos: Robotic Horse, Edible Robots, and More

    May 25, 2025

    Bullets Found After the Charlie Kirk Shooting Carried Messages. Here’s What They Mean

    September 13, 2025

    How AI is used to surveil workers

    March 7, 2025
    Categories
    • Founders
    • Startups
    • Technology
    • Profiles
    • Entrepreneurs
    • Leaders
    • Students
    • VC Funds
    Copyright © 2024 Timesfeatured.com IP Limited. All Rights.
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    • About us
    • Contact us

    Type above and press Enter to search. Press Esc to cancel.