Saturday, August 30, 2025
No Result
View All Result
Shop
WORTH BITCOIN
  • Home
  • Blockchain
  • Crypto
  • Bitcoin
  • Altcoin
  • DeFi
  • NFTs
  • More
    • Market & Analysis
    • Dogecoin
    • Ethereum
    • XRP
    • Regulations
  • Shop
WORTH BITCOIN
No Result
View All Result
Home Blockchain

Reddit blocks the Internet Archive from crawling its data – here’s why

n70products by n70products
August 12, 2025
in Blockchain
0
Reddit blocks the Internet Archive from crawling its data – here’s why
152
SHARES
1.9k
VIEWS
Share on FacebookShare on Twitter


gettyimages-2215157577

Andriy Onufriyenko/Getty Photos

ZDNET’s key takeaways

  • The Web Archive can now solely crawl Reddit’s homepage.
  • Reddit’s objective is to dam AI corporations from scraping Reddit consumer knowledge.
  • Publishers (and others) are suing AI firms for copyright infringement.

Reddit is defending its privateness from AI firms which might be taking roundabout approaches to scraping its content material.

The social media platform, often known as a useful resource the place customers can put up anonymously and discover details about just about any topic, will block the Web Archive’s Wayback Machine from indexing its on-line knowledge, in response to a Monday report from The Verge. The transfer is in response to the invention that AI corporations, unable to scrape knowledge from Reddit instantly because of the platform’s prohibitive insurance policies, have as a substitute been retrieving its knowledge from listed content material on the Web Archive and utilizing it to coach fashions.

The Wayback Machine will now solely be capable of scrape knowledge from Reddit’s homepage, in response to The Verge, whereas entry to consumer profiles, feedback, and put up element pages will probably be blocked.

Launched in 1996, the Web Archive is a non-profit that operates an unlimited digital database of internet content material. The archive is maintained partly by the Wayback Machine, a bit of web-crawling software program that gathers internet pages and preserves them as they appeared after they had been collected, like digital flies in amber. This serves as a useful resource for researchers learning the evolution of on-line tradition and digital forensic proof for legislation enforcement, amongst different makes use of.

What Reddit’s transfer means

Reddit has beforehand flagged issues associated to the scraping of its content material with the Web Archive, in response to The Verge. The non-profit was additionally reportedly notified earlier than the web-crawling restrictions began going into impact yesterday.

The Web Archive has but to make an official assertion about the way it plans to answer Reddit’s new restrictions, and on the time of writing, it has not responded to ZDNET’s request for remark. Wayback Machine director Mark Graham, nevertheless, has advised a number of publications that the Web Archive will “proceed to have ongoing discussions about this matter” with Reddit.

Rising rigidity

Reddit’s reported choice to dam Wayback Machine from scraping nearly all of its content material arrives throughout a second of mounting rigidity between AI firms and digital publishers, although Reddit is the primary tech firm to wade into the controversy. The corporate sued Anthropic in June after discovering that the AI firm was illegally scraping its knowledge, however it has additionally beforehand signed licensing offers with each Google and OpenAI.

(Disclosure: Ziff Davis, ZDNET’s mother or father firm, filed an April 2025 lawsuit in opposition to OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI techniques.) 

AI builders require entry to gargantuan troves of knowledge to coach generative AI fashions, that are designed to determine and replicate refined mathematical patterns gleaned from these coaching datasets.

Lots of these firms have scraped coaching knowledge from publicly out there web sites, together with social media websites and information shops, claiming authorized immunity below an idea recognized in copyright legislation as fair use. (The courts are nonetheless untangling the legitimacy of that argument, and can possible be doing so for a while.)

Most of the organizations whose content material has been copiously scraped — together with a cohort of authors and different artists — have responded with lawsuits. 

Others, in the meantime, have signed content material licensing agreements with the likes of OpenAI, Anthropic, and Google, consenting to the usage of their organizations’ knowledge in trade for elevated visibility within the responses generated by chatbots, or different advantages.





Source link

Tags: ArchiveBlockscrawlingDataHeresinternetReddit
  • Trending
  • Comments
  • Latest
dYdX to Unlock Over 33 Million Tokens: Will Price Crash?

dYdX to Unlock Over 33 Million Tokens: Will Price Crash?

December 19, 2024
XRP Price Reclaims Momentum: Is a Bigger Rally Ahead?

Bitcoin: What stablecoin flows tell you about BTC’s next move

December 19, 2024
Ted Cruz, Cynthia Lummis and 16 Other US Senators Now Aligned With Coinbase ‘Stand With Crypto’ Group

Ted Cruz, Cynthia Lummis and 16 Other US Senators Now Aligned With Coinbase ‘Stand With Crypto’ Group

December 19, 2024
AI for the little guy – Hypergrid Business

AI for the little guy – Hypergrid Business

December 19, 2024
4 Top Professional Crypto Trading Terminals- Better Way To Trade

4 Top Professional Crypto Trading Terminals- Better Way To Trade

0
Celsius CEO Requests to Drop Two Charges Linked to Fraud and Manipulation

Celsius CEO Requests to Drop Two Charges Linked to Fraud and Manipulation

0
Top Analyst Anticipates Dogecoin Surge To $0.10, But There’s A Catch

Top Analyst Anticipates Dogecoin Surge To $0.10, But There’s A Catch

0
Ethereum Bloodbath Incoming? Celsius’ $125 Million Move Threatens ETH Price

Ethereum Bloodbath Incoming? Celsius’ $125 Million Move Threatens ETH Price

0
Ripple Swell 2025 Is Almost Here: Here’s What To Expect From The Event

Ripple Swell 2025 Is Almost Here: Here’s What To Expect From The Event

August 29, 2025
Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

August 29, 2025
Massive TransUnion breach leaks personal data of 4.4 million customers – what to do now

Massive TransUnion breach leaks personal data of 4.4 million customers – what to do now

August 29, 2025
Eliza Labs Files Lawsuit Against Musk’s xAI Alleging Monopolistic Behavior

Eliza Labs Files Lawsuit Against Musk’s xAI Alleging Monopolistic Behavior

August 29, 2025

Recent News

Tether Abandons Plan To Freeze USDT On Five Chains

Tether Abandons Plan To Freeze USDT On Five Chains

August 30, 2025
Ripple Swell 2025 Is Almost Here: Here’s What To Expect From The Event

Ripple Swell 2025 Is Almost Here: Here’s What To Expect From The Event

August 29, 2025
Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

Meta will sell you refurbished Ray-Ban smart glasses for $76 off – how to find them

August 29, 2025

Tags

Altcoin ALTCOINS analyst Bitcoin Bitcoins Blog Breakout BTC Bullish Bulls Coinbase Crash Crypto DOGE Dogecoin ETF ETFs ETH Ethereum Foundation Heres high Key Major market Memecoin Million Move Outlook Predicts Price Rally REPORT Ripple SEC Solana Support Surge Target Top Trader Trump Updates Whales XRP

Categories

  • Altcoin
  • Bitcoin
  • Blockchain
  • Crypto
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs
  • Regulations
  • XRP

Follow Us

© 2023 Worth-Bitcoin | All Rights Resered

No Result
View All Result
  • Home
  • Blockchain
  • Crypto
  • Bitcoin
  • Altcoin
  • DeFi
  • NFTs
  • More
    • Market & Analysis
    • Dogecoin
    • Ethereum
    • XRP
    • Regulations
  • Shop

© 2023 Worth-Bitcoin | All Rights Resered

Go to mobile version