Google Is the Only Search Engine That Works on Reddit Now, Thanks to AI Deal

[ - ]

58 points

4 months ago

“We believe in an open internet… as long as you use these specific services.”

This really sucks. So we’re looking at a future where search engines are like streaming services now. “Hmmm now which search engine was <insert website here> on?”

permalink

report

reply

[ - ]

Red@reddthat.com

4 points

4 months ago

That’s why I use a SearXng instance. Why bother searching for something on 1 instance when you could search for it on 5 and then correlate the results.

permalink

report

parent

reply

[ - ]

termus@beehaw.org

28 points

4 months ago

Does this mean the Internet Archive will no longer be archiving reddit posts? That’s how I’ve tried viewing most since I deleted my accounts.

permalink

report

reply

[ - ]

OfficerBribe@lemm.ee

6 points

4 months ago

I honestly do not think Internet Archive even should be archiving such behemoths like Reddit or Twitter. Only thing it should keep would be currently dead sites.

Even worse when people are accessing these posts through Archive even when there is a live copy. A lot of storage and bandwidth wasted.

permalink

report

parent

reply

[ - ]

Kissaki@beehaw.org

6 points

4 months ago

How do you keep a currently dead website you did not previously archive?

permalink

report

parent

reply

[ - ]

OfficerBribe@lemm.ee

1 point

4 months ago

True, although I think there usually are either signs or site admins give heads up when site is soon to go under. Doubt Reddit or Twitter will be dead any time soon.

permalink

report

parent

reply

[ - ]

Onihikage@beehaw.org

16 points

4 months ago

*

Counterpoint: Scumbag companies ninja-editing their timestamped warranty page such that the only way you know they edited it after you bought the product is because it was archived previously.

Archives are ideal for identifying sneaky behavior like that. You never know when an admin might have the ability to delete or edit something without anyone noticing.

permalink

report

parent

reply

[ - ]

neutronst4r@beehaw.org

12 points

4 months ago

But imagine this… an immoral rich human being, who’s family got rich by mining blood rubies in south Africa, buys reddit for 50B$. This person fires half the people and refuses to pay the bills for servers and the servers shut down… how will you access your favorite GoneWild posts? This is all fictional of course.

permalink

report

parent

reply

[ - ]

Handles@leminal.space

8 points

4 months ago

…but at some point those giant sites may go offline. I see the point of archiving them now for posterity, but you’re right. The archive shouldn’t be used as a concurrent mirror of those sites for privacy reasons.

I have my browser set up to redirect Reddit links to libreddit instances for that purpose.

permalink

report

parent

reply

[ - ]

intensely_human@lemm.ee

4 points

4 months ago

We need to do something to protect Internet Archive and its access to scrape sites.

permalink

report

parent

reply

[ - ]

PotentiallyApricots@beehaw.org

15 points

4 months ago

Ummmmmmmm. This seems illegal. Is this not illegal?

permalink

report

reply

[ - ]

Daniyyel@lemm.ee

9 points

4 months ago

*

It’s a bit of a dilemma reading their policy:

We believe in the open internet and in keeping Reddit publicly accessible to foster human learning (…) Unfortunately, we see more and more entities using unauthorized access (…) especially with the rise of use cases like generative AI. This sort of misuse of public data has become more prominent as more and more platforms close themselves off from the open internet.
We still believe in an open internet, but we do not believe that third parties have a right to misuse public content just because it’s public.

Being a open/public platform, but still wanting to protect user’s content from being used for AI could be a good thing, and I guess also what many fediverse users would want for this platform. Making a distinction between AI and search indexing could indeed be difficult. But then making content deals with Google for search indexing and AI training is a bit hypocrite.

permalink

report

reply

[ - ]

intensely_human@lemm.ee

2 points

4 months ago

I still don’t buy that protecting people’s content from being read by AI is a good thing. I think the fear of AI stealing our thunder or whatever by reading what we’ve written is overblown as a fear.

permalink

report

parent

reply

[ - ]

solanaceous@beehaw.org

8 points

4 months ago

We still believe in an open internet, but we do not believe that third parties have a right to misuse public content just because it’s public.

You need to pay us for the right to misuse our site’s data!

permalink

report

parent

reply

[ - ]

dragynbob@beehaw.org

44 points

4 months ago

I don’t have a ton of knowledge in this area, but this seems like it should run afoul of antitrust regulations?

permalink

report

reply

[ - ]

meowMix2525@lemm.ee

6 points

4 months ago

Given lawmakers that understand how the internet works, I think it would be. To me this isn’t any different than a handful of years back when ISPs were throttling websites to give an advantage to the certain ones that paid them to work faster.

permalink

report

parent

reply

[ - ]

Daniyyel@lemm.ee

11 points

4 months ago

Who should be regulated, Google or Reddit? Reddit updated there robots.txt to disallow everything. As it’s their site, I guess it’s also their right to determine that. They then made a deal with Google, which I guess is also not abusing a dominant position by Google, as Reddit could have made a deal with anyone.

permalink

report

parent

reply

[ - ]

intensely_human@lemm.ee

4 points

4 months ago

Yeah but reddit made a deal with google because google’s the big player.

It’s hard to say, but I’d lead toward Google on this one. How does reddit benefit from only being indexed by one search engine? Google must have offered them something more, to make it in reddit’s best interests.

In other words, this deal naturally benefits only google, at the cost of value to reddit and to the public. So google must be doing something that makes it worth it to reddit. Could be threat of punishment: “You give us exclusive crawl access, or we don’t crawl you”.

permalink

report

parent

reply

[ - ]

jarfil@beehaw.org

3 points

4 months ago

In 2023, Reddit decided to start charging exorbitant amounts for API access, making it non-viable for free 3rd party apps to access its content, citing things like AI crawlers “stealing” their (users’) content.

In 2024, Google announced an agreement with Reddit to access the API, citing things like enhanced up to date search results. I don’t recall having seen whether they pay for it, or how much, but possibly they do.

It would stand to reason, that if Reddit has managed to get a single dime for API access, and they keep thinking free access to their users’ content is “stealing”, then Reddit would be interested in making it as hard as possible to access the content without paying.

Could be threat of punishment: “You give us exclusive crawl access, or we don’t crawl you”.

That could’ve been part of the agreement: “You give us cheap/free API access, or we don’t crawl you”.

Reddit tightening things down while trying to sell API access, just happens to benefit Google.

permalink

report

parent

reply

[ - ]

coyotino [he/him]@beehaw.orgOP

26 points

4 months ago

That was my first thought too. Yet another reason to vote for Dems this November - only one party actually gives a shit about enforcing antitrust regulations!

permalink

report

parent

reply

[ - ]

AVincentInSpace@pawb.social

7 points

4 months ago

are you absolutely positive the democrats give a shit about antitrust regulations? Biden did actively strike break.

permalink

report

parent

reply

[ - ]

Onihikage@beehaw.org

6 points

4 months ago

He did at the beginning, but he helped them get what they wanted in the end, and I think that counts for something.

“We’re thankful that the Biden administration played the long game on sick days and stuck with us for months after Congress imposed our updated national agreement,” Russo said. “Without making a big show of it, Joe Biden and members of his administration in the Transportation and Labor departments have been working continuously to get guaranteed paid sick days for all railroad workers.

“We know that many of our members weren’t happy with our original agreement,” Russo said, “but through it all, we had faith that our friends in the White House and Congress would keep up the pressure on our railroad employers to get us the sick day benefits we deserve. Until we negotiated these new individual agreements with these carriers, an IBEW member who called out sick was not compensated.”

permalink

report

parent

reply

[ - ]

sarsaparilyptus@beehaw.org

2 points

4 months ago

Oh yeah you’re right we should just not even bother voting and let the right wing win.

permalink

report

parent