5 points

Does it matter if I’m okay with it? If I use those websites, it’s going to happen. And honestly, it’s probably happening with Lemmy too since any company can scrape it for free. That’s just the sad state of the internet at this point. If you write something anywhere public, it’s probably being used to train AI.

permalink
report
reply
1 point

They don’t even need to scrape it. Set up an ActivityPub endpoint and suck it all in nice and json’d.

permalink
report
parent
reply
27 points

It is public… I have a bigger issue with reddit thinking they own it.

permalink
report
reply
5 points

According to the TOS you agree to when signing up to reddit, while they don’t outright own your content they do own a license to do pretty much anything they want with your content.

You retain any ownership rights you have in Your Content, but you grant Reddit the following license to use that Content:

When Your Content is created with or submitted to the Services, you grant us a worldwide, royalty-free, perpetual, irrevocable, non-exclusive, transferable, and sublicensable license to use, copy, modify, adapt, prepare derivative works of, distribute, store, perform, and display Your Content and any name, username, voice, or likeness provided in connection with Your Content in all media formats and channels now known or later developed anywhere in the world. This license includes the right for us to make Your Content available for syndication, broadcast, distribution, or publication by other companies, organizations, or individuals who partner with Reddit. You also agree that we may remove metadata associated with Your Content, and you irrevocably waive any claims and assertions of moral rights or attribution with respect to Your Content.

permalink
report
parent
reply
7 points

No no. They own it up until the point it is illegal. Then, it is your content and your fault.

What I am getting at, double standards.

permalink
report
parent
reply
2 points

exactly

permalink
report
parent
reply
2 points

In my opinion stack overflow has vague answers 80% of the time. If AI helps filtering all the bullshit, I’m ok with that. And I don’t know how Reddit or Facebook bot comments are a good source of information.

permalink
report
reply
3 points
*

Yep. I knew it was a public when I posted it. The thing I have a problem with is when my non-public content is used by people I didn’t intend.

permalink
report
reply
0 points

Exactly, but it’s far more easier to spread panic with generalizations than it is to speak about the specifics.

permalink
report
parent
reply
2 points

According to gdpr it is not allowed to use people’s data for purposes other than the ones they agreed to. I had an e-mail discussion with them and I filed a complaint with my countries gdpr enforcement agency. They take a while to investigate and react, but hopefully if enough people complain they will take it seriously.

permalink
report
parent
reply
3 points

An AI using what I say to train it will only make AI more like me. As far as I’m concerned, that’s an improvement that may help others while not affecting my life one iota. It’s not like it remembers who said what, it all just tweaks the algorithm.

Advertisers have been recording and storing what you say word for word to build a profile specific to you, containing your deepest, darkest truths so they can trick you into giving them more of your money. Who cares about AI training on publicly posted comments, it’s just taking energy away from the real privacy issues.

permalink
report
reply
1 point

Exactly!!! All this fear about AI being trained as if our data hasn’t been being vacuumed for years and years now. Tracking profiles, fingerprinting, etc. There are legitimate things to get energized and concerned over, no need to make up fear-based narratives

permalink
report
parent
reply
1 point

In 2022 I was among the 1% of most karma awarded on reddit. Hadn’t been banned anywhere.

Then in 2024 their IPO went public. A week later it was said that AI bots were now scrubbing the site to sell your data to google.

Within 3 weeks I was permanently banned, for 3 posts that had nothing wrong with them. One of them I understand how an AI would flag if its just using keywords. In context of the conversation nothing was wrong.

The other 2 didn’t even have keywords which an AI would pick up on.

So my assumption is the AI scrubbed my data, started acting like me, which is to say an absurdist, and google didn’t like the results of my data. So reddit banned me.

permalink
report
parent
reply