- cross-posted to:
- technology@lemmy.world
- reddit@lemmy.world
- cross-posted to:
- technology@lemmy.world
- reddit@lemmy.world
Great. Now all the AI bots will just be saying “This” and “came here to say that.”
“ChatGPT, solve this problem for me.”
“As an AI language model, username checks out.”
“Are you me?”
“Reddit moment”
Google Reddit comedy
Thanks for the gold kind stranger
This
This
Came here to say that.
This guy 👆🏻
Username checks out
Chef’s kiss
Removed by mod
Remember Narwal bacons at midnight guys
That
“When does le narwhal bacon?”
Wow that’s an old one lol
Well… We all knew that was coming. If you still have an account haven’t done so, now’s a good time to purge your account!
Unless you live in the EU or California, odds are that just deletes the public data, I’m sure Reddit retains it and would sell it.
deleted by creator
by forcing them to actually be open about it (this will not happen)
Reddit account data has been training AI for over a decade. If you ever used it, you’re already in a training set
That will remove your account from public view, but will it remove it from the data they use for AI training?
If not, you’re just enhancing the value of their proprietary data.
Why wouldn’t they enhance it themselves, like Twitter has been doing for months? Once they make signing in mandatory and implement per-user rate limits the information will disappear from the internet and will only be available to people who are paying in some way.
Better yet, use an overwrite script to help turn their training models to jelly
I’d be very surprised if comments weren’t versioned in some way, so even if you delete or rewrite that data, it’s probably still there and a part of training data.
That’s what I just did with my account of 10 years. I had all comments overwritten with gibberish and purged them a few days later. I’ll send them a final DSGVO request and delete it afterwards.
Done it a few months ago but then again if I was working at reddit and in charge of preparing the dataset to feed to the llm, I’d give it access to both a recent one and a snapshot from before July 2023 (or whenever shit hit the fan and we all came to lemmy), most edits would have been made in protest. And AI can figure out which ones by itself
Why? How does it harm you in any meaningful way?
Even if it’s just another scheme to further concentrate wealth (and it is at least that), that harms everyone but the 0.1%.
I draw plenty of benefit from AI tools. There are open source models that anyone can run.
So they’re cashing in by selling other people’s conversations.
Yeah.
FYI: reddit orphans content. In other words your posts/comments are undeletable.
I found instances of such late last year by way of search results. I clicked a username to see more posts by that account. The only content on their profile page was a final deletion message about the API changes.
Their post history was discoverable by using “<username> site:reddit.com” on Google. All of their posts/comments still show up under their
username
instead of the normal[
. Clicking the username takes you to their empty profile page. ]So what we know from this now is that reddit has been saving original submissions. Whereas before their claim was that only the last edits are stored. Which is why the deletion scipts became a thing. People took it on good faith that we could delete our posts. At some point they stopped doing that. Or perhaps it was all a lie the whole time. Who knows.
What happens if you edit every other post with absolute nonsense sentences? I doubt they have a way to go through that?
Are you saying the original answers are not nonsense already?
Lol no. I just really want to invent some insane words, forget about them and then see them years later in some media publication that wasn’t properly reviewed and edited.
Chaotic evil! Where do I sign up?
Finally a good use for LLMs
I think in theory simple check such as edits to the majority of a profile would be enough to detect it.
They would probably notice and roll them back. Bulk edits raise some red flags.
No, that’s Google keeping the content of the pages they scraped at the time they scraped them.
This is how I cleaned (most) of my old posts: Searched them via Google. As they’re posted under my username I was able to change them into nonsense before deleting then. Even though they never appeared under my profile anymore.
Imagine AI mods trained by reddit mod models.
I would love a GPT model that just replies to every prompt with “Ya’ll can’t behave.”
Y’all need Jesus
“Trained by the sweaty armpit of humanity”
@cosmicrookie @ardi60 “Why does this bloody thing keep asking ‘a/s/l’ and ‘Do you want a NSFW roleplay?’ even when I tell it no?!?!”
You know how artists can poison their images for AI… We need a way to poison content on Reddit
I would say most of the content is already poison.
i think that’s just called posting on reddit
Shit posts would do it. That’ll turn the AIs into morons who spurt out “rizz” and “skibidi” instead of anything useful
There’s nothing stopping anyone doing the same with lemmy posts though is there?
NO! There isn’t, lemmy is one giant honeypot.
deleted by creator
It shows my password as hunter2 on my end how about yours
Could you post your social security number, please?
Nah, lemmy censors that, but only if it’s your Social Security number. Here, I’ll post mine as proof.
With dashes:
***********
Without:
*********
And here’s a different SSN that’s not mine:
420-69-8008
Pretty cool!
then call me winnie the pooh
I am waiting for an LLM that trained on 4chan it would be pure gold.
She got the sprit
Oh no.
Fuck 4chan
I don’t want STDs.
Alt title: Reddit looking to steal value from their millions of free users
Fuck /u/spez
It’s a power grab. They’ll justify control over the training data with intellectual property which keeps it out of the hands of everyday people but they stole the “intellectual property” from us in the first place. Then they’ll control the “means of generation”.
deleted by creator
Pretend your Spez. You know that, but will your investors? Investors now a days just toss money at anything with the word AI.
To have better bots so they can advertise via posts and make it seem like a human recommended it. That’s why a lot of people started using Reddit in the first place. Myself included. Only time I use it now it’s when a search result takes me there. Can’t remember the last time it was a useful result though.
This is why I deleted my posts. Also, 60 million is a jokingly lowball figure.
I wonder how reddit users feel about this. I wouldn’t know, I DNS blocked the site months ago.
I left my posts up because they will actually cause the website to be worth less.
Same, let him cook
That’s cute that you think your posts are actually deleted.
This is going to produce the saltiest AI the world has ever seen.
“Hey reddit ai , give me an idea how to balance my budget and pay my student debt, mate”. “here ya go, I got a noose for you. Also I’m not your mate, dude”.
" Hey reddit ai, draw for me a house with a genz family". Here ya go. " Hey reddit ai, why did you show me a pic of a highway ramp with homeless people?"
what could go wrong with training your ai based on the posts of the most racist and misogynistic people on the internet?
My god they’ll create a super redditor
Damn Facebook is owned by Reddit now?
It’s not 4chan… but someone did train one of those once.
That varies by subreddit, which might actually help in training LLMs to recognize the difference.
deleted by creator
I stopped posting there the moment they pulled the trigger on the API change. I used to like cruising LinuxQuestions and answering people, too.
Yup, I provided a bit of good content, but I left as soon as the API change was announced.