There is no such thing as an effective "AI detector", nor will there ever be one.

Excel@lemmy.megumin.org · edit-2 3 years ago

There is no such thing as an effective "AI detector", nor will there ever be one.

jochem@lemmy.ml · 3 years ago

Looks like it does: https://chat.openai.com/share/1b487711-c1be-468a-877b-98091449b55e

I asked it to translate ‘meeting agreements’ to Dutch and it came up with the word ‘bijeenkomstafspraken’, which is a valid but very uncommon Dutch word (I’m native Dutch and don’t think I’ve seen it before). If I throw it into google with quotes around it, the first page is results with ‘bijeenkomst afspraken’, where ‘afspraken’ is used as the past tense of ‘afspreken’ (to agree) instead of as its noun (agreements).

It btw also suggested ‘vergaderafspraken’ as a translation, which is a way more common word.

MBM@lemmings.world · 3 years ago

That’s nice, thanks for checking. I thought ChatGPT only worked at the level of whole words but it seems it chops them up internally.

jochem@lemmy.ml · 3 years ago

Correct, it’s not just regurgitating words, it’s predicting which token comes next. A token is sometimes a whole word, but for longer ones it’s part of a word (and some other rules that define how tokenization works).

How it knows which token comes next is why the current generation of LLMs is so impressive. It seems to have learned the rules the underpin our languages, to the point that it seems to even understand the content. It doesn’t just know the grammer rules (without anyone telling it, it just learned the patterns), it also knows which words belong to each other in which context.

It’s your prompt + some preset other context (e.g. that it is an OpenAI LLM) that creates that context. So being able to predict a token correctly is one part, the other is having a good context. This is why prompt engineering quickly became a thing. This is also why supporting bigger contexts is another thing (but a larger context requires way more processing power, so there’s a trade-off there).

It’s btw not just the trained model + context that gives you the output of ChatGPT. I’m pretty sure there are layers before and after, possibly using other ML models, that filter content or make it more fit for processing. This is why you can’t ask it how to make bombs, even though those recipes are in its training set and it very likely can create a recipe based on that.

There is no such thing as an effective "AI detector", nor will there ever be one.

There is no such thing as an effective "AI detector", nor will there ever be one.

What’s an “AI detector”?

What does “effective” mean?

Why should the accuracy bar be so high? Isn’t anything better than a coin flip good enough?

Why can’t a good AI detector be built?

Why do these “AI detectors” keep getting advertised if they don’t work?