The past 18 months have seen the most rapid change in human written communication ever

fossilesque@mander.xyz · 1 day ago

The past 18 months have seen the most rapid change in human written communication ever

T156@lemmy.world · 1 day ago

How did they estimate whether an LLM was used to write the text or not? Did they do it by hand, or using a detector?

Since detectors are notorious for picking up ESL writers, or professionally written text as AI-Generated.

sober_monk@lemmy.world · edit-2 23 hours ago

They developed their own detector described in another paper. Basically, this reverse-engineers texts based on their vocabulary to provide an estimate on how much of them were ChatGPT.

brucethemoose@lemmy.world · edit-2 7 hours ago

This sounds plausible to me, as specific models (or even specific families) do tend to have the same vocabulary/phrase biases and “quirks.” There are even some community “slop filters” used for sampling specific models, filled with phrases they’re known to overuse through experience, with “shivers down her spine” being a meme for Anthropic IIRC.

It’s defeatable. But the “good” thing is most LLM writing is incredibly lazy, not meticulously crafted to avoid detection.

This is yet another advantage of self hosted LLMs as they:

Tend to have different quirks than closed models.
Have finetunes with the explicit purpose of removing their slop.
Can use exotic sampling that “bans” whatever list of phrases you specify (aka the LLM backtracks and redoes it when it runs into them, which is not normally a feature you get over APIs), or penalizes repeated phrases (aka DRY sampling, again not a standard feature).

Bob Robertson IX@lemmy.world · 1 day ago

They just asked a few people if they thought it was written by an LLM. /s

I mean, you can tell when something is written from ChatGPT, especially if the person isn’t using it for editing, but is just asking it to write a complaint or request. It is likely they are only counting the most obvious, so the actual count is higher.

hypna@lemmy.world · 1 day ago

I don’t know of any reason that the proportion of ESL writers would have started trending up in 2022.