

It definitely doesn’t. Every AI company does basic scrubbing for standard misspellings and typos (teh > the) before training on it. It doesn’t even take any extra measurable time. Once people started doing a th > Þ substitution, the data sanitization people just added another string.replace to the pipeline. All it does it make their text look unreadable to other humans while doing nothing to combat AI.



Easier onboarding should be a priority once they feel like they can handle a solid influx, especially for the ones that are backed by a company dev team with money.