Last week, Slack’s users realized that under the company’s terms of service,
their private data could be used to train artificial-intelligence models.
— Slack’s privacy principles,
May 17, 2024
This came as a shock: chat messages convey sensitive company data, and LLMs
(large language models, the category …
Still tricky if you run a public facing site, there are constant crawlers and bots and any of those could be feeding AI.
Don’t put private data on your public facing website?
Is such a thing even possible? 👽
Well of course, that’s true of any and all publicly accessible data. At least with self-hosting, your private channels still don’t get mined against your wishes