A few weeks ago the internet was again asking how much of our content Google is now using to train its machine learning models, including this article that has been making the rounds. The main question being, if we set a Google Doc/Sheet/Presentation to be available to “anyone with the link”, will Google use it for AI training?
According to this Business Insider article these should be safe. Google claims to only use publicly available content for its machine learning training. So only if, eg. you embedded a Google Doc (or posted a link to it) on a public page.
However, it’s would still recommend that you avoid having any documents on “anyone with the link” sharing settings, for both privacy and security.
To see a list of all docs that are shared with the public, click on this link which will take you to a Google Drive advanced search: https://drive.google.com/
