When can I train AI with public content?
David Rosenthal will answer this question in the next 17:15 Colloquium by the ETH Library and gives an insight into the legal framework that should be considered when training AI.
- Read
- Number of comments

17:15 Colloquium by the ETH Library
Thursday, 31 October 2024, 17.15
The (copyright) legal framework for the use of public content for AI training
David Rosenthal, partner at VISCHER, lecturer at the Department of Humanities, Social and Political Sciences, ETH Zurich, lecturer at the Faculty of Law, University of Basel
ETH Zurich, Campus Zentrum, Rämistrasse 101, 8092 Zurich, E 5 or external page live online.
external page Register now!
The training of AI models is based on the use of data that will improve the output of the respective models. For this reason, it is important to think about the origin of the data used right from the start. This includes knowing how it was collected, where it was obtained from, and which licence terms are they subject to. Data protection aspects should also be taken into account before the training programme begins: Has the data been sufficiently anonymised, and can the results no longer be traced back to individuals? Of course, you should also be aware of the legal conditions of the data set before feeding it into the AI model.
At the upcoming 17:15 Colloquium by the ETH Library, David Rosenthal will explain on which legal basis published content can be used for the training of AI models, especially large language models, and what needs to be considered. He will place a special focus on questions of copyright and licence agreements.
Comments
No comments yet