A comprehensive and secure solution in three steps
Our goal was to advise the client and, as a next step, to ensure that the Yarmill WWTP project was built on an unshakeable legal foundation.
Representatives of the COC and Yarmill agreed on the need to anchor the legal-safe cooperation and find a long-term safe solution. The primary responsibility lays with our client, so it was necessary to legally handle the copyright in several steps.
1. Setting up transparent cooperation with the COC and others
In the first step we started to negotiate the terms between the client and the Czech Olympic Committee. We set up a clear cooperation and the result is a comprehensive contractual documentation (e.g. AI assistant development contract and Yarmill standard application use agreement), which treats both the relationship with the COC and Yarmill customers (sports associations), as well as relationships with authors of publications and users who upload documents to the application for AI analysis.
2. Extensive analysis of the possibilities of using industry data
In the second step, we performed a comprehensive analysis of the possibilities of using the professional publications that users will upload to the app. The key question was to assess whether the such use of publications is legal at all. Do users have the right to upload publications to the app? And how can Yarmill dispose of them? We had to consider the so-called statutory licenses under the Copyright Act (these are exceptions where the author's consent is not required for certain uses of copyright works).
In a complex analysis, we also looked at whether the RAG (Retrieval Augmented Generation) method that Yarmill needs to use to analyse peer-reviewed publications fulfils the criteria for any of the statutory licences. We have examined all the technical processes, applied the effective legislation and considered the possible impact of recent court decisions in some EU Member States.
We also had to minimize the risk of our client's liability for copyright infringement in case the user provides the application with pirated data.
„The crucial assessment was whether the user gets a legal copy of the publication (buys an e-book) and whether he can provide it to train the AI assistant. Because the fact that I have an e-book, does not necessarily automatically mean that I can upload it to any AI tool and extract data from it for my own commercial product,“ says Pavel Čech, an attorney and expert in IT and intellectual property law.
Although the Copyright Act allows data mining for AI training, it sets specific conditions for it.
We have found that we cannot use the general statutory licence for automated analysis in the private sector because the processing of publications by the client's AI tool does not meet all the statutory conditions. Another possible legal licence for scientific research, which could be used by the Czech Olympic Academy as part of the COC, was again limited by the prohibition on commercial use. And because Yarmill could not rule out future commercial intention, we have prepared the concept of obtaining rights directly from authors or beneficiaries (e.g. heirs, employers of authors, publishers). At the same time, Yarmill can use the works under selected Creative Commons licenses that allow the use of publications to the extent necessary for the AI tool.