The European Commission has announced long-awaited plans to make it easier for researchers to harvest facts and data from research papers — by freeing the computer-aided activity from the shackles of copyright law.
Software can rapidly analyse millions of online articles and data sets at speeds humans can’t match, an activity known as text and data mining (TDM). Scientists hope that this could reveal patterns in scientific knowledge and generate new hypotheses.
But the field has been hampered by uncertainties about the legality of sifting through science publishers’ content to crunch the data. In the European Union, this sort of activity requires the permission of a paper’s copyright holder. To crawl across paywalled content, would-be miners have had to go through the laborious process of asking various publishers for approval. And publishers have sometimes refused to allow TDM (apparently out of fear that paywalled content might be freely redistributed), or have only permitted it with restrictions, controlled licenses or fees. A 2014 report for the European Commission suggested that Europe’s researchers were doing less computer crawling than those in the United States and Asia.
As part of copyright-reform proposals announced on 14 September, the Commission suggests exempting TDM from copyright — but only for research organizations “acting in the public interest”, such as universities and research centres, and only for content that they already have legal access to read. It would cover both commercial and non-commercial research. But the exception will not apply to commercial firms, which would still need to negotiate rights with publishers and other content providers.
“We must remove barriers that prevent scientists from digging deeper into the existing knowledge base. This proposed copyright exception will give researchers the freedom to pursue their work without fear of legal repercussions,” said Carlos Moedas, head of research at …