The rise of ChatGPT, Bard and other chatbots is based on huge datasets. In turn, these datasets contain a lot of copyrighted content from major publishers, for which a number of publishers have joined forces in hopes of promoting relevant laws and regulations and demanding that their legal rights be upheld.

According to a report in the Wall Street Journal, several publishers have been investigating evidence of the existence of copyrighted content in datasets used to train AI. The report that these publishers have formed a coalition to pressure the companies developing the generative AI to pay adequate compensation and remuneration through the News Media Alliance, a publishing trade alliance.
News Media Alliance executive Danielle Coffey said, “We have to be compensated for the valuable content that is protected by appropriate copyright and that is constantly used to generate revenue for others.