Thousands of authors urge AI companies to stop using work without permission

Louanne Learning · Jul 18, 2023

Thousands of writers including Nora Roberts, Viet Thanh Nguyen, Michael Chabon and Margaret Atwood have signed a letter asking artificial intelligence companies like OpenAI and Meta to stop using their work without permission or compensation.

"It says it's not fair to use our stuff in your AI without permission or payment," said Mary Rasenberger, CEO of The Author's Guild.

But some literary figures are willing to fight the tech companies in court.

Authors including Sarah Silverman, Paul Tremblay and Mona Awad recently signed on as plaintiffs in class action lawsuits alleging Meta and/or OpenAI trained their AI programs on pirated copies of their works.

Louanne Learning · Jul 18, 2023

AG Recommends Clause in Publishing and Distribution Agreements Prohibiting AI Training Uses

The Authors Guild has drafted a new model clause to prohibit the use of an author’s work for training artificial intelligence technologies without the author’s express permission.
Authors and agents can request this clause be added to their contracts.

w. bogart · Jul 18, 2023

It sounds like AI rights will become like movie rights in the future.

Louanne Learning · Jul 18, 2023

w. bogart said: ↑

It sounds like AI rights will become like movie rights in the future.
Click to expand...

I think currently the AI companies are passing the buck onto the creator of the AI-generated content.

So the AI companies are saying, we're not infringing on copyright when we feed this book or that book into our machines, but if you use the AI output, then you should seek permission from us.

In general, copyright law protects original creative works, such as writing, art, and music, from being used without permission. This means that if you want to use someone else's work, such as the output of a language model, you will generally need to obtain permission from the copyright holder or ensure that your use falls under one of the exceptions to copyright, such as fair use.
Click to expand...

https://nolongerset.com/chatgpt-copyright/

Louanne Learning · Jul 18, 2023

Would it bother you if your published book was used as text data in an AI language model without your permission?

ps102 · Jul 18, 2023

Louanne Learning said: ↑

Would it bother you if your published book was used as text data in an AI language model without your permission?
Click to expand...

It should be fraud really. When you buy a book, you own the copy, but the copyright law forbids you from doing certain things like making copies or using it for any sort of commercial purpose. AI training is a commercial purpose because the models are making them money.

So, I'm not sure why it's legal.

Even if it does become illegal, how do you prove that they didn't do it? The AI scientists don't care. They're here to make history and make a bunch of money in the process. Our wishes as authors do not matter to them. We're nothing in their eyes.

Contrary to what OpenAI's name suggests, their models are closed-source, so there's no way to tell what data they trained ChatGPT on.

Homer Potvin · Jul 18, 2023

w. bogart said: ↑

It sounds like AI rights will become like movie rights in the future.
Click to expand...

You bet your buttons it will!

Not the Territory · Jul 19, 2023

Louanne Learning said: ↑

Would it bother you if your published book was used as text data in an AI language model without your permission?
Click to expand...

Yes. It's publishing in a format outside of the scope I permit, IMO.

deadrats · Jul 20, 2023

How did these authors find out their works were used to train AI? How do we, as writers, find out if our own work had been used like this? I think this AI thing is so wrong when it comes to writing. It seems like if you use AI, you might not even know whose work you're stealing, but I think you should care. I've also hear that any prompt or piece of writing you upload to these AI "writing" programs then retains and uses what you've uploaded to train itself. I won't touch this AI stuff as a writer. I've read about it, but I have zero interest in trying it out, personally.

Louanne Learning · Jul 20, 2023

deadrats said: ↑

How did these authors find out their works were used to train AI?
Click to expand...

Good question. I imagine there must be some way to search the data-base?

ps102 · Jul 20, 2023

deadrats said: ↑

How did these authors find out their works were used to train AI?
Click to expand...

You can ask the AI to make quotations from the books. If it can produce accurate or close-enough sentences, it's been trained on the data.

But it isn't a reliable indicator as the creators can re-train the model to dodge regeneration of original material. OpenAI has demonstrated that they have the ability to influence responses hundreds of times.

deadrats · Jul 20, 2023

Louanne Learning said: ↑

Good question. I imagine there must be some way to search the data-base?
Click to expand...

A lot of us probably have some of our writing online that could have easily been absorbed by AI. And, again, you load anything to AI, it will use it to keep trying itself. Also, with whatever it produces as a result.

This video was pretty interesting. Wanted to share it.

ps102 · Jul 20, 2023

Louanne Learning said: ↑

Good question. I imagine there must be some way to search the data-base?
Click to expand...

It's not a database A database is a file structure that can be queried by an accompanying piece of software to process data (get them, show them, remove them, etc.)

AI uses a "dataset". It's something like a database but it doesn't work in the same way. The dataset basically contains convoluted structures of data that the AI yielded during its training from sources of data like novels, articles, blogs and any piece of digital text they feed it. It's not easily readable and accessible like a traditional database.

But the AI understands it, of course, so the obvious thing is to use it as an interface to the dataset by simply asking it, as I said.

KiraAnn · Aug 11, 2023

This AI learning and copyrights is one of the core issues in the Screenwriters' Guild strikes. I support their position.

Catriona Grace · Aug 11, 2023

Louanne Learning said: ↑

Would it bother you if your published book was used as text data in an AI language model without your permission?
Click to expand...

Abso-fucking-lutely.

Bruce Johnson · Aug 11, 2023

(I accidentally posted this in the other thread too but meant this one)

Some of you may already know this or tried this, but I asked ChatGPT about some of these topics:

Source of Training Data

Default (GPT-3.5)

User
Do you remember the source of text you were trained on?

ChatGPT
I don't have access to my training data, but I was trained on a mixture of licensed data, data created by human trainers, and publicly available data. My training also involved large-scale datasets obtained from books, websites, and other texts to develop a wide-ranging understanding of human language.

User
What do you mean by you don't have access to your training data? Do you not search your training data to provide humanlike responses to the questions that people ask you?

ChatGPT
You're correct, I use the knowledge and patterns learned during my training to generate responses. However, I don't have the ability to search or directly access specific details about the individual sources that were used in my training. My responses are based on a mixture of general knowledge and information from various sources, but I can't pinpoint the exact origin of any particular piec
e of
information.

scriveningnerd · Sep 16, 2023

This whole lawsuit business is stupid. They don't have the first clue as to how LLMs work. Suing the AI companies is akin to suing John Q. Public professor in Bumbfuck, Idaho for using Stephen King's On Writing to teach his class full of twenty something year old brains. LLMs are just really large brains being taught. Right now, feeding these brains *insert random books here* is akin to teaching a two year old to talk. No one has stolen anything.

scriveningnerd · Sep 16, 2023

Louanne Learning said: ↑

Would it bother you if your published book was used as text data in an AI language model without your permission?
Click to expand...

Honestly, no. Simply because my husband has a Masters in Machine Learning and has explained in detail what LLMs are and how they learn.

Not the Territory · Sep 16, 2023

scriveningnerd said: ↑

Right now, feeding these brains *insert random books here* is akin to teaching a two year old to talk.
Click to expand...

You know, that's a very common comparison. Often in the form of "well what about human artists that read books?"

I think the important differences are lost when that simplification of terminology is used, when machine-learning is explained as if the machine is a human. No two-year-old is fed thousands of books line by line nor are they digitally omnipotent, for one. Another fact is that machine-learning will never have the same in/out functionality of a human being. It's human beings that have the right to consume literature and be inspired by it, not anything else.

At the same time, when the author of Tardigrade Boyfriend 14: Small Butts in Venice complains that AI generated works aren't real art and are destined to hurt the industry, I'm hardly sympathetic. AI's literary deficiencies mean it's not a threat to anything other than the most predictable of formats and language, and who needs those anyway?

Edit: to bolster my point about the human/AI distinction rather than similarity, consider how many calories a child would need to eat to compete with ChatGPT:
https://www.washington.edu/news/2023/07/27/how-much-energy-does-chatgpt-use/#:~:text=Overall%2C%20this%20can%20lead%20to,of%20over%201%2C000%20U.S.%20households.

SM: In terms of training a large language model, each processing unit can consume over 400 watts of power while operating. Typically, you need to consume a similar amount of power for cooling and power management as well. Overall, this can lead to up to 10 gigawatt-hour (GWh) power consumption to train a single large language model like ChatGPT-3. This is on average roughly equivalent to the yearly electricity consumption of over 1,000 U.S. households.
Click to expand...

w. bogart · Sep 16, 2023

scriveningnerd said: ↑

This whole lawsuit business is stupid. They don't have the first clue as to how LLMs work. Suing the AI companies is akin to suing John Q. Public professor in Bumbfuck, Idaho for using Stephen King's On Writing to teach his class full of twenty something year old brains. LLMs are just really large brains being taught. Right now, feeding these brains *insert random books here* is akin to teaching a two year old to talk. No one has stolen anything.
Click to expand...

In tour example, unlike with machine learning, Mr. King will be paid for his work writing the book, through the royalties on the sale of the textbooks.

Using an author's wrote without compensating them is theft. At a minimum these companies should be required to purchase the book, so the author recieves something for their work. That is completely ignoring the infringement of the works copyright.

The copyright should entitle the author a fee. What these companies are doing is comparable to buying a DVD then playing it publically while charging an entrance fee.

Louanne Learning · Sep 16, 2023

scriveningnerd said: ↑

This whole lawsuit business is stupid.
Click to expand...

Surely, the holder of a copyright has the right to determine how their book will be used. The lawsuit doesn't say "you can never use our books" - it says "you cannot use our books without permission or compensation."

scriveningnerd · Sep 16, 2023

Louanne Learning said: ↑

Surely, the holder of a copyright has the right to determine how their book will be used. The lawsuit doesn't say "you can never use our books" - it says "you cannot use our books without permission or compensation."
Click to expand...

That’s the thing… these authors have been compensated. 98% of these data sets were BOUGHT bulk. Also, once bought we relinquish any control.

scriveningnerd · Sep 16, 2023

w. bogart said: ↑

In tour example, unlike with machine learning, Mr. King will be paid for his work writing the book, through the royalties on the sale of the textbooks.

Using an author's wrote without compensating them is theft. At a minimum these companies should be required to purchase the book, so the author recieves something for their work. That is completely ignoring the infringement of the works copyright.

The copyright should entitle the author a fee. What these companies are doing is comparable to buying a DVD then playing it publically while charging an entrance fee.
Click to expand...

No, the professor isn’t having the students but the book, he’s simply teaching from it and yet no one is rushing to sue him… because it would be frivolous.

scriveningnerd · Sep 16, 2023

Not the Territory said: ↑

You know, that's a very common comparison. Often in the form of "well what about human artists that read books?"

I think the important differences are lost when that simplification of terminology is used, when machine-learning is explained as if the machine is a human. No two-year-old is fed thousands of books line by line nor are they digitally omnipotent, for one. Another fact is that machine-learning will never have the same in/out functionality of a human being. It's human beings that have the right to consume literature and be inspired by it, not anything else.

At the same time, when the author of Tardigrade Boyfriend 14: Small Butts in Venice complains that AI generated works aren't real art and are destined to hurt the industry, I'm hardly sympathetic. AI's literary deficiencies mean it's not a threat to anything other than the most predictable of formats and language, and who needs those anyway?

Edit: to bolster my point about the human/AI distinction rather than similarity, consider how many calories a child would need to eat to compete with ChatGPT:
https://www.washington.edu/news/2023/07/27/how-much-energy-does-chatgpt-use/#:~:text=Overall, this can lead to,of over 1,000 U.S. households.
Click to expand...

What’s being consumed/used isn’t the same as the output being generated. My quote is accurate, LLMs are akin to that two year old.

big soft moose · Sep 16, 2023

scriveningnerd said: ↑

That’s the thing… these authors have been compensated. 98% of these data sets were BOUGHT bulk. Also, once bought we relinquish any control.
Click to expand...

thing is they weren’t bought bulk at all, what has authors annoyed is that the content was scraped without permission or compensation

someone wants an AI to read one of my books after purchasing it. Not a problem so long as it doesn’t regurgitate verbatim it for a user claiming it to be original work

but that’s not what’s happening here

The fact that your husband has a degree in machine learning doesn't make him or you an unquestioned authority on everything to do with AI, nor does it trump the very many experts included in the preparation of the lawsuit in question.

Log in or Sign up

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

Thousands of authors urge AI companies to stop using work without permission

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

w. bogart Contributor Contributor Blogerator

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

Homer Potvin A tombstone hand and a graveyard mind Staff Supporter Contributor

Not the Territory Contributor Contributor Contest Winner 2023

deadrats Contributor Contributor

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

deadrats Contributor Contributor

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

KiraAnn Contributor Contributor

Catriona Grace Mind the thorns Contributor Contest Winner 2022

Bruce Johnson Contributor Contributor Contest Winner 2023

scriveningnerd Banned

scriveningnerd Banned

Not the Territory Contributor Contributor Contest Winner 2023

w. bogart Contributor Contributor Blogerator

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

scriveningnerd Banned

scriveningnerd Banned

scriveningnerd Banned

big soft moose An Admoostrator Admin Staff Supporter Contributor Community Volunteer

Share This Page

Log in or Sign up

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

Thousands of authors urge AI companies to stop using work without permission

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

w. bogart Contributor Contributor Blogerator

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

Homer Potvin A tombstone hand and a graveyard mind Staff Supporter Contributor

Not the Territory Contributor Contributor Contest Winner 2023

deadrats Contributor Contributor

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

deadrats Contributor Contributor

ps102 PureSnows102 Contributor Contest Winner 2024 Contest Winner 2023

KiraAnn Contributor Contributor

Catriona Grace Mind the thorns Contributor Contest Winner 2022

Bruce Johnson Contributor Contributor Contest Winner 2023

scriveningnerd Banned

scriveningnerd Banned

Not the Territory Contributor Contributor Contest Winner 2023

w. bogart Contributor Contributor Blogerator

Louanne Learning Happy Wonderer Contributor Contest Winner 2022 Contest Winner 2024 Contest Winner 2023

scriveningnerd Banned

scriveningnerd Banned

scriveningnerd Banned

big soft moose An Admoostrator Admin Staff Supporter Contributor Community Volunteer

Share This Page

Useful Searches