Your Chatbot May Be Using Illegally Pirated Books to Answer Your Questions
- by Michael Stillman
A battle is brewing between an ancient source of information, the book and its authors, versus a new invention, the chatbot and its developers. The chatbot is a program that can answer whatever questions you throw at it. The grandaddy (all of three years old) and most famous chatbot is ChatGPT. It uses artificial intelligence (AI) to quickly sort through reams of information to answer your every question. But, where does it get that information? One of the major sources is books, copyrighted books. When the chatbot uses that information to answer your questions, the authors and publishers of those books get nothing. That makes them sad (perhaps a better word is “angry” or “POed”).
Some authors are angry enough to go to court. There are various cases floating around out there but a notable one pits comedian and writer Sarah Silverman against Meta, operator of Facebook, headed by Mark Zuckerberg. Meta's chatbot, Llama, is the culprit here.
It is alleged that Meta used the LibGen (Library Genesis) dataset to train its Llama chatbot. LibGen is a notorious, shadowy entity, possibly operating out of Russia. It's dataset contains over 196,000 pirated books. LibGen has been in the news before for “lending” its pirated books free of charge without compensating the authors. LibGen infringes on authors' copyrights and operates illegally but it doesn't matter. They can't be shut down or forced to pay because they can't be found. They regularly change their urls to avoid being shut down. LibGen is no small operation, receiving an estimated 9 million visits per month from the U.S. to “borrow” books. It is supported by donations (accepted in untraceable bitcoin only).
What Meta has been accused of doing is using this large pirated database of books to supply Llama with much of the information it needs to answer users' questions. The plaintiffs have alleged that approval to do so came from the top, Mr. Zuckerberg himself. This claim has focused on the use of pirated (illegally obtained) books, but that perhaps is not the biggest issue here. What if the books were legally obtained, purchased, borrowed from a physical library, or received as gifts. Would that be any better from a copyright standpoint? Probably not.
In Meta's opinion, this use of the authors' work fits under the “Fair Use” exception to copyrights. “Fair Use” is what lets you quote from a book, write a review or book report, use information you found therein to write something of your own, without violating its copyright. Generally speaking, if you change what you read, add your own twist, copy only a small portion, and such, you are not guilty of copyright infringement. What Meta is doing, leaving aside the issue of using LibGen's pirated texts, is both copying the entire book, but then only sharing a small, rewritten portion such as might be expected to pass the Fair Use text.
This will have to play out in court but the Judge seems less than impressed with the arguments made by the authors. The reality is that chatbots provide very useful information. You probably use one to answer your questions. It's sort of like speaking to a very learned individual. Practically speaking, paying 196,000 authors some small pittance each would be an absolute nightmare, and they might not agree to such an arrangement anyway. It's not that they don't deserve anything, but it probably isn't a lot, and making such demands might force the shutting down of this very new and useful technology altogether. Progress is hard to stop, even if some people feel hurt by it, and my guess is the courts will not do so here.
Sotheby's Fine Books & Manuscripts Available for Immediate Purchase
Sotheby’s: Balthus, Emily Brontë. Wuthering Heights, New York: The Limited Editions Club, 1993. 6,600 USD.
Sotheby’s: Charles Dickens. Complete Works, Philadelphia & London: J.B. Lippincott Company & Chapman & Hall, LD, 1850. Limited Edition set of 30 volumes. 7,500 USD.
Sotheby’s: John Lennon, Yoko Ono. Handwritten Letter from John Lennon and Yoko Ono to their Chauffer. 1971. 32,500 USD.
Sotheby’s: Winston Churchill. First edition of War Speeches, Cassell and Company, Ltd., 1941. Set of 7 volumes. 5,500 USD.
Sotheby’s: Andy Warhol, Julia Warhola. Holy Cats First Edition, Signed by Andy Warhol. 1954. 30,000 USD.
Forum Auctions Online: India Ends 19th February 2026
Forum, Feb. 19: Lot 40 Ramasvami (Kavali Venkata). A Digest of the Different Castes of India, 83 charming hand-coloured lithographed plates, Madras, 1837. £5,000-7,000
Forum, Feb. 19: Lot 50 Watson (John Forbes) & John William Kaye. The People of India: A Series of Photographic Illustrations...of the Races and Tribes of Hindustan, 8 vol., 480 mounted albumen prints, 1868-75. £4,000-6,000
Forum, Feb. 19: Lot 53 Afghanistan.- Elphinstone (Hon. Mountstuart). An Account of the Kingdom of Caubul, first edition, hand-coloured aquatint plates, a fine copy, 1815. £2,000-3,000
Forum, Feb. 19: Lot 57 [Album and Treatise on Hinduism], manuscript treatise on Hinduism in French, 31 watercolours of Hindu deities, Pondicherry, 1865. £3,000-4,000
Forum, Feb. 19: Lot 62 Allan (Capt. Alexander). Views in the Mysore Country,
[1794]. £2,000-3,000
Forum Auctions Online: India Ends 19th February 2026
Forum, Feb. 19: Lot 76 Bird (James). Historical Researches on the Origin and Principles of the Bauddha and Jaina Religions..., first edition, lithographed plates, Bombay, American Mission Press, 1847. £3,000-4,000
Forum, Feb. 19: Lot 100 Ceylon.- Daniell (Samuel). A Picturesque Illustration of the scenery, animals, and native inhabitants, of the Island of Ceylon: in twelve plates, 1808. £5,000-7,000
Forum, Feb. 19: Lot 123 D'Oyly (Charles). Behar Amateur Lithographic Scrap Book, lithographed throughout with title and 55 plates mounted on 43 paper leaves, [Patna], [1828]. £3,000-5,000
Forum, Feb. 19: Lot 139 Gandhi (known as Mahatma Gandhi,) Fine Autograph Letter signed to Jawaharlal Nehru, Sevagram, Wardha, 1942, emphasising the importance of education in rural communities. £10,000-15,000
Forum Auctions Online: India Ends 19th February 2026
Forum, Feb. 19: Lot 140 Gantz (John). Indian Microcosm, first edition, Madras, John Gantz & Son, 1827. £10,000-15,000
Forum, Feb. 19: Lot 146 Grierson (Sir George Abraham). Linguistic Survey of India, 11 vol. in 20, folding maps, original cloth, Calcutta, Superintendent Government Printing, 1903-28. £2,000-3,000
Forum, Feb. 19: Lot 195 Madras.- Fort St. George Gazette (The), No.276-331, pp.493-936 and Index to all of 1834 at end, modern half calf, Madras, 2nd July - 31st December 1834. £2,000-3,000
Forum, Feb. 19: Lot 205 Marshall (Sir John) and Alfred Foucher. The Monuments of Sanchi, 3 vol., first edition, 141 plates, most photogravure, [Calcutta], [1940]. £3,000-4,000
Il Ponte, Feb. 25-26: HAMILTON, Sir William (1730-1803) - Campi Phlegraei. Napoli: [Pietro Fabris], 1776, 1779. € 30.000 - 50.000
Il Ponte, Feb. 25-26: [MORTIER] - BLAEU, Joannes (1596-1673) - Het Nieuw Stede Boek van Italie. Amsterdam: Pieter Mortier, 1704-1705. € 15.000 - 25.000
Il Ponte, Feb. 25-26: TULLIO D'ALBISOLA (1899-1971) - Bruno MUNARI (1907-1998) - L'Anguria lirica (lungo poema passionale). Roma e Savona: Edizioni Futuriste di Poesia, senza data [ma 1933?]. € 20.000 - 30.000
Il Ponte, Feb. 25-26: IL MANOSCRITTO RITROVATO DI IPPOLITA MARIA SFORZA. TITO LIVIO - Ab Urbe Condita. Prima Decade. Manoscritto miniato su pergamena, metà XV secolo. € 280.000 - 350.000