Skip to main content

Feeding the Beast: Why Pirated Books and AI Misuse Are Part of the Same Problem

Original Photo (no logo) by Norma Mortenson: https://www.pexels.com/photo/a-man-in-a-jacket-loading-boxes-in-a-van-4391486/
The digital equivalent of someone selling stolen electronics or meat out of the back of a van, appearing on one corner one day and another the next.

 Recent revelations about Meta’s use of copyrighted material for AI training have reignited concerns about how our creative works are being exploited. While Meta’s actions are troubling, the deeper issue isn’t new. It’s one I’ve spoken about before, and one I’ll continue to emphasize.

The problem lies not just in what corporations like Meta are doing, but in how easily this exploitation is enabled. For decades, authors, artists, content creators, and consumers of our work, have been unknowingly, or carelessly, feeding the beast. The right hand (AI) has our attention right now, but the left hand is still stealing our wallet.


The Problem We Helped Create

I’ve long cautioned about the risks of uploading books, documents, and images to websites and software platforms without fully understanding the terms of service. Many creators have willingly, or inadvertently, handed over their intellectual property without considering how it might be used.

From cloud storage platforms to file-sharing services, we've collectively built an environment where content is freely available, easily scraped, and frequently exploited. Even well-intentioned sharing, like uploading a manuscript draft to a collaboration tool, can leave content vulnerable to misuse.

And yet, this practice continues. For decades, users raced to get everything online in the spirit of democratizing information, often without considering the long-term consequences. Creators attempting to respect copyright frequently rely on images sourced from websites that make no effort to verify whether the uploader had the right or authority to share them. 

Meanwhile, AI hunters, perhaps with good intentions, often upload images and documents into verification software without permission from the original copyright holder. Some verification platforms have recently updated their policies to discourage this practice, but these changes are far from universal. More importantly, if users don’t read or fully understand those terms (and most don’t) these improved policies become little more than window dressing. The problem isn’t just what the policies say; it’s whether users recognize what they’re agreeing to in the first place.

Creators must take responsibility for understanding where their work is stored and who has access to it. This isn’t victim-blaming; it’s recognizing that digital complacency has fueled the very systems now threatening our intellectual rights. LibGen and the Rise of Digital Bootleggers

The recent Atlantic investigation into LibGen shed light on how sites like this fuel AI data scraping. But let’s be clear, LibGen was never a Meta project. It was (and still is) a pirate site that has long existed in legal grey zones.

LibGen, which started as an academic resource, expanded to host copyrighted fiction and non-fiction without permission. While legal actions have targeted LibGen multiple times, with U.S. courts ordering its shutdown in 2015 and a $30 million judgment in 2024, the site continues to resurface. 

However, LibGen’s presence is far from stable. Its primary domains have been seized or disabled repeatedly, with some remaining offline for extended periods. Yet mirrors and alternative access points persist, creating an ongoing challenge for authorities and rights holders alike. Many LibGen mirrors now rely on encrypted networks, like Tor, further complicating enforcement efforts.

And LibGen isn’t alone. Other pirate sites operate under similar tactics, shifting domains, hosting offshore, or leveraging encrypted networks to stay ahead of authorities. It’s the digital equivalent of someone selling stolen electronics or meat out of the back of a van, appearing on one corner one day and another the next. The goods may be accessible to buyers, but the sellers stay just out of reach of the law.


The Consequences for Creators

The combination of piracy and AI scraping creates a perfect storm for exploitation. As long as pirated books remain easy to find, large-scale scraping operations will continue to harvest these works for unauthorized use, whether by AI developers, content aggregators, or unscrupulous publishers.

Meta’s actions deserve scrutiny, but if we focus solely on AI ethics without addressing the rampant accessibility of pirated books, we’re only fighting half the battle.


What Can We Do?

  1. Be Mindful of What You Upload: Before uploading your work to a platform, read the terms of service. Understand what permissions you’re granting and whether your content may be scraped or shared.
  2. Report Pirate Sites: If you discover your books on pirate platforms, take action. Reporting them to search engines, ISPs, and web hosts can help limit their reach.
  3. Educate Others: Encourage fellow creators to be vigilant about their content. Awareness is key to slowing the cycle of exploitation.
  4. Support Ethical Platforms: Advocate for services that protect creators’ rights and refuse to scrape or exploit copyrighted content.
  5. Lobby for Forward-Thinking Laws: Push for legislation that holds search engines and ISPs accountable for enabling access to pirate sites. Harsh penalties for companies that knowingly facilitate piracy could significantly reduce the ease with which these sites operate and thrive. 
    • While governments have introduced over 800 new AI-related laws in the U.S., many of these focus on broader issues like data privacy, algorithmic bias, ethics, transparency, and security. Far fewer address the urgent need to protect creative works from unauthorized AI scraping. Worse, by the time many of these laws take effect, they are already behind the pace of technological advancement. Future laws must be proactive rather than reactive, addressing both the misuse of AI and the ease with which pirated content is exploited for AI training. Without this, new legislation risks being little more than a bandage on an ever-growing wound.
  6. Advocate for Automatic Penalties for Piracy Downloads: Support the development of a system that automatically penalizes individuals downloading pirated content. Unlike industries with centralized resources and capital to pursue piracy cases, publishing, graphic arts, and research are far more fragmented. Without a scalable deterrent, there’s little consequence for users who access and exploit stolen material. An automated system would create accountability where traditional legal action falls short.
That said, implementing such a system comes with legal and technical challenges, particularly in distinguishing intentional piracy from accidental downloads. Of course, such systems would need to address privacy concerns and ensure users are properly informed, creating transparency without unfair surveillance. While an automated system could be a powerful deterrent, ensuring it targets true violations without unfairly penalizing innocent users is crucial. A thoughtful approach would be necessary to strike that balance effectively.

By combining personal responsibility, community awareness, and stronger legal frameworks, we can better protect creative works from exploitation.

The Fight Isn’t Over

AI developers may bear responsibility for misusing pirated content, but the underlying problem is far more complex. Until we address the digital black market for creative works, and recognize our own role in feeding it, the exploitation will continue.

The right hand may be drawing our focus with AI developments, but the left hand has been quietly stealing from us for years. It’s time to stop ignoring both.

Comments

Popular posts from this blog

Leviticus 25: Jubilee — A Novel Written 20 Years Ago, More Relevant Than Ever

  Nearly two decades ago, I wrote Leviticus 25: Jubilee , a political thriller inspired by the 2002 G8 Summit held in Kananaskis Country, Alberta. At the time, world leaders gathered to discuss economic stability, debt relief, and global financial systems. While the summit itself may have faded from public memory, the ideas that emerged from it planted the seeds for my novel — and ironically, those themes feel startlingly relevant today. The story imagines a bold scenario: developing nations, led by Argentina and Peru, announce a "Jubilee" — a coordinated refusal to repay international debts. The term itself is drawn from a biblical concept found in Leviticus 25, where debts are forgiven, and land is restored every fifty years. While the novel's title may suggest a religious narrative, Leviticus 25: Jubilee is not a faith-based book. Instead, it explores how the underlying concept — economic reset and justice — might unfold on a global scale. In the novel, this declarati...

Fragments of Frost and Fire - Episode 2 - Wounds That Won't Heal

Some words arrive unexpectedly—unattached to any story, yet too vivid to be left unwritten. Over time, I’ve found myself collecting these fragments of poetry, pieces that don’t belong in my novels but still deserve a life of their own in the world. My blog has already welcomed one of these wandering verses, but  Fragments of Frost and Fire  is a home for these untethered creations. A space where fleeting thoughts and deeper reflections take form, shaped by ice and flame, stillness and fury, life and loss. Some will stand alone, while others may one day find their place in larger works, but all will linger here, waiting to be felt. Wounds That Won't Heal  You flinch, As It cuts deep, Opening you up, Exposing the nerves, Every wisp of air, Flowing over the wound, Triggering those exposed nerves, Ramping up that pain.   But it’s only for a moment, You hope, As you close up that wound, Wrapping it tightly, So no one can see it.   Someti...