Vivold Consulting

NYT escalates legal battle over AI training data by targeting Perplexity's use of copyrighted material

Key Insights

The New York Times filed a lawsuit against Perplexity alleging unauthorized use of copyrighted reporting in training and generation. The case intensifies the legal fight over publisher rights, AI model training, and acceptable use boundaries.

Stay Updated

Get the latest insights delivered to your inbox

Copyright law hits its AI inflection point


The New York Times' lawsuit against Perplexity raises key questions about how AI companies source, store, and transform content. While numerous publishers have negotiated licensing agreements with AI labs, others are choosing to litigatein hopes of shaping the rules of the ecosystem.

What the lawsuit claims


Though details will evolve in court, the allegations reflect broader industry tensions:
- Perplexity allegedly used Times content without a license for model training and outputs.
- The Times argues that AI-generated paraphrasing still represents derivative use.
- The suit seeks to clarify whether AI companies must pay for both training and downstream generation rights.

Why this matters for model builders


Legal exposure is rising:
- Companies will need strong documentation of training data sources.
- Licenses may expand to include synthetic variants of copyrighted text.
- Publishers may push for per-query monetization in addition to flat training fees.

A market moving toward frameworks


This caseand others like itmay catalyze industry-wide standards for ethically sourced and legally defensible datasets. It could also accelerate a shift toward publisher consortiums selling structured, AI-ready archives.