News

Auditing unauthorized training data from AI generated content using information isotopes

  • Tao Qi, Jinhua Yin, Dongqi Cai, Yueqi Xie, Huili Wang, Zhiyang Hu, Peiru Yang, Guoshun Nan, Zhili Zhou, Chuhan Wu, Lingjuan Lyu, Shangguang Wang, Yongfeng Huang, Nicholas D. Lane--Nature.com
  • published date: 2026-02-21 00:00:00 UTC

Here, the authors introduce information isotopes, a novel framework for tracing unauthorized training data in black-box AI systems, offering a practical tool for auditing data provenance and protecting data rights.

<li>Dathathri, S. et al. Scalable watermarking for identifying large language model outputs. Nature634, 818823 (2024). Google Scholar  </li><li>Heimberg, G. et al. A cell atlas foundation model for… [+14919 chars]