Watermarking Language Models through Language Models
IEEE Transactions on Artificial Intelligence 2025
Prompt-based LLM watermarking framework that embeds detectable signals in model responses without modifying weights or data. Evaluated watermark generation and detection using instruction-tuned LLMs. The figure above is an overview of our prompting strategy.