Text Watermarking
   HOME

TheInfoList



OR:

Text watermarking is a technique for embedding hidden information within textual content to verify its authenticity, origin, or ownership. With the rise of
generative AI Generative artificial intelligence (Generative AI, GenAI, or GAI) is a subfield of artificial intelligence that uses generative models to produce text, images, videos, or other forms of data. These models learn the underlying patterns and str ...
systems using
large language models A large language model (LLM) is a language model trained with Self-supervised learning, self-supervised machine learning on a vast amount of text, designed for natural language processing tasks, especially Natural language generation, language g ...
(LLM), there has been significant development focused on watermarking AI-generated text. Potential applications include detecting
fake news Fake news or information disorder is false or misleading information (misinformation, disinformation, propaganda, and hoaxes) claiming the aesthetics and legitimacy of news. Fake news often has the aim of damaging the reputation of a person ...
and academic cheating, and excluding AI-generated material from LLM
training data In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
. With LLMs the focus is on linguistic approaches that involve selecting words to form patterns within the text that can later be identified. The results of the first reported large-scale public deployment, a trial using Google's Gemini
chatbot A chatbot (originally chatterbot) is a software application or web interface designed to have textual or spoken conversations. Modern chatbots are typically online and use generative artificial intelligence systems that are capable of main ...
, appeared in October 2024: users across 20 million responses found watermarked and unwatermarked text to be of equal quality. Research on text watermarking began in 1997.


See also

*
Digital watermarking A digital watermark is a kind of marker covertly embedded in a noise-tolerant signal such as audio, video or image data.H.T. Sencar, M. Ramkumar and A.N. Akansu: ''Data Hiding Fundamentals and Applications: Content Security in Digital Multimedia'' ...


References

{{Reflist Watermarking Generative artificial intelligence