Fine-grain watermarking for intellectual property protection

<p dir="ltr">The current online digital world, consisting of thousands of newspapers, blogs, social media, and cloud file sharing services, is providing easy and unlimited access to a large treasure of text contents. Making copies of these text contents is simple and virtually costle...

Full description

Saved in:
Bibliographic Details
Main Author: Stefano Giovanni Rizzo (18615112) (author)
Other Authors: Flavio Bertini (5881226) (author), Danilo Montesi (8177640) (author)
Published: 2019
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1864513514061692928
author Stefano Giovanni Rizzo (18615112)
author2 Flavio Bertini (5881226)
Danilo Montesi (8177640)
author2_role author
author
author_facet Stefano Giovanni Rizzo (18615112)
Flavio Bertini (5881226)
Danilo Montesi (8177640)
author_role author
dc.creator.none.fl_str_mv Stefano Giovanni Rizzo (18615112)
Flavio Bertini (5881226)
Danilo Montesi (8177640)
dc.date.none.fl_str_mv 2019-07-12T03:00:00Z
dc.identifier.none.fl_str_mv 10.1186/s13635-019-0094-2
dc.relation.none.fl_str_mv https://figshare.com/articles/journal_contribution/Fine-grain_watermarking_for_intellectual_property_protection/25904404
dc.rights.none.fl_str_mv CC BY 4.0
info:eu-repo/semantics/openAccess
dc.subject.none.fl_str_mv Information and computing sciences
Information systems
Digital text watermarking
Unicode characters
Copyright protection
Copyright enforcement
Tampering detection
dc.title.none.fl_str_mv Fine-grain watermarking for intellectual property protection
dc.type.none.fl_str_mv Text
Journal contribution
info:eu-repo/semantics/publishedVersion
text
contribution to journal
description <p dir="ltr">The current online digital world, consisting of thousands of newspapers, blogs, social media, and cloud file sharing services, is providing easy and unlimited access to a large treasure of text contents. Making copies of these text contents is simple and virtually costless. As a result, producers and owners of text content are interested in the protection of their intellectual property (IP) rights. Digital watermarking has become crucially important in the protection of digital contents. Out of all, text watermarking poses many challenges, since text is characterized by a low capacity to embed a watermark and allows only a restricted number of alternative syntactic and semantic permutations. This becomes even harder when authors want to protect not just a whole book or article, but each single sentence or paragraph, a problem well known to copyright law. In this paper, we present a fine-grain text watermarking method that protects even small portions of the digital content. The core method is based on homoglyph characters substitution for latin symbols and whitespaces. It allows to produce a watermarked version of the original text, preserving the anonymity of the users according to the right to privacy. In particular, the embedding and extraction algorithms allow to continuously protect the watermark through the whole document in a fine-grain fashion. It ensures visual indistinguishability and length preservation, meaning that it does not cause overhead to the original document, and it is robust to the copy and past of small excerpts of the text. We use a real dataset of 1.8 million New York articles to evaluate our method. We evaluate and compare the robustness against common attacks, and we propose a new measure for partial copy and paste robustness. The results show the effectiveness of our approach providing an average length of 101 characters needed to embed the watermark and allowing to protect paragraph-long excerpt or smaller the 94.5% of the times.</p><h2>Other Information</h2><p dir="ltr">Published in: EURASIP Journal on Information Security<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1186/s13635-019-0094-2" target="_blank">https://dx.doi.org/10.1186/s13635-019-0094-2</a></p>
eu_rights_str_mv openAccess
id Manara2_004c4514df7c36f355d14f69f0ba5996
identifier_str_mv 10.1186/s13635-019-0094-2
network_acronym_str Manara2
network_name_str Manara2
oai_identifier_str oai:figshare.com:article/25904404
publishDate 2019
repository.mail.fl_str_mv
repository.name.fl_str_mv
repository_id_str
rights_invalid_str_mv CC BY 4.0
spelling Fine-grain watermarking for intellectual property protectionStefano Giovanni Rizzo (18615112)Flavio Bertini (5881226)Danilo Montesi (8177640)Information and computing sciencesInformation systemsDigital text watermarkingUnicode charactersCopyright protectionCopyright enforcementTampering detection<p dir="ltr">The current online digital world, consisting of thousands of newspapers, blogs, social media, and cloud file sharing services, is providing easy and unlimited access to a large treasure of text contents. Making copies of these text contents is simple and virtually costless. As a result, producers and owners of text content are interested in the protection of their intellectual property (IP) rights. Digital watermarking has become crucially important in the protection of digital contents. Out of all, text watermarking poses many challenges, since text is characterized by a low capacity to embed a watermark and allows only a restricted number of alternative syntactic and semantic permutations. This becomes even harder when authors want to protect not just a whole book or article, but each single sentence or paragraph, a problem well known to copyright law. In this paper, we present a fine-grain text watermarking method that protects even small portions of the digital content. The core method is based on homoglyph characters substitution for latin symbols and whitespaces. It allows to produce a watermarked version of the original text, preserving the anonymity of the users according to the right to privacy. In particular, the embedding and extraction algorithms allow to continuously protect the watermark through the whole document in a fine-grain fashion. It ensures visual indistinguishability and length preservation, meaning that it does not cause overhead to the original document, and it is robust to the copy and past of small excerpts of the text. We use a real dataset of 1.8 million New York articles to evaluate our method. We evaluate and compare the robustness against common attacks, and we propose a new measure for partial copy and paste robustness. The results show the effectiveness of our approach providing an average length of 101 characters needed to embed the watermark and allowing to protect paragraph-long excerpt or smaller the 94.5% of the times.</p><h2>Other Information</h2><p dir="ltr">Published in: EURASIP Journal on Information Security<br>License: <a href="https://creativecommons.org/licenses/by/4.0" target="_blank">https://creativecommons.org/licenses/by/4.0</a><br>See article on publisher's website: <a href="https://dx.doi.org/10.1186/s13635-019-0094-2" target="_blank">https://dx.doi.org/10.1186/s13635-019-0094-2</a></p>2019-07-12T03:00:00ZTextJournal contributioninfo:eu-repo/semantics/publishedVersiontextcontribution to journal10.1186/s13635-019-0094-2https://figshare.com/articles/journal_contribution/Fine-grain_watermarking_for_intellectual_property_protection/25904404CC BY 4.0info:eu-repo/semantics/openAccessoai:figshare.com:article/259044042019-07-12T03:00:00Z
spellingShingle Fine-grain watermarking for intellectual property protection
Stefano Giovanni Rizzo (18615112)
Information and computing sciences
Information systems
Digital text watermarking
Unicode characters
Copyright protection
Copyright enforcement
Tampering detection
status_str publishedVersion
title Fine-grain watermarking for intellectual property protection
title_full Fine-grain watermarking for intellectual property protection
title_fullStr Fine-grain watermarking for intellectual property protection
title_full_unstemmed Fine-grain watermarking for intellectual property protection
title_short Fine-grain watermarking for intellectual property protection
title_sort Fine-grain watermarking for intellectual property protection
topic Information and computing sciences
Information systems
Digital text watermarking
Unicode characters
Copyright protection
Copyright enforcement
Tampering detection