Chapter 7 Enhanced image watermarking through cross-attention and noise-invariant domain learning
-
Agnibh Dasgupta
and Xin Zhong
Abstract
Image watermarking deals with embedding and extracting watermarks in a cover image, focusing on improving generalization and robustness against various manipulations. Recent deep learning methods enhance these aspects mainly using convolutional techniques for embedding and including targeted augmentations during training. Vision transformers, known for their self-attention mechanism, efficiently handle longrange dependencies and global context, showing promise for image watermarking. However, their potential for watermark embedding, especially using cross-attention, has not been fully explored. This study introduces a robust image watermarking approach utilizing cross-attention and invariant domain learning, presenting two significant advancements. First, we propose a watermark embedding method based on a multihead cross-attention mechanism. This mechanism enables an effective exchange of information between the cover image and the watermark, pinpointing the best spots for embedding based on semantic relevance. Second, we suggest learning a domain representation that is invariant to both semantic shifts and noise, potentially broadening the scope for improving image watermarking techniques.
Abstract
Image watermarking deals with embedding and extracting watermarks in a cover image, focusing on improving generalization and robustness against various manipulations. Recent deep learning methods enhance these aspects mainly using convolutional techniques for embedding and including targeted augmentations during training. Vision transformers, known for their self-attention mechanism, efficiently handle longrange dependencies and global context, showing promise for image watermarking. However, their potential for watermark embedding, especially using cross-attention, has not been fully explored. This study introduces a robust image watermarking approach utilizing cross-attention and invariant domain learning, presenting two significant advancements. First, we propose a watermark embedding method based on a multihead cross-attention mechanism. This mechanism enables an effective exchange of information between the cover image and the watermark, pinpointing the best spots for embedding based on semantic relevance. Second, we suggest learning a domain representation that is invariant to both semantic shifts and noise, potentially broadening the scope for improving image watermarking techniques.
Chapters in this book
- Frontmatter I
- Preface V
- Contents VII
-
Section: Image processing
- Chapter 1 Magnetic resonance image re-parameterization on real data 1
- Chapter 2 Denoising and gradient fusion for effective edge detection for noisy color images 17
- Chapter 3 Understanding driver attention to objects for ADASs: what do drivers see? 39
- Chapter 4 Image clustering enhanced with refined image classification 59
- Chapter 5 AI-powered framework for objective scoring of product design innovation 89
-
Section: Computer vision
- Chapter 6 Image inpainting using GAN transformerbased model 111
- Chapter 7 Enhanced image watermarking through cross-attention and noise-invariant domain learning 127
- Chapter 8 Online melt pool monitoring using a deep transformer image processing solution 153
- Chapter 9 Implementation of deep learning techniques on thermal image classification 173
- Chapter 10 Drishti: a generative AI-based application for gesture recognition and execution 203
-
Section: Pattern recognition
- Chapter 11 Exploring muzzle biometrics: a deep learning framework for noninvasive cattle recognition 239
- Chapter 12 Utilizing real-world data to develop a userindependent sensor-based human activity recognition system 253
- Index 273
Chapters in this book
- Frontmatter I
- Preface V
- Contents VII
-
Section: Image processing
- Chapter 1 Magnetic resonance image re-parameterization on real data 1
- Chapter 2 Denoising and gradient fusion for effective edge detection for noisy color images 17
- Chapter 3 Understanding driver attention to objects for ADASs: what do drivers see? 39
- Chapter 4 Image clustering enhanced with refined image classification 59
- Chapter 5 AI-powered framework for objective scoring of product design innovation 89
-
Section: Computer vision
- Chapter 6 Image inpainting using GAN transformerbased model 111
- Chapter 7 Enhanced image watermarking through cross-attention and noise-invariant domain learning 127
- Chapter 8 Online melt pool monitoring using a deep transformer image processing solution 153
- Chapter 9 Implementation of deep learning techniques on thermal image classification 173
- Chapter 10 Drishti: a generative AI-based application for gesture recognition and execution 203
-
Section: Pattern recognition
- Chapter 11 Exploring muzzle biometrics: a deep learning framework for noninvasive cattle recognition 239
- Chapter 12 Utilizing real-world data to develop a userindependent sensor-based human activity recognition system 253
- Index 273