Unsupervised Domain Adaptation using Feature-Whitening and Consensus Loss
data:image/s3,"s3://crabby-images/28f31/28f310c00c4bccaa59970fd283dd75b2022a08d7" alt=""
Summary
In this paper, the authors propose a domain adaptation method to train a deep neural net on a labeled source dataset and yet get good results on a target dataset whose visual domain is shifted WRT the one of the source dataset. The main novelty is a so-called Domain-specific Whitening Transform (DWT) as well as a Min-Entropy Consensus (MEC) loss which account for both the labeled source data and the non-labeled target data. The method is illustraed in Fig.1.
Proposed method
Domain-specific Whitening Transform (DWT)
For the DWT, they replace the usual Batch Norm
data:image/s3,"s3://crabby-images/93447/93447d52907ecb8473545579f38a5725efeb6859" alt=""
by a new Batch Whitening operation:
data:image/s3,"s3://crabby-images/8da24/8da24b414f96f6c8588e939408819ad95b3c4514" alt=""
The goal of the BW is to project the feature space of both target and source distributions onto a common zero-centered hypersphere.
Min-Entropy Consensus Loss
The second contribution of this paper is the Loss which they minimize :
data:image/s3,"s3://crabby-images/3e0b3/3e0b310285e66d0447a500a68e7bc4e4d9384306" alt=""
Where the first term is the usual cross-entropy over the labeled source training data
data:image/s3,"s3://crabby-images/6f16e/6f16edad0541514f9ba800aa58cd21e44bb60703" alt=""
whereas the second term is the Min-Entropy Consensus (MEC) Loss which is intended for the target data
data:image/s3,"s3://crabby-images/f9c43/f9c433344f38c23137ad618e31eee6609feafa8b" alt=""
Since the target data are unlabeled, the goal of that loss is to make sure that the prediction of two identical batches \(B_1^t\) and \(B_2^t\) more or less some noise are identical.
Results
The method is state-of-the-art on several source-tardet dataset pairs:
data:image/s3,"s3://crabby-images/cdf9f/cdf9fa7aeafdee71f7a27b13f310414a385e2542" alt=""