Deep Extreme Cut: From Extreme Points to Object Segmentation
Description
data:image/s3,"s3://crabby-images/01902/01902de4653a669dab0e04e24739b44cefced1b3" alt=""
This paper presents a semi-automatic deep learning segmentation method. The idea is quite simple but the results beat state-of-the-art solutions.
To segment an object, the user is asked to select 4 keypoints : the topmost, the bottommost, the leftmost and the rightmost points. Each point is then associated to a Gaussian kernel printed in a 2D image. This 2D image is concatenated to the RBG input image thus leading to a 4-modality input image. To improve results, the feed to the network a dilated cropped window around the 4 points.
The proposed network is a modified ResNet101 without the last layers, without max poolings and with some dilated convolutions to make sure the output has the same size than the input.
Results
They report a series of ablation results, but at the end of the day, they report state-of-the-art results.
data:image/s3,"s3://crabby-images/8ffa1/8ffa1ba3b909f9c1a9e539b49e360d492a9c25f4" alt=""
data:image/s3,"s3://crabby-images/e1ab7/e1ab74cfb41745ffdd7405b637794a36e8202ba9" alt=""