共 21 条
[14]
Mask R-CNN. He K M,Gkioxari G,Doll′ar P,Girshick R. Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV) . 2017
[15]
MobileViT:Light-weight,General-purpose,and Mobile-friendly Vision Transformer. Mehta S,Rastegari M. . 2021
[16]
Exploring the limits of weakly supervised pretraining. Mahajan D,Girshick R,Ramanathan V,He K M,Paluri M,Li Y X,et al. Proceedings of the 15th European Conference on Computer Vision (ECCV) . 2018
[17]
Convolutional xformers for vision. Jeevan P,Sethi A. . 2022
[18]
Masked-attention mask transformer for universal image segmentation. Cheng B W,Misra I,Schwing A G,Kirillov A,Girdhar R. . 2021
[19]
Demystifying local vision transformer:Sparse connectivity,weight sharing and dynamic weight. Han Q,Fan Z J,Dai Q,Sun L,Cheng M M,Liu J Y,et al. . 2021
[20]
Unified perceptual parsing for scene understanding. Xiao T T,Liu Y C,Zhou B L,Jiang Y N,Sun J. Proceedings of the15th European Conference on Computer Vision(ECCV) . 2018