We introduce a novel dual branch network called Swin Vision Transformer Net (SVTNet), where the Swin Transformer and Vision Transformer are combined to learn features with global and local information ...
This paper proposed a 3D swin transformer with multi-task joint learning framework, to simultaneously learn multiple tasks for hyperspectral tongue images. Based on the 3D swin transformer model, the ...
Ultralytics YOLOv8 is a cutting-edge, state-of-the-art (SOTA) model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and ...
The Traffic Light Detection and Classification project aims to enhance autonomous driving systems by accurately detecting and classifying traffic lights. The model is designed to generate appropriate ...