You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
based on yolo-high-level project (detect\pose\classify\segment\):include yolov5\yolov7\yolov8\ core ,improvement research ,SwintransformV2 and Attention Series. training skills, business customization, engineering deployment C
This study presents a novel multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.
This repository contains code used to perform image retrieval using transformers. It is a demonstration of how using vision transformers, metric learning, and a novel loss based on differential cross-entropy can lead to better retrieval than classical CNN-based methods.
A deep learning project that classifies seven types of skin lesions using the HAM10000 dataset. Among four tested models, Swin Transformer achieved the best accuracy of 88.9%, showing AI’s potential in early skin cancer detection.
This study presents a hybrid multimodal fusion technique for disaster identification in Bangla, combining text and image data using the "BanglaCalamityMMD" dataset. Employing DisasterTextNet, DisasterImageNet, and DisasterMultFusionNet, the approach addresses a key gap in Bangla disaster research.