Object Detection, Semantic and Instance Segmentation -
By Team CodeDevs for Competition Manthan21
Code will be released prior Idea Submission Selection.
- Vehicles (Truck, Bus, Boat, Airplane)
- Roads Signage - (Zebra Crossing, Traffic Light)
- Man-Made Architectures - (Buildings, Bridges)
COCO- TheMS COCO (Microsoft Common Objects in Context)dataset is alarge-scale object detection,segmentation,key-point detection, andcaptioning dataset. The dataset consists of 328K images.CityScapes- Cityscapes is a large-scale database which focuses on semantic understanding of urban street scenes. It providessemantic,instance-wise, anddense pixel annotationsfor 30 classes grouped into 8 categories (flat surfaces, humans, vehicles, constructions, objects, nature, sky, and void). Data was captured in 50 cities during several months, daytimes, and good weather conditions with over 25k images.
- Fully Working Web application : Allowing the user to Upload Image.
- Perform
Instance SegmentationplusObject Detection- CreatingAnnotationsover the Uploaded Image withBounding BoxesandClass Names, andPixel Labelling. - Displaying the Output Image with the Annotated Object on it.
- Our web app takes an input image from the user using
JavaScript - The respective image gets saved in the locally hosted centralised
SQL database. - The model will fetch the object and will detect the same, using the libraries
tensorflow,pytorch&pixellibwithDeep Learning Modelssuch asPointRendandMobileNetV3. - The input image gets annotated using
cv2libraries. - The
annotatedobjects that has been detected, gets displayed along with the original uploaded image via theDjango Backend. - The model gives results in the form of a
JSON (JavaScript Object Notation)format and the output is displayed withCSSandHTMLwebsite on the local web server.
Object Detection is a computer vision technique for locating instances of objects in images or videos. Object detection algorithms typically leverage machine learning or deep learning to produce meaningful results. When humans look at images or video, we can recognize and locate objects of interest within a matter of moments. The goal of object detection is to replicate this intelligence using a computer.
Instance Segmentation is identifying each object instance for every known object within an image. Instance segmentation assigns a label to each pixel of the image. It is used for tasks such as counting the number of objects in an image along with object localization.
For performing segmentation of the objects in images and videos, PixelLib library is used, and so we have invoked the same in our respective project.
PixelLib provides support for Pytorch and it uses PointRend for performing more accurate and real time instance segmentation of objects in images and videos. Hence, annotations over the image takes place once the work is done.
The implementation of the MobileNetV3 architecture follows closely the original paper and it is customizable and offers different configurations for building Classification, Object Detection and Semantic Segmentation backbones.
Furthermore, it was designed to follow a similar structure to MobileNetV2 and the two share common building blocks. The MobileNetV3 class is responsible for building a network out of the provided configuration. The models are then adapted and applied to the tasks of object detection and semantic segmentation.
For the task of semantic segmentation (or any dense pixel prediction), we propose a new efficient segmentation decoder to achieve new state of the art results for mobile classification, detection and segmentation. Finally, the project tries to faithfully implement MobileNetV3 for real-time semantic segmentation, with the aims of being efficient, easy to use, and extensible.
Object Detection
Instance Segmentation
~By CodeDevs



