πŸ•ΈοΈ
Deep Learning
  • πŸ’«Deep Learning Notes
  • πŸ’ΌPractical Tools
  • πŸ’ŽConcepts of Neural Networks
    • 🌱Introduction
    • πŸ”ŽThe Problem in General
    • πŸ‘·β€β™€οΈ Implementation Notes
    • πŸ“šCommon Concepts
    • πŸ’₯Activation Functions
    • 🎈Practical Aspects
    • πŸ‘©β€πŸ”§ NN Regularization
    • ✨Optimization Algorithms
    • 🎨Softmax Regression
    • πŸƒβ€β™€οΈ Introduction to Tensorflow
    • πŸ‘©β€πŸ’» Python Code Snippets
  • πŸ™‹β€β™€οΈ Hello World of Deep Learning with Neural Networks
    • 🌱Introduction
    • 🌐CNNs In Browser
  • πŸšͺIntroduction to Computer Vision
    • 🌱Introduction
  • 🚩Concepts of Convolutional Neural Networks
    • 🌱Introduction
    • πŸ“ŒCommon Concepts
    • 🌟Advanced Concepts
    • πŸ‘€Visualization
    • πŸ‘΅Classic Networks
    • ✨Other Approaches
    • πŸ•ΈοΈCommon Applications
  • πŸ‘©β€πŸ’» Works and Notes on CNNs
    • 🌱Introduction
  • πŸ’„Popular Strategies of Deep Learning
    • 🌱Introduction
    • πŸš™Transfer Learning
    • πŸ“šOther Strategies
  • 🀑Image Augmentation
    • 🌱Introduction
  • πŸ€Έβ€β™€οΈ Notes on Applied Machine Learning
    • 🌱Introduction
    • πŸ‘©β€πŸ”§ Notes on Structuring Machine Learning Projects
    • πŸ‘©β€πŸ« Implementation Guidelines
  • πŸ•΅οΈβ€β™€οΈ Basics of Object Detection
    • 🌱Introduction
    • β­•Region-Based CNNs
    • 🀳SSD and YOLO
    • πŸ€–TensorFlow Object Detection API
    • 🐞Model Debugging
  • ➰Sequence Models In Deep Learning
    • 🌱Introduction
    • πŸ“šGeneral Concepts
    • πŸ”„Recurrent Neural Networks
    • 🌌Vanishing Gradients with RNNs
    • 🌚Word Representation
    • πŸ’¬Mixed Info On NLP
  • πŸ’¬NLP
    • 🌱Introduction
  • πŸ’¬Applied NLP
    • πŸ™ŒπŸ» Handling texts
    • 🧩Regex
  • πŸ‘€Quick Visual Info
  • πŸ“šPDFs that I found and recommend
Powered by GitBook
On this page
  • πŸ˜‰ You Only Look Once
  • πŸŽ€ Advantages
  • πŸ™„ Disadvantages
  • πŸ€Έβ€β™€οΈ SSD
  • πŸ‘©β€πŸ« Details
  • πŸ€“ Long Story Short
  • πŸ•΅οΈβ€β™€οΈ Evaluation
  • πŸ‘€ Visualization
  • 🚫 SSD vs YOLO
  • 🧐 References

Was this helpful?

Export as PDF
  1. πŸ•΅οΈβ€β™€οΈ Basics of Object Detection

SSD and YOLO

Single Shot Detectors and You Only Look Once

πŸ˜‰ You Only Look Once

  • πŸ’₯ The approach involves a single neural network trained end to end

    • It takes an image as input and predicts bounding boxes and class labels for each bounding box directly.

  • πŸ˜• The technique offers lower predictive accuracy (e.g. more localization errors) Compared with region based models

  • βž— YOLO divides the input image into an SΓ—S grid. Each grid cell predicts only one object

πŸ‘·β€β™€οΈ Long Story Short: The system divides the input image into an S Γ— S grid. If the center of an object falls into a grid cell, that grid cell is responsible for detecting that object.

πŸŽ€ Advantages

  • πŸš€ Speed

  • πŸ€Έβ€β™€οΈ Feasible for real time applications

πŸ™„ Disadvantages

  • πŸ˜• Poor performance on small-sized objects

    • It tends to give imprecise object locations.

TODO: Compare versions of YOLO

πŸ€Έβ€β™€οΈ SSD

  • πŸ’₯ Predicts objects in images using a single deep neural network.

  • πŸ€“ The network generates scores for the presence of each object category using small convolutional filters applied to feature maps.

  • ✌ This approach uses a feed-forward CNN that produces a collection of bounding boxes and scores for the presence of certain objects.

  • ❗ In this model, each feature map cell is linked to a set of default bounding boxes

πŸ‘©β€πŸ« Details

  • πŸ–ΌοΈ After going through a certain of convolutions for feature extraction, we obtain a feature layer of size mΓ—n (number of locations) with p channels, such as 8Γ—8 or 4Γ—4 above.

    • And a 3Γ—3 conv is applied on this mΓ—nΓ—p feature layer.

  • πŸ“ For each location, we got k bounding boxes. These k bounding boxes have different sizes and aspect ratios.

    • The concept is, maybe a vertical rectangle is more fit for human, and a horizontal rectangle is more fit for car.

  • πŸ’« For each of the bounding boxes, we will compute c class scores and 4 offsets relative to the original default bounding box shape.

πŸ€“ Long Story Short

The SSD object detection algorithm is composed of 2 parts:

  • Extract feature maps

  • Apply convolution filters to detect objects.

πŸ•΅οΈβ€β™€οΈ Evaluation

  • Better accuracy compared to YOLO

  • Better speed compared to Region based algorithms

πŸ‘€ Visualization

🚫 SSD vs YOLO

🧐 References

PreviousRegion-Based CNNsNextTensorFlow Object Detection API

Last updated 4 years ago

Was this helpful?

🀳
A 2019 Guide to Object Detection
Review: SSD β€” Single Shot Detector (Object Detection) ✨
More on YOLO