jennypng

Recent Notes

Computer Vision
Jun 04, 2025
Linear Discriminant Analysis (LDA)
Jun 04, 2025
Object Detection
Jun 04, 2025
backpropagation
Jun 04, 2025
classifier
Jun 04, 2025

See 311 more →

❯

Object Detection

Object Detection

Jun 04, 20251 min read

extends Recognition by not just identifying objects, but localizing (where is it?)

Involves

localization (positions of objects - bounding boxes)
recognition - classification of each detected object

Challenging Key benchmarks include:

PASCAL VOC (20 categories).
ImageNet (200 categories).
COCO (80 categories, highly diverse).

evaluation

intersection over union (IoU)

measures localization accuracy $I o U = \frac{Area of Overlap}{Area of Union}$
IoU = 1 means perfect detection
IoU = 0: no overlap (incorrect)
= 0.5 is acceptable

metrics: precision and recall

True Positive (IoU >= threshold)
False Positive: (IoU < threshold)
False Negative: Missed detection (ground truth not detected) Example
if a model predicts 3 boxes (1 correct, 2 incorrect)
- precision = 1/3
- recall = 1/2 (if there are 2 ground truths

trad methods

sliding window

Enumerate possible locations using a sliding window.
Run a recognition model on each window.
Aggregate detections across scales.

Challenges:

Objects vary in size → Need multiple window sizes.
Computationally expensive.

image pyramid

to handle diff object sizes

fix window size but resize image (creating pyramid)
this is equiv to varying window sizes

deformable parts (DPM)

part-based detection

linear models for classification

perceptron
softmax
loss function: cross-entropy

optimization: gradient descent and backpropagation

neural networks

Graph View

evaluation
intersection over union (IoU)
metrics: precision and recall
trad methods
sliding window
image pyramid
deformable parts (DPM)
linear models for classification
optimization: gradient descent and backpropagation
neural networks

Backlinks

Computer Vision

Created with Quartz v4.5.0 © 2025

GitHub
Discord Community