jennypng

Recent Notes

Computer Vision
Jun 04, 2025
Linear Discriminant Analysis (LDA)
Jun 04, 2025
Object Detection
Jun 04, 2025
backpropagation
Jun 04, 2025
classifier
Jun 04, 2025

See 311 more →

❯

Recognition

Jun 04, 20251 min read

assigning semantic labels to pixels, regions, or entire images.

For example:

Image-level: Does this image contain a building? (Answer: Yes/No)
Object-level: Where is the car? What are the people doing?
Attribute-level: What material is the building made of? What time does the clock show? Recognition tasks vary in granularity:
Category-level: Detecting any cereal box.
Instance-level: Identifying a specific cereal box (e.g., Kellogg’s).

Challenges

scale of categories
- defining categories is also subjective
variations
- viewpoint
- illumination
- scale
- deformation (non-rigid objects)
- occlusion
- clutter

recognition pipeline

feature extraction
train a classifier - learn function f that maps features to labels
testing: apply f to new, unseen images

feature representation

color histograms - simple, not robust to scale
SIFT - local detector - scale-invariant, good for keypoints
deep learning - highly flexible but data heavy
k-nearest neighbors (KNN)

how to choose features?

(i think rgb is scale invariant tho)

Graph View

Backlinks

Computer Vision
Object Detection

Created with Quartz v4.5.0 © 2025

GitHub
Discord Community