멀티미디어 - Computer Vision 4 컴퓨터 비전 (recognition, skin detection classifier, bayes rule estimation, visual recognition, generative discriminative model, bag of word)

728x90

• Recognition

• Skin detection

• Skin classifier

• Probabilistic skin classification

• Bayes rule

• Visual Recognition

• ★ Generative vs. Discriminative Models

• Bag‐of‐words models

• Weakness of BoW

Recognition

Skin detection

피부 픽셀은 RGB혹은 HSV로 표현하는 색상 영역에 해당한다

픽셀 x가 해당 영역에 속하면 피부로 분류한다

example로 피부영역을 학습하고 training image에서 수동으로 피부/비피부 픽셀을 라벨링한다

RGB 공간에 훈련 데이터를 띄운다

피부 픽셀은 주황색 비피부는 회색으로 표시한다

일부 피부 픽셀이 영역 밖에 있을수도, 비피부 픽셀이 영역 안에 있을 수도 있다

Skin classifier

새로운 값 x가 주어지면 피부인지 결정하는 방법

x와 가장 가까운 라벨링된 픽셀을 찾는 nearest neighbor 기법이 있다

SVM을 이용해서 두 클래스를 구분하는 평면/곡선을 찾는다

각 클래스에 확률 밀도/분포 모델을 맞추는 데이터 모델링을 한다

Probabilistic skin classification

확률 분포를 histograming해서 확률밀도함수 PDF와 불확실성 함수를 모델링한다

skin classifier

새로운 pixel x가 주어지면 skin인지를 확률로 해석한다

Bayes rule

skin 확률분포를 구하기 위해서 bayes rule을 사용한다

확률 P(skin)는 domain knowledge로 train set에서 피부 픽셀의 비율이다

posterior 확률을 최대화하여서 라벨을 선택하여 잘못 분류할 확률을 최소화하기 위해

Maximum A Posteriori(MAP) 추정을 사용한다

Visual Recognition

★ Generative vs. Discriminative Models

generative method

클래스 조건부 확률 밀도함수(PDF)와 prior 확률을 모델링한다

generative한 이유는 샘플링을 통해 합성 데이터를 생성할 수 있기 때문이다

gaussian, bayes, mixture of gaussian, HMM 모델 등이 해당한다

discriminative method

posterior 확률을 직접 추정한다

기본 확률 분포를 모델링할 필요가 없다

주어진 작업에 계산 자원을 집중해서 성능을 향상시킨다

SVM, Nearest neighbor, CRF 모델 등이 해당한다

만약 음성의 언어를 판단하는 상황이라고 가정하면

genrative method는 각 언어를 학습하고 말이 어떤 언어에 속하는지 결정한다

discriminative method는 언어를 학습하지 않고 언어적인 차이를 결정하여 더 쉽다

Bag‐of‐words models

Weakness of BoW

- 객체를 구성하는 요소의 기하학적 정보가 부족하다

- view point invariance와 scale invariance에 대해서 광범위한 테스트가 없다

- segmentation과 localization이 불분명하다

이러한 단점 때문에

기하학적 제약 하에 N개 부분 집합으로 객체 클래스를 표현하는 constellation model(별자리 모델)이 나왔다

728x90

저작자표시 비영리 변경금지

'개발 · 컴퓨터공학' 카테고리의 다른 글

OOTDiffusion을 이용한 PyQT 애플리케이션 만들기 (파이썬 GUI 프로그램, QT 애플리케이션, virtual try on, 가상피팅, diffusion model) (0)	2024.06.14
멀티미디어 - Motion 모션 (optical flow, estimating optical flow, brightness constancy contraint, lucas-kanade flow) (0)	2024.06.13
멀티미디어 - Computer Vision 3 컴퓨터 비전 (segmentation, gestalt theory, clustering, k-means, probabilistic, mixture of gaussian, expectation maximization, mean shift, histogram-based segmentation, images as graphs) (1)	2024.06.11
멀티미디어 - Computer Vision 2 컴퓨터 비전 (stereo vision, epipolar constriant, model fitting, hough transform, RANSAC) (2)	2024.06.10
멀티미디어 - Computer Vision 1 컴퓨터 비전 (local feature, edge and corner, gradient, canny edge detection, harris corner detector, eigenvector, invariance, MOPS, SIFT) (1)	2024.06.09