+ All Categories
Home > Documents > Probabilistic Person Identification in TV Seriesmtapaswi/... · Person identification in multimedia...

Probabilistic Person Identification in TV Seriesmtapaswi/... · Person identification in multimedia...

Date post: 11-Aug-2020
Category:
Upload: others
View: 1 times
Download: 0 times
Share this document with a friend
1
KIT University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association “Knock! Knock! Who is it?” Probabilistic Person Identification in TV Series Makarand Tapaswi, Martin Bäuml, Rainer Stiefelhagen Computer Vision for Human Computer Interaction, Karlsruhe Institute of Technology, Germany Motivation Person identification in multimedia data (movies / TV series) has many applications ranging from smart video browsing, video summarization, retrieval of favorite actor clips, etc. to building person-specific models for action recognition or character profiling. Face Recognition Block DCT features 1-vs-All SVM Output as confidence Probabilistic Identification Model Divide the episode into scenes and shots In each shot, optimize node ID for each person track Associate clothing and face information Incorporate speaker through concept of presence Ensure identities of co-occurring people are unique Inference by energy minimization in the MRF Results The Big Bang Theory Season 01, Episodes 01 06 Video Analysis Acknowledgment: This work was supported by the Quaero Programme, funded by OSEO; and BMBF contract no. 01ISO9052E. Clothing Clustering and Identification Person detection and tracking Color histogram, agglomerative clustering Assign face id to cluster when face majority exists Compare clothing for others to obtain identity Clustering Assignment Leonard Face Match Assigned Cluster Sheldon Sheldon Unassigned Cluster Clothing Match Cluster Types Tracking 1 2 3 Contact {makarand.tapaswi, baeuml}@kit.edu Computer Vision for Human Computer Interaction http://cvhci.anthropomatik.kit.edu Project page (tracks, ground truth, etc.) http://cvhci.anthropomatik.kit.edu/~mtapaswi/projects/personid.html Major Contributions Shift focus from face tracks to person tracks, leverage the temporal structure of TV series episodes Automatically learn clothing models using face recognition results Model the person identification task using a Markov Random Field Clothing wrong Face corrects Face wrong Clothing corrects Uniqueness Constraint No Face Characters identified! F C ID F C ID F C ID Presence S Scene 1 Scene 2 F C ID F C ID Presence S E C E F E S E U Shot Boundaries Special Sequences Alternating Shots Person Recognition Acc Max Prior (Sheldon) 27.5 Face Only 63.1 Clothing Only 76.2 Clothing + Face 79.8 Full Model 82.6 Face Recognition Acc Face Only 71.8 Clothing + Face 79.8 Full Model 83.2 Block DCT 1vsAll SVM Face Tracking MCT-based multi-pose face detector Particle filter tracker
Transcript
Page 1: Probabilistic Person Identification in TV Seriesmtapaswi/... · Person identification in multimedia data (movies / TV series) has many applications ranging from smart video browsing,

KIT – University of the State of Baden-Wuerttemberg and

National Research Center of the Helmholtz Association

“Knock! Knock! Who is it?”

Probabilistic Person Identification in TV Series Makarand Tapaswi, Martin Bäuml, Rainer Stiefelhagen

Computer Vision for Human Computer Interaction, Karlsruhe Institute of Technology, Germany

Motivation Person identification in multimedia data (movies / TV

series) has many applications ranging from smart video

browsing, video summarization, retrieval of favorite actor

clips, etc. to building person-specific models for action

recognition or character profiling.

Face Recognition

• Block DCT features

• 1-vs-All SVM

• Output as confidence

Probabilistic Identification Model

• Divide the episode into scenes and shots

• In each shot, optimize node ID for each person track

• Associate clothing and face information

• Incorporate speaker through concept of presence

• Ensure identities of co-occurring people are unique

• Inference by energy minimization in the MRF

Results The Big Bang Theory Season 01, Episodes 01 – 06

Video Analysis

Acknowledgment: This work was supported by the Quaero Programme, funded by OSEO; and BMBF contract no. 01ISO9052E.

Clothing Clustering and Identification

• Person detection and tracking

• Color histogram, agglomerative clustering

• Assign face id to cluster when face majority exists

• Compare clothing for others to obtain identity

Clustering Assignment

Leonard

Face

Match Assigned Cluster

Sheldon Sheldon

Unassigned Cluster

Clothing

Match

Cluster Types Tracking

1 2

3

Contact

{makarand.tapaswi, baeuml}@kit.edu

Computer Vision for Human Computer Interaction

http://cvhci.anthropomatik.kit.edu

Project page (tracks, ground truth, etc.) http://cvhci.anthropomatik.kit.edu/~mtapaswi/projects/personid.html

Major Contributions

Shift focus from face tracks to person tracks, leverage the temporal structure of TV series episodes

Automatically learn clothing models using face recognition results

Model the person identification task using a Markov Random Field

Clothing wrong

Face corrects

Face wrong

Clothing corrects

Uniqueness

Constraint

No Face

Characters identified!

F C

ID

F C

ID

F C

ID

Presence

S

Scene 1 Scene 2

F C

ID

F C

ID

Presence

S

EC EF

ES

EU

… …

Shot Boundaries Special Sequences Alternating Shots

Person Recognition Acc

Max Prior (Sheldon) 27.5

Face Only 63.1

Clothing Only 76.2

Clothing + Face 79.8

Full Model 82.6

Face Recognition Acc

Face Only 71.8

Clothing + Face 79.8

Full Model 83.2

Block

DCT

1vsAll

SVM

Face Tracking

• MCT-based multi-pose

face detector

• Particle filter tracker

Recommended