Abstract

The recognition of objects and classes of objects is of importance in the field of computer vision due to its applicability in areas such as video surveillance, medical imaging and retrieval of images and videos from large databases on the Internet. Effective recognition of object classes is still a challenge in vision; hence, there is much interest to improve the rate of recognition in order to keep up with the rising demands of the fields where these techniques are being applied. This thesis investigates the recognition of activities and expressions in video sequences using a new descriptor called the spatiotemporal shape context. The shape context is a well-known algorithm that describes the shape of an object based upon the mutual distribution of points in the contour of the object; however, it falls short when the distinctive property of an object is not just its shape but also its movement across frames in a video sequence. Since actions and expressions tend to have a motion component that enhances the capability of distinguishing them, the shape based information from the shape context proves insufficient. This thesis proposes new 3D and 4D spatiotemporal shape context descriptors that incorporate into the original shape context changes in motion across frames. Results of classification of actions and expressions demonstrate that the spatiotemporal shape context is better than the original shape context at enhancing recognition of classes in the activity and expression domains.

Library of Congress Subject Headings

Computer vision; Video recordings--Data processing; Pattern recognition systems; Motion perception (Vision)--Computer simulation; Classification--Data processing

Publication Date

7-1-2009

Document Type

Thesis

Department, Program, or Center

Computer Engineering (KGCOE)

Advisor

Savakis, Andreas

Advisor/Committee Member

Yang, Shanchieh

Advisor/Committee Member

Gaborski, Roger

Comments

Note: imported from RIT’s Digital Media Library running on DSpace to RIT Scholar Works. Physical copy available through RIT's The Wallace Library at: TA1634 .K46 2009

Recommended Citation

Kholgade, Natasha Prashant, "Recognition of human activities and expressions in video sequences using shape context descriptor" (2009). Thesis. Rochester Institute of Technology. Accessed from
https://repository.rit.edu/theses/3222

Campus

RIT – Main Campus

Download

COinS

Theses

Recognition of human activities and expressions in video sequences using shape context descriptor

Abstract

Library of Congress Subject Headings

Publication Date

Document Type

Department, Program, or Center

Advisor

Advisor/Committee Member

Advisor/Committee Member

Comments

Recommended Citation

Campus

Search

Browse

Author Corner

RIT Links

Theses

Recognition of human activities and expressions in video sequences using shape context descriptor

Author

Abstract

Library of Congress Subject Headings

Publication Date

Document Type

Department, Program, or Center

Advisor

Advisor/Committee Member

Advisor/Committee Member

Comments

Recommended Citation

Campus

Share

Search

Browse

Author Corner

RIT Links