Video Content Analysis Using Multimodal Information
For Movie Content Extraction, Indexing and Representation
(Sprache: Englisch)
With the fast growth ofmultimedia information, content-based video anal ysis, indexing and representation have attracted increasing attention in re cent years. Many applications have emerged in these areas such as video on-demand, distributed multimedia...
Voraussichtlich lieferbar in 3 Tag(en)
versandkostenfrei
Buch (Kartoniert)
Fr. 118.00
inkl. MwSt.
- Kreditkarte, Paypal, Rechnungskauf
- 30 Tage Widerrufsrecht
Produktdetails
Produktinformationen zu „Video Content Analysis Using Multimodal Information “
Klappentext zu „Video Content Analysis Using Multimodal Information “
With the fast growth ofmultimedia information, content-based video anal ysis, indexing and representation have attracted increasing attention in re cent years. Many applications have emerged in these areas such as video on-demand, distributed multimedia systems, digital video libraries, distance learning/education, entertainment, surveillance and geographical information systems. The need for content-based video indexing and retrieval was also rec ognized by ISOIMPEG, and a new international standard called "Multimedia Content Description Interface" (or in short, MPEG-7)was initialized in 1998 and finalized in September 2001. In this context, a systematic and thorough review ofexisting approaches as well as the state-of-the-art techniques in video content analysis, indexing and representation areas are investigated and studied in this book. In addition, we will specifically elaborate on a system which analyzes, indexes and abstracts movie contents based on the integration ofmultiple media modalities. Content ofeach part ofthis book is briefly previewed below. In the first part, we segment a video sequence into a set ofcascaded shots, where a shot consistsofone or more continuouslyrecorded image frames. Both raw and compressedvideo data will beinvestigated. Moreover, consideringthat there are always non-story units in real TV programs such as commercials, a novel commercial break detection/extraction scheme is developed which ex ploits both audio and visual cues to achieve robust results. Specifically, we first employ visual cues such as the video data statistics, the camera cut fre quency, and the existenceofdelimiting black frames between commercials and programs, to obtain coarse-level detection results.
Inhaltsverzeichnis zu „Video Content Analysis Using Multimodal Information “
Dedication. List of Figures. List of Tables. Preface. Acknowledgments.1: Introduction.
1. Audiovisual Content Analysis.
1.1. Audio Content Analysis.
1.2. Visual Content Analysis.
1.3. Audiovisual Content Analysis.
2. Video Indexing, Browsing and Abstraction.
3. MPEG-7 Standard.
4. Roadmap of The Book.
4.1. Video Segmentation.
4.2. Movie Content Analysis.
4.3. Movie Content Abstraction.
2: Background And Previous Work.
1. Visual Content Analysis.
1.1. Video Shot Detection.
1.2. Video Scene and Event Detection.
2. Audio Content Analysis.
2.1. Audio Segmentation and Classification.
2.2. Audio Analysis for Video Indexing.
3. Speaker Identification.
4. Video Abstraction.
4.1. Video Skimming.
4.2. Video Summarization.
5. Video Indexing and Retrieval.
3: Video Content Pre-Processing.
1. Shot Detection in Raw Data Domain.
1.1. YUV Color Space.
1.2. Metrics for Frame Differencing.
1.3. Camera Break Detection.
1.4. Gradual Transition Detection.
1.5. Camera Motion Detection.
1.6. Illumination Change Detection.
1.7. A Review of the Proposed System.
2. Shot Detection in Compressed Domain.
2.1. DC-image and DC-sequence.
3. Audio Feature Analysis.
4. Commercial Break Detection.
4.1. Features of A Commercial Break.
4.2. Feature Extraction.
4.3. The Proposed Detection Scheme.
5. Experimental Results.
5.1. Shot Detection Results.
5.2. Commercial Break Detection Results.
4: Content-Based Movie Scene And Event Extraction.
1. Movie Scene Extraction.
1.1. Sink-based Scene Construction.
1.2. Audiovisual-based Scene Refinement.
1.3. User Interaction.
2. Movie Event Extraction.
2.1. Sink Clustering and Categorization.
2.2. Event Extraction and Classification.
2.3. Integrating Speech and Face Information.
3. Experimental Results.
3.1. Scene Extraction Results.
3.2. Event Extraction Results.
5: Speaker Identification For Movies.
1. Supervised Speaker Identification for Movie Dialogs.
1.1. Feature Selection and
... mehr
Extraction.
1.2. Gaussian Mixture Model.
1.3. Likelihood Calculation and Score Normalization.
1.4. Speech Segment Isolation.
2. Adaptive Speaker Identification.
2.1. Face Detection, Recognition and Mouth Tracking.
2.2. Speech Segmentation and Clustering.
2.3. Initial Speaker Modeling.
2.4. Likelihood-based Speaker Identification.
2.5. Audiovisual Integration for Speaker Identification.
2.6. Unsupervised Speaker Model Adaptation.
3. Experimental Results.
3.1. Supervised Speaker Identification Results.
3.2. Adaptive Speaker Identification Results.
3.3. An Example of Movie Content Annotation.
6: Scene-Based Movie Summarization.
1. An Overview of the Proposed System.
2. Hierarchical Keyframe Extraction.
2.1. Scene Importance Computation.
2.2. Sink Importance Computation.
2.3. Shot Importance Computation.
2.4. Frame Importance Computation.
2.5. Keyframe Selection.
3. Scalable Movie Summarization and Navigation.
4. Experimental Results.
4.1. Keyframe Extraction Results.
4.2. User Study.
4.3. System Interface Design.
4.4. Applications.
7: Event-Based Movie Skimming.
1. Introduction.
2. An Overview of the Proposed System.
3. Extended Event Set Construction.
4. Extended Event Feature Extraction.
5. Video Skim Generation.
6. More Thoughts on the Video Skim.
6.1. When More Judging Rules Are Needed.
6.2. Sub-sampling the Video Skim.
6.3. Discovering the Story and Visual Structure.
7. Experimental Results.
8: Conclusion And Future Work.
1. Conclusion.
2. Future Work.
2.1. System Refinement.
2.2. New Research Topics. References. Index.
1.2. Gaussian Mixture Model.
1.3. Likelihood Calculation and Score Normalization.
1.4. Speech Segment Isolation.
2. Adaptive Speaker Identification.
2.1. Face Detection, Recognition and Mouth Tracking.
2.2. Speech Segmentation and Clustering.
2.3. Initial Speaker Modeling.
2.4. Likelihood-based Speaker Identification.
2.5. Audiovisual Integration for Speaker Identification.
2.6. Unsupervised Speaker Model Adaptation.
3. Experimental Results.
3.1. Supervised Speaker Identification Results.
3.2. Adaptive Speaker Identification Results.
3.3. An Example of Movie Content Annotation.
6: Scene-Based Movie Summarization.
1. An Overview of the Proposed System.
2. Hierarchical Keyframe Extraction.
2.1. Scene Importance Computation.
2.2. Sink Importance Computation.
2.3. Shot Importance Computation.
2.4. Frame Importance Computation.
2.5. Keyframe Selection.
3. Scalable Movie Summarization and Navigation.
4. Experimental Results.
4.1. Keyframe Extraction Results.
4.2. User Study.
4.3. System Interface Design.
4.4. Applications.
7: Event-Based Movie Skimming.
1. Introduction.
2. An Overview of the Proposed System.
3. Extended Event Set Construction.
4. Extended Event Feature Extraction.
5. Video Skim Generation.
6. More Thoughts on the Video Skim.
6.1. When More Judging Rules Are Needed.
6.2. Sub-sampling the Video Skim.
6.3. Discovering the Story and Visual Structure.
7. Experimental Results.
8: Conclusion And Future Work.
1. Conclusion.
2. Future Work.
2.1. System Refinement.
2.2. New Research Topics. References. Index.
... weniger
Bibliographische Angaben
- Autoren: Ying Li , C.C. Jay Kuo
- 2010, XXIII, 194 Seiten, Masse: 1,5 x 2,4 cm, Kartoniert (TB), Englisch
- Verlag: Springer, Berlin
- ISBN-10: 1441953655
- ISBN-13: 9781441953650
Sprache:
Englisch
Kommentar zu "Video Content Analysis Using Multimodal Information"
0 Gebrauchte Artikel zu „Video Content Analysis Using Multimodal Information“
Zustand | Preis | Porto | Zahlung | Verkäufer | Rating |
---|
Schreiben Sie einen Kommentar zu "Video Content Analysis Using Multimodal Information".
Kommentar verfassen