Article

Automatic generation of personalized music sports video

Authors:

Qi TianAuthors Info & Claims

MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

Pages 735 - 744

https://doi.org/10.1145/1101149.1101309

Published: 06 November 2005 Publication History

Abstract

In this paper, we propose a novel automatic approach for personalized music sports video generation. Two research challenges, semantic sports video content selection and automatic video composition, are addressed. For the first challenge, we propose to use multi-modal (audio, video and text) feature analysis and alignment to detect the semantic of events in sports video. For the second challenge, we propose video-centric and music-centric music video composition schemes to automatically generate personalized music sports video based on user's preference. The experimental results and user evaluations are promising and show that our system's generated music sports video is comparable to manually generated ones. The proposed approach greatly facilitates the automatic music sports video generation for both professionals and amateurs.

References

[1]

MuVee Technologies Pte. Ltd, "Muvee TM," 2000.

[2]

N. Adami, R. Leonardi, and P. Migliorati, "An overview of multi-modal techniques for the characterization of sport programmes," Proc. of SPIE-VCIP'03, pp. 1296--1306, July, 2003.

[3]

J. Wang, E. Chng, and C. Xu, "Soccer replay detection using scene transition structure analysis," Proc. of IEEE ICASSP'05, March 2005.

[4]

J. Wang, et al, "Event detection based on non-broadcast sports video," Proc. of IEEE ICIP'04, Nov. 2004.

[5]

A. Ekin, A. Tekalp, and R. Mehrotra, "Automatic soccer video analysis and summarization," IEEE Trans. on Image Processing, vol. 12:7, no. 5, pp. 796--807, 2003.

Digital Library

[6]

J. Assfalg, et al, "Semantic annotation of soccer videos: automatic highlights identification," Computer Vision and Image Understanding (CVIU), vol. 92, pp. 285--305, Nov. 2003.

Digital Library

[7]

N. Babaguchi and N. Nitta, "Intermodal collaboration: A strategy for semantic content analysis for broadcasted sports video," Proc. of IEEE ICIP'03, vol. 1, pp. 13--16, Sept. 2003.

[8]

X. Hua, L. Lu, and H. Zhang, "Automatic music video generation based on temporal pattern analysis," Proc. of ACM MultiMedia'04, pp. 472--475, Oct. 2004.

Digital Library

[9]

J. Foote, M. Cooper, and A. Girgensohn, "Creating music videos using automatic media analysis," Proc. of ACM MultiMedia'02, pp. 553--560, Dec. 2002.

Digital Library

[10]

H. Xu and T. Chua, "The fusion of audio-visual features and external knowledge for event detection in team sports video," Workshop on Multimedia Information Retrieval (MIR'04), Oct. 2004.

Digital Library

[11]

MediaWare Solutions Pte. Ltd (USA), "M2-edit pro TM," 2002.

[12]

H. Pan, B. Li, and M. Sezan, "Automatic detection of replay segments in broadcast sports programs by detection of logos in scene transitions," Proc. of IEEE ICASSP'02, May 2002.

[13]

V. Kobla, D. DeMenthon, and D. Doermann, "Detection of slow-motion replay sequences for identifying sports videos," Proc. IEEE Workshop on Multimedia Signal Processing, 1999.

[14]

H. Pan, B. Li, and M. Sezan, "Detection of slow-motion replay segments in sports video for highlights generation," Proc. of IEEE ICASSP'01, May 2001.

Digital Library

[15]

J. Wang, et al, "Automatic replay generation for soccer video broadcasting," Proc of ACM MultiMedia'04, pp. 31--38, Oct. 2004.

Digital Library

[16]

N. Babaguchi, Y. Kawai, and T. Kitahashi, "Event based indexing of broadcasted sports video by intermodal collaboration," IEEE Trans. on Multimedia, vol. 4, pp. 68--75, March 2002.

Digital Library

[17]

N. Nitta, N. Babaguchi, and T. Kitahashi, "Generating semantic descriptions of broadcasted sports video based on structure of sports game," Multimedia Tools and Applications, vol. 25, pp. 59--83, Jan. 2005.

Digital Library

[18]

N. Nitta and N. Babaguchi, "Automatic story segmentation of closed-caption text for semantic content analysis of broadcasted sports video," Proc. of 8th International Workshop on MIS'02, pp. 110--116, 2002.

[19]

"http://news.bbc.co.uk/sport1/hi/football/teams/,"

[20]

C. Manning and H. Schutze, "Foundations of statistical natural language processing," The MIT Press, Cambridge, Massachusetts, May 1999.

Digital Library

[21]

dtSearch Corp, "dtsearch 6.50 (6608)," 1991-2005.

[22]

"Hidden markov model toolkit," http://htk.eng.cam.ac.uk/.

[23]

J. Pylkkonen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," Proc. of IEEE ICASSP'04, pp. 385--388, May 2004.

[24]

N. Maddage, et al, "Content-based music structure analysis with applications to music semantics understanding," Proc. of ACM MultiMedia'04, pp. 112--119, Oct. 2004.

Digital Library

[25]

J. Chin, V. Diehl, and K. Norman, "Development of an instrument measuring user satisfaction of the human-computer interface," Proc. of SIGCHI on Human Factors in CS, pp. 213--218, 1998.

Digital Library

Cited By

Liu SLerch A(2024)Enhancing Video Music Recommendation with Transformer-Driven Audio-Visual Embeddings2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704086(1-6)Online publication date: 30-Sep-2024
https://doi.org/10.1109/IS262782.2024.10704086
Kaur GKaur AKhurana M(2023)Exploring the Role of Mathematical Modelling in Automatic Scene Generation amidst Rapid Technological Advances2023 4th International Conference on Data Analytics for Business and Industry (ICDABI)10.1109/ICDABI60145.2023.10629356(391-397)Online publication date: 25-Oct-2023
https://doi.org/10.1109/ICDABI60145.2023.10629356
Tang TTang JLai JYing LWu YYu LRen P(2022)SmartShots: An Optimization Approach for Generating Videos with Data Visualizations EmbeddedACM Transactions on Interactive Intelligent Systems10.1145/348450612:1(1-21)Online publication date: 4-Mar-2022
https://dl.acm.org/doi/10.1145/3484506
Show More Cited By

Index Terms

Automatic generation of personalized music sports video

Recommendations

Automatic music video generation based on temporal pattern analysis
MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia

Music video (MV) is a short film meant to present a visual representation of a popular music song. In this paper, we present a system that automatically generates MV-like videos from personal home videos based on observations that generally there are ...
Audio keywords generation for sports video analysis

Sports video has attracted a global viewership. Research effort in this area has been focused on semantic event detection in sports video to facilitate accessing and browsing. Most of the event detection methods in sports video are based on visual ...
Generation of Personalized Music Sports Video Using Multimodal Cues

In this paper, we propose a novel automatic approach for personalized music sports video generation. Two research challenges are addressed, specifically the semantic sports video content extraction and the automatic music video composition. For the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

November 2005

1110 pages

ISBN:1595930442

DOI:10.1145/1101149

General Chairs:
Hongjiang Zhang
Microsoft Research Asia, China
,
Tat-Seng Chua
National University of Singapore, Singapore
,
Program Chairs:
Ralf Steinmetz
Technische Universitat Darmstadt, Germany
,
Mohan Kankanhalli
National University of Singapore, Singapore
,
Lynn Wilcox
FXPAL

Copyright © 2005 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Article

Conference

MM05

Sponsor:

MM05: 2005 13th Annual ACM International Conference on Multimedia

November 6 - 11, 2005

Hilton, Singapore

Acceptance Rates

MULTIMEDIA '05 Paper Acceptance Rate 49 of 312 submissions, 16%;

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

27
Total Citations
View Citations
856
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)2

Reflects downloads up to 30 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Liu SLerch A(2024)Enhancing Video Music Recommendation with Transformer-Driven Audio-Visual Embeddings2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704086(1-6)Online publication date: 30-Sep-2024
https://doi.org/10.1109/IS262782.2024.10704086
Kaur GKaur AKhurana M(2023)Exploring the Role of Mathematical Modelling in Automatic Scene Generation amidst Rapid Technological Advances2023 4th International Conference on Data Analytics for Business and Industry (ICDABI)10.1109/ICDABI60145.2023.10629356(391-397)Online publication date: 25-Oct-2023
https://doi.org/10.1109/ICDABI60145.2023.10629356
Tang TTang JLai JYing LWu YYu LRen P(2022)SmartShots: An Optimization Approach for Generating Videos with Data Visualizations EmbeddedACM Transactions on Interactive Intelligent Systems10.1145/348450612:1(1-21)Online publication date: 4-Mar-2022
https://dl.acm.org/doi/10.1145/3484506
Pham TWang JIyengar RXiao YPillai PKlatzky RSatyanarayanan M(2021)Ajalon: Simplifying the authoring of wearable cognitive assistantsSoftware: Practice and Experience10.1002/spe.298751:8(1773-1797)Online publication date: 18-May-2021
https://doi.org/10.1002/spe.2987
Sun WLi YSheopuri ATeixeira TChampin PGandon FMédini LLalmas MIpeirotis P(2018)Computational Creative AdvertisementsCompanion Proceedings of the The Web Conference 201810.1145/3184558.3191549(1155-1162)Online publication date: 23-Apr-2018
https://dl.acm.org/doi/10.1145/3184558.3191549
Hsu PFan YChen H(2018)On Semantic Annotation for Sports Video Highlights by Mining User Comments from Live Broadcast Social NetworkAdvances on Broadband and Wireless Computing, Communication and Applications10.1007/978-3-030-02613-4_33(367-380)Online publication date: 19-Oct-2018
https://doi.org/10.1007/978-3-030-02613-4_33
Chen FDe Vleeschouwer CCavallaro A(2014)Resource Allocation for Personalized Video SummarizationIEEE Transactions on Multimedia10.1109/TMM.2013.229196716:2(455-469)Online publication date: 1-Feb-2014
https://dl.acm.org/doi/10.1109/TMM.2013.2291967
XU JTAKAGI KSAKAZAWA S(2012)Efficient Generation of Dancing Animation Synchronizing with Music Based on Meta Motion GraphsIEICE Transactions on Information and Systems10.1587/transinf.E95.D.1646E95.D:6(1646-1655)Online publication date: 2012
https://doi.org/10.1587/transinf.E95.D.1646
Zhang TXu CZhu GLiu SLu H(2012)A Generic Framework for Video Annotation via Semi-Supervised LearningIEEE Transactions on Multimedia10.1109/TMM.2012.219194414:4(1206-1219)Online publication date: 1-Aug-2012
https://dl.acm.org/doi/10.1109/TMM.2012.2191944
Chu WTsai S(2012)Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance VideosIEEE Transactions on Multimedia10.1109/TMM.2011.217240114:1(129-141)Online publication date: 1-Feb-2012
https://dl.acm.org/doi/10.1109/TMM.2011.2172401
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten