skip to main content
10.1145/1101149.1101309acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Automatic generation of personalized music sports video

Published: 06 November 2005 Publication History

Abstract

In this paper, we propose a novel automatic approach for personalized music sports video generation. Two research challenges, semantic sports video content selection and automatic video composition, are addressed. For the first challenge, we propose to use multi-modal (audio, video and text) feature analysis and alignment to detect the semantic of events in sports video. For the second challenge, we propose video-centric and music-centric music video composition schemes to automatically generate personalized music sports video based on user's preference. The experimental results and user evaluations are promising and show that our system's generated music sports video is comparable to manually generated ones. The proposed approach greatly facilitates the automatic music sports video generation for both professionals and amateurs.

References

[1]
MuVee Technologies Pte. Ltd, "Muvee TM," 2000.
[2]
N. Adami, R. Leonardi, and P. Migliorati, "An overview of multi-modal techniques for the characterization of sport programmes," Proc. of SPIE-VCIP'03, pp. 1296--1306, July, 2003.
[3]
J. Wang, E. Chng, and C. Xu, "Soccer replay detection using scene transition structure analysis," Proc. of IEEE ICASSP'05, March 2005.
[4]
J. Wang, et al, "Event detection based on non-broadcast sports video," Proc. of IEEE ICIP'04, Nov. 2004.
[5]
A. Ekin, A. Tekalp, and R. Mehrotra, "Automatic soccer video analysis and summarization," IEEE Trans. on Image Processing, vol. 12:7, no. 5, pp. 796--807, 2003.
[6]
J. Assfalg, et al, "Semantic annotation of soccer videos: automatic highlights identification," Computer Vision and Image Understanding (CVIU), vol. 92, pp. 285--305, Nov. 2003.
[7]
N. Babaguchi and N. Nitta, "Intermodal collaboration: A strategy for semantic content analysis for broadcasted sports video," Proc. of IEEE ICIP'03, vol. 1, pp. 13--16, Sept. 2003.
[8]
X. Hua, L. Lu, and H. Zhang, "Automatic music video generation based on temporal pattern analysis," Proc. of ACM MultiMedia'04, pp. 472--475, Oct. 2004.
[9]
J. Foote, M. Cooper, and A. Girgensohn, "Creating music videos using automatic media analysis," Proc. of ACM MultiMedia'02, pp. 553--560, Dec. 2002.
[10]
H. Xu and T. Chua, "The fusion of audio-visual features and external knowledge for event detection in team sports video," Workshop on Multimedia Information Retrieval (MIR'04), Oct. 2004.
[11]
MediaWare Solutions Pte. Ltd (USA), "M2-edit pro TM," 2002.
[12]
H. Pan, B. Li, and M. Sezan, "Automatic detection of replay segments in broadcast sports programs by detection of logos in scene transitions," Proc. of IEEE ICASSP'02, May 2002.
[13]
V. Kobla, D. DeMenthon, and D. Doermann, "Detection of slow-motion replay sequences for identifying sports videos," Proc. IEEE Workshop on Multimedia Signal Processing, 1999.
[14]
H. Pan, B. Li, and M. Sezan, "Detection of slow-motion replay segments in sports video for highlights generation," Proc. of IEEE ICASSP'01, May 2001.
[15]
J. Wang, et al, "Automatic replay generation for soccer video broadcasting," Proc of ACM MultiMedia'04, pp. 31--38, Oct. 2004.
[16]
N. Babaguchi, Y. Kawai, and T. Kitahashi, "Event based indexing of broadcasted sports video by intermodal collaboration," IEEE Trans. on Multimedia, vol. 4, pp. 68--75, March 2002.
[17]
N. Nitta, N. Babaguchi, and T. Kitahashi, "Generating semantic descriptions of broadcasted sports video based on structure of sports game," Multimedia Tools and Applications, vol. 25, pp. 59--83, Jan. 2005.
[18]
N. Nitta and N. Babaguchi, "Automatic story segmentation of closed-caption text for semantic content analysis of broadcasted sports video," Proc. of 8th International Workshop on MIS'02, pp. 110--116, 2002.
[19]
"http://news.bbc.co.uk/sport1/hi/football/teams/,"
[20]
C. Manning and H. Schutze, "Foundations of statistical natural language processing," The MIT Press, Cambridge, Massachusetts, May 1999.
[21]
dtSearch Corp, "dtsearch 6.50 (6608)," 1991-2005.
[22]
"Hidden markov model toolkit," http://htk.eng.cam.ac.uk/.
[23]
J. Pylkkonen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," Proc. of IEEE ICASSP'04, pp. 385--388, May 2004.
[24]
N. Maddage, et al, "Content-based music structure analysis with applications to music semantics understanding," Proc. of ACM MultiMedia'04, pp. 112--119, Oct. 2004.
[25]
J. Chin, V. Diehl, and K. Norman, "Development of an instrument measuring user satisfaction of the human-computer interface," Proc. of SIGCHI on Human Factors in CS, pp. 213--218, 1998.

Cited By

View all
  • (2024)Enhancing Video Music Recommendation with Transformer-Driven Audio-Visual Embeddings2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704086(1-6)Online publication date: 30-Sep-2024
  • (2023)Exploring the Role of Mathematical Modelling in Automatic Scene Generation amidst Rapid Technological Advances2023 4th International Conference on Data Analytics for Business and Industry (ICDABI)10.1109/ICDABI60145.2023.10629356(391-397)Online publication date: 25-Oct-2023
  • (2022)SmartShots: An Optimization Approach for Generating Videos with Data Visualizations EmbeddedACM Transactions on Interactive Intelligent Systems10.1145/348450612:1(1-21)Online publication date: 4-Mar-2022
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia
November 2005
1110 pages
ISBN:1595930442
DOI:10.1145/1101149
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 November 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. automatic video editing
  2. event detection
  3. personalized music sports video
  4. sports video analysis
  5. video content selection

Qualifiers

  • Article

Conference

MM05

Acceptance Rates

MULTIMEDIA '05 Paper Acceptance Rate 49 of 312 submissions, 16%;
Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)10
  • Downloads (Last 6 weeks)2
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Enhancing Video Music Recommendation with Transformer-Driven Audio-Visual Embeddings2024 IEEE 5th International Symposium on the Internet of Sounds (IS2)10.1109/IS262782.2024.10704086(1-6)Online publication date: 30-Sep-2024
  • (2023)Exploring the Role of Mathematical Modelling in Automatic Scene Generation amidst Rapid Technological Advances2023 4th International Conference on Data Analytics for Business and Industry (ICDABI)10.1109/ICDABI60145.2023.10629356(391-397)Online publication date: 25-Oct-2023
  • (2022)SmartShots: An Optimization Approach for Generating Videos with Data Visualizations EmbeddedACM Transactions on Interactive Intelligent Systems10.1145/348450612:1(1-21)Online publication date: 4-Mar-2022
  • (2021)Ajalon: Simplifying the authoring of wearable cognitive assistantsSoftware: Practice and Experience10.1002/spe.298751:8(1773-1797)Online publication date: 18-May-2021
  • (2018)Computational Creative AdvertisementsCompanion Proceedings of the The Web Conference 201810.1145/3184558.3191549(1155-1162)Online publication date: 23-Apr-2018
  • (2018)On Semantic Annotation for Sports Video Highlights by Mining User Comments from Live Broadcast Social NetworkAdvances on Broadband and Wireless Computing, Communication and Applications10.1007/978-3-030-02613-4_33(367-380)Online publication date: 19-Oct-2018
  • (2014)Resource Allocation for Personalized Video SummarizationIEEE Transactions on Multimedia10.1109/TMM.2013.229196716:2(455-469)Online publication date: 1-Feb-2014
  • (2012)Efficient Generation of Dancing Animation Synchronizing with Music Based on Meta Motion GraphsIEICE Transactions on Information and Systems10.1587/transinf.E95.D.1646E95.D:6(1646-1655)Online publication date: 2012
  • (2012)A Generic Framework for Video Annotation via Semi-Supervised LearningIEEE Transactions on Multimedia10.1109/TMM.2012.219194414:4(1206-1219)Online publication date: 1-Aug-2012
  • (2012)Rhythm of Motion Extraction and Rhythm-Based Cross-Media Alignment for Dance VideosIEEE Transactions on Multimedia10.1109/TMM.2011.217240114:1(129-141)Online publication date: 1-Feb-2012
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media