Application of a Novel Audio Compression Scheme in Automatic Music Recommendation, Digital Rights Management and Audio Fingerprinting

Rapid progress in audio compression technology has contributed to the explosive growth of music available in digital form today. In a reversal of ideas, this work makes use of a recently proposed efficient audio compression scheme to develop three important applications in the context of Music Information Retrieval (MIR) for the effective manipulation of large music databases, namely automatic music recommendation (AMR), digital rights management (DRM) and audio finger-printing for song identification. The performance of these three applications has been evaluated with respect to a database of songs collected from a diverse set of genres.





References:
[1] Rainer Typke, Frans Wiering, Remco C. Veltkamp, "A Survey of Music
Information Retrieval Systems," International Symposium on Music
Information Retrieval, 2006.
[2] A. Roy, G. Saha, "Compression using Joint Optimization based on
Signal Statistics and Quantization Noise", Elesevier Computers and
Electrical Engineering. Submitted for publication.
[3] Shankar Vembu and Stephan Baumann, "A Self-Organizing Map Based
Knowledge Discovery for Music Recommendation Systems", Lecture
notes in Computer Science - Computer Music Modeling and Retrieval,
Springer Berlin /Heidelberg, vol. 3310/2005, pp. 119 -129.
[4] François Pachet and Pierre Roy, "Automatic Generation of Music
Programs", Lecture notes in Computer Science - Principles and Practice
of Constraint Programming CP99, Springer Berline / Heidelberg, vol.
1713/2004, pp. 331 - 345.
[5] François Pachet and J.-J.Aucouturier, "Scaling up music playlist
generation", Proceedings of IEEE International Conference on
Multimedia and Expo, 2002, vol. 1, pp. 105- 108.
[6] Qiong Liu, Reihaneh Safavi-Naini and Nicholas Paul Sheppard, "Digital
rights management for content distribution" Proceedings of the
Australasian information security workshop conference on ACSW
frontiers 2003, vol. 21, pp. 49-58..
[7] Frank Hartung and Friedhelm Ramme, "Digital rights management and
watermarking of multimedia content for M-commerce applications",
IEEE Communication Magazine, vol. 38, no. 11, Nov. 2000, pp. 78-84.
[8] Jong Won Seok and Jin Woo Hong, "Audio watermarking for copyright
protection of digital audio data", IEEE Electronics Letters, vol. 37, no.
1, Jan. 2001, pp. 60-61.
[9] P. Cano et al., "Audio Fingerprinting: Concepts And Applications",
Studies in Computational Intelligence (SCI) 2, Springer-Verlag 2005,
pp.233-245.
[10] Christopher J.C. Burges, Daniel Plastina, John C. Platt, Erin Renshaw,
and Henrique S. Malvar, "Using Audio Fingerprinting for duplicate
detection and thumbnail generation", International Conference on
Audio, Speech and Signal Processing, 2005.
[11] Jason Freeman, "Fast Generation of Audio Signatures to Describe
iTunes Libraries", Journal of New Music Research 2006, vol. 35, no. 1,
pp. 51-61.
[12] Michail K. Tsatsanis and Georgios B. Giannakis, "Principal Component
Filter Banks for Optimal Multiresolution Analysis", IEEE Trans. on
Signal Proc., vol. 43, no.8, Aug.1995.
[13] P.Nasiopoulos, M.Yedlin and R.K.Ward, "A high performance fixedlength
compression method using the Karhunen-Loeve transform", IEEE
Trans. on Consumer Elec., vol. 41, no. 4, Nov. 1995, pp. 1189-1196.
[14] Anil K. Jain, "A Fast Karhunen Loeve Transform for a class of Random
Processes", IEEE Trans. on Communications, vol. 24, no. 9, Sept. 1976,
pp. 1023-1029.
[15] M. Loève, "Probability Theory - I", New York, Springer-Verlag, 1963.
[16] W. Kinsner, "Compression and Its Metrics for Multimedia",
Proceedings of the First IEEE International Conference on Cognitive
Informatics (ICCI02), pp. 1-15.
[17] Ahmed H. Tewfik, Deepen Sinha, and Paul Jorgensen, "On the Optimal
Choice of a Wavelet for Signal Representation", IEEE Trans. on Info.
Theory, vol. 38, no. 2, March 1992.
[18] Deepen Sinha and Ahmed H. Tewfik, "Low Bit Rate Transparent Audio
Compression using Adapted Wavelets", IEEE Trans. on Signal Proc.,
vol. 41, no. 12, Dec. 1993.
[19] Jeff B. Burl, "Estimating the Basis Functions of the Karhunen-Loève
Transform", IEEE Trans. on ASSP, vol. 37, no. 1, Jan. 1989.
[20] D. Pan, "A tutorial on MPEG/audio compression", Multimedia, IEEE,
vol. 2, no. 2, Summer 1995.
[21] David Salomon, "Data Compression - The Complete Reference",
Springer-Verlag, Third Edition, 2004.
[22] Masahiro Nakagawa and Makoto Miyahara, "Generalized Karhunen-
Loeve Transformation I (Theoretical Consideration)", IEEE Trans. on
Communications, vol. COM-35, no. 2, Feb. 1987.
[23] Yingbo Hua and Wanquan Liu, "Generalized Karhunen - Loeve
Transform", IEEE Signal Proc. Letters, vol. 5, no. 6, June 1998.
[24] Gregory W. Wornell, "A Karhunen-Loève like Expansion for 1/f
Proceses via Wavelets", IEEE Trans. on Information Theory, vol.36,
no.4, July 1990, pp.859-861.
[25] N.J.Jayant and P.Noll, "Digital Coding of Waveforms", Prentice Hall,
Engleood Clffs, NJ, 1984.
[26] Bryan E. Usevitch, "A Tutorial on Modern Lossy Wavelet Image
Compression: Foundations of JPEG 2000", IEEE Signal Proc.
Magazine, Sept. 2001, pp.22 - 35.
[27] Yukihiko Yamashita and Hidemitsu Ogawa, "Relative Karhunen-Lohe
Transform", IEEE Trans. on Signal Proc., vol. 44, no. 2, Feb.1996.
[28] Louis L.Scharf and John K. Thomas, "Wiener Filters in Canonical
Coordinates for Transform Coding, Filtering and Quantizing", IEEE
Trans. on Signal Proc., vol. 46, no. 3, March 1998.