Motion Prediction and Motion Vector Cost Reduction during Fast Block Motion Estimation in MCTF

In 3D-wavelet video coding framework temporal filtering is done along the trajectory of motion using Motion Compensated Temporal Filtering (MCTF). Hence computationally efficient motion estimation technique is the need of MCTF. In this paper a predictive technique is proposed in order to reduce the computational complexity of the MCTF framework, by exploiting the high correlation among the frames in a Group Of Picture (GOP). The proposed technique applies coarse and fine searches of any fast block based motion estimation, only to the first pair of frames in a GOP. The generated motion vectors are supplied to the next consecutive frames, even to subsequent temporal levels and only fine search is carried out around those predicted motion vectors. Hence coarse search is skipped for all the motion estimation in a GOP except for the first pair of frames. The technique has been tested for different fast block based motion estimation algorithms over different standard test sequences using MC-EZBC, a state-of-the-art scalable video coder. The simulation result reveals substantial reduction (i.e. 20.75% to 38.24%) in the number of search points during motion estimation, without compromising the quality of the reconstructed video compared to non-predictive techniques. Since the motion vectors of all the pair of frames in a GOP except the first pair will have value ±1 around the motion vectors of the previous pair of frames, the number of bits required for motion vectors is also reduced by 50%.




References:
[1] Taubman, D.: Successive refinement of video: fundamental issues, past
efforts and new directions, International Symposium on Visual
Communications and Image Processing (VCIP2003), SPIE volume
5150, pp. 791-805 July 2003.
[2] ISO/IEC JTC/SC29/WG11, n6025: Applications and requirements for
scalable video coding, October 2003.
[3] Taubman, D. and Marcellin: JPEG 2000: Image Compression
Fundamentals, Standards and Practice, Boston: Kulwer Academic
Publishers, 2002.
[4] Jens-Rainer Ohm: Three-Dimension subband coding with motion
compensation, IEEE Trans. On Image Processing, Vol. 3. No. 5,
September 1994.
[5] A. Secker and D. Taubman, "Motion-compensated highly scalable video
compression using an adaptive 3D wavelettransform based on lifting," in
Proceedings of IEEE InternationalConference on Image Processing
(ICIP -01), vol. 2, pp. 1029-1032, Thessaloniki, Greece, October 2001.
[6] Secker and D. Taubman: Lefting-based invertible motion adaptive
transform (LIMAT) frame work for highly scalable video compression,
IEEE Trans. Image Procesing. 2004.
[7] R. Xiong, et al: Exploiting temporal correlation with adaptive block size
motion alignment for 3D wavelet coding, SPIE/IEEE Visual
Communication and Image Processing (VCIP2004), San Jose,
Califronia, USA, Jan 2004.
[8] J. Barbarien, Y. Andreopoulos, A. Munteanu, P.Schelkens and J.
Cornelis: Coding of motion vectors produced by wavelet domain
motion estimation.
[9] J. R. Jain and A. K. Jain: Displacement measurement and its application
in inter-frame image coding, IEEE Trans. Commun., vol COM-29, pp-
1799-1808, Dec. 1981.
[10] R. Li, B. Zeng and M L Liou: A new three-step search algorithm for
block motion estimation, IEEE Trans. Circuits. Syst. Video Technology,
Vol. 4, pp 438-442, Aug. 1994.
[11] L.M. Po and W. C. Ma: A novel four step search algorithm for fast block
motion estimation, IEEE Trans. Circuits System Video Technology, Vol.
6, pp 313-317, June 1996.
[12] L. K. Liu and E. Feig: A block based gradient decent search algorithm
for block motion estimation in video coding, IEEE Trans. Circuits
System Video Technology, Vol. 6 pp 419-423 Aug. 1996.
[13] J. Lu and M. L. Liou: A simple and efficient search algorithm for block
matching motion estimation, IEEE Trans. Circuits System Video
Technology, Vol.7, pp 429-433, Apr. 1997.
[14] S. Zhu and K. K. Ma: A new diamond search algorithm for fast block
matching motion estimation, IEEE Trans. Image Processing, Vol.. 9, pp
287-290, Feb 2000.
[15] Jong Chul Ye and Mihaela van der Schaar: Fully Scalable 3_D
Overcomplete Wavelet Video Coding using Adaptive Motion
Compensated Temporal Filetering, VCIP 2003.
[16] P. Chen and J. W. Woods, "Bidirectional MC-EZBC with lifting
implementation," IEEE Transactions on Circuits and Systems for Video
Technology, vol. 14, no. 10, pp. 1183-1194, 2004.
[17] MC-EZBC software available at
http://www.cipr.rpi.edu/research/mcezbc/.
[18] A. Golwelkar and J. W. Woods, "Scalable video compression using
longer motion compensated temporal filters," in Visual Communications
and Image Processing, T. Ebrahimi and T. Sikora, Eds., vol. 5150 of
Proceedings of SPIE, pp. 1406-1416, Lugano, Switzerland, July 2003.
[19] M. Flierl and B. Girod, "Video coding with motion compensated lifted
wavelet transforms," Signal Processing: Image Communication, vol. 19,
no. 7, pp. 561-575, 2004.
[20] P. Chen and J.W.Woods, "Bidirectional MC-EZBC with lifting
implementation," IEEE Transactions on Circuits and Systems for Video
Technology, vol. 14, no. 10, pp. 1183-1194, 2004.
[21] V. Bottreau, M. B'eneti`ere, B. Felts, and B. Pesquet-Popescu, "A fully
scalable 3D subband video codec," in Proceedings of IEEE International
Conference on Image Processing (ICIP -01), vol. 2, pp. 1017-1020,
Thessaloniki, Greece, October 2001.
[22] D. S. Turaga and M. van der Schaar, "Wavelet coding for video
streaming using new unconstrained motion compensated temporal
filtering," in Proceedings of International Thyrrhenian Workshop on
Digital Communications. (IWDC -02), pp. 41-48, Capri, Italy,
September 2002, Advanced Methods for Multimedia Signal Processing.
[23] D. S. Turaga, M. van der Schaar, Y. Andreopoulos, A. Munteanu, and P.
Schelkens, "Unconstrained motion compensated temporal filtering
(UMCTF) for efficient and flexible interframe wavelet video coding,"
Signal Processing: ImageCommunication, vol. 20, no. 1, pp. 1-19, 2005.
[24] Y. Wang, S. Cui, and J. E. Fowler, "3D video coding with redundantwavelet
multihypothesis," IEEE Transactions on Circuits and Systems
for Video Technology, vol. 16, no. 2, pp. 166-177, 2006.
[25] M. Trocan, C. Tillier, B. Pesquet-Popescu, and M. van der Schaar, "A 5-
band temporal lifting scheme for video surveillance," in Proceedings of
the 8th IEEE Workshop on Multimedia Signal Processing (MMSP -06),
pp. 278-281, Victoria, BC, Canada, October 2006.
[26] James E. Fowler1 and B'eatrice Pesquet-Popescu2, "An Overview on
Wavelets in Source Coding, Communications, and Networks" EURASIP
Journal on Image and Video Processing, pp. 1-27, April 2007.