Adaptive Group of Pictures Structure Based On the Positions of Video Cuts

In this paper we propose a method which improves the efficiency of video coding. Our method combines an adaptive GOP (group of pictures) structure and the shot cut detection. We have analyzed different approaches for shot cut detection with aim to choose the most appropriate one. The next step is to situate N frames to the positions of detected cuts during the process of video encoding. Finally the efficiency of the proposed method is confirmed by simulations and the obtained results are compared with fixed GOP structures of sizes 4, 8, 12, 16, 32, 64, 128 and GOP structure with length of entire video. Proposed method achieved the gain in bit rate from 0.37% to 50.59%, while providing PSNR (Peak Signal-to-Noise Ratio) gain from 1.33% to 0.26% in comparison to simulated fixed GOP structures.





References:
<p>[1] T. Wiegand, G. J. Sullivan, G. Bjontegard, and A. Luthra, "Overview of
the H.264/AVC video coding standard," IEEE Trans. on CSVT, vol. 13,
no. 7, pp. 560-576, July 2003.
[2] JVT Editors, Draft ITUT Recommendation and final draft international
standard of joint video specification (ITU-T Rec.H.264 |ISO/IEC 14496-
10 AVC), JVT-G050r1, Geneva, May 2003.
[3] D. Zhang and G. Lu, “Review of shape representation and description
techniques,” Pattern Recognition., vol. 37, no. 1, pp. 1–19, 2004.
[4] S.-C. Hsia, S.-C. Cheng and C.-L. Chen, “A real-time chip
implementation for adaptive video coding control,” IEEE Transactions
on Circuits and Systems for Video Technology, vol. 14, no. 8, pp. 1098-
1104, August 2004.
[5] S. Paschalakis and D. Simmons, (2008, April 24), “Detection of gradual
transitions in video sequences” (Online). Available:
http://www.wipo.int/pctdb/en/wo.jsp?WO=2008046748&IA=EP200706
0594&DISPLAY=STATUS
[6] A. Hanjalic, “Shot-boundary detection: unraveled and resolved?,” IEEE
Transactions on Circuits and Systems for Video Technology, vol. 12, no.
2, pp. 90–105, 2002.
[7] L. Krulikovska and J. Polec, “A novel method of comparison frames for
video cuts detection” Proceeding of Redžúr 2011 5th International
Workshop on Speech and Signal Processing, in press.
[8] B. Zatt, M. Porto, J. Scharcanski and S. Bampi, “Gop structure adaptive
to the video content for efficient H.264/AVC encoding,” Proceedings of
2010 IEEE 17th International Conference on Image Processing, pp.
3053-3056, September 2010.</p>