RAW PLENOPTIC VIDEO CODING UNDER HEXAGONAL LATTICE RESOLUTION OF MOTION VECTORS Thuc Nguyen Huu, Vinh Duong Van, Jonghoon Yim, and Byeungwoo Jeon Department of Electrical and Computer Engineering, Sungkyunkwan University, Korea ABSTRACT In raw plenoptic video, the optimal motion searching points mostly follow the hexagonal structure of micro-images. Based on this understanding, we propose a new motion vector resolution, namely the hexagonal lattice (HL) resolution which reflects micro-image structure. The HL resolution can be efficiently represented by HL basis. A study in this paper shows that motion vectors are highly concentrated at hexagonal lattice points, leading to use of the proposed resolution in the context of video compression. In this regard, we demonstrate the compression benefit brought by estimating motion vectors at HL resolution in the VVC codec. Index Termsplenoptic video, lenslet video, motion vector resolution, hexagonal lattice resolution, video coding. 1. INTRODUCTION 1 Plenoptic camera [1] [2] records the 4D plenoptic function [3] (or 4D light field [4]) by inserting an additional microlens array (MLA) between main lens and image sensor as in Fig. 1. The sensor output of plenoptic camera is known as the raw plenoptic image since it yet requires additional pre- processing steps to generate final plenoptic function. The plenoptic image consists of many micro-images (drawn as circles in Fig. 1) which are distributed following a particular structure. Here, the structure of micro-images depends on MLA layout which can be either in rectangular or hexagonal. To increase the fill-factor of a sensor, the MLA often follows the hexagonal layout, leading to hexagonal structure of micro-images. The raw plenoptic data, especially for raw plenoptic video, calls for compression due to its extremely high spatial resolution (it could be up to 7728 x 5368 for sensor output of Lytro camera). In this regard, motion estimation and compensation of plenoptic video are essential enabling techniques to effectively reduce the temporal redundancy. Unlike conventional video, plenoptic video has a unique characteristic to utilize in motion estimation, i.e., its optimum This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Science and ICT (NRF-2020R1A2C2007673). searching points mostly (around 80%) follow the micro- images structure [5][6]. This interesting property is illustrated in Fig. 2(b) which plots the motion vectors (MVs) of raw plenoptic video estimated by full search. The observation that motion vectors are, in general, sparsely located in plenoptic video motivated development of fast search methods [5][6] which can speed up the motion search while maintaining the compression performance. Noting that previous works [5][6] only paid attention to fast motion search algorithms, we suggest that the special motion property is not fully utilized yet, and propose a new motion vector resolution, namely the hexagonal lattice (HL) resolution. In this paper, we study using the HL resolution of the plenoptic video since it may work efficiently with the conventional MV resolutions in video coding standards (e.g., H.264/AVC, H.265/HEVC, H.266/VVC, etc.) considering its better rate distortion performance than the conventional MV resolutions for most cases. We focus on the sensor output of Lytro camera [1] (standard plenoptic camera) and the research for Raytrix camera [2] is left for future work. The paper is organized as follows. In the next section, we present fundamental background of motion vector and motion vector resolution in video coding standards. Following, in Section 3, we propose the HL resolution which is a new MV resolution for raw plenoptic video. A comprehensive study of HL resolution and its usage are also presented. In Section 4, we give experimental results and next we conclude the paper. Fig. 1. Optical structure of plenoptic camera and its sensor output Image sensor Optical axis MLA Main lens micro- image Raw plenoptic data (Lenslet image) object h D v D 1621 978-1-6654-0540-9/22/$31.00 ©2022 IEEE ICASSP 2022 ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 978-1-6654-0540-9/22/$31.00 ©2022 IEEE | DOI: 10.1109/ICASSP43922.2022.9747504