+ All Categories
Home > Technology > THE SJTU 4K VIDEO SEQUENCE DATASET

THE SJTU 4K VIDEO SEQUENCE DATASET

Date post: 11-Jun-2015
Category:
Upload: shanghai-jiaotong-university
View: 378 times
Download: 3 times
Share this document with a friend
Description:
This is our poster presentation about 4K video dataset.
Popular Tags:
1
THE SJTU 4K VIDEO SEQUENCE DATASET Li Song*, Xun Tang , Wei Zhang*, Xiaokang Yang*, Pingjian Xia * Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University National Engineering Research Center of Digital Television, Shanghai, China Please contact [email protected] for further information. Introduction Last year, the International Telecommunication Union (ITU) has officially approved ultra-high definition (UHD) TV as a standard, conjointly standardizing both 4K and 8K resolutions in ITU Recommendation BT.2020. A new coding standard, High Efficient Video Coding under development by ISO MPEG and ITU-T VCEG, was also approved by the ITU-T on 25th January 2013. Ultra-high resolution 4K video, generally 3840 x 2160, is on the immediate horizon. Purpose Before 4K video application coming into service, a great number of researches need to be conducted. Video sequences pose an inportant role in the corresponding work. Our dataset presents a set of 15 new 4K resolution UHD video sequences for catering the requirement of active UHD video quality assessment algorithms in coming years, as well as help to fully evaluate coding efficiency of latest High Efficient Video Coding. Picture 8 Picture 7 Picture 6 Picture 4 Picture 2 Picture 7 Picture 5 Picture 3 Picture 1 Link The size of all 15 sequences is about 270GB, with both YUV 4:4:4 color sampling, 10 bits per sample and YUV 4:2:0 color sampling, 8 bits per sample formats. All sequences can be downloaded from our public server through the following link: http://medialab.sjtu.edu.cn/web4k/index.html Thanks for your attention. Sony F65 4K camera Then the signal was converted to digital format with utmost care, guaranteeing that the video sequences are distortion free. The raw data of the images quantified with 10 bit were exported at resolution of 3840x2160 and frame rate of 30fps with DPX format. The DPX image files were combined and converted into YUV files at last. All scenes were chosen in Shanghai, China. The factors such as image texture, image detail, movement speed of the object in the image, light intensity, and the camera lens stretching, panning were taken into account. Some sequences were shot from an overlooking angle by using tripod. The first frame of each video sequences is showed as follow: Shooting and processing All of the UHD video sequences included in our dataset were shot using Sony F65 camera. The photoelectric signals of images received from 8K CMOS sensor were stored in the disk array. Picture 6 Picture 4 Picture 2 Picture 5 Picture 3 Picture 1 Spatial Information (SI) and Temporal Information (TI) indexes of the 4K video sequences Sequence content analysis Generally, the spatial and temporal information were used as representing the video content. The analysis of content classification has been performed by computing the SI and TI indexes on the luminance component of each video sequences according to ITU-R P.910. = T = , −1 ,
Transcript
Page 1: THE SJTU 4K VIDEO SEQUENCE DATASET

THE SJTU 4K VIDEO SEQUENCE DATASET

Li Song*, Xun Tang†, Wei Zhang*, Xiaokang Yang*, Pingjian Xia† * Institute of Image Communication and Network Engineering, Shanghai Jiao Tong University

† National Engineering Research Center of Digital Television, Shanghai, China

Please contact [email protected] for further information.

Introduction

Last year, the International Telecommunication

Union (ITU) has officially approved ultra-high

definition (UHD) TV as a standard, conjointly

standardizing both 4K and 8K resolutions in

ITU Recommendation BT.2020. A new coding

standard, High Efficient Video Coding under

development by ISO MPEG and ITU-T VCEG,

was also approved by the ITU-T on 25th

January 2013. Ultra-high resolution 4K video,

generally 3840 x 2160, is on the immediate

horizon.

Purpose

Before 4K video application coming into

service, a great number of researches need to

be conducted. Video sequences pose an

inportant role in the corresponding work. Our

dataset presents a set of 15 new 4K resolution

UHD video sequences for catering the

requirement of active UHD video quality

assessment algorithms in coming years, as

well as help to fully evaluate coding efficiency

of latest High Efficient Video Coding.

Picture 8 Picture 7

Picture 6

Picture 4

Picture 2

Picture 7

Picture 5

Picture 3

Picture 1

Link

The size of all 15 sequences is about 270GB,

with both YUV 4:4:4 color sampling, 10 bits per

sample and YUV 4:2:0 color sampling, 8 bits

per sample formats. All sequences can be

downloaded from our public server through the

following link:

http://medialab.sjtu.edu.cn/web4k/index.html

Thanks for your attention.

Sony F65 4K camera

Then the signal was converted to digital format

with utmost care, guaranteeing that the video

sequences are distortion free. The raw data of

the images quantified with 10 bit were exported

at resolution of 3840x2160 and frame rate of

30fps with DPX format. The DPX image files

were combined and converted into YUV files at

last.

All scenes were chosen in Shanghai, China.

The factors such as image texture, image detail,

movement speed of the object in the image,

light intensity, and the camera lens stretching,

panning were taken into account. Some

sequences were shot from an overlooking

angle by using tripod. The first frame of each

video sequences is showed as follow:

Shooting and processing

All of the UHD video sequences included in our

dataset were shot using Sony F65 camera. The

photoelectric signals of images received from

8K CMOS sensor were stored in the disk array. Picture 6

Picture 4

Picture 2

Picture 5

Picture 3

Picture 1

Spatial Information (SI) and Temporal Information (TI) indexes

of the 4K video sequences

Sequence content analysis

Generally, the spatial and temporal information

were used as representing the video content.

The analysis of content classification has been

performed by computing the SI and TI indexes

on the luminance component of each video

sequences according to ITU-R P.910.

𝑆𝐼 = 𝑚𝑎𝑥𝑡𝑖𝑚𝑒 𝑠𝑡𝑑𝑠𝑝𝑎𝑐𝑒 𝑆𝑜𝑏𝑒𝑙 𝐹𝑛

T𝐼 = 𝑚𝑎𝑥𝑡𝑖𝑚𝑒 𝑠𝑡𝑑𝑠𝑝𝑎𝑐𝑒 𝐹𝑛 𝑖, 𝑗 − 𝐹𝑛−1 𝑖, 𝑗

Recommended