Purpose: Invalidity Analysis


Patent: EP2347591B1
Filed: 2008-10-03
Issued: 2020-04-08
Patent Holder: (Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd
Inventor(s): Peisong Chen, Yan Ye, Marta Karczewicz

Title: Video coding with large macroblocks




Disclaimer: The promise of Apex Standards Pseudo Claim Charting (PCC) is not to replace expert opinion but to provide due diligence and transparency prior to high precision charting. PCC conducts aggressive mapping (based on Broadest Reasonable, Ordinary or Customary Interpretation and Multilingual Translation) between a target patent's claim elements and other documents (potential technical standard specification or prior arts in the same or across different jurisdictions), therefore allowing for a top-down, apriori evaluation, with which, stakeholders can assess standard essentiality (potential strengths) or invalidity (potential weaknesses) quickly and effectively before making complex, high-value decisions. PCC is designed to relieve initial burden of proof via an exhaustive listing of contextual semantic mapping as potential building blocks towards a litigation-ready work product. Stakeholders may then use the mapping to modify upon shortlisted PCC or identify other relevant materials in order to formulate strategy and achieve further purposes.

Click on references to view corresponding claim charts.


GroundReferencesOwner of the ReferenceTitleSemantic MappingChallenged Claims
12457
1KR100809298B1

(이교혁, 2008)
(Original Assignee) 삼성전자주식회사     플래그 인코딩 방법, 플래그 디코딩 방법, 및 상기 방법을이용한 장치 block pattern block pattern
length code 워드를
XX
2US20080002770A1

(Kemal Ugur, 2008)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Oyj
Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices video blocks first encoding mode
square partitions, four square partitions second partition, first partition
XXXX
3US20060203905A1

(Shih-Chang Hsia, 2006)
(Original Assignee) National Kaohsiung First University of Science and Technology     

(Current Assignee)
National Kaohsiung First University of Science and Technology
Video coding system minimum size, minimum size value horizontal resolution
maximum size processing speed
second syntax element video encoder
length code partial data
block pattern two blocks, same time
XX
4US7362804B2

(Pavel Novotny, 2008)
(Original Assignee) LSI Corp     

(Current Assignee)
Avago Technologies International Sales Pte Ltd
Graphical symbols for H.264 bitstream syntax elements first syntax, syntax information syntax elements
maximum size value maximum values
XXXX
5US7289674B2

(Marta Karczewicz, 2007)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Technologies Oy
Spatial prediction based intra coding square partitions ascending order
block pattern second blocks
XX
6US6980596B2

(Limin Wang, 2005)
(Original Assignee) Arris Technology Inc     

(Current Assignee)
Google Technology Holdings LLC
Macroblock level adaptive frame/field coding for digital video content four square partitions processing two
readable medium said blocks
rectangular partitions frame mode
XX
7US6795584B2

(Marta Karczewicz, 2004)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Technologies Oy
Context-based adaptive variable length coding for adaptive block transforms readable medium comprising instructions software product
readable medium readable medium
square partitions entropy coding
XX
8JP2003319394A

(Kuhn Peter, 2003)
(Original Assignee) Sony Corp; ソニー株式会社     符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム generating syntax information 生成ステップ
rectangular partitions プログラム, 前記圧縮
four square partitions field flag
second syntax 手段と
length code 逆符号
XX
9US6968012B1

(Jens Meggers, 2005)
(Original Assignee) Firepad Inc     

(Current Assignee)
Nordic Interactive Technologies LLC
Methods for encoding digital video for decoding on low performance devices block pattern corresponding blocks
generate syntax information identifying means
second syntax element video encoder
video blocks memory block
XXXX
10US6959116B2

(Ugur Sezer, 2005)
(Original Assignee) EMC Corp     

(Current Assignee)
EMC Corp
Largest magnitude indices selection for (run, level) encoding of a block coded picture square partitions transform coefficients
second syntax entropy encoding
length code length code
XX
11US6529634B1

(Kadayam Thyagarajan, 2003)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Contrast sensitive variance based adaptive block size DCT image compression block pattern minimum block size
video blocks, four square partitions transforming means, main data
readable medium readable medium, said blocks
generate syntax information, generating syntax information Huffman code
integer multiple th column
XXXX
12US6539124B2

(Sriram Sethuraman, 2003)
(Original Assignee) Sarnoff Corp     

(Current Assignee)
MediaTek Inc
Quantizer selection based on region complexities derived using a rate distortion model minimum size value absolute differences
readable medium readable medium
XX
13US6215910B1

(Navin Chaddha, 2001)
(Original Assignee) Microsoft Corp     

(Current Assignee)
V-XSTREAM Inc ; Microsoft Technology Licensing LLC
Table-based compression with embedded coding square partitions transform coefficients
readable medium readable medium
generate syntax information embedded code
readable medium comprising instructions first stage
XX
14JPH08205140A

(Koji Hirabayashi, 1996)
(Original Assignee) Canon Inc; キヤノン株式会社     画像圧縮装置 maximum size, maximum size value ブロックサイズ, 小さいサイズ
second syntax 手段と
XX
15US5107345A

(Chong U. Lee, 1992)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Adaptive block size image compression method and system video blocks inverse discrete cosine transform, compressed image
generate syntax information, generating syntax information Huffman code
rectangular partitions count values
length code length code
XXXX
16JP2008219870A

(Teo Han Boon, 2008)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     動画像符号化方法及び動画像符号化装置 maximum size, minimum size 小さいサイズ, 大きいサイズ
rectangular partitions プログラム
second syntax 手段と
XX
17WO2008086316A2

(Marta Karczewicz, 2008)
(Original Assignee) Qualcomm Incorporated     Variable length coding techniques for coded block patterns readable medium comprising instructions readable medium comprising instructions
block pattern block pattern
generating syntax information video coding
XX
18WO2008054688A1

(Debargha Mukherjee, 2008)
(Original Assignee) Hewlett-Packard Development Company, L.P.     Method for decomposing a video sequence frame second syntax different motion vectors
square partitions, four square partitions th partition
XX
19WO2008027192A2

(Oscar Divorra Escoda, 2008)
(Original Assignee) Thomson Licensing     Methods and apparatus for reduced resolution partitioning syntax information syntax information
second syntax video encoding
square partitions, four square partitions n partitions
generating syntax information video coding
minimum size value slice level
XXXX
20US20080049834A1

(Thomas Holcomb, 2008)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Sub-block transform coding of prediction residuals video blocks motion prediction
readable medium readable medium
second syntax video encoding
second syntax element video encoder
XXXX
21WO2008016605A2

(Oscar Divorra Escoda, 2008)
(Original Assignee) Thomson Licensing     Adaptive geometric partitioning for video decoding square partitions, four square partitions th partition
length code based coding
generating syntax information video coding
XX
22WO2008002881A2

(Harinath Garudadri, 2008)
(Original Assignee) Qualcomm Incorporated     Reduction of errors during computation of inverse discrete cosine transform readable medium comprising instructions readable medium comprising instructions
video blocks inverse discrete cosine transform
programmable processor programmable processor
XXXXX
23WO2007117711A2

(Cheng Chang, 2007)
(Original Assignee) Microsoft Corporation     Dynamic selection of motion estimation search ranges and extended motion vector ranges square partitions transform coefficients
first syntax prediction residuals
integer multiple frequency transform
second syntax element vertical components
second syntax entropy encoding
maximum size, maximum size value current picture
minimum size value absolute values
XX
24WO2007115325A2

(Fang Shi, 2007)
(Original Assignee) Qualcomm Incorporated     Apparatus and method of enhanced frame interpolation in video compression readable medium comprising instructions readable medium comprising instructions
integer multiple receiving encoded data
XX
25WO2007115126A1

(Tao Tian, 2007)
(Original Assignee) Qualcomm Incorporated     Adaptive encoder-assisted frame rate up conversion readable medium readable medium
second syntax video encoding
block pattern block pattern
XX
26WO2007112417A2

(Yiliang Bao, 2007)
(Original Assignee) Qualcomm Incorporated     Method and system for coding and decoding information associated with video compression square partitions transform coefficients
readable medium readable medium
block pattern block pattern
XX
27US20070297518A1

(Woo-jin Han, 2007)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Flag encoding method, flag decoding method, and apparatus thereof block pattern block pattern
minimum size minimum size
video blocks flag value
XXXX
28US20070160303A1

(Onur Guleryuz, 2007)
(Original Assignee) Docomo Communications Labs USA Inc     

(Current Assignee)
NTT Docomo Inc
Geometrical image representation and compression rectangular partitions image processing operation
square partitions entropy coding
XX
29US20070230563A1

(Tao Tian, 2007)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Adaptive encoder-assisted frame rate up conversion readable medium readable medium
second syntax video encoding
block pattern block pattern
XX
30JP2008092150A

(Tadashi Kaneko, 2008)
(Original Assignee) Canon Inc; キヤノン株式会社     画像符号化装置、画像符号化方法 rectangular partitions プログラム
second syntax 手段と
XX
31CN1972449A

(金壹求, 2007)
(Original Assignee) 首尔国立大学校产学协力财团; 三星电子株式会社     用于在单一描述和多描述之间切换的设备和方法 minimum size value 的频率特性
size NxN 余弦变换
generating syntax information 从而产生
XX
32US20070171982A1

(Jens Meggers, 2007)
(Original Assignee) WIMAXIMUM LLC     

(Current Assignee)
Nordic Interactive Technologies LLC ; WIMAXIMUM LLC
Methods for encoding digital video for decoding on low performance devices block pattern corresponding blocks
video blocks memory block
XXXX
33EP1771008A2

(Kasman E. Thomas, 2007)
(Original Assignee) Micro Focus Software Inc     

(Current Assignee)
Oracle International Corp
Adaptive method and system for encoding digital images for the internet minimum size value predetermined threshold value
rectangular partitions color values
XX
34WO2007027402A2

(Ankur Shah, 2007)
(Original Assignee) Micronas Usa, Inc.     Multi-stage cabac decoding pipeline first syntax, syntax information syntax elements
length code clock cycle
integer multiple n bits
XXXX
35JP2008011431A

(Hideji Michinaka, 2008)
(Original Assignee) Toshiba Corp; 株式会社東芝     画像符号化装置 generating syntax information 発生符号量, 発生確率
second syntax 手段と
XX
36WO2006128076A2

(Shu Xiao, 2006)
(Original Assignee) Qualcomm Incorporated     Fixed point integer division techniques for ac/dc prediction in video coding devices readable medium comprising instructions readable medium comprising instructions
generating syntax information video coding
X
37CN101189641A

(I·戈里, 2008)
(Original Assignee) 布雷克成像有限公司     编码数字图像的像素或体素的方法及处理数字图像的方法 minimum size, minimum size value 数值数据
generate syntax information, generating syntax information 处理设
XX
38US20060233254A1

(Bae-keun Lee, 2006)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Method and apparatus for adaptively selecting context model for entropy coding square partitions entropy coding
video blocks generates data
syntax information, generating syntax information layer decoder
block pattern block pattern
second syntax element video encoder
XXXX
39CN101204092A

(纪尧姆·布瓦松, 2008)
(Original Assignee) 汤姆森许可贸易公司     从低分辨率图像导出高分辨率图像的编码信息的方法以及实施所述方法的编码和解码设备 length code 第二编码装置
minimum size, minimum size value 的尺寸
XX
40US20060120450A1

(Woo-jin Han, 2006)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Method and apparatus for multi-layered video encoding and decoding rectangular partitions intra-prediction modes
square partitions transform coefficients
first syntax prediction residuals
second syntax element video encoder
XX
41US20080025391A1

(Peter Amon, 2008)
(Original Assignee) Siemens AG     

(Current Assignee)
Siemens AG
Transcoding Method And Device first syntax, syntax information syntax elements, network element
second syntax video encoding
XXXX
42US20070206679A1

(Chong Lim, 2007)
(Original Assignee) Panasonic Corp     

(Current Assignee)
Dolby International AB
Motion Image Encoding Method and Motion Image Decoding Method minimum size value predetermined threshold value
generating syntax information video coding
XX
43US20060126740A1

(Teng Lin, 2006)
(Original Assignee) Micronas USA Inc     

(Current Assignee)
TDK Micronas GmbH
Shared pipeline architecture for motion vector prediction and residual decoding first syntax element, second syntax element motion vector differences
first syntax data state
length code bit data
video blocks d log
XXXX
44US20050025246A1

(Thomas Holcomb, 2005)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Decoding jointly coded transform type and subblock pattern information first syntax, syntax information syntax elements
readable medium readable medium
video blocks first VLC
XXXX
45US20050053292A1

(Kunal Mukerjee, 2005)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Advanced bi-directional predictive coding of interlaced video block pattern second motion vector predictor
readable medium readable medium
second syntax element video encoder
XX
46CN1810037A

(尹鹏, 2006)
(Original Assignee) 汤姆森许可贸易公司     帧间的快速模式确定编码 video blocks 一种视频编码方法, 针对帧
minimum size 尺寸的
XXXX
47US20050135484A1

(Yung-Lyul Lee, 2005)
(Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group     

(Current Assignee)
Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group
Method of encoding mode determination, method of motion estimation and encoding apparatus maximum size, maximum size value current picture
minimum size other ones
XX
48CN1857001A

(P·托思, 2006)
(Original Assignee) Amt先进多媒体科技公司     混合视频压缩方法 video blocks 块进行编码
size NxN 余弦变换
generating syntax information 从而产生
minimum size, minimum size value 的尺寸, 尺寸的
XXXX
49WO2004104930A2

(Peter Toth, 2004)
(Original Assignee) Amt Advanced Multimedia Technology Ab     Hybrid video compression method first syntax, programmable processor processing element, data elements
block pattern subsequent frame
square partitions entropy coding
length code quality values
video blocks given block
XXXXX
50US20070053436A1

(Lambertus Van Eggelen, 2007)
(Original Assignee) Koninklijke Philips NV     

(Current Assignee)
Koninklijke Philips NV
Encoding video information using block based adaptive scan order maximum size corresponding data blocks
video blocks preceding image
XXXX
51US20040234144A1

(Kazuo Sugimoto, 2004)
(Original Assignee) NTT Docomo Inc     

(Current Assignee)
NTT Docomo Inc
Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program maximum size predetermined shape
minimum size value absolute values
XX
52US20050276323A1

(Alexey Martemyanov, 2005)
(Original Assignee) Vanguard Video LLC     

(Current Assignee)
Beamr Imaging Ltd
Real-time video coding/decoding length code high compression efficiency, height h
second syntax, first syntax motion vector refinement, performing texture
square partitions transform coefficients, entropy coding
block pattern block pattern
XX
53CN101087424A

(孙式军, 2007)
(Original Assignee) 夏普株式会社     图像内预测模式估计、通信和组织的方法和系统 rectangular partitions 的左侧
minimum size value 平均值
XX
54CN1457196A

(薛金柱, 2003)
(Original Assignee) 北京工业大学     基于时空域相关性运动矢量预测的视频编码方法 minimum size value 一个视频
generating syntax information 基于运动
XX
55EP1478190A1

(Satoru c/o NTT DoCoMo Inc. ADACHI, 2004)
(Original Assignee) NTT Docomo Inc     

(Current Assignee)
NTT Docomo Inc
Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program maximum size predetermined shape
minimum size value absolute values
XX
56CN1647541A

(J·莱恩马, 2005)
(Original Assignee) 诺基亚有限公司     用于在视频编码系统中指示量化器参数的方法与设备 minimum size value 一个视频
first syntax 多个分
XX
57US20030179940A1

(Chih-Lung Lin, 2003)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Efficient macroblock header coding for video compression block pattern length coding table, block pattern
four square partitions, minimum size value chrominance values
readable medium readable medium
rectangular partitions color values
length code length code
XX
58US20030156648A1

(Thomas Holcomb, 2003)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Sub-block transform coding of prediction residuals video blocks, four square partitions motion prediction, main data
second syntax video encoding
second syntax element video encoder
integer multiple n bits
XXXX
59US7200275B2

(Sridhar Srinivasan, 2007)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Skip macroblock coding readable medium readable medium
second syntax video encoding
XX
60US7206456B2

(Miska Hannuksela, 2007)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Technologies Oy
Video coding and decoding syntax information coding elements
second syntax video encoding
XXXX
61JP2002359852A

(Sen Bun Chun, 2002)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     画像予測復号化装置及び方法 minimum size 量子化ステップサイズ
second syntax 手段と
length code 長復号
XX
62JPH11164305A

(Kotaro Asai, 1999)
(Original Assignee) Mitsubishi Electric Corp; 三菱電機株式会社     動画像符号化方法、動画像符号化装置および動画像復号装置 rectangular partitions のクラス
minimum size value 示す複数
XX
63US5748789A

(Ming-Chieh Lee, 1998)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Transparent block skipping in object-based video coding systems readable medium readable medium
block pattern block pattern
generating syntax information video coding
length code length code
XX
64US6125143A

(Teruhiko Suzuki, 2000)
(Original Assignee) Sony Corp     

(Current Assignee)
Ironworks Patents LLC
Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium square partitions transform coefficients
maximum size, minimum size mean value
XX
65US5691770A

(Gerrit J. Keesman, 1997)
(Original Assignee) US Philips Corp     

(Current Assignee)
US Philips Corp
Device and method for coding video pictures integer multiple spatial frequency
rectangular partitions same amplitude
XX
66EP0727910A2

(Max Chien, 1996)
(Original Assignee) Thomson Multimedia SA     

(Current Assignee)
Technicolor SA
Temporal-spatial error concealment apparatus and method for video signal processors readable medium said blocks
block pattern bad block
XX
67US5576767A

(Chong U. Lee, 1996)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Interframe video encoding and decoding system block pattern additional reference
readable medium said blocks
XX
68US5768434A

(Xiaonong Ran, 1998)
(Original Assignee) National Semiconductor Corp     

(Current Assignee)
National Semiconductor Corp
Quadtree-structured walsh transform coding minimum size value predetermined threshold value, absolute values
square partitions second partition, first partition
block pattern two blocks
XX
69US5666461A

(Katsuji Igarashi, 1997)
(Original Assignee) Sony Corp     

(Current Assignee)
Sony Corp
High efficiency encoding and decoding of picture signals and recording medium containing same readable medium said recording medium
video blocks picture portion
XXXX
70US5793897A

(Jae-moon Jo, 1998)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Adaptive variable-length coding and decoding methods for image data square partitions transform coefficients
block pattern length coding table
length code length code
XX
71US5623311A

(Larry Phillips, 1997)
(Original Assignee) Panasonic Corp of North America     

(Current Assignee)
Panasonic Corp of North America
MPEG video decoder having a high bandwidth memory video blocks inverse discrete cosine transform
second syntax element luminance component
four square partitions, minimum size value chrominance values
XXXX
72US5594504A

(Touradj Ebrahimi, 1997)
(Original Assignee) Nokia of America Corp     

(Current Assignee)
Nokia of America Corp
Predictive video coding using a motion vector updating routine video blocks motion estimation unit
maximum size predetermined shape
square partitions, four square partitions value z
XXXX
73US5422676A

(Carsten Herpel, 1995)
(Original Assignee) Deutsche Thomson Brandt GmbH     System for coding an image representative signal block pattern second blocks
readable medium said blocks
XX
74US5091782A

(Edward A. Krause, 1992)
(Original Assignee) Arris Technology Inc     

(Current Assignee)
CIF LICENSING LLC
Apparatus and method for adaptively compressing successive blocks of digital video square partitions transform coefficients
video blocks transforming means
readable medium said blocks
XXXX
75EP1623577A1

(Lambertus A. Van Eggelen, 2006)
(Original Assignee) Koninklijke Philips NV     

(Current Assignee)
Koninklijke Philips NV
Encoding of video information using block based adaptive scan order maximum size corresponding data blocks
video blocks preceding image
XXXX
76JP3910594B2

(芳美 井須, 2007)
(Original Assignee) 三菱電機株式会社     画像符号化装置 syntax information ビットストリーム
second syntax 手段と
XXXX
77GB2382940A

(Miska Hannuksela, 2003)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Oyj
Encoding objects and background blocks syntax information coding elements
second syntax video encoding
second syntax element video encoder
generating syntax information video coding
XXXX
78JPH082106B2

(洋一 加藤, 1996)
(Original Assignee) 国際電信電話株式会社; 富士通株式会社; 日本電信電話株式会社; 日本電気株式会社     動画像信号のハイブリツド符号化方式 maximum size, maximum size value ブロックサイズ
generate syntax information 変換部
XX




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
KR100809298B1

Filed: 2006-08-16     Issued: 2008-03-04

플래그 인코딩 방법, 플래그 디코딩 방법, 및 상기 방법을이용한 장치

(Original Assignee) 삼성전자주식회사     

이교혁, 이배근, 이태미, 한우진
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
KR100809298B1
CLAIM 7
제1항에 있어서 , 상기 플래그는 Cbp(coded block pattern (block pattern) ) 또는 잔차 예측 플래그인 플래그 인코딩 장치 .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
KR100809298B1
CLAIM 7
제1항에 있어서 , 상기 플래그는 Cbp(coded block pattern (block pattern) ) 또는 잔차 예측 플래그인 플래그 인코딩 장치 .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20080002770A1

Filed: 2006-06-30     Issued: 2008-01-03

Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Oyj

Kemal Ugur, Jani Lainema
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (first encoding mode) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition, th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20080002770A1
CLAIM 1
. A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ;
generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ;
and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold .

US20080002770A1
CLAIM 7
. A method according to claim 4 , further comprising : determining whether the summation of a first distortion value and a second distortion value exceeds a fourth predetermined threshold , wherein the first distortion value and the second distortion value correspond to a first partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of a second block size among the plurality of block sizes ;
estimating the motion corresponding to the first partition when the summation of the first distortion value and the second distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the first partition when the summation of the first distortion value and the second distortion value is less than the fourth predetermined threshold .

US20080002770A1
CLAIM 8
. A method according to claim 7 , further comprising : determining whether the summation of a third distortion value and a fourth distortion value exceeds the fourth predetermined threshold , wherein the third distortion value and the fourth distortion value correspond to a second partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the second block size among the plurality of block sizes ;
estimating the motion corresponding to the second partition when the summation of the third distortion value and the fourth distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the second partition when the summation of the third distortion value and the fourth distortion value is less than the fourth predetermined threshold .

US20080002770A1
CLAIM 10
. A method according to claim 9 , further comprising : determining whether the summation of a sixth distortion value and a seventh distortion value exceeds the fourth predetermined threshold , wherein the sixth distortion value and the seventh distortion value corresponds to a fourth partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the third block size among the plurality of block sizes ;
estimating the motion corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value is less than the fourth predetermined threshold .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (first encoding mode) .
US20080002770A1
CLAIM 1
. A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ;
generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ;
and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (first encoding mode) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition, th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20080002770A1
CLAIM 1
. A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ;
generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ;
and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold .

US20080002770A1
CLAIM 7
. A method according to claim 4 , further comprising : determining whether the summation of a first distortion value and a second distortion value exceeds a fourth predetermined threshold , wherein the first distortion value and the second distortion value correspond to a first partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of a second block size among the plurality of block sizes ;
estimating the motion corresponding to the first partition when the summation of the first distortion value and the second distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the first partition when the summation of the first distortion value and the second distortion value is less than the fourth predetermined threshold .

US20080002770A1
CLAIM 8
. A method according to claim 7 , further comprising : determining whether the summation of a third distortion value and a fourth distortion value exceeds the fourth predetermined threshold , wherein the third distortion value and the fourth distortion value correspond to a second partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the second block size among the plurality of block sizes ;
estimating the motion corresponding to the second partition when the summation of the third distortion value and the fourth distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the second partition when the summation of the third distortion value and the fourth distortion value is less than the fourth predetermined threshold .

US20080002770A1
CLAIM 10
. A method according to claim 9 , further comprising : determining whether the summation of a sixth distortion value and a seventh distortion value exceeds the fourth predetermined threshold , wherein the sixth distortion value and the seventh distortion value corresponds to a fourth partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the third block size among the plurality of block sizes ;
estimating the motion corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value is less than the fourth predetermined threshold .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (first encoding mode) .
US20080002770A1
CLAIM 1
. A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ;
generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ;
and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20060203905A1

Filed: 2005-03-14     Issued: 2006-09-14

Video coding system

(Original Assignee) National Kaohsiung First University of Science and Technology     (Current Assignee) National Kaohsiung First University of Science and Technology

Shih-Chang Hsia
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks, same time) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (processing speed) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size (horizontal resolution) value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20060203905A1
CLAIM 6
. A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ;
the system includes six address generators (AG 1 ˜AG 6) ;
the internal storage cell being consisted with input memory M 1 and frame memory M 2 .

US20060203905A1
CLAIM 7
. The video coding system as claimed in claim 6 , further comprising the timing schedule . MB 1 , MB 2 . . . are continuous macro-blocks . For real-time processing , a pipelined schedule could be employed . In the first time , the motion estimation for MB 1 block is performed , where other MBs are idle . As motion vector of MB 1 is found , the DCT processor can transform the differential values of input pixel (from M 1 memory) and the reference frame (from M 2 memory) in the second time . At the same time (block pattern) , MB 2 is processed in the motion estimation engine . In the third time , the DCT coefficients of MB 1 could be performed by quantization and de-quantization procedures . Then the pixels are reconstructed from IDCT , and written into the frame memory for motion compensation . Simultaneously , the motion estimation for MB 3 and DCT transformation for MB 2 are fulfilled .

US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, minimum size value) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 12
. The video coding system as claimed in claim 8 , wherein there are two blocks (block pattern) delaying between input and output in the frame memory . The write address (WA) could easily find from the read address (RA) added the offset value .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks, same time) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (processing speed) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size (horizontal resolution) value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20060203905A1
CLAIM 6
. A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ;
the system includes six address generators (AG 1 ˜AG 6) ;
the internal storage cell being consisted with input memory M 1 and frame memory M 2 .

US20060203905A1
CLAIM 7
. The video coding system as claimed in claim 6 , further comprising the timing schedule . MB 1 , MB 2 . . . are continuous macro-blocks . For real-time processing , a pipelined schedule could be employed . In the first time , the motion estimation for MB 1 block is performed , where other MBs are idle . As motion vector of MB 1 is found , the DCT processor can transform the differential values of input pixel (from M 1 memory) and the reference frame (from M 2 memory) in the second time . At the same time (block pattern) , MB 2 is processed in the motion estimation engine . In the third time , the DCT coefficients of MB 1 could be performed by quantization and de-quantization procedures . Then the pixels are reconstructed from IDCT , and written into the frame memory for motion compensation . Simultaneously , the motion estimation for MB 3 and DCT transformation for MB 2 are fulfilled .

US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, minimum size value) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 12
. The video coding system as claimed in claim 8 , wherein there are two blocks (block pattern) delaying between input and output in the frame memory . The write address (WA) could easily find from the read address (RA) added the offset value .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US7362804B2

Filed: 2003-11-24     Issued: 2008-04-22

Graphical symbols for H.264 bitstream syntax elements

(Original Assignee) LSI Corp     (Current Assignee) Avago Technologies International Sales Pte Ltd

Pavel Novotny, Guy Cote, Lowell L. Winger, Simon Booth
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value (maximum values) , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US7362804B2
CLAIM 1
. An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ;
and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values (maximum size value) , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US7362804B2
CLAIM 1
. An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ;
and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value (maximum values) , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US7362804B2
CLAIM 1
. An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ;
and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values (maximum size value) , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US7362804B2
CLAIM 1
. An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ;
and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US7289674B2

Filed: 2003-06-10     Issued: 2007-10-30

Spatial prediction based intra coding

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Technologies Oy

Marta Karczewicz
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (ascending order) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US7289674B2
CLAIM 8
. The method of claim 1 , wherein the derived prediction modes are representable by a plurality of different integers , each integer corresponding to a derived prediction mode , and wherein the integers are arranged in an ascending order (square partitions) of the integers for providing an ordered set , said method further comprising removing the integers corresponding to the first prediction modes from the ordered set for providing a modified ordered set , so as to allow the decoding device to determine the order of the second group based on the modified ordered set .

US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (block pattern) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (ascending order) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US7289674B2
CLAIM 8
. The method of claim 1 , wherein the derived prediction modes are representable by a plurality of different integers , each integer corresponding to a derived prediction mode , and wherein the integers are arranged in an ascending order (square partitions) of the integers for providing an ordered set , said method further comprising removing the integers corresponding to the first prediction modes from the ordered set for providing a modified ordered set , so as to allow the decoding device to determine the order of the second group based on the modified ordered set .

US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (block pattern) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6980596B2

Filed: 2002-11-20     Issued: 2005-12-27

Macroblock level adaptive frame/field coding for digital video content

(Original Assignee) Arris Technology Inc     (Current Assignee) Google Technology Holdings LLC

Limin Wang, Rajeev Gandhi, Krit Panusopone, Ajay Luthra
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (frame mode) or into four square partitions (processing two) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6980596B2
CLAIM 1
. A method of encoding or decoding digital video content , said digital video content comprising a stream of pictures which can each be intra , predicted , or bi-predicted pictures , each of said pictures comprising macroblocks , said method comprising the steps of : selecting to encode or decode a group of neighboring macroblocks in a frame mode (rectangular partitions) or a field mode ;
arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks according to the selected one of said frame mode or said field mode ;
and encoding or decoding said frame macroblocks or said field macroblocks , wherein , the step of arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks includes the step of : in frame mode , jointly processing two (four square partitions) fields in said group of neighboring macroblocks , said group of neighboring macroblocks being divided into frame macroblocks , and each of said frame macroblocks including both top and bottom field pixels ;
in said field mode , separately processing two fields of said group of neighboring macroblocks , said group of neighboring macroblocks being split into field macroblocks , and each of said field macroblocks including either top or bottom field pixels , and wherein said frame macroblocks and said field macroblocks are the same size .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (frame mode) or into four square partitions (processing two) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6980596B2
CLAIM 1
. A method of encoding or decoding digital video content , said digital video content comprising a stream of pictures which can each be intra , predicted , or bi-predicted pictures , each of said pictures comprising macroblocks , said method comprising the steps of : selecting to encode or decode a group of neighboring macroblocks in a frame mode (rectangular partitions) or a field mode ;
arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks according to the selected one of said frame mode or said field mode ;
and encoding or decoding said frame macroblocks or said field macroblocks , wherein , the step of arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks includes the step of : in frame mode , jointly processing two (four square partitions) fields in said group of neighboring macroblocks , said group of neighboring macroblocks being divided into frame macroblocks , and each of said frame macroblocks including both top and bottom field pixels ;
in said field mode , separately processing two fields of said group of neighboring macroblocks , said group of neighboring macroblocks being split into field macroblocks , and each of said field macroblocks including either top or bottom field pixels , and wherein said frame macroblocks and said field macroblocks are the same size .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6795584B2

Filed: 2002-10-03     Issued: 2004-09-21

Context-based adaptive variable length coding for adaptive block transforms

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Technologies Oy

Marta Karczewicz, Justin Ridge
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6795584B2
CLAIM 12
. An image encoder for receiving image data and providing a bitstream indicative of the image data , characterized by : means for forming at least a block of transform coefficients from the image data , by means for scanning the block of transform coefficients for forming an ordered sequence of transform coefficients from the block , by a software program stored on a computer readable medium for sub-sampling the ordered sequence of transform coefficients in order to form a plurality of sub-sampled sequences of transform coefficients , by means for entropy coding (square partitions) the sub-sampled sequences of transform coefficients for provided signals indicative of the encoded transform coefficients , and by means , for providing the bitstream based on the signals .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6795584B2
CLAIM 12
. An image encoder for receiving image data and providing a bitstream indicative of the image data , characterized by : means for forming at least a block of transform coefficients from the image data , by means for scanning the block of transform coefficients for forming an ordered sequence of transform coefficients from the block , by a software program stored on a computer readable medium for sub-sampling the ordered sequence of transform coefficients in order to form a plurality of sub-sampled sequences of transform coefficients , by means for entropy coding (square partitions) the sub-sampled sequences of transform coefficients for provided signals indicative of the encoded transform coefficients , and by means , for providing the bitstream based on the signals .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP2003319394A

Filed: 2002-04-26     Issued: 2003-11-07

符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム

(Original Assignee) Sony Corp; ソニー株式会社     

Kuhn Peter, Kazufumi Sato, Teruhiko Suzuki, Yoichi Yagasaki, クーン ピーター, 数史 佐藤, 陽一 矢ヶ崎, 輝彦 鈴木
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム, 前記圧縮) or into four square partitions (field flag) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP2003319394A
CLAIM 1
【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。

JP2003319394A
CLAIM 20
【請求項20】 マクロブロックCのframe/field fla gに関するコンテキストモデルctx fifr flag(C)は、前 記マクロブロックCに隣接するマクロブロックA,Bそ れぞれのframe/field flag (four square partitions) の値a,bを用いて次式の ように定義されている ctx fifr flag(C)=a+2b ことを特徴とする請求項19に記載の符号化装置。

JP2003319394A
CLAIM 27
【請求項27】 飛び越し走査の画像情報を入力とし て、マクロブロックレベルで適応的にフィールドベース またはフレームベースの符号化処理を施すプログラム (rectangular partitions) で あって、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換ステップと、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定ステップと、 前記直交変換ステップの処理結果を量子化する量子化ス テップと、 前記量子化ステップの処理結果を可逆符号化する可逆符 号化ステップとを含むことを特徴とするコンピュータが 読み取り可能なプログラムが記録されている記録媒体。

JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮 (rectangular partitions) 画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム, 前記圧縮) or into four square partitions (field flag) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (生成ステップ) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP2003319394A
CLAIM 1
【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。

JP2003319394A
CLAIM 20
【請求項20】 マクロブロックCのframe/field fla gに関するコンテキストモデルctx fifr flag(C)は、前 記マクロブロックCに隣接するマクロブロックA,Bそ れぞれのframe/field flag (four square partitions) の値a,bを用いて次式の ように定義されている ctx fifr flag(C)=a+2b ことを特徴とする請求項19に記載の符号化装置。

JP2003319394A
CLAIM 27
【請求項27】 飛び越し走査の画像情報を入力とし て、マクロブロックレベルで適応的にフィールドベース またはフレームベースの符号化処理を施すプログラム (rectangular partitions) で あって、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換ステップと、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定ステップと、 前記直交変換ステップの処理結果を量子化する量子化ス テップと、 前記量子化ステップの処理結果を可逆符号化する可逆符 号化ステップとを含むことを特徴とするコンピュータが 読み取り可能なプログラムが記録されている記録媒体。

JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮 (rectangular partitions) 画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。

JP2003319394A
CLAIM 32
【請求項32】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置の復号方法において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号ステップと、 前記量子化されている直交変換係数を逆量子化する逆量 子化ステップと、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換ステップと、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成ステップ (generating syntax information) とを含むことを特徴とする復号方法。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6968012B1

Filed: 2001-10-02     Issued: 2005-11-22

Methods for encoding digital video for decoding on low performance devices

(Original Assignee) Firepad Inc     (Current Assignee) Nordic Interactive Technologies LLC

Jens Meggers
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ;

generate syntax information (identifying means) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6968012B1
CLAIM 1
. A method of encoding a video stream , comprising : (a) receiving a current video frame of the video stream ;
(b) subdividing the current video frame into blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
(c) repeating (b) using at least one additional block size , such that the current video frame is separately subdivided and compared to the preceding video frame using multiple block sizes ;
(d) selecting from the multiple block sizes a block size that produces a best video compression result ;
and (e) encoding the video frame in the video stream with the block size selected in (d) .

US6968012B1
CLAIM 10
. The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm .

US6968012B1
CLAIM 18
. A data processing system comprising : a server computer comprising a video encoder (second syntax element) application for encoding video streams using multiple block size detection method ;
a client device comprising a decoder application for decoding and displaying the encoded video streams ;
and a video encoder module for encoding video streams using multiple block size detection method comprising block detection component , crunching component , and compressing component .

US6968012B1
CLAIM 19
. A system for encoding and decoding video , the system comprising : identifying means (generate syntax information) for detecting pixel blocks which have changed from a previous decoded video frame ;
storing means for copying the changed pixel blocks and a preamble block ;
compressing means for compressing the changed pixel blocks and the preamble block ;
and identifying means for selecting a smallest encoded frame .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) .
US6968012B1
CLAIM 10
. The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6968012B1
CLAIM 1
. A method of encoding a video stream , comprising : (a) receiving a current video frame of the video stream ;
(b) subdividing the current video frame into blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
(c) repeating (b) using at least one additional block size , such that the current video frame is separately subdivided and compared to the preceding video frame using multiple block sizes ;
(d) selecting from the multiple block sizes a block size that produces a best video compression result ;
and (e) encoding the video frame in the video stream with the block size selected in (d) .

US6968012B1
CLAIM 10
. The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm .

US6968012B1
CLAIM 18
. A data processing system comprising : a server computer comprising a video encoder (second syntax element) application for encoding video streams using multiple block size detection method ;
a client device comprising a decoder application for decoding and displaying the encoded video streams ;
and a video encoder module for encoding video streams using multiple block size detection method comprising block detection component , crunching component , and compressing component .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) .
US6968012B1
CLAIM 10
. The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6959116B2

Filed: 2001-09-18     Issued: 2005-10-25

Largest magnitude indices selection for (run, level) encoding of a block coded picture

(Original Assignee) EMC Corp     (Current Assignee) EMC Corp

Ugur Sezer, Sorin Faibish, Seyfullah H. Oguz, Wayne W. Duso
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (entropy encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6959116B2
CLAIM 1
. A method of producing a (run , level) encoded picture from an original picture , said method comprising : producing respective sets of transform coefficients (square partitions) for blocks of pixels in the original picture ;
quantizing transform coefficients in the respective sets of transform coefficients to produce respective sets of non-zero quantization indices for the blocks of pixels , wherein the non-zero quantization indices for at least some of the blocks are produced by using a quantization step size that is not uniform within said at least some of the blocks ;
inspecting magnitudes of the non-zero quantization indices for selecting a limited number of largest magnitude non-zero quantization indices for the blocks of pixels in the respective sets of quantization indices to produce respective sets of selected quantization indices having non-zero levels for the blocks of pixels ;
and (run , level) encoding quantization indices from the respective sets of selected quantization indices to produce the (run , level) encoded picture .

US6959116B2
CLAIM 8
. The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a .) parsing and copying a differential DC coefficient variable-length code (VLC) ;
b .) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ;
c .) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ;
and d .) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (entropy encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6959116B2
CLAIM 1
. A method of producing a (run , level) encoded picture from an original picture , said method comprising : producing respective sets of transform coefficients (square partitions) for blocks of pixels in the original picture ;
quantizing transform coefficients in the respective sets of transform coefficients to produce respective sets of non-zero quantization indices for the blocks of pixels , wherein the non-zero quantization indices for at least some of the blocks are produced by using a quantization step size that is not uniform within said at least some of the blocks ;
inspecting magnitudes of the non-zero quantization indices for selecting a limited number of largest magnitude non-zero quantization indices for the blocks of pixels in the respective sets of quantization indices to produce respective sets of selected quantization indices having non-zero levels for the blocks of pixels ;
and (run , level) encoding quantization indices from the respective sets of selected quantization indices to produce the (run , level) encoded picture .

US6959116B2
CLAIM 8
. The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a . ) parsing and copying a differential DC coefficient variable-length code (VLC) ;
b . ) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ;
c . ) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ;
and d . ) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6529634B1

Filed: 1999-11-08     Issued: 2003-03-04

Contrast sensitive variance based adaptive block size DCT image compression

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Kadayam Thyagarajan, Michael J. Merritt
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means, main data) of size NxN , where N is an integer multiple (th column) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (transforming means, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (minimum block size) ;

generate syntax information (Huffman code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (block pattern) of pixel data .

US6529634B1
CLAIM 4
. An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ;
transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ;
quantizer means for quantizing said frequency domain data ;
serializer means for scanning said quantized data into a serialized stream of data ;
and variable length coding means for coding said serialized stream of data in preparation for transmission .

US6529634B1
CLAIM 11
. The system of claim 4 wherein said variable length coding means comprises a Huffman code (generate syntax information, generating syntax information) r .

US6529634B1
CLAIM 24
. The method of claim 1 , wherein said variance is determined by the following equation : var = 1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j 2 - (1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j) 2 where N=dimension of the block , and x ij is the pixel in the i th row , j th column (integer multiple) within the N×N block .

US6529634B1
CLAIM 30
. The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means, main data) .
US6529634B1
CLAIM 4
. An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ;
transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ;
quantizer means for quantizing said frequency domain data ;
serializer means for scanning said quantized data into a serialized stream of data ;
and variable length coding means for coding said serialized stream of data in preparation for transmission .

US6529634B1
CLAIM 30
. The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means, main data) of size NxN , where N is an integer multiple (th column) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (transforming means, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (minimum block size) ;

generating syntax information (Huffman code) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (block pattern) of pixel data .

US6529634B1
CLAIM 4
. An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ;
transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ;
quantizer means for quantizing said frequency domain data ;
serializer means for scanning said quantized data into a serialized stream of data ;
and variable length coding means for coding said serialized stream of data in preparation for transmission .

US6529634B1
CLAIM 11
. The system of claim 4 wherein said variable length coding means comprises a Huffman code (generate syntax information, generating syntax information) r .

US6529634B1
CLAIM 24
. The method of claim 1 , wherein said variance is determined by the following equation : var = 1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j 2 - (1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j) 2 where N=dimension of the block , and x ij is the pixel in the i th row , j th column (integer multiple) within the N×N block .

US6529634B1
CLAIM 30
. The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means, main data) .
US6529634B1
CLAIM 4
. An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ;
transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ;
quantizer means for quantizing said frequency domain data ;
serializer means for scanning said quantized data into a serialized stream of data ;
and variable length coding means for coding said serialized stream of data in preparation for transmission .

US6529634B1
CLAIM 30
. The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6539124B2

Filed: 1999-08-17     Issued: 2003-03-25

Quantizer selection based on region complexities derived using a rate distortion model

(Original Assignee) Sarnoff Corp     (Current Assignee) MediaTek Inc

Sriram Sethuraman, Ravi Krishnamurthy
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute differences) , the minimum size value indicating a size of the smallest partition for the coded unit .
US6539124B2
CLAIM 7
. The invention of claim 6 , wherein the distortion measure S is based on a sum of absolute differences (minimum size value) (SAD) measure .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute differences) , the minimum size value indicating a size of a smallest partition for the coded unit .
US6539124B2
CLAIM 7
. The invention of claim 6 , wherein the distortion measure S is based on a sum of absolute differences (minimum size value) (SAD) measure .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6215910B1

Filed: 1996-03-28     Issued: 2001-04-10

Table-based compression with embedded coding

(Original Assignee) Microsoft Corp     (Current Assignee) V-XSTREAM Inc ; Microsoft Technology Licensing LLC

Navin Chaddha
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (embedded code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6215910B1
CLAIM 1
. A method of compressing an image comprising the steps of : dividing the image into blocks of vectors comprising bits , each vector in a block representing a pixel of the image ;
reducing the number of vectors that must be compressed by use of full search hierarchical lookup tables incorporating block transforms to provide indices ;
and mapping the indices to a set of embedded code (generate syntax information) s by use of a tree structured table .

US6215910B1
CLAIM 3
. The method of claim 2 wherein the hierarchical lookup tables contain distortion measures based on human visual sensitivity to quantization errors in different transform coefficients (square partitions) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6215910B1
CLAIM 3
. The method of claim 2 wherein the hierarchical lookup tables contain distortion measures based on human visual sensitivity to quantization errors in different transform coefficients (square partitions) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JPH08205140A

Filed: 1995-01-31     Issued: 1996-08-09

画像圧縮装置

(Original Assignee) Canon Inc; キヤノン株式会社     

Koji Hirabayashi, 康二 平林
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ, 小さいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (ブロックサイズ, 小さいサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit .
JPH08205140A
CLAIM 1
【請求項1】 所定の画素数から成るサイズを有するブ ロックデータについて冗長部分の大きさが所定値以上か 否かを判定し、所定値以下のときは上記ブロックデータ を上記サイズより小さいサイズ (maximum size, maximum size value, minimum size, minimum size value) を有するサブブロックに 分割し、所定値より大きいときは分割しないことを示す 分割情報を出力するブロックサイズ (maximum size, maximum size value, minimum size, minimum size value) 判定手段と (second syntax) 、 上記分割情報に基づいて上記ブロックデータを上記サブ ブロックに分割してサブブロックデータ又は上記ブロッ クデータを出力するサブブロック切出し手段と、 上記サブブロックデータ又は上記ブロックデータをその サイズに応じた処理により圧縮符号化する符号化手段と を備えた画像圧縮装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ, 小さいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (ブロックサイズ, 小さいサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit .
JPH08205140A
CLAIM 1
【請求項1】 所定の画素数から成るサイズを有するブ ロックデータについて冗長部分の大きさが所定値以上か 否かを判定し、所定値以下のときは上記ブロックデータ を上記サイズより小さいサイズ (maximum size, maximum size value, minimum size, minimum size value) を有するサブブロックに 分割し、所定値より大きいときは分割しないことを示す 分割情報を出力するブロックサイズ (maximum size, maximum size value, minimum size, minimum size value) 判定手段と (second syntax) 、 上記分割情報に基づいて上記ブロックデータを上記サブ ブロックに分割してサブブロックデータ又は上記ブロッ クデータを出力するサブブロック切出し手段と、 上記サブブロックデータ又は上記ブロックデータをその サイズに応じた処理により圧縮符号化する符号化手段と を備えた画像圧縮装置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5107345A

Filed: 1991-05-28     Issued: 1992-04-21

Adaptive block size image compression method and system

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Chong U. Lee
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform, compressed image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (count values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (Huffman code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5107345A
CLAIM 17
. The system of claim 8 wherein said encoder means comprises : a Huffman code (generate syntax information, generating syntax information) r having an input coupled to said ordering means and an output coupled to said assembler means .

US5107345A
CLAIM 19
. A method for image signal compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ;
performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ;
performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ;
determining bit count values (rectangular partitions) corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ;
determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ;
selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ;
generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ;
ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ;
encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ;
and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data .

US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform, compressed image) .
US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform, compressed image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (count values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (Huffman code) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5107345A
CLAIM 17
. The system of claim 8 wherein said encoder means comprises : a Huffman code (generate syntax information, generating syntax information) r having an input coupled to said ordering means and an output coupled to said assembler means .

US5107345A
CLAIM 19
. A method for image signal compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ;
performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ;
performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ;
determining bit count values (rectangular partitions) corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ;
determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ;
selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ;
generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ;
ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ;
encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ;
and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data .

US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform, compressed image) .
US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP2008219870A

Filed: 2008-01-28     Issued: 2008-09-18

動画像符号化方法及び動画像符号化装置

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Teo Han Boon, Chong Soon Lim, Takahiro Nishi, Yoji Shibahara, Toshiyasu Sugio, Kazuya Takagi, Viktor Wahadaniah, ハン ブーン テオ, ワハダニア ビクター, チョン スーン リム, 敏康 杉尾, 陽司 柴原, 孝啓 西, 一也 高木
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (小さいサイズ, 大きいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (小さいサイズ, 大きいサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (maximum size, minimum size, maximum size value, minimum size value) 又は所定の小さいサイズ (maximum size, minimum size, maximum size value, minimum size value) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

JP2008219870A
CLAIM 9
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置に用いられる、コンピュータに実行させるためのプログラム (rectangular partitions) であって、 前記プログラムは、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とするプログラム

JP2008219870A
CLAIM 10
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (小さいサイズ, 大きいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (小さいサイズ, 大きいサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (maximum size, minimum size, maximum size value, minimum size value) 又は所定の小さいサイズ (maximum size, minimum size, maximum size value, minimum size value) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

JP2008219870A
CLAIM 9
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置に用いられる、コンピュータに実行させるためのプログラム (rectangular partitions) であって、 前記プログラムは、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とするプログラム

JP2008219870A
CLAIM 10
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2008086316A2

Filed: 2008-01-07     Issued: 2008-07-17

Variable length coding techniques for coded block patterns

(Original Assignee) Qualcomm Incorporated     

Marta Karczewicz, Hyukjune Chung, Phoom Sagetong
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2008086316A2
CLAIM 1
. A method comprising : selecting a variable length coding (VLC) table for a current video block from a plurality of tables based on a number of neighboring video blocks to the current video block that include non-zero transform coefficients ;
and coding one or more coded block pattern (block pattern) s (CBPs) of the current video block using the selected VLC table .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2008086316A2
CLAIM 1
. A method comprising : selecting a variable length coding (VLC) table for a current video block from a plurality of tables based on a number of neighboring video blocks to the current video block that include non-zero transform coefficients ;
and coding one or more coded block pattern (block pattern) s (CBPs) of the current video block using the selected VLC table .

WO2008086316A2
CLAIM 10
. The method of claim 1 , wherein the method is performed with respect to video blocks of an enhancement layer in scalable video coding (generating syntax information) (SVC) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2008054688A1

Filed: 2007-10-26     Issued: 2008-05-08

Method for decomposing a video sequence frame

(Original Assignee) Hewlett-Packard Development Company, L.P.     

Debargha Mukherjee, Sam Liu
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (different motion vectors) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2008054688A1
CLAIM 6
. The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) .

WO2008054688A1
CLAIM 8
. An encoder (112) for encoding a video sequence frame , wherein the video sequence frame is represented by a plurality of blocks (122-126) having a first set of dimensions , said encoder (112) comprising : a wedge partitioner (224) ;
a block divider (222) ;
a motion estimator (226) ;
and an error calculator (228) configured to calculate a first metric of the motion vectors (132) associated with partition (square partitions, four square partitions) ing the at least one block (122-126) into two wedges (130a , 130b) and to calculate a second metric of the motion vectors (132) associated with dividing the at least block into sub-blocks (122-126) , and for at least one block (122-126) , wherein a controller (204) is configured (a) to determine whether either both of the dimensions of the at least one block (122-126) equal a predetermined minimum dimension , (b) to implement the motion estimator (226) to estimate a motion vector (132) for the at least one block in response to the either or both of the dimensions equaling the predetermined minimum dimensions , and in response to both of the dimensions exceeding the predetermined minimum dimension , (c) to implement the motion estimator (226) to estimate a motion vector (!32) for the at least one block (122-126) , (d) to implement the wedge partitioner (224) to partition the at least one block (122-126) of the plurality of blocks (122-126) into two wedges (130a , 130b) , (e) to implement the block divider (222) to divide the at least one block (122-126) into sub-blocks (122-126) having a second set of dimensions , and to perform (a)-(e) for each of the sub-blocks (122-126) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (different motion vectors) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2008054688A1
CLAIM 6
. The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) .

WO2008054688A1
CLAIM 8
. An encoder (112) for encoding a video sequence frame , wherein the video sequence frame is represented by a plurality of blocks (122-126) having a first set of dimensions , said encoder (112) comprising : a wedge partitioner (224) ;
a block divider (222) ;
a motion estimator (226) ;
and an error calculator (228) configured to calculate a first metric of the motion vectors (132) associated with partition (square partitions, four square partitions) ing the at least one block (122-126) into two wedges (130a , 130b) and to calculate a second metric of the motion vectors (132) associated with dividing the at least block into sub-blocks (122-126) , and for at least one block (122-126) , wherein a controller (204) is configured (a) to determine whether either both of the dimensions of the at least one block (122-126) equal a predetermined minimum dimension , (b) to implement the motion estimator (226) to estimate a motion vector (132) for the at least one block in response to the either or both of the dimensions equaling the predetermined minimum dimensions , and in response to both of the dimensions exceeding the predetermined minimum dimension , (c) to implement the motion estimator (226) to estimate a motion vector (!32) for the at least one block (122-126) , (d) to implement the wedge partitioner (224) to partition the at least one block (122-126) of the plurality of blocks (122-126) into two wedges (130a , 130b) , (e) to implement the block divider (222) to divide the at least one block (122-126) into sub-blocks (122-126) having a second set of dimensions , and to perform (a)-(e) for each of the sub-blocks (122-126) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2008027192A2

Filed: 2007-08-16     Issued: 2008-03-06

Methods and apparatus for reduced resolution partitioning

(Original Assignee) Thomson Licensing     

Oscar Divorra Escoda, Peng Yin
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (n partitions) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (syntax information) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value (slice level) , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2008027192A2
CLAIM 2
. The apparatus of claim 1 , wherein partitions (square partitions, four square partitions) are obtained from a combination of top-down tree partitioning and bottom-up tree joining .

WO2008027192A2
CLAIM 3
. The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation .

WO2008027192A2
CLAIM 8
. The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (minimum size value) , and a macroblock level .

WO2008027192A2
CLAIM 17
. The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (syntax information) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
WO2008027192A2
CLAIM 17
. The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (n partitions) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (syntax information) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value (slice level) , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2008027192A2
CLAIM 2
. The apparatus of claim 1 , wherein partitions (square partitions, four square partitions) are obtained from a combination of top-down tree partitioning and bottom-up tree joining .

WO2008027192A2
CLAIM 3
. The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation .

WO2008027192A2
CLAIM 8
. The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (minimum size value) , and a macroblock level .

WO2008027192A2
CLAIM 15
. The apparatus of claim 9 , wherein a block scanning order is equal or different from an existing block scan order imposed by a video coding (generating syntax information) recommendation or a video coding standard .

WO2008027192A2
CLAIM 17
. The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (syntax information) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
WO2008027192A2
CLAIM 17
. The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20080049834A1

Filed: 2007-08-03     Issued: 2008-02-28

Sub-block transform coding of prediction residuals

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Thomas Holcomb, Chih-Lung Lin
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20080049834A1
CLAIM 56
. The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data .

US20080049834A1
CLAIM 60
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes .

US20080049834A1
CLAIM 62
. In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ;
and outputting the plural sub-block pattern codes .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction) .
US20080049834A1
CLAIM 56
. The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20080049834A1
CLAIM 56
. The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data .

US20080049834A1
CLAIM 60
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes .

US20080049834A1
CLAIM 62
. In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ;
and outputting the plural sub-block pattern codes .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction) .
US20080049834A1
CLAIM 56
. The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2008016605A2

Filed: 2007-07-31     Issued: 2008-02-07

Adaptive geometric partitioning for video decoding

(Original Assignee) Thomson Licensing     

Oscar Divorra Escoda, Peng Yin
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2008016605A2
CLAIM 18
. The apparatus of claim 16 , wherein said decoder (1100) decodes model parameters of at least one of the at least one parametric model and the at least one curve to determine the parametric model-based partitions along with partition (square partitions, four square partitions) s prediction data .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2008016605A2
CLAIM 15
. The apparatus of claim 1 , wherein said decoder (1100) is an extended version of an existing hybrid predictive decoder of an existing video coding (generating syntax information) standard or video coding recommendation .

WO2008016605A2
CLAIM 18
. The apparatus of claim 16 , wherein said decoder (1100) decodes model parameters of at least one of the at least one parametric model and the at least one curve to determine the parametric model-based partitions along with partition (square partitions, four square partitions) s prediction data .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2008002881A2

Filed: 2007-06-25     Issued: 2008-01-03

Reduction of errors during computation of inverse discrete cosine transform

(Original Assignee) Qualcomm Incorporated     

Harinath Garudadri, Yuriy Reznik
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

EP2347591B1
CLAIM 7
A computer-readable medium comprising instructions for causing a programmable processor (programmable processor) to perform the methods of any of claims 4-6 .
WO2008002881A2
CLAIM 81
. A computer-readable medium comprising instructions , wherein the instructions cause a programmable processor (programmable processor) to : generate a matrix of adjusted coefficients by left-shifting each coefficient in a matrix of source coefficients ;
generate a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of adjusted coefficients ;
generate a matrix of scaled coefficients by scaling each coefficient in the matrix of transformed coefficients ;
generate a matrix of biased coefficients by adding one or more bias values to one or more coefficients in the matrix of scaled coefficients ;
and generate a matrix of output coefficients by right-shifting coefficients in the matrix of biased coefficients by a first magnitude ;
wherein the matrix of source coefficients represents media data ;
wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal forward discrete cosine transform ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of biased coefficients by the first magnitude and results from dividing the coefficients in the matrix of biased coefficients by 2 raised to the power of the first magnitude , without regard to precision .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2007117711A2

Filed: 2007-04-09     Issued: 2007-10-18

Dynamic selection of motion estimation search ranges and extended motion vector ranges

(Original Assignee) Microsoft Corporation     

Cheng Chang, Chih-Lung Lin, Thomas W. Holcomb
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (frequency transform) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (entropy encoding) element (vertical components) representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2007117711A2
CLAIM 4
. The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ;
and signaling a second motion vector range indicating the second search range ;
whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation .

WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transform (integer multiple) er for transforming prediction residuals (first syntax) into transform coefficients (square partitions) ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size, maximum size value) is based at least in part upon motion vector distribution information for plural previous motion vectors .

WO2007117711A2
CLAIM 19
. The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values (minimum size value) of horizontal and vertical components (second syntax element) ;
and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (frequency transform) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (entropy encoding) element (vertical components) representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2007117711A2
CLAIM 4
. The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ;
and signaling a second motion vector range indicating the second search range ;
whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation .

WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transform (integer multiple) er for transforming prediction residuals (first syntax) into transform coefficients (square partitions) ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size, maximum size value) is based at least in part upon motion vector distribution information for plural previous motion vectors .

WO2007117711A2
CLAIM 19
. The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values (minimum size value) of horizontal and vertical components (second syntax element) ;
and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2007115325A2

Filed: 2007-04-04     Issued: 2007-10-11

Apparatus and method of enhanced frame interpolation in video compression

(Original Assignee) Qualcomm Incorporated     

Fang Shi, Vijayalakshmi R. Raveendran
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (receiving encoded data) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2007115325A2
CLAIM 42
. A method of processing multimedia data , the method comprising : receiving encoded data (integer multiple) indicative of an interpolated reference frame for at least a portion of a frame ;
generating at least a portion of said interpolated reference frame ;
and decoding at least a portion of the frame based at least in part on the generated portion of the interpolated reference frame .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (receiving encoded data) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2007115325A2
CLAIM 42
. A method of processing multimedia data , the method comprising : receiving encoded data (integer multiple) indicative of an interpolated reference frame for at least a portion of a frame ;
generating at least a portion of said interpolated reference frame ;
and decoding at least a portion of the frame based at least in part on the generated portion of the interpolated reference frame .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2007115126A1

Filed: 2007-03-29     Issued: 2007-10-11

Adaptive encoder-assisted frame rate up conversion

(Original Assignee) Qualcomm Incorporated     

Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2007115126A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 3
. The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2007115126A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 3
. The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2007112417A2

Filed: 2007-03-27     Issued: 2007-10-04

Method and system for coding and decoding information associated with video compression

(Original Assignee) Qualcomm Incorporated     

Yiliang Bao
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2007112417A2
CLAIM 1
. A method for coding information associated with video compression comprising : receiving a video sequence ;
processing the video sequence in a plurality of frames ;
processing each frame of the plurality of frames in a plurality of macrob locks ;
constructing a prediction of an original video signal , which is a part of a macroblock , in a current frame , from the video sequence ;
forming a residual signal by subtracting the prediction of the original video signal from the original video signal in the current frame ;
applying a transform to the residual signal ;
quantizing a plurality of transform coefficients (square partitions) ;
identifying a symbol of at least one syntax element that defines a characteristic of the residual signal ;
and coding symbols of the at least one syntax element of a same category together .

WO2007112417A2
CLAIM 2
. The method of claim 1 , wherein coding the symbols of the at least one syntax element comprises coding a coded block pattern (block pattern) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2007112417A2
CLAIM 1
. A method for coding information associated with video compression comprising : receiving a video sequence ;
processing the video sequence in a plurality of frames ;
processing each frame of the plurality of frames in a plurality of macrob locks ;
constructing a prediction of an original video signal , which is a part of a macroblock , in a current frame , from the video sequence ;
forming a residual signal by subtracting the prediction of the original video signal from the original video signal in the current frame ;
applying a transform to the residual signal ;
quantizing a plurality of transform coefficients (square partitions) ;
identifying a symbol of at least one syntax element that defines a characteristic of the residual signal ;
and coding symbols of the at least one syntax element of a same category together .

WO2007112417A2
CLAIM 2
. The method of claim 1 , wherein coding the symbols of the at least one syntax element comprises coding a coded block pattern (block pattern) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070297518A1

Filed: 2007-03-22     Issued: 2007-12-27

Flag encoding method, flag decoding method, and apparatus thereof

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Woo-jin Han, Bae-keun Lee, Tammy Lee, Kyo-hyuk Lee
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (flag value) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (minimum size) value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070297518A1
CLAIM 1
. An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ;
a maximum-run-determining unit which determines a maximum run of the flag bit string ;
and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table .

US20070297518A1
CLAIM 7
. The apparatus for encoding a flag of claim 1 , wherein the flag is a coded block pattern (block pattern) (Cbp) or a residual prediction flag .

US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (flag value) .
US20070297518A1
CLAIM 1
. An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ;
a maximum-run-determining unit which determines a maximum run of the flag bit string ;
and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (flag value) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (minimum size) value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070297518A1
CLAIM 1
. An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ;
a maximum-run-determining unit which determines a maximum run of the flag bit string ;
and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table .

US20070297518A1
CLAIM 7
. The apparatus for encoding a flag of claim 1 , wherein the flag is a coded block pattern (block pattern) (Cbp) or a residual prediction flag .

US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (flag value) .
US20070297518A1
CLAIM 1
. An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ;
a maximum-run-determining unit which determines a maximum run of the flag bit string ;
and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070160303A1

Filed: 2006-12-20     Issued: 2007-07-12

Geometrical image representation and compression

(Original Assignee) Docomo Communications Labs USA Inc     (Current Assignee) NTT Docomo Inc

Onur Guleryuz, Arthur Cunha
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions (image processing operation) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070160303A1
CLAIM 1
. A method comprising : creating a representation for image data that includes determining a geometric flow for image data ;
and performing an image processing operation (rectangular partitions) on data in the representation using the geometric flow .

US20070160303A1
CLAIM 26
. The method defined in claim 17 wherein encoding the prediction errors comprises entropy coding (square partitions) the prediction errors .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions (image processing operation) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070160303A1
CLAIM 1
. A method comprising : creating a representation for image data that includes determining a geometric flow for image data ;
and performing an image processing operation (rectangular partitions) on data in the representation using the geometric flow .

US20070160303A1
CLAIM 26
. The method defined in claim 17 wherein encoding the prediction errors comprises entropy coding (square partitions) the prediction errors .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070230563A1

Filed: 2006-11-10     Issued: 2007-10-04

Adaptive encoder-assisted frame rate up conversion

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070230563A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 3
. The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070230563A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 3
. The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP2008092150A

Filed: 2006-09-29     Issued: 2008-04-17

画像符号化装置、画像符号化方法

(Original Assignee) Canon Inc; キヤノン株式会社     

Tadashi Kaneko, 金子  唯史
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP2008092150A
CLAIM 1
動画像を構成する各フレームの画像を符号化する画像符号化装置であって、 符号化対象フレームの画像をマクロブロック毎に入力する入力手段と (second syntax) 、 前記符号化対象フレームの画像における符号化対象マクロブロックと、前記符号化対象フレームよりも前のフレームの画像において当該符号化対象マクロブロックに対応するマクロブロックと、の差分量を求める計算手段と、 前記差分量に基づいて、前記符号化対象マクロブロックに対する符号化単位を決定する決定手段と、 前記決定手段が決定した符号化単位でもって前記符号化対象マクロブロックに対する符号化を行う符号化手段と を備えることを特徴とする画像符号化装置。

JP2008092150A
CLAIM 7
コンピュータに請求項6に記載の画像処理方法を実行させるためのプログラム (rectangular partitions)

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP2008092150A
CLAIM 1
動画像を構成する各フレームの画像を符号化する画像符号化装置であって、 符号化対象フレームの画像をマクロブロック毎に入力する入力手段と (second syntax) 、 前記符号化対象フレームの画像における符号化対象マクロブロックと、前記符号化対象フレームよりも前のフレームの画像において当該符号化対象マクロブロックに対応するマクロブロックと、の差分量を求める計算手段と、 前記差分量に基づいて、前記符号化対象マクロブロックに対する符号化単位を決定する決定手段と、 前記決定手段が決定した符号化単位でもって前記符号化対象マクロブロックに対する符号化を行う符号化手段と を備えることを特徴とする画像符号化装置。

JP2008092150A
CLAIM 7
コンピュータに請求項6に記載の画像処理方法を実行させるためのプログラム (rectangular partitions)




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN1972449A

Filed: 2006-09-11     Issued: 2007-05-30

用于在单一描述和多描述之间切换的设备和方法

(Original Assignee) 首尔国立大学校产学协力财团; 三星电子株式会社     

金壹求, 赵南翊
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN (余弦变换) , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (的频率特性) , the minimum size value indicating a size of the smallest partition for the coded unit .
CN1972449A
CLAIM 1
. 一种视频代码转换器,包括:译码单元,译码压缩视频以为所述压缩视频的每一个帧产生视频报头、运动矢量和DCT(离散余弦变换 (size NxN) )系数;单一描述/多描述确定单元,基于在上述压缩视频的传输信道中错误发生的概率确定是否执行代码转换为单一描述或多描述;以及代码转换单元,当上述单一描述/多描述确定单元确定执行代码转换为单一描述时,通过将DCT系数和视频报头以及运动矢量一起编码,执行代码转换为单一描述,和当上述单一描述/多描述确定单元确定执行代码转换为多描述时,通过将DCT系数划分为若干DCT系数以及将上述DCT系数的每一个和视频报头以及运动矢量一起编码,执行代码转换为多描述。

CN1972449A
CLAIM 2
. 权利要求1中的视频代码转换器,其中上述代码转换单元包括:执行代码转换为单一描述的单一描述代码转换单元;以及执行代码译码为多描述的多描述代码转换单元,其中上述多描述代码转换单元包括:开始位置确定单元,根据DCT系数的频率特性 (minimum size value) 以预先确定或要求的扫描顺序确定上述DCT系数的划分的开始位置;DCT系数划分单元,使在上述开始位置之前的DCT系数的所有数据被包括在若干被划分的DCT系数的每一个中,和使在上述开始位置之后的DCT系数的数据交替地被包括在若干被划分的DCT系数的每一个中,从而产生多个被划分的DCT系数;以及压缩编码单元,将多个被划分的DCT系数的每一个和视频报头以及运动矢量一起编码。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN (余弦变换) , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (从而产生) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (的频率特性) , the minimum size value indicating a size of a smallest partition for the coded unit .
CN1972449A
CLAIM 1
. 一种视频代码转换器,包括:译码单元,译码压缩视频以为所述压缩视频的每一个帧产生视频报头、运动矢量和DCT(离散余弦变换 (size NxN) )系数;单一描述/多描述确定单元,基于在上述压缩视频的传输信道中错误发生的概率确定是否执行代码转换为单一描述或多描述;以及代码转换单元,当上述单一描述/多描述确定单元确定执行代码转换为单一描述时,通过将DCT系数和视频报头以及运动矢量一起编码,执行代码转换为单一描述,和当上述单一描述/多描述确定单元确定执行代码转换为多描述时,通过将DCT系数划分为若干DCT系数以及将上述DCT系数的每一个和视频报头以及运动矢量一起编码,执行代码转换为多描述。

CN1972449A
CLAIM 2
. 权利要求1中的视频代码转换器,其中上述代码转换单元包括:执行代码转换为单一描述的单一描述代码转换单元;以及执行代码译码为多描述的多描述代码转换单元,其中上述多描述代码转换单元包括:开始位置确定单元,根据DCT系数的频率特性 (minimum size value) 以预先确定或要求的扫描顺序确定上述DCT系数的划分的开始位置;DCT系数划分单元,使在上述开始位置之前的DCT系数的所有数据被包括在若干被划分的DCT系数的每一个中,和使在上述开始位置之后的DCT系数的数据交替地被包括在若干被划分的DCT系数的每一个中,从而产生 (generating syntax information) 多个被划分的DCT系数;以及压缩编码单元,将多个被划分的DCT系数的每一个和视频报头以及运动矢量一起编码。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070171982A1

Filed: 2006-08-31     Issued: 2007-07-26

Methods for encoding digital video for decoding on low performance devices

(Original Assignee) WIMAXIMUM LLC     (Current Assignee) Nordic Interactive Technologies LLC ; WIMAXIMUM LLC

Jens Meggers
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070171982A1
CLAIM 25
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ;
subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ;
selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression result ;
and encoding the video frame in the video stream with the block size selected by said selecting .

US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) .
US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070171982A1
CLAIM 25
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ;
subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ;
selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression result ;
and encoding the video frame in the video stream with the block size selected by said selecting .

US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) .
US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
EP1771008A2

Filed: 2006-08-15     Issued: 2007-04-04

Adaptive method and system for encoding digital images for the internet

(Original Assignee) Micro Focus Software Inc     (Current Assignee) Oracle International Corp

Kasman E. Thomas
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of the smallest partition for the coded unit .
EP1771008A2
CLAIM 1
A method of converting an image bitmap comprising : defining (34) a current rectangular region (120) of the image bitmap ;
quantifying (36) spectral characteristics of the current rectangular region ;
dividing (44) the current rectangular region into four rectangular sub-regions (124 , 125 , 126 , 127) in response to the spectral characteristics being greater than a predetermined threshold value (minimum size value) ;
tagging (40) the current rectangular sub-region as a terminal rectangular region in response to the spectral characteristics being less than or equal to the predetermined threshold value ;
tagging (52) each rectangular sub-region as a terminal rectangular region in response to the size of the rectangular sub-region being less than or equal to a predetermined threshold size ;
defining (50) one of the non-terminal rectangular sub-regions as the current rectangular region and repeating the above until all rectangular regions are tagged terminal ;
and determining (56) color characteristics for each of the terminal rectangular regions .

EP1771008A2
CLAIM 4
The method of claim 3 , wherein determining (42) a split point comprises determining (86 , 88) a center-of-gravity of the rectangular region based on color values (rectangular partitions) of pixels in the rectangular region (82) and an average luminance value of the rectangular region (84) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of a smallest partition for the coded unit .
EP1771008A2
CLAIM 1
A method of converting an image bitmap comprising : defining (34) a current rectangular region (120) of the image bitmap ;
quantifying (36) spectral characteristics of the current rectangular region ;
dividing (44) the current rectangular region into four rectangular sub-regions (124 , 125 , 126 , 127) in response to the spectral characteristics being greater than a predetermined threshold value (minimum size value) ;
tagging (40) the current rectangular sub-region as a terminal rectangular region in response to the spectral characteristics being less than or equal to the predetermined threshold value ;
tagging (52) each rectangular sub-region as a terminal rectangular region in response to the size of the rectangular sub-region being less than or equal to a predetermined threshold size ;
defining (50) one of the non-terminal rectangular sub-regions as the current rectangular region and repeating the above until all rectangular regions are tagged terminal ;
and determining (56) color characteristics for each of the terminal rectangular regions .

EP1771008A2
CLAIM 4
The method of claim 3 , wherein determining (42) a split point comprises determining (86 , 88) a center-of-gravity of the rectangular region based on color values (rectangular partitions) of pixels in the rectangular region (82) and an average luminance value of the rectangular region (84) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2007027402A2

Filed: 2006-08-11     Issued: 2007-03-08

Multi-stage cabac decoding pipeline

(Original Assignee) Micronas Usa, Inc.     

Ankur Shah, Liang Peng
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (n bits) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2007027402A2
CLAIM 1
What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ;
table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ;
a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ;
a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ;
a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ;
a decode stage adapted to operate on all bins , for carrying out decode processes ;
a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ;
and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ;
stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit .

WO2007027402A2
CLAIM 1
3 . The device of claim I wherein bits (integer multiple) of the 1 video bit stream are decoded in 2 an average of four clock cycles .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
WO2007027402A2
CLAIM 1
What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ;
table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ;
a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ;
a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ;
a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ;
a decode stage adapted to operate on all bins , for carrying out decode processes ;
a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ;
and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ;
stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (n bits) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2007027402A2
CLAIM 1
What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ;
table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ;
a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ;
a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ;
a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ;
a decode stage adapted to operate on all bins , for carrying out decode processes ;
a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ;
and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ;
stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit .

WO2007027402A2
CLAIM 1
3 . The device of claim I wherein bits (integer multiple) of the 1 video bit stream are decoded in 2 an average of four clock cycles .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
WO2007027402A2
CLAIM 1
What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ;
table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ;
a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ;
a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ;
a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ;
a decode stage adapted to operate on all bins , for carrying out decode processes ;
a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ;
and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ;
stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP2008011431A

Filed: 2006-06-30     Issued: 2008-01-17

画像符号化装置

(Original Assignee) Toshiba Corp; 株式会社東芝     

Hideji Michinaka, Hiromitsu Nakayama, Akihiro Ogami, Takaya Ogawa, Yoshinori Shigeta, Tatsuhiro Suzumura, Satoshi Takegawa, Kiwamu Watanabe, 啓満 中山, 晃弘 大上, 貴也 小川, 究 渡辺, 智 竹川, 良則 繁田, 秀治 道中, 竜広 鈴村
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP2008011431A
CLAIM 1
入力される2値シンボル列中の各2値シンボルに対する発生確率に基づく値を符号量の情報として出力する符号量情報出力手段と (second syntax) 、 前記符号量情報出力手段からの情報によって与えられる符号量を加算して、前記2値シンボル列を算術符号化した場合の発生符号量の推定値を出力する符号量累積加算手段と、を有する発生符号量推定部 を具備したことを特徴とする画像符号化装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (発生符号量, 発生確率) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP2008011431A
CLAIM 1
入力される2値シンボル列中の各2値シンボルに対する発生確率 (generating syntax information) に基づく値を符号量の情報として出力する符号量情報出力手段と (second syntax) 、 前記符号量情報出力手段からの情報によって与えられる符号量を加算して、前記2値シンボル列を算術符号化した場合の発生符号量 (generating syntax information) の推定値を出力する符号量累積加算手段と、を有する発生符号量推定部 を具備したことを特徴とする画像符号化装置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2006128076A2

Filed: 2006-05-25     Issued: 2006-11-30

Fixed point integer division techniques for ac/dc prediction in video coding devices

(Original Assignee) Qualcomm Incorporated     

Shu Xiao, Junchen Du, Tao Shen
EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2006128076A2
CLAIM 2
. The coding device of claim 1 , wherein the coding device comprises a video coding (generating syntax information) device and the coding standard comprises an MPEG-4 video coding standard .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN101189641A

Filed: 2006-05-09     Issued: 2008-05-28

编码数字图像的像素或体素的方法及处理数字图像的方法

(Original Assignee) 布雷克成像有限公司     

I·戈里, M·马蒂乌齐
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (处理设) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (数值数据) value , the minimum size value indicating a size of the smallest partition for the coded unit .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size, minimum size value) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

CN101189641A
CLAIM 40
. 根据权利要求39所述的图像处理设 (generate syntax information, generating syntax information) 备,其特征在于,所述计算 机硬件由所述成像设备的计算机硬件形成,并且,所述软件装置被存储或 加载于所述成像设备的所述计算机硬件中并可通过所述计算机硬件执行。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (处理设) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (数值数据) value , the minimum size value indicating a size of a smallest partition for the coded unit .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size, minimum size value) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

CN101189641A
CLAIM 40
. 根据权利要求39所述的图像处理设 (generate syntax information, generating syntax information) 备,其特征在于,所述计算 机硬件由所述成像设备的计算机硬件形成,并且,所述软件装置被存储或 加载于所述成像设备的所述计算机硬件中并可通过所述计算机硬件执行。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20060233254A1

Filed: 2006-04-18     Issued: 2006-10-19

Method and apparatus for adaptively selecting context model for entropy coding

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Bae-keun Lee, Kyo-hyuk Lee, Sang-Chang Cha, Woo-jin Han
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (generates data) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information (layer decoder) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20060233254A1
CLAIM 1
. A method of coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the method comprising : calculating characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (block pattern) (CBP) of the lower layer block ;
determining a coding method for the residual prediction flag according to the characteristic data ;
and coding the residual prediction flag using the determined coding method .

US20060233254A1
CLAIM 30
. A video encoder (second syntax element) for coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the encoder comprising : a lower layer encoding unit which generates characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (CBP) of the lower layer block ;
an enhancement layer encoding unit which codes the residual prediction flag using a coding method for the residual prediction flag , wherein the method is determined according to the characteristic data .

US20060233254A1
CLAIM 64
. The coding method of claim 55 , wherein the coding of the data for the enhancement layer data is performed by entropy coding (square partitions) .

US20060233254A1
CLAIM 78
. A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ;
and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data .

US20060233254A1
CLAIM 89
. A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ;
and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (layer decoder) comprises a fixed-length code corresponding to the size of the plurality of video blocks (generates data) .
US20060233254A1
CLAIM 78
. A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ;
and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data .

US20060233254A1
CLAIM 89
. A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ;
and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (generates data) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information (layer decoder) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20060233254A1
CLAIM 1
. A method of coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the method comprising : calculating characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (block pattern) (CBP) of the lower layer block ;
determining a coding method for the residual prediction flag according to the characteristic data ;
and coding the residual prediction flag using the determined coding method .

US20060233254A1
CLAIM 30
. A video encoder (second syntax element) for coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the encoder comprising : a lower layer encoding unit which generates characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (CBP) of the lower layer block ;
an enhancement layer encoding unit which codes the residual prediction flag using a coding method for the residual prediction flag , wherein the method is determined according to the characteristic data .

US20060233254A1
CLAIM 64
. The coding method of claim 55 , wherein the coding of the data for the enhancement layer data is performed by entropy coding (square partitions) .

US20060233254A1
CLAIM 78
. A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ;
and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data .

US20060233254A1
CLAIM 89
. A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ;
and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (layer decoder) comprises a fixed-length code corresponding to the size of the plurality of video blocks (generates data) .
US20060233254A1
CLAIM 78
. A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ;
and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data .

US20060233254A1
CLAIM 89
. A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ;
and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN101204092A

Filed: 2006-02-13     Issued: 2008-06-18

从低分辨率图像导出高分辨率图像的编码信息的方法以及实施所述方法的编码和解码设备

(Original Assignee) 汤姆森许可贸易公司     

纪尧姆·布瓦松, 尼古拉斯·比尔丹, 爱德华·弗朗索瓦, 帕特里克·洛佩兹, 格温埃利·马康, 热罗姆·维耶龙
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (的尺寸) value , the minimum size value indicating a size of the smallest partition for the coded unit .
CN101204092A
CLAIM 15
、 如权利要求1至14之一所述的方法,其中,所述第一尺寸块 具有8*8像素的尺寸 (minimum size, minimum size value) ,所述宏块具有16*16像素的尺寸,以及所述第二尺寸块具有4*4像素的尺寸

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (的尺寸) value , the minimum size value indicating a size of a smallest partition for the coded unit .
CN101204092A
CLAIM 15
、 如权利要求1至14之一所述的方法,其中,所述第一尺寸块 具有8*8像素的尺寸 (minimum size, minimum size value) ,所述宏块具有16*16像素的尺寸,以及所述第二尺寸块具有4*4像素的尺寸




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20060120450A1

Filed: 2005-12-01     Issued: 2006-06-08

Method and apparatus for multi-layered video encoding and decoding

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Woo-jin Han, Sang-Chang Cha, Ho-Jin Ha
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions (intra-prediction modes) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction modes (rectangular partitions) .

US20060120450A1
CLAIM 8
. The method of claim 1 , wherein the encoding of the selected prediction residual comprises : performing spatial transform on the selected prediction residual to create transform coefficients (square partitions) ;
quantizing the transform coefficients to generate quantization coefficients ;
and losslessly encoding the quantization coefficients .

US20060120450A1
CLAIM 20
. A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
a unit configured to select one of the two prediction residuals that offers higher coding efficiency ;
and a unit configured to encode the selected prediction residual .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions (intra-prediction modes) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction modes (rectangular partitions) .

US20060120450A1
CLAIM 8
. The method of claim 1 , wherein the encoding of the selected prediction residual comprises : performing spatial transform on the selected prediction residual to create transform coefficients (square partitions) ;
quantizing the transform coefficients to generate quantization coefficients ;
and losslessly encoding the quantization coefficients .

US20060120450A1
CLAIM 20
. A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
a unit configured to select one of the two prediction residuals that offers higher coding efficiency ;
and a unit configured to encode the selected prediction residual .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20080025391A1

Filed: 2005-11-09     Issued: 2008-01-31

Transcoding Method And Device

(Original Assignee) Siemens AG     (Current Assignee) Siemens AG

Peter Amon, Jurgen Pandel
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (syntax elements, network element) for the coded unit wherein the syntax information includes : a first syntax (syntax elements, network element) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element (syntax elements, network element) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20080025391A1
CLAIM 13
. The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard .

US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (syntax elements, network element) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (syntax elements, network element) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements, network element) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element (syntax elements, network element) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20080025391A1
CLAIM 13
. The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard .

US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (syntax elements, network element) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070206679A1

Filed: 2005-09-06     Issued: 2007-09-06

Motion Image Encoding Method and Motion Image Decoding Method

(Original Assignee) Panasonic Corp     (Current Assignee) Dolby International AB

Chong Lim, Sheng Shen, Shinya Kadono
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070206679A1
CLAIM 6
. The video coding method according to claim 5 , wherein in the implicit mode , the transform block size is determined to be 4×4 pixels in the case where a degree to which the motion vectors differ from each other is greater than or equal to a predetermined threshold value (minimum size value) , and the transform block size is determined to be 8×8 pixels in the case where the degree to which the motion vectors differ from each other is less than the predetermined threshold value .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070206679A1
CLAIM 1
. A video coding (generating syntax information) method for coding an input picture on a block-by-block basis so as to generate a coded stream , said method comprising : determining a notification method for notifying a transform block size to be used in orthogonal transform of a current block ;
creating an adaptive block transform mode indicating the notification method ;
and adding the adaptive block transform mode to the coded stream .

US20070206679A1
CLAIM 6
. The video coding method according to claim 5 , wherein in the implicit mode , the transform block size is determined to be 4×4 pixels in the case where a degree to which the motion vectors differ from each other is greater than or equal to a predetermined threshold value (minimum size value) , and the transform block size is determined to be 8×8 pixels in the case where the degree to which the motion vectors differ from each other is less than the predetermined threshold value .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20060126740A1

Filed: 2005-05-25     Issued: 2006-06-15

Shared pipeline architecture for motion vector prediction and residual decoding

(Original Assignee) Micronas USA Inc     (Current Assignee) TDK Micronas GmbH

Teng Lin, Weimin Zeng
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (d log) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax (data state) element (motion vector differences) representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (motion vector differences) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20060126740A1
CLAIM 3
. The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis .

US20060126740A1
CLAIM 7
. The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information .

US20060126740A1
CLAIM 9
. The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (d log) .
US20060126740A1
CLAIM 3
. The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (d log) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (data state) element (motion vector differences) representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (motion vector differences) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20060126740A1
CLAIM 3
. The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis .

US20060126740A1
CLAIM 7
. The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information .

US20060126740A1
CLAIM 9
. The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (d log) .
US20060126740A1
CLAIM 3
. The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20050025246A1

Filed: 2004-07-16     Issued: 2005-02-03

Decoding jointly coded transform type and subblock pattern information

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Thomas Holcomb
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (first VLC) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (first VLC) .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (first VLC) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (first VLC) .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20050053292A1

Filed: 2004-06-29     Issued: 2005-03-10

Advanced bi-directional predictive coding of interlaced video

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Kunal Mukerjee, Thomas Holcomb
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second motion vector predictor) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (block pattern) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US20050053292A1
CLAIM 28
. A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second motion vector predictor) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (block pattern) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US20050053292A1
CLAIM 28
. A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN1810037A

Filed: 2004-06-17     Issued: 2006-07-26

帧间的快速模式确定编码

(Original Assignee) 汤姆森许可贸易公司     

尹鹏, 亚历山德罗斯·迈克尔·图拉普斯, 吉尔·麦克唐纳·布瓦斯
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (一种视频编码方法, 针对帧) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (尺寸的) value , the minimum size value indicating a size of the smallest partition for the coded unit .
CN1810037A
CLAIM 1
. 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。

CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

CN1810037A
CLAIM 10
. 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (一种视频编码方法, 针对帧) .
CN1810037A
CLAIM 1
. 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。

CN1810037A
CLAIM 10
. 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (一种视频编码方法, 针对帧) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (尺寸的) value , the minimum size value indicating a size of a smallest partition for the coded unit .
CN1810037A
CLAIM 1
. 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。

CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

CN1810037A
CLAIM 10
. 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (一种视频编码方法, 针对帧) .
CN1810037A
CLAIM 1
. 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。

CN1810037A
CLAIM 10
. 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20050135484A1

Filed: 2004-06-07     Issued: 2005-06-23

Method of encoding mode determination, method of motion estimation and encoding apparatus

(Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group     (Current Assignee) Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group

Yung-Lyul Lee, Ki-hoon Han
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (other ones) value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size, maximum size value) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (other ones) value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size, maximum size value) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN1857001A

Filed: 2004-05-20     Issued: 2006-11-01

混合视频压缩方法

(Original Assignee) Amt先进多媒体科技公司     

P·托思, J·卡罗利
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (块进行编码) of size NxN (余弦变换) , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (的尺寸, 尺寸的) value , the minimum size value indicating a size of the smallest partition for the coded unit .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size, minimum size value) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 6
. 按照权利要求1-5中任何一条权利要求的方法,其特征在于:应用离散余弦变换 (size NxN) (DCT)作为将空间域表示转换成频率域表示的变换。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size, minimum size value) 和可用于传输编码数据的带宽而修改的。

CN1857001A
CLAIM 23
. 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (块进行编码) .
CN1857001A
CLAIM 23
. 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (块进行编码) of size NxN (余弦变换) , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (从而产生) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (的尺寸, 尺寸的) value , the minimum size value indicating a size of a smallest partition for the coded unit .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size, minimum size value) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 6
. 按照权利要求1-5中任何一条权利要求的方法,其特征在于:应用离散余弦变换 (size NxN) (DCT)作为将空间域表示转换成频率域表示的变换。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size, minimum size value) 和可用于传输编码数据的带宽而修改的。

CN1857001A
CLAIM 23
. 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。

CN1857001A
CLAIM 33
. 按照权利要求23-32中任何一条权利要求的方法,其特征在于:在差分块的信息内容上执行一个将空间表示转换成频率表示的变换(DCT或哈达马变换),从而产生 (generating syntax information) 多元素的二维变换块(DCT或哈达马系数矩阵),并且用熵编码对变换块的信息内容进行编码。

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (块进行编码) .
CN1857001A
CLAIM 23
. 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
WO2004104930A2

Filed: 2004-05-20     Issued: 2004-12-02

Hybrid video compression method

(Original Assignee) Amt Advanced Multimedia Technology Ab     

Peter Toth, József KÁROLY
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (given block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (subsequent frame) ;

generate syntax information (processing element, data elements) for the coded unit wherein the syntax information includes : a first syntax (processing element, data elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (processing element, data elements) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding (square partitions) , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 4
. The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frame (block pattern) s (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (processing element, data elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (given block) .
WO2004104930A2
CLAIM 4
. The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (given block) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (subsequent frame) ;

generating syntax information (processing element, data elements) for the coded unit , wherein the syntax information includes : a first syntax (processing element, data elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (processing element, data elements) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding (square partitions) , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 4
. The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frame (block pattern) s (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (processing element, data elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (given block) .
WO2004104930A2
CLAIM 4
. The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window .

EP2347591B1
CLAIM 7
A computer-readable medium comprising instructions for causing a programmable processor (processing element, data elements) to perform the methods of any of claims 4-6 .
WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20070053436A1

Filed: 2004-05-04     Issued: 2007-03-08

Encoding video information using block based adaptive scan order

(Original Assignee) Koninklijke Philips NV     (Current Assignee) Koninklijke Philips NV

Lambertus Van Eggelen
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20070053436A1
CLAIM 2
. A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) .
US20070053436A1
CLAIM 2
. A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20070053436A1
CLAIM 2
. A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) .
US20070053436A1
CLAIM 2
. A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20040234144A1

Filed: 2004-02-18     Issued: 2004-11-25

Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program

(Original Assignee) NTT Docomo Inc     (Current Assignee) NTT Docomo Inc

Kazuo Sugimoto, Satoru Adachi, Sadaatsu Kato, Minoru Etoh, Mitsuru Kobayashi, Hiroyuki Yamaguchi
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US20040234144A1
CLAIM 6
. The image encoding apparatus according to claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US20040234144A1
CLAIM 6
. The image encoding apparatus according to claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20050276323A1

Filed: 2003-09-26     Issued: 2005-12-15

Real-time video coding/decoding

(Original Assignee) Vanguard Video LLC     (Current Assignee) Beamr Imaging Ltd

Alexey Martemyanov, Nick Terterov, Alexander Zheludkov, Alexey Dolgoborodov, Vladimir Semenyuk, Eugene Neimark, Irena Terterov
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients, entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20050276323A1
CLAIM 1
. A video codec for encoding/decoding digitized sequence of video frames with high compression efficiency , comprising : means for frame encoding ;
means for setting and storing codec setting parameters ;
means for controlling desired frame encoding time and CPU loading ;
means for rate control including size of frame encoding output bitstream ;
and means for arithmetic coding of quantized transform coefficients (square partitions) .

US20050276323A1
CLAIM 8
. The video codec of claim 7 , comprising means for macroblock encoding , comprising : means for intra prediction , at least one means for inter prediction , means for selecting macroblock type and encoding setting , means for calculating macroblock texture prediction and prediction error , means for performing texture (second syntax, first syntax, first syntax element) prediction error transform and transform coefficient quantization ;
means calculating motion vector prediction and prediction error ;
and means for entropy encoding (second syntax, first syntax, first syntax element) providing arithmetic context encoding of motion vectors , header parameters and transform coefficients .

US20050276323A1
CLAIM 24
. The method of claim 23 , comprising wavelet transform , wherein resulting wavelet transform coefficients are compressed by context-based entropy coding (square partitions) .

US20050276323A1
CLAIM 30
. The method of claim 28 , wherein the motion estimation comprises : calculating motion vectors MV(wb , hb , CF , RF) with integer-pel accuracy using previously calculated motion data ;
calculating motion vectors MV(wb , hb , CF , RF) [block x][block y] performing inverse logarithmic motion search with parameters block x , block y , current range ;
performing motion vector refinement (second syntax, first syntax, first syntax element) choosing from sets of neighboring motion vectors MVNeiborhood (wb , hb , CF , RF)[block x][block y] elements (mvx , mvy) that provide minimum value of motion vector weight function Q(mvx , mvy , CF , RF , wb , hb , block x , block y) ;
performing motion vector estimation with quarter-pel accuracy based on results of motion vector estimation with integer-pel accuracy by changing components of the integer-pel accuracy motion vector MV(wb , hb , CF , RF) [block x][block y] in range [−¾ ;
+¾] with a step ¼ ;
and calculating motion vectors MV(wb , hb , CF , RF) with quarter-pel accuracy by sequentially applying the motion estimation steps with integer-pel accuracy and with quarter-pel accuracy , wherein CF—current frame with horizontal coordinate x and vertical coordinate y ;
RF—reference frame with horizontal coordinate x and vertical coordinate y ;
wb—width of the blocks for which motion estimation is performed ;
hb—height of the blocks for which motion estimation is performed ;
W—a multiple of wb , current and reference frame width ;
H—a multiple of hb , current and reference frame height ;
Q(mvx , mvy , CF , RF , wb , hb , block x , block y)—motion vector weight calculation function ;
MV(wb , hb , CF , RF)[block x][block y]—motion vector (i . e . pair (mvx , mvy) of integers) corresponding to the frame CF and reference frame RF for a block of width wb , height hb , which left-top corner is located at a pixel with horizontal coordinate block x and vertical coordinate block y ;
MV(wb , hb , CF , RF)—a set of motion vectors MV(wb , hb , CF , RF)[block x][block y] for : block x=0 , wb , 2 wb , 3 wb , . . . , block x< ;
W , and block y=0 , hb , 2·hb , 3·hb , . . . , block y< ;
H . MVNeighborhood (wb , hb , CF , RF)[block x][block y]—a set of neighboring motion vectors MV(wb , hb , CF , RF)[nx][ny] , where nx may be equal to block x−wb , block x , block x+wb , and ny may be equal to block y−hb , block y , block y+hb , and nx≧0 , ny≧0 , nx≦W−wb , ny≦H−hb .

US20050276323A1
CLAIM 34
. A method of decoding of sequence of video frames encoded according to claim 13 , comprising steps of : arithmetic decoding ;
decoding coded block pattern (block pattern) of macroblock mode and texture using arithmetic context-based mode (second syntax, first syntax, first syntax element) ling ;
decoding texture prediction error using arithmetic context-based modeling ;
calculating prediction for motion vectors ;
and decoding motion vectors using context-based arithmetic modeling .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients, entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20050276323A1
CLAIM 1
. A video codec for encoding/decoding digitized sequence of video frames with high compression efficiency , comprising : means for frame encoding ;
means for setting and storing codec setting parameters ;
means for controlling desired frame encoding time and CPU loading ;
means for rate control including size of frame encoding output bitstream ;
and means for arithmetic coding of quantized transform coefficients (square partitions) .

US20050276323A1
CLAIM 8
. The video codec of claim 7 , comprising means for macroblock encoding , comprising : means for intra prediction , at least one means for inter prediction , means for selecting macroblock type and encoding setting , means for calculating macroblock texture prediction and prediction error , means for performing texture (second syntax, first syntax, first syntax element) prediction error transform and transform coefficient quantization ;
means calculating motion vector prediction and prediction error ;
and means for entropy encoding (second syntax, first syntax, first syntax element) providing arithmetic context encoding of motion vectors , header parameters and transform coefficients .

US20050276323A1
CLAIM 24
. The method of claim 23 , comprising wavelet transform , wherein resulting wavelet transform coefficients are compressed by context-based entropy coding (square partitions) .

US20050276323A1
CLAIM 30
. The method of claim 28 , wherein the motion estimation comprises : calculating motion vectors MV(wb , hb , CF , RF) with integer-pel accuracy using previously calculated motion data ;
calculating motion vectors MV(wb , hb , CF , RF) [block x][block y] performing inverse logarithmic motion search with parameters block x , block y , current range ;
performing motion vector refinement (second syntax, first syntax, first syntax element) choosing from sets of neighboring motion vectors MVNeiborhood (wb , hb , CF , RF)[block x][block y] elements (mvx , mvy) that provide minimum value of motion vector weight function Q(mvx , mvy , CF , RF , wb , hb , block x , block y) ;
performing motion vector estimation with quarter-pel accuracy based on results of motion vector estimation with integer-pel accuracy by changing components of the integer-pel accuracy motion vector MV(wb , hb , CF , RF) [block x][block y] in range [−¾ ;
+¾] with a step ¼ ;
and calculating motion vectors MV(wb , hb , CF , RF) with quarter-pel accuracy by sequentially applying the motion estimation steps with integer-pel accuracy and with quarter-pel accuracy , wherein CF—current frame with horizontal coordinate x and vertical coordinate y ;
RF—reference frame with horizontal coordinate x and vertical coordinate y ;
wb—width of the blocks for which motion estimation is performed ;
hb—height of the blocks for which motion estimation is performed ;
W—a multiple of wb , current and reference frame width ;
H—a multiple of hb , current and reference frame height ;
Q(mvx , mvy , CF , RF , wb , hb , block x , block y)—motion vector weight calculation function ;
MV(wb , hb , CF , RF)[block x][block y]—motion vector (i . e . pair (mvx , mvy) of integers) corresponding to the frame CF and reference frame RF for a block of width wb , height hb , which left-top corner is located at a pixel with horizontal coordinate block x and vertical coordinate block y ;
MV(wb , hb , CF , RF)—a set of motion vectors MV(wb , hb , CF , RF)[block x][block y] for : block x=0 , wb , 2 wb , 3 wb , . . . , block x< ;
W , and block y=0 , hb , 2·hb , 3·hb , . . . , block y< ;
H . MVNeighborhood (wb , hb , CF , RF)[block x][block y]—a set of neighboring motion vectors MV(wb , hb , CF , RF)[nx][ny] , where nx may be equal to block x−wb , block x , block x+wb , and ny may be equal to block y−hb , block y , block y+hb , and nx≧0 , ny≧0 , nx≦W−wb , ny≦H−hb .

US20050276323A1
CLAIM 34
. A method of decoding of sequence of video frames encoded according to claim 13 , comprising steps of : arithmetic decoding ;
decoding coded block pattern (block pattern) of macroblock mode and texture using arithmetic context-based mode (second syntax, first syntax, first syntax element) ling ;
decoding texture prediction error using arithmetic context-based modeling ;
calculating prediction for motion vectors ;
and decoding motion vectors using context-based arithmetic modeling .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN101087424A

Filed: 2003-05-27     Issued: 2007-12-12

图像内预测模式估计、通信和组织的方法和系统

(Original Assignee) 夏普株式会社     

孙式军, L·J·柯罗夫斯基
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (的左侧) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (平均值) , the minimum size value indicating a size of the smallest partition for the coded unit .
CN101087424A
CLAIM 2
. 一种用于估计象素内预测模式的方法,该方法包括:用于使用邻接块中的一个象素的象素值来预测要被预测的目标块的象素值的内预测步骤;以及用于估计用于目标块的预测模式的预测模式估计步骤,其中所述内预测步骤使用至少一个使用等于邻接于目标块并且在目标块之上的第一个块的象素值的预测值的垂直预测模式、使用等于邻接于目标块的左侧 (rectangular partitions) 的第二个块的象素值的预测值的水平预测模式和使用等于第一个块和第二个块的象素值的平均值 (minimum size value) 的预测值的DC预测模式,利用逐渐增加的数字按照垂直预测模式、水平预测模式和DC预测模式的顺序对于预测模式进行编号,以及预测模式估计步骤确定一个具有第一个块的预测模式和第二个块的预测模式中的较低模式编号的预测模式作为用于目标块的预测模式。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (的左侧) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (平均值) , the minimum size value indicating a size of a smallest partition for the coded unit .
CN101087424A
CLAIM 2
. 一种用于估计象素内预测模式的方法,该方法包括:用于使用邻接块中的一个象素的象素值来预测要被预测的目标块的象素值的内预测步骤;以及用于估计用于目标块的预测模式的预测模式估计步骤,其中所述内预测步骤使用至少一个使用等于邻接于目标块并且在目标块之上的第一个块的象素值的预测值的垂直预测模式、使用等于邻接于目标块的左侧 (rectangular partitions) 的第二个块的象素值的预测值的水平预测模式和使用等于第一个块和第二个块的象素值的平均值 (minimum size value) 的预测值的DC预测模式,利用逐渐增加的数字按照垂直预测模式、水平预测模式和DC预测模式的顺序对于预测模式进行编号,以及预测模式估计步骤确定一个具有第一个块的预测模式和第二个块的预测模式中的较低模式编号的预测模式作为用于目标块的预测模式。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN1457196A

Filed: 2003-04-25     Issued: 2003-11-19

基于时空域相关性运动矢量预测的视频编码方法

(Original Assignee) 北京工业大学     

薛金柱, 沈兰荪
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of the smallest partition for the coded unit .
CN1457196A
CLAIM 1
. 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频 (minimum size value) 序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (基于运动) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of a smallest partition for the coded unit .
CN1457196A
CLAIM 1
. 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频 (minimum size value) 序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。

CN1457196A
CLAIM 2
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动 (generating syntax information) 补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准H . 264 JM6 . 0的编码框架。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
EP1478190A1

Filed: 2003-04-25     Issued: 2004-11-17

Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program

(Original Assignee) NTT Docomo Inc     (Current Assignee) NTT Docomo Inc

Satoru c/o NTT DoCoMo Inc. ADACHI, Minoru c/o NTT DoCoMo Inc. ETOH, Sadaatsu c/o NTT DoCoMo Inc. KATO, Mitsuru c/o NTT DoCoMo Inc. KOBAYASHI, Kazuo c/o NTT DoCoMo Inc. SUGIMOTO, Hiroyuki NTT DoCoMo Inc. YAMAGUCHI
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

EP1478190A1
CLAIM 6
The image encoding apparatus according to Claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

EP1478190A1
CLAIM 6
The image encoding apparatus according to Claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
CN1647541A

Filed: 2003-04-23     Issued: 2005-07-27

用于在视频编码系统中指示量化器参数的方法与设备

(Original Assignee) 诺基亚有限公司     

J·莱恩马
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax (多个分) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of the smallest partition for the coded unit .
CN1647541A
CLAIM 6
. 根据权利要求5的编码方法,其特征在于:已编码比特流从一个视频 (minimum size value) 编码设备发射到一个相应的视频解码设备。

CN1647541A
CLAIM 13
. 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (多个分) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of a smallest partition for the coded unit .
CN1647541A
CLAIM 6
. 根据权利要求5的编码方法,其特征在于:已编码比特流从一个视频 (minimum size value) 编码设备发射到一个相应的视频解码设备。

CN1647541A
CLAIM 13
. 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20030179940A1

Filed: 2003-03-19     Issued: 2003-09-25

Efficient macroblock header coding for video compression

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Chih-Lung Lin, Ming-Chieh Lee
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table, block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (chrominance values) , the minimum size value indicating a size of the smallest partition for the coded unit .
US20030179940A1
CLAIM 1
. In a video coder for coding video images in a block format , a method for improving compression of the video images comprising : for a macroblock in a video frame , determining whether texture values for the color values (rectangular partitions) of the macroblock are coded and setting the coded block parameters corresponding to the colors to indicate whether or not the texture values are coded ;
forming a combined parameter representing all of the coded block parameters for the macroblock ;
determining a single variable length code for the combined parameter of the macroblock ;
and repeating the above-steps for macroblocks in the video image .

US20030179940A1
CLAIM 2
. The method of claim 1 wherein : the texture values are chrominance values (four square partitions, minimum size value) U and V , and luminance values Y , the macroblock includes one block for U , one block for V and foul blocks for Y ;
and the coded block parameters include one bit each for U and V indicating whether the corresponding U and V blocks are coded , and four bits for Y indicating whether the four corresponding Y blocks are coded .

US20030179940A1
CLAIM 4
. The method of claim 1 further including : selecting a predictor for the coded block parameters ;
and computing an exclusive OR between the predictor and the coded block parameters to compute predicted coded block parameters , the predicted coded block parameters forming at least a part of the combined coded block parameter for the macroblock ;
wherein the step of determining the single variable length code includes looking up the combined coded block parameter in a variable length coding table (block pattern) to find the single variable length code for the combined coded block parameter .

US20030179940A1
CLAIM 13
. The method of claim 12 wherein the first variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block pattern (block pattern) s for chrominance and luminance ;
and wherein the second variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block patterns for chrominance and luminance .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table, block pattern) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (chrominance values) , the minimum size value indicating a size of a smallest partition for the coded unit .
US20030179940A1
CLAIM 1
. In a video coder for coding video images in a block format , a method for improving compression of the video images comprising : for a macroblock in a video frame , determining whether texture values for the color values (rectangular partitions) of the macroblock are coded and setting the coded block parameters corresponding to the colors to indicate whether or not the texture values are coded ;
forming a combined parameter representing all of the coded block parameters for the macroblock ;
determining a single variable length code for the combined parameter of the macroblock ;
and repeating the above-steps for macroblocks in the video image .

US20030179940A1
CLAIM 2
. The method of claim 1 wherein : the texture values are chrominance values (four square partitions, minimum size value) U and V , and luminance values Y , the macroblock includes one block for U , one block for V and foul blocks for Y ;
and the coded block parameters include one bit each for U and V indicating whether the corresponding U and V blocks are coded , and four bits for Y indicating whether the four corresponding Y blocks are coded .

US20030179940A1
CLAIM 4
. The method of claim 1 further including : selecting a predictor for the coded block parameters ;
and computing an exclusive OR between the predictor and the coded block parameters to compute predicted coded block parameters , the predicted coded block parameters forming at least a part of the combined coded block parameter for the macroblock ;
wherein the step of determining the single variable length code includes looking up the combined coded block parameter in a variable length coding table (block pattern) to find the single variable length code for the combined coded block parameter .

US20030179940A1
CLAIM 13
. The method of claim 12 wherein the first variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block pattern (block pattern) s for chrominance and luminance ;
and wherein the second variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block patterns for chrominance and luminance .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US20030156648A1

Filed: 2002-12-17     Issued: 2003-08-21

Sub-block transform coding of prediction residuals

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Thomas Holcomb, Chih-Lung Lin
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction, main data) of size NxN , where N is an integer multiple (n bits) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (motion prediction, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US20030156648A1
CLAIM 15
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) .

US20030156648A1
CLAIM 20
. The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size .

US20030156648A1
CLAIM 29
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ;
selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ;
and encoding the spatial domain data using transforms of the selected one or more transform sizes .

US20030156648A1
CLAIM 37
. In a computer system , a computer-implemented method of processing one or more video frames , the method comprising : switching transform sizes from among plural available transform sizes for data for the one or more video frames , wherein bits (integer multiple) tream syntax allows the switching at plural different bitstream levels for the one or more video frames ;
and processing the data for the one or more video frames with one or more transforms of switched transform size .

US20030156648A1
CLAIM 46
. A video encoder (second syntax element) comprising : means for switching frequency transform sizes from among plural available frequency transform sizes for prediction residual data for one or more motion-predicted video frames , wherein bitstream syntax allows the switching at plural different bitstream levels for the one or more motion-predicted video frames ;
and means for processing the prediction residual data with one or more frequency transforms of switched frequency transform size .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction, main data) .
US20030156648A1
CLAIM 20
. The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size .

US20030156648A1
CLAIM 29
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ;
selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ;
and encoding the spatial domain data using transforms of the selected one or more transform sizes .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction, main data) of size NxN , where N is an integer multiple (n bits) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (motion prediction, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US20030156648A1
CLAIM 15
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) .

US20030156648A1
CLAIM 20
. The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size .

US20030156648A1
CLAIM 29
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ;
selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ;
and encoding the spatial domain data using transforms of the selected one or more transform sizes .

US20030156648A1
CLAIM 37
. In a computer system , a computer-implemented method of processing one or more video frames , the method comprising : switching transform sizes from among plural available transform sizes for data for the one or more video frames , wherein bits (integer multiple) tream syntax allows the switching at plural different bitstream levels for the one or more video frames ;
and processing the data for the one or more video frames with one or more transforms of switched transform size .

US20030156648A1
CLAIM 46
. A video encoder (second syntax element) comprising : means for switching frequency transform sizes from among plural available frequency transform sizes for prediction residual data for one or more motion-predicted video frames , wherein bitstream syntax allows the switching at plural different bitstream levels for the one or more motion-predicted video frames ;
and means for processing the prediction residual data with one or more frequency transforms of switched frequency transform size .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction, main data) .
US20030156648A1
CLAIM 20
. The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size .

US20030156648A1
CLAIM 29
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ;
selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ;
and encoding the spatial domain data using transforms of the selected one or more transform sizes .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US7200275B2

Filed: 2002-12-16     Issued: 2007-04-03

Skip macroblock coding

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Sridhar Srinivasan, Pohsiang Hsu
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US7200275B2
CLAIM 2
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US7200275B2
CLAIM 2
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US7206456B2

Filed: 2002-11-26     Issued: 2007-04-17

Video coding and decoding

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Technologies Oy

Miska Hannuksela, Ye-Kui Wang
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (coding elements) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US7206456B2
CLAIM 1
. A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
encoding the at least one separate coding object ;
and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US7206456B2
CLAIM 1
. A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
encoding the at least one separate coding object ;
and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (coding elements) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US7206456B2
CLAIM 1
. A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
encoding the at least one separate coding object ;
and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
US7206456B2
CLAIM 1
. A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
encoding the at least one separate coding object ;
and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP2002359852A

Filed: 2002-03-22     Issued: 2002-12-13

画像予測復号化装置及び方法

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Sen Bun Chun, Mei Shen Shen, Thiow Keng Tan, メイ・シェン シェン, セン・ブン チュン, ケン・タン ティオ
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (量子化ステップサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size (量子化ステップサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JPH11164305A

Filed: 1998-02-25     Issued: 1999-06-18

動画像符号化方法、動画像符号化装置および動画像復号装置

(Original Assignee) Mitsubishi Electric Corp; 三菱電機株式会社     

Kotaro Asai, Yoshimi Isu, Shunichi Sekiguchi, 芳美 井須, 光太郎 浅井, 俊一 関口
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (のクラス) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (示す複数) , the minimum size value indicating a size of the smallest partition for the coded unit .
JPH11164305A
CLAIM 8
【請求項8】 前記分割処理部は、 各領域に関し、その領域の画像の特性を示す複数 (minimum size value) の数値 の線形和をその領域のアクティビティとして算出するア クティビティ算出部と、 算出されたアクティビティとあらかじめ設定された基準 値を比較する分割判定部と、 を備え、比較の結果、アクティビティが基準値を上回る 領域がさらに小さな領域へ分割されることを特徴とする 請求項4、5のいずれかに記載の動画像符号化装置。

JPH11164305A
CLAIM 12
【請求項12】 前記クラス識別部は、複数の領域にま たがる被写体構造に注目して各領域のクラス (rectangular partitions) を決定する ことを特徴とする請求項11に記載の動画像符号化装 置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (のクラス) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (示す複数) , the minimum size value indicating a size of a smallest partition for the coded unit .
JPH11164305A
CLAIM 8
【請求項8】 前記分割処理部は、 各領域に関し、その領域の画像の特性を示す複数 (minimum size value) の数値 の線形和をその領域のアクティビティとして算出するア クティビティ算出部と、 算出されたアクティビティとあらかじめ設定された基準 値を比較する分割判定部と、 を備え、比較の結果、アクティビティが基準値を上回る 領域がさらに小さな領域へ分割されることを特徴とする 請求項4、5のいずれかに記載の動画像符号化装置。

JPH11164305A
CLAIM 12
【請求項12】 前記クラス識別部は、複数の領域にま たがる被写体構造に注目して各領域のクラス (rectangular partitions) を決定する ことを特徴とする請求項11に記載の動画像符号化装 置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5748789A

Filed: 1996-10-31     Issued: 1998-05-05

Transparent block skipping in object-based video coding systems

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Ming-Chieh Lee, Wei-ge Chen
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5748789A
CLAIM 17
. In an object-based video coding method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a transformation block partially covered by a first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture coding for the sub-transformation block and encoding a coded block pattern (block pattern) bit or bits only for non-transparent sub-transformation blocks in the partially covered transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a transformation block partially covered by the first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture decoding for the sub-transformation block .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ;

generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5748789A
CLAIM 1
. In an object-based video coding (generating syntax information) method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US5748789A
CLAIM 17
. In an object-based video coding method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a transformation block partially covered by a first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture coding for the sub-transformation block and encoding a coded block pattern (block pattern) bit or bits only for non-transparent sub-transformation blocks in the partially covered transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a transformation block partially covered by the first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture decoding for the sub-transformation block .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US6125143A

Filed: 1996-10-18     Issued: 2000-09-26

Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium

(Original Assignee) Sony Corp     (Current Assignee) Ironworks Patents LLC

Teruhiko Suzuki, Masami Ogata, Tak Yen Tong
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (mean value) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (mean value) value , the minimum size value indicating a size of the smallest partition for the coded unit .
US6125143A
CLAIM 2
. A picture encoding device according to claim 1 , further comprising transform means for performing prescribed transform processing on said picture data using a plurality of said encoding block units and outputting transform coefficients (square partitions) , wherein said encoding means encodes said transform coefficients outputted by said transform means using said encoding block units .

US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (maximum size, minimum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (mean value) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size (mean value) value , the minimum size value indicating a size of a smallest partition for the coded unit .
US6125143A
CLAIM 2
. A picture encoding device according to claim 1 , further comprising transform means for performing prescribed transform processing on said picture data using a plurality of said encoding block units and outputting transform coefficients (square partitions) , wherein said encoding means encodes said transform coefficients outputted by said transform means using said encoding block units .

US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (maximum size, minimum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5691770A

Filed: 1996-04-24     Issued: 1997-11-25

Device and method for coding video pictures

(Original Assignee) US Philips Corp     (Current Assignee) US Philips Corp

Gerrit J. Keesman, Willem M. J. Coene, Eduard W. Salomons
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (spatial frequency) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (same amplitude) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5691770A
CLAIM 4
. A device as claimed in claim 1 , wherein the amplitude distribution is obtained by counting , for each spatial frequency (integer multiple) (i) , the number of times (h i , n) when a coefficient indicative of said spatial frequency assumes the same amplitude (rectangular partitions) (n) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (spatial frequency) of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (same amplitude) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5691770A
CLAIM 4
. A device as claimed in claim 1 , wherein the amplitude distribution is obtained by counting , for each spatial frequency (integer multiple) (i) , the number of times (h i , n) when a coefficient indicative of said spatial frequency assumes the same amplitude (rectangular partitions) (n) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
EP0727910A2

Filed: 1996-02-05     Issued: 1996-08-21

Temporal-spatial error concealment apparatus and method for video signal processors

(Original Assignee) Thomson Multimedia SA     (Current Assignee) Technicolor SA

Max Chien, Wilson Kwok, Huifang Sun
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (bad block) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (block pattern) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (bad block) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (block pattern) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5576767A

Filed: 1995-09-21     Issued: 1996-11-19

Interframe video encoding and decoding system

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Chong U. Lee, Donald Pian
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (additional reference) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference (block pattern) blocks of data .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (additional reference) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference (block pattern) blocks of data .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5768434A

Filed: 1995-06-07     Issued: 1998-06-16

Quadtree-structured walsh transform coding

(Original Assignee) National Semiconductor Corp     (Current Assignee) National Semiconductor Corp

Xiaonong Ran
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value, absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit .
US5768434A
CLAIM 11
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , N and M are integers with N< ;
M , and for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
each block in the partition corresponding to the integer M consists of x 2 values , x being a power of two ;
each block in the partition corresponding to an integer M-1 consists of 4*x 2 values ;
and partitioning the two-dimensional array to form the quadtree partition , further comprises the steps of : performing a Walsh transform on each block of the partition corresponding to the integer M to derive y Walsh transform coefficients , where y is an integer less than or equal to x 2 ;
determining an error for each block in the partition corresponding to the integer M-1 , wherein the error depends on a first set of 4*x 2 values generated by an inverse Walsh transform of a 2x-by-2x matrix having at most y non-zero coefficients , and a second set of 4*x 2 values generated by inverse Walsh transforms of four x-by-x matrices , each of the four x-by-x matrices containing the y Walsh transform coefficients derived for a block which is in the partition corresponding to M and is a subset of the block for which the error is being determined ;
and including in the quadtree partition four blocks from the partition corresponding to the integer M , wherein a union of the four blocks equals a block having an error greater than a predetermined threshold value (minimum size value) .

US5768434A
CLAIM 14
. The method of claim 12 , wherein step (c) further comprises determining a sum of absolute values (minimum size value) of differences between values in the block of the selected partition and corresponding values generated by the inverse Walsh transform of the matrix having no more than x 2 non-zero coefficients .

US5768434A
CLAIM 16
. The method of claim 6 , wherein the step of generating signals which identify the blocks in the quadtree partition comprises the sequential steps of : providing a signal for each block which is in a first partition (square partitions) from a series of uniform partitions , wherein the signal for a block has a first value if the block is in the quadtree partition and has a second value otherwise ;
and for each block in a second partition (square partitions) from the series of uniform partitions : providing a signal having the first value if the block in the quadtree partition ;
and providing a signal having the second value if the block is not in the quadtree partition and is not a subset of a block in the quadtree partition .

US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks , wherein no two blocks (block pattern) in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value (predetermined threshold value, absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit .
US5768434A
CLAIM 11
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , N and M are integers with N< ;
M , and for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
each block in the partition corresponding to the integer M consists of x 2 values , x being a power of two ;
each block in the partition corresponding to an integer M-1 consists of 4*x 2 values ;
and partitioning the two-dimensional array to form the quadtree partition , further comprises the steps of : performing a Walsh transform on each block of the partition corresponding to the integer M to derive y Walsh transform coefficients , where y is an integer less than or equal to x 2 ;
determining an error for each block in the partition corresponding to the integer M-1 , wherein the error depends on a first set of 4*x 2 values generated by an inverse Walsh transform of a 2x-by-2x matrix having at most y non-zero coefficients , and a second set of 4*x 2 values generated by inverse Walsh transforms of four x-by-x matrices , each of the four x-by-x matrices containing the y Walsh transform coefficients derived for a block which is in the partition corresponding to M and is a subset of the block for which the error is being determined ;
and including in the quadtree partition four blocks from the partition corresponding to the integer M , wherein a union of the four blocks equals a block having an error greater than a predetermined threshold value (minimum size value) .

US5768434A
CLAIM 14
. The method of claim 12 , wherein step (c) further comprises determining a sum of absolute values (minimum size value) of differences between values in the block of the selected partition and corresponding values generated by the inverse Walsh transform of the matrix having no more than x 2 non-zero coefficients .

US5768434A
CLAIM 16
. The method of claim 6 , wherein the step of generating signals which identify the blocks in the quadtree partition comprises the sequential steps of : providing a signal for each block which is in a first partition (square partitions) from a series of uniform partitions , wherein the signal for a block has a first value if the block is in the quadtree partition and has a second value otherwise ;
and for each block in a second partition (square partitions) from the series of uniform partitions : providing a signal having the first value if the block in the quadtree partition ;
and providing a signal having the second value if the block is not in the quadtree partition and is not a subset of a block in the quadtree partition .

US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks , wherein no two blocks (block pattern) in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5666461A

Filed: 1995-05-30     Issued: 1997-09-09

High efficiency encoding and decoding of picture signals and recording medium containing same

(Original Assignee) Sony Corp     (Current Assignee) Sony Corp

Katsuji Igarashi, Jun Yonemitsu, Yoichi Yagasaki, Yasushi Fujinami, Tomoyuki Sato, Motoki Kato, Teruhiko Suzuki
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (picture portion) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5666461A
CLAIM 1
. A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (picture portion) .
US5666461A
CLAIM 1
. A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (picture portion) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5666461A
CLAIM 1
. A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (picture portion) .
US5666461A
CLAIM 1
. A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5793897A

Filed: 1994-12-16     Issued: 1998-08-11

Adaptive variable-length coding and decoding methods for image data

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Jae-moon Jo, Je-chang Jeong
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5793897A
CLAIM 1
. An adaptive variable-length coding method whereby quantized orthogonal transform coefficients (square partitions) are scanned in a zigzag pattern , are modified into run , level data and then are variable-length coded in a coding system for image data , said method comprising the steps of : setting a plurality of variable-length coding table (block pattern) s having different patterns of a regular region and an escape region according to statistical characteristics of said run , level data ;
selecting one of said plurality of variable-length coding tables according to intra/inter mode information of the currently processed block , zigzag scanning position and quantization step size ;
and variable-length coding the orthogonal transform coefficients according to said selected variable-length coding table , wherein said selecting step has the selecting range of a plurality of variable-length coding tables having different patterns of a regular region and an escape region according to said intra/inter mode information of the currently Processed block .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5793897A
CLAIM 1
. An adaptive variable-length coding method whereby quantized orthogonal transform coefficients (square partitions) are scanned in a zigzag pattern , are modified into run , level data and then are variable-length coded in a coding system for image data , said method comprising the steps of : setting a plurality of variable-length coding table (block pattern) s having different patterns of a regular region and an escape region according to statistical characteristics of said run , level data ;
selecting one of said plurality of variable-length coding tables according to intra/inter mode information of the currently processed block , zigzag scanning position and quantization step size ;
and variable-length coding the orthogonal transform coefficients according to said selected variable-length coding table , wherein said selecting step has the selecting range of a plurality of variable-length coding tables having different patterns of a regular region and an escape region according to said intra/inter mode information of the currently Processed block .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5623311A

Filed: 1994-10-28     Issued: 1997-04-22

MPEG video decoder having a high bandwidth memory

(Original Assignee) Panasonic Corp of North America     (Current Assignee) Panasonic Corp of North America

Larry Phillips, Shuji Inoue, Edwin R. Meyer
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (luminance component) representing a minimum size value (chrominance values) , the minimum size value indicating a size of the smallest partition for the coded unit .
US5623311A
CLAIM 1
. Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance component (second syntax element) s and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow image data representing corresponding chrominance and luminance components to be accessed concurrently using the first and second channels , respectively , and to allow image data representing first and second contiguous luminance components to be accessed concurrently using the first and second channels , respectively ;
means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ;
decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ;
(b) means for processing the fetched variable length encoded data stream to produce first decoded values ;
(c) means for fetching reference image data from the memory via the single memory port ;
(d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ;
and (e) means for storing the second decoded values in the memory via the single memory port ;
and means for fetching image data from the memory , via the single memory port , for display .

US5623311A
CLAIM 2
. Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance components and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow components of the image data to be accessed concurrently using the first and second channels ;
means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ;
decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ;
(b) means for (1) processing the fetched variable length encoded data stream to produce first decoded values and (2) generating luminance values and corresponding chrominance values (four square partitions, minimum size value) as the first decoded values ;
(c) means for fetching reference image data from the memory via the single memory port ;
(d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ;
and (e) means (1) for storing the second decoded values in the memory via the single memory port and (2) for storing luminance components of the second decoded values using one of the first and second channels and storing corresponding chrominance components of the second decoded values using the other one of the first and second channels ;
and means (1) for fetching image data from the memory , via the single memory port , for display and (2) for fetching luminance components of the image data using one of the first and second channels and for fetching the corresponding chrominance components of the image data using the other one of the first and second channels .

US5623311A
CLAIM 5
. A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ;
a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ;
an inverse quantization processor ;
an inverse discrete cosine transform (video blocks) processor ;
and a motion compensation processor , coupled to the image memory via the single memory port ;
and a block to raster converter , coupled to the image memory via the single memory port .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) .
US5623311A
CLAIM 5
. A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ;
a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ;
an inverse quantization processor ;
an inverse discrete cosine transform (video blocks) processor ;
and a motion compensation processor , coupled to the image memory via the single memory port ;
and a block to raster converter , coupled to the image memory via the single memory port .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element (luminance component) representing a minimum size value (chrominance values) , the minimum size value indicating a size of a smallest partition for the coded unit .
US5623311A
CLAIM 1
. Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance component (second syntax element) s and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow image data representing corresponding chrominance and luminance components to be accessed concurrently using the first and second channels , respectively , and to allow image data representing first and second contiguous luminance components to be accessed concurrently using the first and second channels , respectively ;
means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ;
decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ;
(b) means for processing the fetched variable length encoded data stream to produce first decoded values ;
(c) means for fetching reference image data from the memory via the single memory port ;
(d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ;
and (e) means for storing the second decoded values in the memory via the single memory port ;
and means for fetching image data from the memory , via the single memory port , for display .

US5623311A
CLAIM 2
. Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance components and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow components of the image data to be accessed concurrently using the first and second channels ;
means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ;
decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ;
(b) means for (1) processing the fetched variable length encoded data stream to produce first decoded values and (2) generating luminance values and corresponding chrominance values (four square partitions, minimum size value) as the first decoded values ;
(c) means for fetching reference image data from the memory via the single memory port ;
(d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ;
and (e) means (1) for storing the second decoded values in the memory via the single memory port and (2) for storing luminance components of the second decoded values using one of the first and second channels and storing corresponding chrominance components of the second decoded values using the other one of the first and second channels ;
and means (1) for fetching image data from the memory , via the single memory port , for display and (2) for fetching luminance components of the image data using one of the first and second channels and for fetching the corresponding chrominance components of the image data using the other one of the first and second channels .

US5623311A
CLAIM 5
. A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ;
a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ;
an inverse quantization processor ;
an inverse discrete cosine transform (video blocks) processor ;
and a motion compensation processor , coupled to the image memory via the single memory port ;
and a block to raster converter , coupled to the image memory via the single memory port .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) .
US5623311A
CLAIM 5
. A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ;
a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ;
an inverse quantization processor ;
an inverse discrete cosine transform (video blocks) processor ;
and a motion compensation processor , coupled to the image memory via the single memory port ;
and a block to raster converter , coupled to the image memory via the single memory port .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5594504A

Filed: 1994-07-06     Issued: 1997-01-14

Predictive video coding using a motion vector updating routine

(Original Assignee) Nokia of America Corp     (Current Assignee) Nokia of America Corp

Touradj Ebrahimi
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion estimation unit) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (value z) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5594504A
CLAIM 3
. The method of claim 2 wherein the initial motion vector assigned to each smaller region in the first segmentation level has the value z (square partitions, four square partitions) ero .

US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US5594504A
CLAIM 19
. A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ;
a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ;
a first region label memory unit for storing a motion vector associated with each merged region ;
a second region label memory unit for storing the region label assigned to each pixel in the current frame ;
a region shape representation unit for forming a set of contour pixels that defines the merged regions ;
a contour coding unit for encoding the set of contour pixels ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion estimation unit) .
US5594504A
CLAIM 19
. A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ;
a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ;
a first region label memory unit for storing a motion vector associated with each merged region ;
a second region label memory unit for storing the region label assigned to each pixel in the current frame ;
a region shape representation unit for forming a set of contour pixels that defines the merged regions ;
a contour coding unit for encoding the set of contour pixels ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion estimation unit) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (value z) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5594504A
CLAIM 3
. The method of claim 2 wherein the initial motion vector assigned to each smaller region in the first segmentation level has the value z (square partitions, four square partitions) ero .

US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US5594504A
CLAIM 19
. A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ;
a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ;
a first region label memory unit for storing a motion vector associated with each merged region ;
a second region label memory unit for storing the region label assigned to each pixel in the current frame ;
a region shape representation unit for forming a set of contour pixels that defines the merged regions ;
a contour coding unit for encoding the set of contour pixels ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion estimation unit) .
US5594504A
CLAIM 19
. A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ;
a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ;
a first region label memory unit for storing a motion vector associated with each merged region ;
a second region label memory unit for storing the region label assigned to each pixel in the current frame ;
a region shape representation unit for forming a set of contour pixels that defines the merged regions ;
a contour coding unit for encoding the set of contour pixels ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5422676A

Filed: 1993-10-22     Issued: 1995-06-06

System for coding an image representative signal

(Original Assignee) Deutsche Thomson Brandt GmbH     

Carsten Herpel, Heinz-Werner Keesen
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (block pattern) .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (block pattern) .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
US5091782A

Filed: 1990-04-09     Issued: 1992-02-25

Apparatus and method for adaptively compressing successive blocks of digital video

(Original Assignee) Arris Technology Inc     (Current Assignee) CIF LICENSING LLC

Edward A. Krause, Woo H. Paik
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
US5091782A
CLAIM 8
. Apparatus in accordance with claim 1 wherein : said first and second compressing means produce and quantize respective first and second arrays of transform coefficients (square partitions) for the set of pixel data ;
and said error evaluating means determines the error between the quantized transform coefficients and the unquantized transform coefficients of each array .

US5091782A
CLAIM 25
. Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means) .
US5091782A
CLAIM 25
. Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
US5091782A
CLAIM 8
. Apparatus in accordance with claim 1 wherein : said first and second compressing means produce and quantize respective first and second arrays of transform coefficients (square partitions) for the set of pixel data ;
and said error evaluating means determines the error between the quantized transform coefficients and the unquantized transform coefficients of each array .

US5091782A
CLAIM 25
. Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means) .
US5091782A
CLAIM 25
. Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
EP1623577A1

Filed: 2004-05-04     Issued: 2006-02-08

Encoding of video information using block based adaptive scan order

(Original Assignee) Koninklijke Philips NV     (Current Assignee) Koninklijke Philips NV

Lambertus A. Van Eggelen
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
EP1623577A1
CLAIM 2
. A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
- pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

EP1623577A1
CLAIM 3
. A method according to Claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) .
EP1623577A1
CLAIM 2
. A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
- pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
EP1623577A1
CLAIM 2
. A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
- pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

EP1623577A1
CLAIM 3
. A method according to Claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) .
EP1623577A1
CLAIM 2
. A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
- pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JP3910594B2

Filed: 2004-02-20     Issued: 2007-04-25

画像符号化装置

(Original Assignee) 三菱電機株式会社     

芳美 井須, 光太郎 浅井, 博文 西川, 由里 長谷川, 俊一 関口, 慎一 黒田
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (ビットストリーム) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
JP3910594B2
CLAIM 1
画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と (second syntax) 、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (ビットストリーム) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
JP3910594B2
CLAIM 1
画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (ビットストリーム) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
JP3910594B2
CLAIM 1
画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と (second syntax) 、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (ビットストリーム) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
JP3910594B2
CLAIM 1
画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
GB2382940A

Filed: 2001-11-27     Issued: 2003-06-11

Encoding objects and background blocks

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Oyj

Miska Hannuksela, Ye-Kui Wang
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (coding elements) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
GB2382940A
CLAIM 1
. A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
10 determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
15 encoding the at least one separate coding object ;
and encoding as one coding object the background object .

GB2382940A
CLAIM 13
. A video encoder (second syntax element) comprising : means for dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding 5 elements ;
means for determining at least one shape within a picture ;
means for selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
means for determining as at least one separate coding object the selected 10 at least one subset of the coding blocks ;
means for determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
means for encoding the at least one separate coding object ;
and 15 means for encoding as one coding object the background object .

EP2347591B1
CLAIM 2
The apparatus of claim 1 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
GB2382940A
CLAIM 1
. A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
10 determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
15 encoding the at least one separate coding object ;
and encoding as one coding object the background object .

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information (coding elements) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
GB2382940A
CLAIM 1
. A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
10 determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
15 encoding the at least one separate coding object ;
and encoding as one coding object the background object .

GB2382940A
CLAIM 13
. A video encoder (second syntax element) comprising : means for dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding 5 elements ;
means for determining at least one shape within a picture ;
means for selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
means for determining as at least one separate coding object the selected 10 at least one subset of the coding blocks ;
means for determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
means for encoding the at least one separate coding object ;
and 15 means for encoding as one coding object the background object .

GB2382940A
CLAIM 17
. A video signal comprising : a plurality of compressed video coding (generating syntax information) blocks corresponding to at least one 5 separate coding object corresponding to a part of a video encoded picture , the part of the video encoded picture having a variable size and position ;
an identifier for each of the at least one separate coding object ;
position and size information of the at least one separate coding object ;
and 10 a plurality of compressed video coding blocks corresponding to a background object that corresponds to a set of the coding blocks that corresponds to the video encoded picture excluding the at least one separate coding object . cat . ' ;
. 5 . , : - a . L

EP2347591B1
CLAIM 5
The method of claim 4 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks .
GB2382940A
CLAIM 1
. A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ;
determining at least one shape within a picture ;
selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ;
10 determining as at least one separate coding object the selected at least one subset of the coding blocks ;
determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ;
15 encoding the at least one separate coding object ;
and encoding as one coding object the background object .




EP2347591B1

Filed: 2008-10-03     Issued: 2020-04-08

Video coding with large macroblocks

(Original Assignee) Velos Media International Ltd     (Current Assignee) Velos Media International Ltd

Peisong Chen, Yan Ye, Marta Karczewicz
JPH082106B2

Filed: 1986-11-10     Issued: 1996-01-10

動画像信号のハイブリツド符号化方式

(Original Assignee) 国際電信電話株式会社; 富士通株式会社; 日本電信電話株式会社; 日本電気株式会社     

洋一 加藤, 睦 太田, 康宏 小杉, 好律 羽鳥
EP2347591B1
CLAIM 1
An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generate syntax information (変換部) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit .
JPH082106B2
CLAIM 2
【請求項2】前記有効/無効判定部は、有効/無効の判 定を直交変換を行なうブロックサイズ (maximum size, maximum size value) より小さいブロッ クサイズのサブブロックに分割し、サブブロック単位で 評価量Smを計算して閾値T 1 と比較し、当該ブロック中に 1つでも有効なサブブロックがある場合には当該ブロッ クを有効ブロック,その他の場合には無効ブロックと判 定することを特徴とする特許請求の範囲第1項記載の動 画像信号のハイブリッド符号化方式。

JPH082106B2
CLAIM 6
【請求項6】前記有効/無効判定部は有効サブブロック が当該ブロック内の1個或いは , 2個である時に後続の直 交変換部 (generate syntax information) において1/2M×1/2N , 1/2M×N又はM×1/2Nの 直交変換を行なうように指示情報を送出することを特徴 とする特許請求の範囲第4項記載の動画像信号のハイブ リッド符号化方式。

EP2347591B1
CLAIM 4
A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ;

wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ;

generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ;

a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit .
JPH082106B2
CLAIM 2
【請求項2】前記有効/無効判定部は、有効/無効の判 定を直交変換を行なうブロックサイズ (maximum size, maximum size value) より小さいブロッ クサイズのサブブロックに分割し、サブブロック単位で 評価量Smを計算して閾値T 1 と比較し、当該ブロック中に 1つでも有効なサブブロックがある場合には当該ブロッ クを有効ブロック,その他の場合には無効ブロックと判 定することを特徴とする特許請求の範囲第1項記載の動 画像信号のハイブリッド符号化方式。