Ground | References | Owner of the Reference | Title | Semantic Mapping | Challenged Claims | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 4 | 5 | 7 | |||||||
1 | KR100809298B1 (이교혁, 2008) | (Original Assignee) 삼성전자주식회사 | 플래그 인코딩 방법, 플래그 디코딩 방법, 및 상기 방법을이용한 장치 | ▪ block pattern ≈ block pattern ▪ length code ≈ 워드를 | X | X | |||||
2 | US20080002770A1 (Kemal Ugur, 2008) | (Original Assignee) Nokia Oyj (Current Assignee) Nokia Oyj | Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices | ▪ video blocks ≈ first encoding mode ▪ square partitions, four square partitions ≈ second partition, first partition | X | X | X | X | |||
3 | US20060203905A1 (Shih-Chang Hsia, 2006) | (Original Assignee) National Kaohsiung First University of Science and Technology (Current Assignee) National Kaohsiung First University of Science and Technology | Video coding system | ▪ minimum size, minimum size value ≈ horizontal resolution ▪ maximum size ≈ processing speed ▪ second syntax element ≈ video encoder ▪ length code ≈ partial data ▪ block pattern ≈ two blocks, same time | X | X | |||||
4 | US7362804B2 (Pavel Novotny, 2008) | (Original Assignee) LSI Corp (Current Assignee) Avago Technologies International Sales Pte Ltd | Graphical symbols for H.264 bitstream syntax elements | ▪ first syntax, syntax information ≈ syntax elements ▪ maximum size value ≈ maximum values | X | X | X | X | |||
5 | US7289674B2 (Marta Karczewicz, 2007) | (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy | Spatial prediction based intra coding | ▪ square partitions ≈ ascending order ▪ block pattern ≈ second blocks | X | X | |||||
6 | US6980596B2 (Limin Wang, 2005) | (Original Assignee) Arris Technology Inc (Current Assignee) Google Technology Holdings LLC | Macroblock level adaptive frame/field coding for digital video content | ▪ four square partitions ≈ processing two ▪ readable medium ≈ said blocks ▪ rectangular partitions ≈ frame mode | X | X | |||||
7 | US6795584B2 (Marta Karczewicz, 2004) | (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy | Context-based adaptive variable length coding for adaptive block transforms | ▪ readable medium comprising instructions ≈ software product ▪ readable medium ≈ readable medium ▪ square partitions ≈ entropy coding | X | X | |||||
8 | JP2003319394A (Kuhn Peter, 2003) | (Original Assignee) Sony Corp; ソニー株式会社 | 符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム | ▪ generating syntax information ≈ 生成ステップ ▪ rectangular partitions ≈ プログラム, 前記圧縮 ▪ four square partitions ≈ field flag ▪ second syntax ≈ 手段と ▪ length code ≈ 逆符号 | X | X | |||||
9 | US6968012B1 (Jens Meggers, 2005) | (Original Assignee) Firepad Inc (Current Assignee) Nordic Interactive Technologies LLC | Methods for encoding digital video for decoding on low performance devices | ▪ block pattern ≈ corresponding blocks ▪ generate syntax information ≈ identifying means ▪ second syntax element ≈ video encoder ▪ video blocks ≈ memory block | X | X | X | X | |||
10 | US6959116B2 (Ugur Sezer, 2005) | (Original Assignee) EMC Corp (Current Assignee) EMC Corp | Largest magnitude indices selection for (run, level) encoding of a block coded picture | ▪ square partitions ≈ transform coefficients ▪ second syntax ≈ entropy encoding ▪ length code ≈ length code | X | X | |||||
11 | US6529634B1 (Kadayam Thyagarajan, 2003) | (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc | Contrast sensitive variance based adaptive block size DCT image compression | ▪ block pattern ≈ minimum block size ▪ video blocks, four square partitions ≈ transforming means, main data ▪ readable medium ≈ readable medium, said blocks ▪ generate syntax information, generating syntax information ≈ Huffman code ▪ integer multiple ≈ th column | X | X | X | X | |||
12 | US6539124B2 (Sriram Sethuraman, 2003) | (Original Assignee) Sarnoff Corp (Current Assignee) MediaTek Inc | Quantizer selection based on region complexities derived using a rate distortion model | ▪ minimum size value ≈ absolute differences ▪ readable medium ≈ readable medium | X | X | |||||
13 | US6215910B1 (Navin Chaddha, 2001) | (Original Assignee) Microsoft Corp (Current Assignee) V-XSTREAM Inc ; Microsoft Technology Licensing LLC | Table-based compression with embedded coding | ▪ square partitions ≈ transform coefficients ▪ readable medium ≈ readable medium ▪ generate syntax information ≈ embedded code ▪ readable medium comprising instructions ≈ first stage | X | X | |||||
14 | JPH08205140A (Koji Hirabayashi, 1996) | (Original Assignee) Canon Inc; キヤノン株式会社 | 画像圧縮装置 | ▪ maximum size, maximum size value ≈ ブロックサイズ, 小さいサイズ ▪ second syntax ≈ 手段と | X | X | |||||
15 | US5107345A (Chong U. Lee, 1992) | (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc | Adaptive block size image compression method and system | ▪ video blocks ≈ inverse discrete cosine transform, compressed image ▪ generate syntax information, generating syntax information ≈ Huffman code ▪ rectangular partitions ≈ count values ▪ length code ≈ length code | X | X | X | X | |||
16 | JP2008219870A (Teo Han Boon, 2008) | (Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社 | 動画像符号化方法及び動画像符号化装置 | ▪ maximum size, minimum size ≈ 小さいサイズ, 大きいサイズ ▪ rectangular partitions ≈ プログラム ▪ second syntax ≈ 手段と | X | X | |||||
17 | WO2008086316A2 (Marta Karczewicz, 2008) | (Original Assignee) Qualcomm Incorporated | Variable length coding techniques for coded block patterns | ▪ readable medium comprising instructions ≈ readable medium comprising instructions ▪ block pattern ≈ block pattern ▪ generating syntax information ≈ video coding | X | X | |||||
18 | WO2008054688A1 (Debargha Mukherjee, 2008) | (Original Assignee) Hewlett-Packard Development Company, L.P. | Method for decomposing a video sequence frame | ▪ second syntax ≈ different motion vectors ▪ square partitions, four square partitions ≈ th partition | X | X | |||||
19 | WO2008027192A2 (Oscar Divorra Escoda, 2008) | (Original Assignee) Thomson Licensing | Methods and apparatus for reduced resolution partitioning | ▪ syntax information ≈ syntax information ▪ second syntax ≈ video encoding ▪ square partitions, four square partitions ≈ n partitions ▪ generating syntax information ≈ video coding ▪ minimum size value ≈ slice level | X | X | X | X | |||
20 | US20080049834A1 (Thomas Holcomb, 2008) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Sub-block transform coding of prediction residuals | ▪ video blocks ≈ motion prediction ▪ readable medium ≈ readable medium ▪ second syntax ≈ video encoding ▪ second syntax element ≈ video encoder | X | X | X | X | |||
21 | WO2008016605A2 (Oscar Divorra Escoda, 2008) | (Original Assignee) Thomson Licensing | Adaptive geometric partitioning for video decoding | ▪ square partitions, four square partitions ≈ th partition ▪ length code ≈ based coding ▪ generating syntax information ≈ video coding | X | X | |||||
22 | WO2008002881A2 (Harinath Garudadri, 2008) | (Original Assignee) Qualcomm Incorporated | Reduction of errors during computation of inverse discrete cosine transform | ▪ readable medium comprising instructions ≈ readable medium comprising instructions ▪ video blocks ≈ inverse discrete cosine transform ▪ programmable processor ≈ programmable processor | X | X | X | X | X | ||
23 | WO2007117711A2 (Cheng Chang, 2007) | (Original Assignee) Microsoft Corporation | Dynamic selection of motion estimation search ranges and extended motion vector ranges | ▪ square partitions ≈ transform coefficients ▪ first syntax ≈ prediction residuals ▪ integer multiple ≈ frequency transform ▪ second syntax element ≈ vertical components ▪ second syntax ≈ entropy encoding ▪ maximum size, maximum size value ≈ current picture ▪ minimum size value ≈ absolute values | X | X | |||||
24 | WO2007115325A2 (Fang Shi, 2007) | (Original Assignee) Qualcomm Incorporated | Apparatus and method of enhanced frame interpolation in video compression | ▪ readable medium comprising instructions ≈ readable medium comprising instructions ▪ integer multiple ≈ receiving encoded data | X | X | |||||
25 | WO2007115126A1 (Tao Tian, 2007) | (Original Assignee) Qualcomm Incorporated | Adaptive encoder-assisted frame rate up conversion | ▪ readable medium ≈ readable medium ▪ second syntax ≈ video encoding ▪ block pattern ≈ block pattern | X | X | |||||
26 | WO2007112417A2 (Yiliang Bao, 2007) | (Original Assignee) Qualcomm Incorporated | Method and system for coding and decoding information associated with video compression | ▪ square partitions ≈ transform coefficients ▪ readable medium ≈ readable medium ▪ block pattern ≈ block pattern | X | X | |||||
27 | US20070297518A1 (Woo-jin Han, 2007) | (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd | Flag encoding method, flag decoding method, and apparatus thereof | ▪ block pattern ≈ block pattern ▪ minimum size ≈ minimum size ▪ video blocks ≈ flag value | X | X | X | X | |||
28 | US20070160303A1 (Onur Guleryuz, 2007) | (Original Assignee) Docomo Communications Labs USA Inc (Current Assignee) NTT Docomo Inc | Geometrical image representation and compression | ▪ rectangular partitions ≈ image processing operation ▪ square partitions ≈ entropy coding | X | X | |||||
29 | US20070230563A1 (Tao Tian, 2007) | (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc | Adaptive encoder-assisted frame rate up conversion | ▪ readable medium ≈ readable medium ▪ second syntax ≈ video encoding ▪ block pattern ≈ block pattern | X | X | |||||
30 | JP2008092150A (Tadashi Kaneko, 2008) | (Original Assignee) Canon Inc; キヤノン株式会社 | 画像符号化装置、画像符号化方法 | ▪ rectangular partitions ≈ プログラム ▪ second syntax ≈ 手段と | X | X | |||||
31 | CN1972449A (金壹求, 2007) | (Original Assignee) 首尔国立大学校产学协力财团; 三星电子株式会社 | 用于在单一描述和多描述之间切换的设备和方法 | ▪ minimum size value ≈ 的频率特性 ▪ size NxN ≈ 余弦变换 ▪ generating syntax information ≈ 从而产生 | X | X | |||||
32 | US20070171982A1 (Jens Meggers, 2007) | (Original Assignee) WIMAXIMUM LLC (Current Assignee) Nordic Interactive Technologies LLC ; WIMAXIMUM LLC | Methods for encoding digital video for decoding on low performance devices | ▪ block pattern ≈ corresponding blocks ▪ video blocks ≈ memory block | X | X | X | X | |||
33 | EP1771008A2 (Kasman E. Thomas, 2007) | (Original Assignee) Micro Focus Software Inc (Current Assignee) Oracle International Corp | Adaptive method and system for encoding digital images for the internet | ▪ minimum size value ≈ predetermined threshold value ▪ rectangular partitions ≈ color values | X | X | |||||
34 | WO2007027402A2 (Ankur Shah, 2007) | (Original Assignee) Micronas Usa, Inc. | Multi-stage cabac decoding pipeline | ▪ first syntax, syntax information ≈ syntax elements ▪ length code ≈ clock cycle ▪ integer multiple ≈ n bits | X | X | X | X | |||
35 | JP2008011431A (Hideji Michinaka, 2008) | (Original Assignee) Toshiba Corp; 株式会社東芝 | 画像符号化装置 | ▪ generating syntax information ≈ 発生符号量, 発生確率 ▪ second syntax ≈ 手段と | X | X | |||||
36 | WO2006128076A2 (Shu Xiao, 2006) | (Original Assignee) Qualcomm Incorporated | Fixed point integer division techniques for ac/dc prediction in video coding devices | ▪ readable medium comprising instructions ≈ readable medium comprising instructions ▪ generating syntax information ≈ video coding | X | ||||||
37 | CN101189641A (I·戈里, 2008) | (Original Assignee) 布雷克成像有限公司 | 编码数字图像的像素或体素的方法及处理数字图像的方法 | ▪ minimum size, minimum size value ≈ 数值数据 ▪ generate syntax information, generating syntax information ≈ 处理设 | X | X | |||||
38 | US20060233254A1 (Bae-keun Lee, 2006) | (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd | Method and apparatus for adaptively selecting context model for entropy coding | ▪ square partitions ≈ entropy coding ▪ video blocks ≈ generates data ▪ syntax information, generating syntax information ≈ layer decoder ▪ block pattern ≈ block pattern ▪ second syntax element ≈ video encoder | X | X | X | X | |||
39 | CN101204092A (纪尧姆·布瓦松, 2008) | (Original Assignee) 汤姆森许可贸易公司 | 从低分辨率图像导出高分辨率图像的编码信息的方法以及实施所述方法的编码和解码设备 | ▪ length code ≈ 第二编码装置 ▪ minimum size, minimum size value ≈ 的尺寸 | X | X | |||||
40 | US20060120450A1 (Woo-jin Han, 2006) | (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd | Method and apparatus for multi-layered video encoding and decoding | ▪ rectangular partitions ≈ intra-prediction modes ▪ square partitions ≈ transform coefficients ▪ first syntax ≈ prediction residuals ▪ second syntax element ≈ video encoder | X | X | |||||
41 | US20080025391A1 (Peter Amon, 2008) | (Original Assignee) Siemens AG (Current Assignee) Siemens AG | Transcoding Method And Device | ▪ first syntax, syntax information ≈ syntax elements, network element ▪ second syntax ≈ video encoding | X | X | X | X | |||
42 | US20070206679A1 (Chong Lim, 2007) | (Original Assignee) Panasonic Corp (Current Assignee) Dolby International AB | Motion Image Encoding Method and Motion Image Decoding Method | ▪ minimum size value ≈ predetermined threshold value ▪ generating syntax information ≈ video coding | X | X | |||||
43 | US20060126740A1 (Teng Lin, 2006) | (Original Assignee) Micronas USA Inc (Current Assignee) TDK Micronas GmbH | Shared pipeline architecture for motion vector prediction and residual decoding | ▪ first syntax element, second syntax element ≈ motion vector differences ▪ first syntax ≈ data state ▪ length code ≈ bit data ▪ video blocks ≈ d log | X | X | X | X | |||
44 | US20050025246A1 (Thomas Holcomb, 2005) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Decoding jointly coded transform type and subblock pattern information | ▪ first syntax, syntax information ≈ syntax elements ▪ readable medium ≈ readable medium ▪ video blocks ≈ first VLC | X | X | X | X | |||
45 | US20050053292A1 (Kunal Mukerjee, 2005) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Advanced bi-directional predictive coding of interlaced video | ▪ block pattern ≈ second motion vector predictor ▪ readable medium ≈ readable medium ▪ second syntax element ≈ video encoder | X | X | |||||
46 | CN1810037A (尹鹏, 2006) | (Original Assignee) 汤姆森许可贸易公司 | 帧间的快速模式确定编码 | ▪ video blocks ≈ 一种视频编码方法, 针对帧 ▪ minimum size ≈ 尺寸的 | X | X | X | X | |||
47 | US20050135484A1 (Yung-Lyul Lee, 2005) | (Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group (Current Assignee) Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group | Method of encoding mode determination, method of motion estimation and encoding apparatus | ▪ maximum size, maximum size value ≈ current picture ▪ minimum size ≈ other ones | X | X | |||||
48 | CN1857001A (P·托思, 2006) | (Original Assignee) Amt先进多媒体科技公司 | 混合视频压缩方法 | ▪ video blocks ≈ 块进行编码 ▪ size NxN ≈ 余弦变换 ▪ generating syntax information ≈ 从而产生 ▪ minimum size, minimum size value ≈ 的尺寸, 尺寸的 | X | X | X | X | |||
49 | WO2004104930A2 (Peter Toth, 2004) | (Original Assignee) Amt Advanced Multimedia Technology Ab | Hybrid video compression method | ▪ first syntax, programmable processor ≈ processing element, data elements ▪ block pattern ≈ subsequent frame ▪ square partitions ≈ entropy coding ▪ length code ≈ quality values ▪ video blocks ≈ given block | X | X | X | X | X | ||
50 | US20070053436A1 (Lambertus Van Eggelen, 2007) | (Original Assignee) Koninklijke Philips NV (Current Assignee) Koninklijke Philips NV | Encoding video information using block based adaptive scan order | ▪ maximum size ≈ corresponding data blocks ▪ video blocks ≈ preceding image | X | X | X | X | |||
51 | US20040234144A1 (Kazuo Sugimoto, 2004) | (Original Assignee) NTT Docomo Inc (Current Assignee) NTT Docomo Inc | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program | ▪ maximum size ≈ predetermined shape ▪ minimum size value ≈ absolute values | X | X | |||||
52 | US20050276323A1 (Alexey Martemyanov, 2005) | (Original Assignee) Vanguard Video LLC (Current Assignee) Beamr Imaging Ltd | Real-time video coding/decoding | ▪ length code ≈ high compression efficiency, height h ▪ second syntax, first syntax ≈ motion vector refinement, performing texture ▪ square partitions ≈ transform coefficients, entropy coding ▪ block pattern ≈ block pattern | X | X | |||||
53 | CN101087424A (孙式军, 2007) | (Original Assignee) 夏普株式会社 | 图像内预测模式估计、通信和组织的方法和系统 | ▪ rectangular partitions ≈ 的左侧 ▪ minimum size value ≈ 平均值 | X | X | |||||
54 | CN1457196A (薛金柱, 2003) | (Original Assignee) 北京工业大学 | 基于时空域相关性运动矢量预测的视频编码方法 | ▪ minimum size value ≈ 一个视频 ▪ generating syntax information ≈ 基于运动 | X | X | |||||
55 | EP1478190A1 (Satoru c/o NTT DoCoMo Inc. ADACHI, 2004) | (Original Assignee) NTT Docomo Inc (Current Assignee) NTT Docomo Inc | Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program | ▪ maximum size ≈ predetermined shape ▪ minimum size value ≈ absolute values | X | X | |||||
56 | CN1647541A (J·莱恩马, 2005) | (Original Assignee) 诺基亚有限公司 | 用于在视频编码系统中指示量化器参数的方法与设备 | ▪ minimum size value ≈ 一个视频 ▪ first syntax ≈ 多个分 | X | X | |||||
57 | US20030179940A1 (Chih-Lung Lin, 2003) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Efficient macroblock header coding for video compression | ▪ block pattern ≈ length coding table, block pattern ▪ four square partitions, minimum size value ≈ chrominance values ▪ readable medium ≈ readable medium ▪ rectangular partitions ≈ color values ▪ length code ≈ length code | X | X | |||||
58 | US20030156648A1 (Thomas Holcomb, 2003) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Sub-block transform coding of prediction residuals | ▪ video blocks, four square partitions ≈ motion prediction, main data ▪ second syntax ≈ video encoding ▪ second syntax element ≈ video encoder ▪ integer multiple ≈ n bits | X | X | X | X | |||
59 | US7200275B2 (Sridhar Srinivasan, 2007) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Skip macroblock coding | ▪ readable medium ≈ readable medium ▪ second syntax ≈ video encoding | X | X | |||||
60 | US7206456B2 (Miska Hannuksela, 2007) | (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy | Video coding and decoding | ▪ syntax information ≈ coding elements ▪ second syntax ≈ video encoding | X | X | X | X | |||
61 | JP2002359852A (Sen Bun Chun, 2002) | (Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社 | 画像予測復号化装置及び方法 | ▪ minimum size ≈ 量子化ステップサイズ ▪ second syntax ≈ 手段と ▪ length code ≈ 長復号 | X | X | |||||
62 | JPH11164305A (Kotaro Asai, 1999) | (Original Assignee) Mitsubishi Electric Corp; 三菱電機株式会社 | 動画像符号化方法、動画像符号化装置および動画像復号装置 | ▪ rectangular partitions ≈ のクラス ▪ minimum size value ≈ 示す複数 | X | X | |||||
63 | US5748789A (Ming-Chieh Lee, 1998) | (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC | Transparent block skipping in object-based video coding systems | ▪ readable medium ≈ readable medium ▪ block pattern ≈ block pattern ▪ generating syntax information ≈ video coding ▪ length code ≈ length code | X | X | |||||
64 | US6125143A (Teruhiko Suzuki, 2000) | (Original Assignee) Sony Corp (Current Assignee) Ironworks Patents LLC | Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium | ▪ square partitions ≈ transform coefficients ▪ maximum size, minimum size ≈ mean value | X | X | |||||
65 | US5691770A (Gerrit J. Keesman, 1997) | (Original Assignee) US Philips Corp (Current Assignee) US Philips Corp | Device and method for coding video pictures | ▪ integer multiple ≈ spatial frequency ▪ rectangular partitions ≈ same amplitude | X | X | |||||
66 | EP0727910A2 (Max Chien, 1996) | (Original Assignee) Thomson Multimedia SA (Current Assignee) Technicolor SA | Temporal-spatial error concealment apparatus and method for video signal processors | ▪ readable medium ≈ said blocks ▪ block pattern ≈ bad block | X | X | |||||
67 | US5576767A (Chong U. Lee, 1996) | (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc | Interframe video encoding and decoding system | ▪ block pattern ≈ additional reference ▪ readable medium ≈ said blocks | X | X | |||||
68 | US5768434A (Xiaonong Ran, 1998) | (Original Assignee) National Semiconductor Corp (Current Assignee) National Semiconductor Corp | Quadtree-structured walsh transform coding | ▪ minimum size value ≈ predetermined threshold value, absolute values ▪ square partitions ≈ second partition, first partition ▪ block pattern ≈ two blocks | X | X | |||||
69 | US5666461A (Katsuji Igarashi, 1997) | (Original Assignee) Sony Corp (Current Assignee) Sony Corp | High efficiency encoding and decoding of picture signals and recording medium containing same | ▪ readable medium ≈ said recording medium ▪ video blocks ≈ picture portion | X | X | X | X | |||
70 | US5793897A (Jae-moon Jo, 1998) | (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd | Adaptive variable-length coding and decoding methods for image data | ▪ square partitions ≈ transform coefficients ▪ block pattern ≈ length coding table ▪ length code ≈ length code | X | X | |||||
71 | US5623311A (Larry Phillips, 1997) | (Original Assignee) Panasonic Corp of North America (Current Assignee) Panasonic Corp of North America | MPEG video decoder having a high bandwidth memory | ▪ video blocks ≈ inverse discrete cosine transform ▪ second syntax element ≈ luminance component ▪ four square partitions, minimum size value ≈ chrominance values | X | X | X | X | |||
72 | US5594504A (Touradj Ebrahimi, 1997) | (Original Assignee) Nokia of America Corp (Current Assignee) Nokia of America Corp | Predictive video coding using a motion vector updating routine | ▪ video blocks ≈ motion estimation unit ▪ maximum size ≈ predetermined shape ▪ square partitions, four square partitions ≈ value z | X | X | X | X | |||
73 | US5422676A (Carsten Herpel, 1995) | (Original Assignee) Deutsche Thomson Brandt GmbH | System for coding an image representative signal | ▪ block pattern ≈ second blocks ▪ readable medium ≈ said blocks | X | X | |||||
74 | US5091782A (Edward A. Krause, 1992) | (Original Assignee) Arris Technology Inc (Current Assignee) CIF LICENSING LLC | Apparatus and method for adaptively compressing successive blocks of digital video | ▪ square partitions ≈ transform coefficients ▪ video blocks ≈ transforming means ▪ readable medium ≈ said blocks | X | X | X | X | |||
75 | EP1623577A1 (Lambertus A. Van Eggelen, 2006) | (Original Assignee) Koninklijke Philips NV (Current Assignee) Koninklijke Philips NV | Encoding of video information using block based adaptive scan order | ▪ maximum size ≈ corresponding data blocks ▪ video blocks ≈ preceding image | X | X | X | X | |||
76 | JP3910594B2 (芳美 井須, 2007) | (Original Assignee) 三菱電機株式会社 | 画像符号化装置 | ▪ syntax information ≈ ビットストリーム ▪ second syntax ≈ 手段と | X | X | X | X | |||
77 | GB2382940A (Miska Hannuksela, 2003) | (Original Assignee) Nokia Oyj (Current Assignee) Nokia Oyj | Encoding objects and background blocks | ▪ syntax information ≈ coding elements ▪ second syntax ≈ video encoding ▪ second syntax element ≈ video encoder ▪ generating syntax information ≈ video coding | X | X | X | X | |||
78 | JPH082106B2 (洋一 加藤, 1996) | (Original Assignee) 国際電信電話株式会社; 富士通株式会社; 日本電信電話株式会社; 日本電気株式会社 | 動画像信号のハイブリツド符号化方式 | ▪ maximum size, maximum size value ≈ ブロックサイズ ▪ generate syntax information ≈ 変換部 | X | X |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | KR100809298B1 Filed: 2006-08-16 Issued: 2008-03-04 플래그 인코딩 방법, 플래그 디코딩 방법, 및 상기 방법을이용한 장치 (Original Assignee) 삼성전자주식회사 이교혁, 이배근, 이태미, 한우진 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
KR100809298B1 CLAIM 7 제1항에 있어서 , 상기 플래그는 Cbp(coded block pattern (block pattern) ) 또는 잔차 예측 플래그인 플래그 인코딩 장치 . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
KR100809298B1 CLAIM 7 제1항에 있어서 , 상기 플래그는 Cbp(coded block pattern (block pattern) ) 또는 잔차 예측 플래그인 플래그 인코딩 장치 . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20080002770A1 Filed: 2006-06-30 Issued: 2008-01-03 Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices (Original Assignee) Nokia Oyj (Current Assignee) Nokia Oyj Kemal Ugur, Jani Lainema |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (first encoding mode) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition, th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20080002770A1 CLAIM 1 . A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ; generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ; and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold . US20080002770A1 CLAIM 7 . A method according to claim 4 , further comprising : determining whether the summation of a first distortion value and a second distortion value exceeds a fourth predetermined threshold , wherein the first distortion value and the second distortion value correspond to a first partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of a second block size among the plurality of block sizes ; estimating the motion corresponding to the first partition when the summation of the first distortion value and the second distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the first partition when the summation of the first distortion value and the second distortion value is less than the fourth predetermined threshold . US20080002770A1 CLAIM 8 . A method according to claim 7 , further comprising : determining whether the summation of a third distortion value and a fourth distortion value exceeds the fourth predetermined threshold , wherein the third distortion value and the fourth distortion value correspond to a second partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the second block size among the plurality of block sizes ; estimating the motion corresponding to the second partition when the summation of the third distortion value and the fourth distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the second partition when the summation of the third distortion value and the fourth distortion value is less than the fourth predetermined threshold . US20080002770A1 CLAIM 10 . A method according to claim 9 , further comprising : determining whether the summation of a sixth distortion value and a seventh distortion value exceeds the fourth predetermined threshold , wherein the sixth distortion value and the seventh distortion value corresponds to a fourth partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the third block size among the plurality of block sizes ; estimating the motion corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value is less than the fourth predetermined threshold . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (first encoding mode) . |
US20080002770A1 CLAIM 1 . A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ; generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ; and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (first encoding mode) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition, th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20080002770A1 CLAIM 1 . A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ; generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ; and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold . US20080002770A1 CLAIM 7 . A method according to claim 4 , further comprising : determining whether the summation of a first distortion value and a second distortion value exceeds a fourth predetermined threshold , wherein the first distortion value and the second distortion value correspond to a first partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of a second block size among the plurality of block sizes ; estimating the motion corresponding to the first partition when the summation of the first distortion value and the second distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the first partition when the summation of the first distortion value and the second distortion value is less than the fourth predetermined threshold . US20080002770A1 CLAIM 8 . A method according to claim 7 , further comprising : determining whether the summation of a third distortion value and a fourth distortion value exceeds the fourth predetermined threshold , wherein the third distortion value and the fourth distortion value correspond to a second partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the second block size among the plurality of block sizes ; estimating the motion corresponding to the second partition when the summation of the third distortion value and the fourth distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the second partition when the summation of the third distortion value and the fourth distortion value is less than the fourth predetermined threshold . US20080002770A1 CLAIM 10 . A method according to claim 9 , further comprising : determining whether the summation of a sixth distortion value and a seventh distortion value exceeds the fourth predetermined threshold , wherein the sixth distortion value and the seventh distortion value corresponds to a fourth partition (square partitions, four square partitions) of the at least one macroblock when the at least one macroblock consists of the third block size among the plurality of block sizes ; estimating the motion corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value exceeds the fourth predetermined threshold ; and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the fourth partition when the summation of the sixth distortion value and the seventh distortion value is less than the fourth predetermined threshold . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (first encoding mode) . |
US20080002770A1 CLAIM 1 . A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ; generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ; and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20060203905A1 Filed: 2005-03-14 Issued: 2006-09-14 Video coding system (Original Assignee) National Kaohsiung First University of Science and Technology (Current Assignee) National Kaohsiung First University of Science and Technology Shih-Chang Hsia |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks, same time) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (processing speed) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size (horizontal resolution) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20060203905A1 CLAIM 6 . A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ; the system includes six address generators (AG 1 ˜AG 6) ; the internal storage cell being consisted with input memory M 1 and frame memory M 2 . US20060203905A1 CLAIM 7 . The video coding system as claimed in claim 6 , further comprising the timing schedule . MB 1 , MB 2 . . . are continuous macro-blocks . For real-time processing , a pipelined schedule could be employed . In the first time , the motion estimation for MB 1 block is performed , where other MBs are idle . As motion vector of MB 1 is found , the DCT processor can transform the differential values of input pixel (from M 1 memory) and the reference frame (from M 2 memory) in the second time . At the same time (block pattern) , MB 2 is processed in the motion estimation engine . In the third time , the DCT coefficients of MB 1 could be performed by quantization and de-quantization procedures . Then the pixels are reconstructed from IDCT , and written into the frame memory for motion compensation . Simultaneously , the motion estimation for MB 3 and DCT transformation for MB 2 are fulfilled . US20060203905A1 CLAIM 8 . The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, minimum size value) . Two banks are executed with interlaced operations for real-time data access . US20060203905A1 CLAIM 12 . The video coding system as claimed in claim 8 , wherein there are two blocks (block pattern) delaying between input and output in the frame memory . The write address (WA) could easily find from the read address (RA) added the offset value . US20060203905A1 CLAIM 20 . The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks, same time) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (processing speed) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size (horizontal resolution) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20060203905A1 CLAIM 6 . A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ; the system includes six address generators (AG 1 ˜AG 6) ; the internal storage cell being consisted with input memory M 1 and frame memory M 2 . US20060203905A1 CLAIM 7 . The video coding system as claimed in claim 6 , further comprising the timing schedule . MB 1 , MB 2 . . . are continuous macro-blocks . For real-time processing , a pipelined schedule could be employed . In the first time , the motion estimation for MB 1 block is performed , where other MBs are idle . As motion vector of MB 1 is found , the DCT processor can transform the differential values of input pixel (from M 1 memory) and the reference frame (from M 2 memory) in the second time . At the same time (block pattern) , MB 2 is processed in the motion estimation engine . In the third time , the DCT coefficients of MB 1 could be performed by quantization and de-quantization procedures . Then the pixels are reconstructed from IDCT , and written into the frame memory for motion compensation . Simultaneously , the motion estimation for MB 3 and DCT transformation for MB 2 are fulfilled . US20060203905A1 CLAIM 8 . The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, minimum size value) . Two banks are executed with interlaced operations for real-time data access . US20060203905A1 CLAIM 12 . The video coding system as claimed in claim 8 , wherein there are two blocks (block pattern) delaying between input and output in the frame memory . The write address (WA) could easily find from the read address (RA) added the offset value . US20060203905A1 CLAIM 20 . The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US7362804B2 Filed: 2003-11-24 Issued: 2008-04-22 Graphical symbols for H.264 bitstream syntax elements (Original Assignee) LSI Corp (Current Assignee) Avago Technologies International Sales Pte Ltd Pavel Novotny, Guy Cote, Lowell L. Winger, Simon Booth |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value (maximum values) , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US7362804B2 CLAIM 1 . An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ; and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values (maximum size value) , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US7362804B2 CLAIM 1 . An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ; and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value (maximum values) , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US7362804B2 CLAIM 1 . An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ; and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values (maximum size value) , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US7362804B2 CLAIM 1 . An apparatus comprising : a first circuit configured to generate a decoded video signal and syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) in response to an encoded bitstream ; and a second circuit configured to generate one or more overlay images in response to said syntax elements , wherein (a) said one or more overlay images are displayed overlaying a corresponding decoded video image , (b) said one or more overlay images comprise one or more graphic symbols representing said syntax elements of said encoded bitstream and (c) said one or more overlay images comprise at least one image selected from the group consisting of (i) an image displaying statistics about said decoded video signal including resolution , frame rate , bit rate , a bit rate graph , bitstream errors , peak signal to noise ratio (PSNR) , average PSNR and a PSNR graph , wherein one or more scales of said bit rate graph and said PSNR graph are dynamically adjusted to display minimum and maximum values , (ii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing partitioning and encoding information for the respective macroblock is shown , (iii) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock size parameter for the respective macroblock and a number representing a quantization parameter for the respective macroblock are shown , and (iv) an image displaying a grid representing a number of macroblocks of said decoded video image , wherein for each macroblock a graphic symbol representing a macroblock level adaptive frame/field (MBAFF) coding mode of the respective macroblock is shown . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US7289674B2 Filed: 2003-06-10 Issued: 2007-10-30 Spatial prediction based intra coding (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy Marta Karczewicz |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (ascending order) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US7289674B2 CLAIM 8 . The method of claim 1 , wherein the derived prediction modes are representable by a plurality of different integers , each integer corresponding to a derived prediction mode , and wherein the integers are arranged in an ascending order (square partitions) of the integers for providing an ordered set , said method further comprising removing the integers corresponding to the first prediction modes from the ordered set for providing a modified ordered set , so as to allow the decoding device to determine the order of the second group based on the modified ordered set . US7289674B2 CLAIM 11 . The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (block pattern) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (ascending order) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US7289674B2 CLAIM 8 . The method of claim 1 , wherein the derived prediction modes are representable by a plurality of different integers , each integer corresponding to a derived prediction mode , and wherein the integers are arranged in an ascending order (square partitions) of the integers for providing an ordered set , said method further comprising removing the integers corresponding to the first prediction modes from the ordered set for providing a modified ordered set , so as to allow the decoding device to determine the order of the second group based on the modified ordered set . US7289674B2 CLAIM 11 . The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (block pattern) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6980596B2 Filed: 2002-11-20 Issued: 2005-12-27 Macroblock level adaptive frame/field coding for digital video content (Original Assignee) Arris Technology Inc (Current Assignee) Google Technology Holdings LLC Limin Wang, Rajeev Gandhi, Krit Panusopone, Ajay Luthra |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (frame mode) or into four square partitions (processing two) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6980596B2 CLAIM 1 . A method of encoding or decoding digital video content , said digital video content comprising a stream of pictures which can each be intra , predicted , or bi-predicted pictures , each of said pictures comprising macroblocks , said method comprising the steps of : selecting to encode or decode a group of neighboring macroblocks in a frame mode (rectangular partitions) or a field mode ; arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks according to the selected one of said frame mode or said field mode ; and encoding or decoding said frame macroblocks or said field macroblocks , wherein , the step of arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks includes the step of : in frame mode , jointly processing two (four square partitions) fields in said group of neighboring macroblocks , said group of neighboring macroblocks being divided into frame macroblocks , and each of said frame macroblocks including both top and bottom field pixels ; in said field mode , separately processing two fields of said group of neighboring macroblocks , said group of neighboring macroblocks being split into field macroblocks , and each of said field macroblocks including either top or bottom field pixels , and wherein said frame macroblocks and said field macroblocks are the same size . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (frame mode) or into four square partitions (processing two) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6980596B2 CLAIM 1 . A method of encoding or decoding digital video content , said digital video content comprising a stream of pictures which can each be intra , predicted , or bi-predicted pictures , each of said pictures comprising macroblocks , said method comprising the steps of : selecting to encode or decode a group of neighboring macroblocks in a frame mode (rectangular partitions) or a field mode ; arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks according to the selected one of said frame mode or said field mode ; and encoding or decoding said frame macroblocks or said field macroblocks , wherein , the step of arranging said group of neighboring macroblocks for processing into frame macroblocks or field macroblocks includes the step of : in frame mode , jointly processing two (four square partitions) fields in said group of neighboring macroblocks , said group of neighboring macroblocks being divided into frame macroblocks , and each of said frame macroblocks including both top and bottom field pixels ; in said field mode , separately processing two fields of said group of neighboring macroblocks , said group of neighboring macroblocks being split into field macroblocks , and each of said field macroblocks including either top or bottom field pixels , and wherein said frame macroblocks and said field macroblocks are the same size . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6795584B2 Filed: 2002-10-03 Issued: 2004-09-21 Context-based adaptive variable length coding for adaptive block transforms (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy Marta Karczewicz, Justin Ridge |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6795584B2 CLAIM 12 . An image encoder for receiving image data and providing a bitstream indicative of the image data , characterized by : means for forming at least a block of transform coefficients from the image data , by means for scanning the block of transform coefficients for forming an ordered sequence of transform coefficients from the block , by a software program stored on a computer readable medium for sub-sampling the ordered sequence of transform coefficients in order to form a plurality of sub-sampled sequences of transform coefficients , by means for entropy coding (square partitions) the sub-sampled sequences of transform coefficients for provided signals indicative of the encoded transform coefficients , and by means , for providing the bitstream based on the signals . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6795584B2 CLAIM 12 . An image encoder for receiving image data and providing a bitstream indicative of the image data , characterized by : means for forming at least a block of transform coefficients from the image data , by means for scanning the block of transform coefficients for forming an ordered sequence of transform coefficients from the block , by a software program stored on a computer readable medium for sub-sampling the ordered sequence of transform coefficients in order to form a plurality of sub-sampled sequences of transform coefficients , by means for entropy coding (square partitions) the sub-sampled sequences of transform coefficients for provided signals indicative of the encoded transform coefficients , and by means , for providing the bitstream based on the signals . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP2003319394A Filed: 2002-04-26 Issued: 2003-11-07 符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム (Original Assignee) Sony Corp; ソニー株式会社 Kuhn Peter, Kazufumi Sato, Teruhiko Suzuki, Yoichi Yagasaki, クーン ピーター, 数史 佐藤, 陽一 矢ヶ崎, 輝彦 鈴木 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム, 前記圧縮) or into four square partitions (field flag) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP2003319394A CLAIM 1 【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。 JP2003319394A CLAIM 20 【請求項20】 マクロブロックCのframe/field fla gに関するコンテキストモデルctx fifr flag(C)は、前 記マクロブロックCに隣接するマクロブロックA,Bそ れぞれのframe/field flag (four square partitions) の値a,bを用いて次式の ように定義されている ctx fifr flag(C)=a+2b ことを特徴とする請求項19に記載の符号化装置。 JP2003319394A CLAIM 27 【請求項27】 飛び越し走査の画像情報を入力とし て、マクロブロックレベルで適応的にフィールドベース またはフレームベースの符号化処理を施すプログラム (rectangular partitions) で あって、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換ステップと、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定ステップと、 前記直交変換ステップの処理結果を量子化する量子化ス テップと、 前記量子化ステップの処理結果を可逆符号化する可逆符 号化ステップとを含むことを特徴とするコンピュータが 読み取り可能なプログラムが記録されている記録媒体。 JP2003319394A CLAIM 29 【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮 (rectangular partitions) 画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム, 前記圧縮) or into four square partitions (field flag) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (生成ステップ) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP2003319394A CLAIM 1 【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。 JP2003319394A CLAIM 20 【請求項20】 マクロブロックCのframe/field fla gに関するコンテキストモデルctx fifr flag(C)は、前 記マクロブロックCに隣接するマクロブロックA,Bそ れぞれのframe/field flag (four square partitions) の値a,bを用いて次式の ように定義されている ctx fifr flag(C)=a+2b ことを特徴とする請求項19に記載の符号化装置。 JP2003319394A CLAIM 27 【請求項27】 飛び越し走査の画像情報を入力とし て、マクロブロックレベルで適応的にフィールドベース またはフレームベースの符号化処理を施すプログラム (rectangular partitions) で あって、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換ステップと、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定ステップと、 前記直交変換ステップの処理結果を量子化する量子化ス テップと、 前記量子化ステップの処理結果を可逆符号化する可逆符 号化ステップとを含むことを特徴とするコンピュータが 読み取り可能なプログラムが記録されている記録媒体。 JP2003319394A CLAIM 29 【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮 (rectangular partitions) 画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。 JP2003319394A CLAIM 32 【請求項32】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置の復号方法において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号ステップと、 前記量子化されている直交変換係数を逆量子化する逆量 子化ステップと、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換ステップと、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画像を生成する参照画 像生成ステップ (generating syntax information) とを含むことを特徴とする復号方法。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6968012B1 Filed: 2001-10-02 Issued: 2005-11-22 Methods for encoding digital video for decoding on low performance devices (Original Assignee) Firepad Inc (Current Assignee) Nordic Interactive Technologies LLC Jens Meggers |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ; generate syntax information (identifying means) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6968012B1 CLAIM 1 . A method of encoding a video stream , comprising : (a) receiving a current video frame of the video stream ; (b) subdividing the current video frame into blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ; (c) repeating (b) using at least one additional block size , such that the current video frame is separately subdivided and compared to the preceding video frame using multiple block sizes ; (d) selecting from the multiple block sizes a block size that produces a best video compression result ; and (e) encoding the video frame in the video stream with the block size selected in (d) . US6968012B1 CLAIM 10 . The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm . US6968012B1 CLAIM 18 . A data processing system comprising : a server computer comprising a video encoder (second syntax element) application for encoding video streams using multiple block size detection method ; a client device comprising a decoder application for decoding and displaying the encoded video streams ; and a video encoder module for encoding video streams using multiple block size detection method comprising block detection component , crunching component , and compressing component . US6968012B1 CLAIM 19 . A system for encoding and decoding video , the system comprising : identifying means (generate syntax information) for detecting pixel blocks which have changed from a previous decoded video frame ; storing means for copying the changed pixel blocks and a preamble block ; compressing means for compressing the changed pixel blocks and the preamble block ; and identifying means for selecting a smallest encoded frame . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) . |
US6968012B1 CLAIM 10 . The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6968012B1 CLAIM 1 . A method of encoding a video stream , comprising : (a) receiving a current video frame of the video stream ; (b) subdividing the current video frame into blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ; (c) repeating (b) using at least one additional block size , such that the current video frame is separately subdivided and compared to the preceding video frame using multiple block sizes ; (d) selecting from the multiple block sizes a block size that produces a best video compression result ; and (e) encoding the video frame in the video stream with the block size selected in (d) . US6968012B1 CLAIM 10 . The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm . US6968012B1 CLAIM 18 . A data processing system comprising : a server computer comprising a video encoder (second syntax element) application for encoding video streams using multiple block size detection method ; a client device comprising a decoder application for decoding and displaying the encoded video streams ; and a video encoder module for encoding video streams using multiple block size detection method comprising block detection component , crunching component , and compressing component . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) . |
US6968012B1 CLAIM 10 . The method as in claim 4 , wherein the contiguous memory block (video blocks) is compressed using an LZW-like compression algorithm . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6959116B2 Filed: 2001-09-18 Issued: 2005-10-25 Largest magnitude indices selection for (run, level) encoding of a block coded picture (Original Assignee) EMC Corp (Current Assignee) EMC Corp Ugur Sezer, Sorin Faibish, Seyfullah H. Oguz, Wayne W. Duso |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (entropy encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6959116B2 CLAIM 1 . A method of producing a (run , level) encoded picture from an original picture , said method comprising : producing respective sets of transform coefficients (square partitions) for blocks of pixels in the original picture ; quantizing transform coefficients in the respective sets of transform coefficients to produce respective sets of non-zero quantization indices for the blocks of pixels , wherein the non-zero quantization indices for at least some of the blocks are produced by using a quantization step size that is not uniform within said at least some of the blocks ; inspecting magnitudes of the non-zero quantization indices for selecting a limited number of largest magnitude non-zero quantization indices for the blocks of pixels in the respective sets of quantization indices to produce respective sets of selected quantization indices having non-zero levels for the blocks of pixels ; and (run , level) encoding quantization indices from the respective sets of selected quantization indices to produce the (run , level) encoded picture . US6959116B2 CLAIM 8 . The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a .) parsing and copying a differential DC coefficient variable-length code (VLC) ; b .) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ; c .) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ; and d .) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (entropy encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6959116B2 CLAIM 1 . A method of producing a (run , level) encoded picture from an original picture , said method comprising : producing respective sets of transform coefficients (square partitions) for blocks of pixels in the original picture ; quantizing transform coefficients in the respective sets of transform coefficients to produce respective sets of non-zero quantization indices for the blocks of pixels , wherein the non-zero quantization indices for at least some of the blocks are produced by using a quantization step size that is not uniform within said at least some of the blocks ; inspecting magnitudes of the non-zero quantization indices for selecting a limited number of largest magnitude non-zero quantization indices for the blocks of pixels in the respective sets of quantization indices to produce respective sets of selected quantization indices having non-zero levels for the blocks of pixels ; and (run , level) encoding quantization indices from the respective sets of selected quantization indices to produce the (run , level) encoded picture . US6959116B2 CLAIM 8 . The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a . ) parsing and copying a differential DC coefficient variable-length code (VLC) ; b . ) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ; c . ) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ; and d . ) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6529634B1 Filed: 1999-11-08 Issued: 2003-03-04 Contrast sensitive variance based adaptive block size DCT image compression (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc Kadayam Thyagarajan, Michael J. Merritt |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means, main data) of size NxN , where N is an integer multiple (th column) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (transforming means, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (minimum block size) ; generate syntax information (Huffman code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6529634B1 CLAIM 3 . The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (block pattern) of pixel data . US6529634B1 CLAIM 4 . An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ; transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ; quantizer means for quantizing said frequency domain data ; serializer means for scanning said quantized data into a serialized stream of data ; and variable length coding means for coding said serialized stream of data in preparation for transmission . US6529634B1 CLAIM 11 . The system of claim 4 wherein said variable length coding means comprises a Huffman code (generate syntax information, generating syntax information) r . US6529634B1 CLAIM 24 . The method of claim 1 , wherein said variance is determined by the following equation : var = 1 N 2 ∑ i = 0 N - 1 ∑ j = 0 N - 1 x i , j 2 - (1 N 2 ∑ i = 0 N - 1 ∑ j = 0 N - 1 x i , j) 2 where N=dimension of the block , and x ij is the pixel in the i th row , j th column (integer multiple) within the N×N block . US6529634B1 CLAIM 30 . The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means, main data) . |
US6529634B1 CLAIM 4 . An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ; transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ; quantizer means for quantizing said frequency domain data ; serializer means for scanning said quantized data into a serialized stream of data ; and variable length coding means for coding said serialized stream of data in preparation for transmission . US6529634B1 CLAIM 30 . The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means, main data) of size NxN , where N is an integer multiple (th column) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (transforming means, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (minimum block size) ; generating syntax information (Huffman code) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6529634B1 CLAIM 3 . The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (block pattern) of pixel data . US6529634B1 CLAIM 4 . An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ; transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ; quantizer means for quantizing said frequency domain data ; serializer means for scanning said quantized data into a serialized stream of data ; and variable length coding means for coding said serialized stream of data in preparation for transmission . US6529634B1 CLAIM 11 . The system of claim 4 wherein said variable length coding means comprises a Huffman code (generate syntax information, generating syntax information) r . US6529634B1 CLAIM 24 . The method of claim 1 , wherein said variance is determined by the following equation : var = 1 N 2 ∑ i = 0 N - 1 ∑ j = 0 N - 1 x i , j 2 - (1 N 2 ∑ i = 0 N - 1 ∑ j = 0 N - 1 x i , j) 2 where N=dimension of the block , and x ij is the pixel in the i th row , j th column (integer multiple) within the N×N block . US6529634B1 CLAIM 30 . The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means, main data) . |
US6529634B1 CLAIM 4 . An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ; transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks, four square partitions) ; quantizer means for quantizing said frequency domain data ; serializer means for scanning said quantized data into a serialized stream of data ; and variable length coding means for coding said serialized stream of data in preparation for transmission . US6529634B1 CLAIM 30 . The system of claim 27 , wherein said transforming means (video blocks, four square partitions) utilizes the Discrete Cosine Transform . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6539124B2 Filed: 1999-08-17 Issued: 2003-03-25 Quantizer selection based on region complexities derived using a rate distortion model (Original Assignee) Sarnoff Corp (Current Assignee) MediaTek Inc Sriram Sethuraman, Ravi Krishnamurthy |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute differences) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6539124B2 CLAIM 7 . The invention of claim 6 , wherein the distortion measure S is based on a sum of absolute differences (minimum size value) (SAD) measure . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute differences) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6539124B2 CLAIM 7 . The invention of claim 6 , wherein the distortion measure S is based on a sum of absolute differences (minimum size value) (SAD) measure . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6215910B1 Filed: 1996-03-28 Issued: 2001-04-10 Table-based compression with embedded coding (Original Assignee) Microsoft Corp (Current Assignee) V-XSTREAM Inc ; Microsoft Technology Licensing LLC Navin Chaddha |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (embedded code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6215910B1 CLAIM 1 . A method of compressing an image comprising the steps of : dividing the image into blocks of vectors comprising bits , each vector in a block representing a pixel of the image ; reducing the number of vectors that must be compressed by use of full search hierarchical lookup tables incorporating block transforms to provide indices ; and mapping the indices to a set of embedded code (generate syntax information) s by use of a tree structured table . US6215910B1 CLAIM 3 . The method of claim 2 wherein the hierarchical lookup tables contain distortion measures based on human visual sensitivity to quantization errors in different transform coefficients (square partitions) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6215910B1 CLAIM 3 . The method of claim 2 wherein the hierarchical lookup tables contain distortion measures based on human visual sensitivity to quantization errors in different transform coefficients (square partitions) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JPH08205140A Filed: 1995-01-31 Issued: 1996-08-09 画像圧縮装置 (Original Assignee) Canon Inc; キヤノン株式会社 Koji Hirabayashi, 康二 平林 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ, 小さいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (ブロックサイズ, 小さいサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JPH08205140A CLAIM 1 【請求項1】 所定の画素数から成るサイズを有するブ ロックデータについて冗長部分の大きさが所定値以上か 否かを判定し、所定値以下のときは上記ブロックデータ を上記サイズより小さいサイズ (maximum size, maximum size value, minimum size, minimum size value) を有するサブブロックに 分割し、所定値より大きいときは分割しないことを示す 分割情報を出力するブロックサイズ (maximum size, maximum size value, minimum size, minimum size value) 判定手段と (second syntax) 、 上記分割情報に基づいて上記ブロックデータを上記サブ ブロックに分割してサブブロックデータ又は上記ブロッ クデータを出力するサブブロック切出し手段と、 上記サブブロックデータ又は上記ブロックデータをその サイズに応じた処理により圧縮符号化する符号化手段と を備えた画像圧縮装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ, 小さいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (ブロックサイズ, 小さいサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JPH08205140A CLAIM 1 【請求項1】 所定の画素数から成るサイズを有するブ ロックデータについて冗長部分の大きさが所定値以上か 否かを判定し、所定値以下のときは上記ブロックデータ を上記サイズより小さいサイズ (maximum size, maximum size value, minimum size, minimum size value) を有するサブブロックに 分割し、所定値より大きいときは分割しないことを示す 分割情報を出力するブロックサイズ (maximum size, maximum size value, minimum size, minimum size value) 判定手段と (second syntax) 、 上記分割情報に基づいて上記ブロックデータを上記サブ ブロックに分割してサブブロックデータ又は上記ブロッ クデータを出力するサブブロック切出し手段と、 上記サブブロックデータ又は上記ブロックデータをその サイズに応じた処理により圧縮符号化する符号化手段と を備えた画像圧縮装置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5107345A Filed: 1991-05-28 Issued: 1992-04-21 Adaptive block size image compression method and system (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc Chong U. Lee |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform, compressed image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (count values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (Huffman code) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5107345A CLAIM 17 . The system of claim 8 wherein said encoder means comprises : a Huffman code (generate syntax information, generating syntax information) r having an input coupled to said ordering means and an output coupled to said assembler means . US5107345A CLAIM 19 . A method for image signal compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ; performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ; performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ; determining bit count values (rectangular partitions) corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ; determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ; selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ; generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ; ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ; encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ; and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data . US5107345A CLAIM 23 . A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ; decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ; ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ; inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ; and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data . US5107345A CLAIM 28 . The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ; second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ; third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ; and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform, compressed image) . |
US5107345A CLAIM 23 . A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ; decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ; ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ; inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ; and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data . US5107345A CLAIM 28 . The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ; second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ; third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ; and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform, compressed image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (count values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (Huffman code) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5107345A CLAIM 17 . The system of claim 8 wherein said encoder means comprises : a Huffman code (generate syntax information, generating syntax information) r having an input coupled to said ordering means and an output coupled to said assembler means . US5107345A CLAIM 19 . A method for image signal compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ; performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ; performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ; determining bit count values (rectangular partitions) corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ; determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ; selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ; generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ; ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ; encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ; and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data . US5107345A CLAIM 23 . A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ; decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ; ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ; inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ; and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data . US5107345A CLAIM 28 . The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ; second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ; third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ; and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform, compressed image) . |
US5107345A CLAIM 23 . A system for decoding a compressed image (video blocks) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ; decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ; ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ; inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ; and combiner means for , receiving said one of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data . US5107345A CLAIM 28 . The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ; second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ; third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ; and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP2008219870A Filed: 2008-01-28 Issued: 2008-09-18 動画像符号化方法及び動画像符号化装置 (Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社 Teo Han Boon, Chong Soon Lim, Takahiro Nishi, Yoji Shibahara, Toshiyasu Sugio, Kazuya Takagi, Viktor Wahadaniah, ハン ブーン テオ, ワハダニア ビクター, チョン スーン リム, 敏康 杉尾, 陽司 柴原, 孝啓 西, 一也 高木 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (小さいサイズ, 大きいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (小さいサイズ, 大きいサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP2008219870A CLAIM 4 前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (maximum size, minimum size, maximum size value, minimum size value) 又は所定の小さいサイズ (maximum size, minimum size, maximum size value, minimum size value) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。 JP2008219870A CLAIM 9 既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置に用いられる、コンピュータに実行させるためのプログラム (rectangular partitions) であって、 前記プログラムは、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とするプログラム。 JP2008219870A CLAIM 10 既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (小さいサイズ, 大きいサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (小さいサイズ, 大きいサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP2008219870A CLAIM 4 前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (maximum size, minimum size, maximum size value, minimum size value) 又は所定の小さいサイズ (maximum size, minimum size, maximum size value, minimum size value) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。 JP2008219870A CLAIM 9 既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置に用いられる、コンピュータに実行させるためのプログラム (rectangular partitions) であって、 前記プログラムは、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とするプログラム。 JP2008219870A CLAIM 10 既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2008086316A2 Filed: 2008-01-07 Issued: 2008-07-17 Variable length coding techniques for coded block patterns (Original Assignee) Qualcomm Incorporated Marta Karczewicz, Hyukjune Chung, Phoom Sagetong |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2008086316A2 CLAIM 1 . A method comprising : selecting a variable length coding (VLC) table for a current video block from a plurality of tables based on a number of neighboring video blocks to the current video block that include non-zero transform coefficients ; and coding one or more coded block pattern (block pattern) s (CBPs) of the current video block using the selected VLC table . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2008086316A2 CLAIM 1 . A method comprising : selecting a variable length coding (VLC) table for a current video block from a plurality of tables based on a number of neighboring video blocks to the current video block that include non-zero transform coefficients ; and coding one or more coded block pattern (block pattern) s (CBPs) of the current video block using the selected VLC table . WO2008086316A2 CLAIM 10 . The method of claim 1 , wherein the method is performed with respect to video blocks of an enhancement layer in scalable video coding (generating syntax information) (SVC) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2008054688A1 Filed: 2007-10-26 Issued: 2008-05-08 Method for decomposing a video sequence frame (Original Assignee) Hewlett-Packard Development Company, L.P. Debargha Mukherjee, Sam Liu |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (different motion vectors) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2008054688A1 CLAIM 6 . The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) . WO2008054688A1 CLAIM 8 . An encoder (112) for encoding a video sequence frame , wherein the video sequence frame is represented by a plurality of blocks (122-126) having a first set of dimensions , said encoder (112) comprising : a wedge partitioner (224) ; a block divider (222) ; a motion estimator (226) ; and an error calculator (228) configured to calculate a first metric of the motion vectors (132) associated with partition (square partitions, four square partitions) ing the at least one block (122-126) into two wedges (130a , 130b) and to calculate a second metric of the motion vectors (132) associated with dividing the at least block into sub-blocks (122-126) , and for at least one block (122-126) , wherein a controller (204) is configured (a) to determine whether either both of the dimensions of the at least one block (122-126) equal a predetermined minimum dimension , (b) to implement the motion estimator (226) to estimate a motion vector (132) for the at least one block in response to the either or both of the dimensions equaling the predetermined minimum dimensions , and in response to both of the dimensions exceeding the predetermined minimum dimension , (c) to implement the motion estimator (226) to estimate a motion vector (!32) for the at least one block (122-126) , (d) to implement the wedge partitioner (224) to partition the at least one block (122-126) of the plurality of blocks (122-126) into two wedges (130a , 130b) , (e) to implement the block divider (222) to divide the at least one block (122-126) into sub-blocks (122-126) having a second set of dimensions , and to perform (a)-(e) for each of the sub-blocks (122-126) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (different motion vectors) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2008054688A1 CLAIM 6 . The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) . WO2008054688A1 CLAIM 8 . An encoder (112) for encoding a video sequence frame , wherein the video sequence frame is represented by a plurality of blocks (122-126) having a first set of dimensions , said encoder (112) comprising : a wedge partitioner (224) ; a block divider (222) ; a motion estimator (226) ; and an error calculator (228) configured to calculate a first metric of the motion vectors (132) associated with partition (square partitions, four square partitions) ing the at least one block (122-126) into two wedges (130a , 130b) and to calculate a second metric of the motion vectors (132) associated with dividing the at least block into sub-blocks (122-126) , and for at least one block (122-126) , wherein a controller (204) is configured (a) to determine whether either both of the dimensions of the at least one block (122-126) equal a predetermined minimum dimension , (b) to implement the motion estimator (226) to estimate a motion vector (132) for the at least one block in response to the either or both of the dimensions equaling the predetermined minimum dimensions , and in response to both of the dimensions exceeding the predetermined minimum dimension , (c) to implement the motion estimator (226) to estimate a motion vector (!32) for the at least one block (122-126) , (d) to implement the wedge partitioner (224) to partition the at least one block (122-126) of the plurality of blocks (122-126) into two wedges (130a , 130b) , (e) to implement the block divider (222) to divide the at least one block (122-126) into sub-blocks (122-126) having a second set of dimensions , and to perform (a)-(e) for each of the sub-blocks (122-126) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2008027192A2 Filed: 2007-08-16 Issued: 2008-03-06 Methods and apparatus for reduced resolution partitioning (Original Assignee) Thomson Licensing Oscar Divorra Escoda, Peng Yin |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (n partitions) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (syntax information) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value (slice level) , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2008027192A2 CLAIM 2 . The apparatus of claim 1 , wherein partitions (square partitions, four square partitions) are obtained from a combination of top-down tree partitioning and bottom-up tree joining . WO2008027192A2 CLAIM 3 . The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation . WO2008027192A2 CLAIM 8 . The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (minimum size value) , and a macroblock level . WO2008027192A2 CLAIM 17 . The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (syntax information) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
WO2008027192A2 CLAIM 17 . The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (n partitions) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (syntax information) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value (slice level) , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2008027192A2 CLAIM 2 . The apparatus of claim 1 , wherein partitions (square partitions, four square partitions) are obtained from a combination of top-down tree partitioning and bottom-up tree joining . WO2008027192A2 CLAIM 3 . The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation . WO2008027192A2 CLAIM 8 . The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (minimum size value) , and a macroblock level . WO2008027192A2 CLAIM 15 . The apparatus of claim 9 , wherein a block scanning order is equal or different from an existing block scan order imposed by a video coding (generating syntax information) recommendation or a video coding standard . WO2008027192A2 CLAIM 17 . The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (syntax information) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
WO2008027192A2 CLAIM 17 . The apparatus of claim 16 , wherein the at least one local super-block partition related syntax element operates as an adaptive predictor selector for direct prediction , and wherein at least one of motion information and a block mode type is directly predicted or derived based on neighboring macroblock data and super-block syntax information (syntax information) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20080049834A1 Filed: 2007-08-03 Issued: 2008-02-28 Sub-block transform coding of prediction residuals (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Thomas Holcomb, Chih-Lung Lin |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20080049834A1 CLAIM 56 . The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data . US20080049834A1 CLAIM 60 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes . US20080049834A1 CLAIM 62 . In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ; and outputting the plural sub-block pattern codes . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction) . |
US20080049834A1 CLAIM 56 . The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20080049834A1 CLAIM 56 . The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data . US20080049834A1 CLAIM 60 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes . US20080049834A1 CLAIM 62 . In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ; and outputting the plural sub-block pattern codes . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction) . |
US20080049834A1 CLAIM 56 . The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2008016605A2 Filed: 2007-07-31 Issued: 2008-02-07 Adaptive geometric partitioning for video decoding (Original Assignee) Thomson Licensing Oscar Divorra Escoda, Peng Yin |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2008016605A2 CLAIM 18 . The apparatus of claim 16 , wherein said decoder (1100) decodes model parameters of at least one of the at least one parametric model and the at least one curve to determine the parametric model-based partitions along with partition (square partitions, four square partitions) s prediction data . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (th partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2008016605A2 CLAIM 15 . The apparatus of claim 1 , wherein said decoder (1100) is an extended version of an existing hybrid predictive decoder of an existing video coding (generating syntax information) standard or video coding recommendation . WO2008016605A2 CLAIM 18 . The apparatus of claim 16 , wherein said decoder (1100) decodes model parameters of at least one of the at least one parametric model and the at least one curve to determine the parametric model-based partitions along with partition (square partitions, four square partitions) s prediction data . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2008002881A2 Filed: 2007-06-25 Issued: 2008-01-03 Reduction of errors during computation of inverse discrete cosine transform (Original Assignee) Qualcomm Incorporated Harinath Garudadri, Yuriy Reznik |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2008002881A2 CLAIM 1 . A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ; generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ; generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ; and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ; causing a media presentation unit to output audible or visible signals based on the matrix of output values ; wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) . |
WO2008002881A2 CLAIM 1 . A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ; generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ; generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ; and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ; causing a media presentation unit to output audible or visible signals based on the matrix of output values ; wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2008002881A2 CLAIM 1 . A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ; generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ; generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ; and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ; causing a media presentation unit to output audible or visible signals based on the matrix of output values ; wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) . |
WO2008002881A2 CLAIM 1 . A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ; generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ; generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ; and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks) ; causing a media presentation unit to output audible or visible signals based on the matrix of output values ; wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision . |
EP2347591B1 CLAIM 7 A computer-readable medium comprising instructions for causing a programmable processor (programmable processor) to perform the methods of any of claims 4-6 . |
WO2008002881A2 CLAIM 81 . A computer-readable medium comprising instructions , wherein the instructions cause a programmable processor (programmable processor) to : generate a matrix of adjusted coefficients by left-shifting each coefficient in a matrix of source coefficients ; generate a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of adjusted coefficients ; generate a matrix of scaled coefficients by scaling each coefficient in the matrix of transformed coefficients ; generate a matrix of biased coefficients by adding one or more bias values to one or more coefficients in the matrix of scaled coefficients ; and generate a matrix of output coefficients by right-shifting coefficients in the matrix of biased coefficients by a first magnitude ; wherein the matrix of source coefficients represents media data ; wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal forward discrete cosine transform ; wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations of coefficients in the matrix of biased coefficients by the first magnitude and results from dividing the coefficients in the matrix of biased coefficients by 2 raised to the power of the first magnitude , without regard to precision . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2007117711A2 Filed: 2007-04-09 Issued: 2007-10-18 Dynamic selection of motion estimation search ranges and extended motion vector ranges (Original Assignee) Microsoft Corporation Cheng Chang, Chih-Lung Lin, Thomas W. Holcomb |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (frequency transform) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (entropy encoding) element (vertical components) representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2007117711A2 CLAIM 4 . The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ; and signaling a second motion vector range indicating the second search range ; whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation . WO2007117711A2 CLAIM 16 . An encoder comprising : a motion estimator for performing motion estimation ; a frequency transform (integer multiple) er for transforming prediction residuals (first syntax) into transform coefficients (square partitions) ; a quantizer for quantizing the transform coefficients ; an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ; and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size, maximum size value) is based at least in part upon motion vector distribution information for plural previous motion vectors . WO2007117711A2 CLAIM 19 . The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values (minimum size value) of horizontal and vertical components (second syntax element) ; and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (frequency transform) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (entropy encoding) element (vertical components) representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2007117711A2 CLAIM 4 . The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ; and signaling a second motion vector range indicating the second search range ; whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation . WO2007117711A2 CLAIM 16 . An encoder comprising : a motion estimator for performing motion estimation ; a frequency transform (integer multiple) er for transforming prediction residuals (first syntax) into transform coefficients (square partitions) ; a quantizer for quantizing the transform coefficients ; an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ; and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size, maximum size value) is based at least in part upon motion vector distribution information for plural previous motion vectors . WO2007117711A2 CLAIM 19 . The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values (minimum size value) of horizontal and vertical components (second syntax element) ; and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2007115325A2 Filed: 2007-04-04 Issued: 2007-10-11 Apparatus and method of enhanced frame interpolation in video compression (Original Assignee) Qualcomm Incorporated Fang Shi, Vijayalakshmi R. Raveendran |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (receiving encoded data) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2007115325A2 CLAIM 42 . A method of processing multimedia data , the method comprising : receiving encoded data (integer multiple) indicative of an interpolated reference frame for at least a portion of a frame ; generating at least a portion of said interpolated reference frame ; and decoding at least a portion of the frame based at least in part on the generated portion of the interpolated reference frame . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (receiving encoded data) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2007115325A2 CLAIM 42 . A method of processing multimedia data , the method comprising : receiving encoded data (integer multiple) indicative of an interpolated reference frame for at least a portion of a frame ; generating at least a portion of said interpolated reference frame ; and decoding at least a portion of the frame based at least in part on the generated portion of the interpolated reference frame . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2007115126A1 Filed: 2007-03-29 Issued: 2007-10-11 Adaptive encoder-assisted frame rate up conversion (Original Assignee) Qualcomm Incorporated Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2007115126A1 CLAIM 1 . A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ; encoding at least a portion of a video frame using the FRUC frame as a reference ; selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ; and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode . WO2007115126A1 CLAIM 3 . The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2007115126A1 CLAIM 1 . A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ; encoding at least a portion of a video frame using the FRUC frame as a reference ; selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ; and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode . WO2007115126A1 CLAIM 3 . The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2007112417A2 Filed: 2007-03-27 Issued: 2007-10-04 Method and system for coding and decoding information associated with video compression (Original Assignee) Qualcomm Incorporated Yiliang Bao |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2007112417A2 CLAIM 1 . A method for coding information associated with video compression comprising : receiving a video sequence ; processing the video sequence in a plurality of frames ; processing each frame of the plurality of frames in a plurality of macrob locks ; constructing a prediction of an original video signal , which is a part of a macroblock , in a current frame , from the video sequence ; forming a residual signal by subtracting the prediction of the original video signal from the original video signal in the current frame ; applying a transform to the residual signal ; quantizing a plurality of transform coefficients (square partitions) ; identifying a symbol of at least one syntax element that defines a characteristic of the residual signal ; and coding symbols of the at least one syntax element of a same category together . WO2007112417A2 CLAIM 2 . The method of claim 1 , wherein coding the symbols of the at least one syntax element comprises coding a coded block pattern (block pattern) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2007112417A2 CLAIM 1 . A method for coding information associated with video compression comprising : receiving a video sequence ; processing the video sequence in a plurality of frames ; processing each frame of the plurality of frames in a plurality of macrob locks ; constructing a prediction of an original video signal , which is a part of a macroblock , in a current frame , from the video sequence ; forming a residual signal by subtracting the prediction of the original video signal from the original video signal in the current frame ; applying a transform to the residual signal ; quantizing a plurality of transform coefficients (square partitions) ; identifying a symbol of at least one syntax element that defines a characteristic of the residual signal ; and coding symbols of the at least one syntax element of a same category together . WO2007112417A2 CLAIM 2 . The method of claim 1 , wherein coding the symbols of the at least one syntax element comprises coding a coded block pattern (block pattern) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070297518A1 Filed: 2007-03-22 Issued: 2007-12-27 Flag encoding method, flag decoding method, and apparatus thereof (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd Woo-jin Han, Bae-keun Lee, Tammy Lee, Kyo-hyuk Lee |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (flag value) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (minimum size) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070297518A1 CLAIM 1 . An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ; a maximum-run-determining unit which determines a maximum run of the flag bit string ; and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table . US20070297518A1 CLAIM 7 . The apparatus for encoding a flag of claim 1 , wherein the flag is a coded block pattern (block pattern) (Cbp) or a residual prediction flag . US20070297518A1 CLAIM 24 . The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (flag value) . |
US20070297518A1 CLAIM 1 . An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ; a maximum-run-determining unit which determines a maximum run of the flag bit string ; and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (flag value) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (minimum size) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070297518A1 CLAIM 1 . An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ; a maximum-run-determining unit which determines a maximum run of the flag bit string ; and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table . US20070297518A1 CLAIM 7 . The apparatus for encoding a flag of claim 1 , wherein the flag is a coded block pattern (block pattern) (Cbp) or a residual prediction flag . US20070297518A1 CLAIM 24 . The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (flag value) . |
US20070297518A1 CLAIM 1 . An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ; a maximum-run-determining unit which determines a maximum run of the flag bit string ; and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070160303A1 Filed: 2006-12-20 Issued: 2007-07-12 Geometrical image representation and compression (Original Assignee) Docomo Communications Labs USA Inc (Current Assignee) NTT Docomo Inc Onur Guleryuz, Arthur Cunha |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions (image processing operation) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070160303A1 CLAIM 1 . A method comprising : creating a representation for image data that includes determining a geometric flow for image data ; and performing an image processing operation (rectangular partitions) on data in the representation using the geometric flow . US20070160303A1 CLAIM 26 . The method defined in claim 17 wherein encoding the prediction errors comprises entropy coding (square partitions) the prediction errors . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions (image processing operation) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070160303A1 CLAIM 1 . A method comprising : creating a representation for image data that includes determining a geometric flow for image data ; and performing an image processing operation (rectangular partitions) on data in the representation using the geometric flow . US20070160303A1 CLAIM 26 . The method defined in claim 17 wherein encoding the prediction errors comprises entropy coding (square partitions) the prediction errors . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070230563A1 Filed: 2006-11-10 Issued: 2007-10-04 Adaptive encoder-assisted frame rate up conversion (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070230563A1 CLAIM 1 . A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ; encoding at least a portion of a video frame using the FRUC frame as a reference ; selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ; and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode . US20070230563A1 CLAIM 3 . The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070230563A1 CLAIM 1 . A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ; encoding at least a portion of a video frame using the FRUC frame as a reference ; selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ; and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode . US20070230563A1 CLAIM 3 . The method of claim 2 , wherein adjusting one or more parameters comprises adjusting a coded block pattern (block pattern) (CBP) parameter and a motion vector (MV) parameter for each of the blocks to indicate the selected FRUC mode . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP2008092150A Filed: 2006-09-29 Issued: 2008-04-17 画像符号化装置、画像符号化方法 (Original Assignee) Canon Inc; キヤノン株式会社 Tadashi Kaneko, 金子 唯史 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP2008092150A CLAIM 1 動画像を構成する各フレームの画像を符号化する画像符号化装置であって、 符号化対象フレームの画像をマクロブロック毎に入力する入力手段と (second syntax) 、 前記符号化対象フレームの画像における符号化対象マクロブロックと、前記符号化対象フレームよりも前のフレームの画像において当該符号化対象マクロブロックに対応するマクロブロックと、の差分量を求める計算手段と、 前記差分量に基づいて、前記符号化対象マクロブロックに対する符号化単位を決定する決定手段と、 前記決定手段が決定した符号化単位でもって前記符号化対象マクロブロックに対する符号化を行う符号化手段と を備えることを特徴とする画像符号化装置。 JP2008092150A CLAIM 7 コンピュータに請求項6に記載の画像処理方法を実行させるためのプログラム (rectangular partitions) 。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (プログラム) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP2008092150A CLAIM 1 動画像を構成する各フレームの画像を符号化する画像符号化装置であって、 符号化対象フレームの画像をマクロブロック毎に入力する入力手段と (second syntax) 、 前記符号化対象フレームの画像における符号化対象マクロブロックと、前記符号化対象フレームよりも前のフレームの画像において当該符号化対象マクロブロックに対応するマクロブロックと、の差分量を求める計算手段と、 前記差分量に基づいて、前記符号化対象マクロブロックに対する符号化単位を決定する決定手段と、 前記決定手段が決定した符号化単位でもって前記符号化対象マクロブロックに対する符号化を行う符号化手段と を備えることを特徴とする画像符号化装置。 JP2008092150A CLAIM 7 コンピュータに請求項6に記載の画像処理方法を実行させるためのプログラム (rectangular partitions) 。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN1972449A Filed: 2006-09-11 Issued: 2007-05-30 用于在单一描述和多描述之间切换的设备和方法 (Original Assignee) 首尔国立大学校产学协力财团; 三星电子株式会社 金壹求, 赵南翊 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN (余弦变换) , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (的频率特性) , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN1972449A CLAIM 1 . 一种视频代码转换器,包括:译码单元,译码压缩视频以为所述压缩视频的每一个帧产生视频报头、运动矢量和DCT(离散余弦变换 (size NxN) )系数;单一描述/多描述确定单元,基于在上述压缩视频的传输信道中错误发生的概率确定是否执行代码转换为单一描述或多描述;以及代码转换单元,当上述单一描述/多描述确定单元确定执行代码转换为单一描述时,通过将DCT系数和视频报头以及运动矢量一起编码,执行代码转换为单一描述,和当上述单一描述/多描述确定单元确定执行代码转换为多描述时,通过将DCT系数划分为若干DCT系数以及将上述DCT系数的每一个和视频报头以及运动矢量一起编码,执行代码转换为多描述。 CN1972449A CLAIM 2 . 权利要求1中的视频代码转换器,其中上述代码转换单元包括:执行代码转换为单一描述的单一描述代码转换单元;以及执行代码译码为多描述的多描述代码转换单元,其中上述多描述代码转换单元包括:开始位置确定单元,根据DCT系数的频率特性 (minimum size value) 以预先确定或要求的扫描顺序确定上述DCT系数的划分的开始位置;DCT系数划分单元,使在上述开始位置之前的DCT系数的所有数据被包括在若干被划分的DCT系数的每一个中,和使在上述开始位置之后的DCT系数的数据交替地被包括在若干被划分的DCT系数的每一个中,从而产生多个被划分的DCT系数;以及压缩编码单元,将多个被划分的DCT系数的每一个和视频报头以及运动矢量一起编码。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN (余弦变换) , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (从而产生) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (的频率特性) , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN1972449A CLAIM 1 . 一种视频代码转换器,包括:译码单元,译码压缩视频以为所述压缩视频的每一个帧产生视频报头、运动矢量和DCT(离散余弦变换 (size NxN) )系数;单一描述/多描述确定单元,基于在上述压缩视频的传输信道中错误发生的概率确定是否执行代码转换为单一描述或多描述;以及代码转换单元,当上述单一描述/多描述确定单元确定执行代码转换为单一描述时,通过将DCT系数和视频报头以及运动矢量一起编码,执行代码转换为单一描述,和当上述单一描述/多描述确定单元确定执行代码转换为多描述时,通过将DCT系数划分为若干DCT系数以及将上述DCT系数的每一个和视频报头以及运动矢量一起编码,执行代码转换为多描述。 CN1972449A CLAIM 2 . 权利要求1中的视频代码转换器,其中上述代码转换单元包括:执行代码转换为单一描述的单一描述代码转换单元;以及执行代码译码为多描述的多描述代码转换单元,其中上述多描述代码转换单元包括:开始位置确定单元,根据DCT系数的频率特性 (minimum size value) 以预先确定或要求的扫描顺序确定上述DCT系数的划分的开始位置;DCT系数划分单元,使在上述开始位置之前的DCT系数的所有数据被包括在若干被划分的DCT系数的每一个中,和使在上述开始位置之后的DCT系数的数据交替地被包括在若干被划分的DCT系数的每一个中,从而产生 (generating syntax information) 多个被划分的DCT系数;以及压缩编码单元,将多个被划分的DCT系数的每一个和视频报头以及运动矢量一起编码。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070171982A1 Filed: 2006-08-31 Issued: 2007-07-26 Methods for encoding digital video for decoding on low performance devices (Original Assignee) WIMAXIMUM LLC (Current Assignee) Nordic Interactive Technologies LLC ; WIMAXIMUM LLC Jens Meggers |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070171982A1 CLAIM 25 . An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ; subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ; optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ; selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression result ; and encoding the video frame in the video stream with the block size selected by said selecting . US20070171982A1 CLAIM 34 . Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) . |
US20070171982A1 CLAIM 34 . Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (memory block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (corresponding blocks) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070171982A1 CLAIM 25 . An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ; subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks (block pattern) of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ; optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ; selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression result ; and encoding the video frame in the video stream with the block size selected by said selecting . US20070171982A1 CLAIM 34 . Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (memory block) . |
US20070171982A1 CLAIM 34 . Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video blocks) using an LZW type compression algorithm . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | EP1771008A2 Filed: 2006-08-15 Issued: 2007-04-04 Adaptive method and system for encoding digital images for the internet (Original Assignee) Micro Focus Software Inc (Current Assignee) Oracle International Corp Kasman E. Thomas |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of the smallest partition for the coded unit . |
EP1771008A2 CLAIM 1 A method of converting an image bitmap comprising : defining (34) a current rectangular region (120) of the image bitmap ; quantifying (36) spectral characteristics of the current rectangular region ; dividing (44) the current rectangular region into four rectangular sub-regions (124 , 125 , 126 , 127) in response to the spectral characteristics being greater than a predetermined threshold value (minimum size value) ; tagging (40) the current rectangular sub-region as a terminal rectangular region in response to the spectral characteristics being less than or equal to the predetermined threshold value ; tagging (52) each rectangular sub-region as a terminal rectangular region in response to the size of the rectangular sub-region being less than or equal to a predetermined threshold size ; defining (50) one of the non-terminal rectangular sub-regions as the current rectangular region and repeating the above until all rectangular regions are tagged terminal ; and determining (56) color characteristics for each of the terminal rectangular regions . EP1771008A2 CLAIM 4 The method of claim 3 , wherein determining (42) a split point comprises determining (86 , 88) a center-of-gravity of the rectangular region based on color values (rectangular partitions) of pixels in the rectangular region (82) and an average luminance value of the rectangular region (84) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of a smallest partition for the coded unit . |
EP1771008A2 CLAIM 1 A method of converting an image bitmap comprising : defining (34) a current rectangular region (120) of the image bitmap ; quantifying (36) spectral characteristics of the current rectangular region ; dividing (44) the current rectangular region into four rectangular sub-regions (124 , 125 , 126 , 127) in response to the spectral characteristics being greater than a predetermined threshold value (minimum size value) ; tagging (40) the current rectangular sub-region as a terminal rectangular region in response to the spectral characteristics being less than or equal to the predetermined threshold value ; tagging (52) each rectangular sub-region as a terminal rectangular region in response to the size of the rectangular sub-region being less than or equal to a predetermined threshold size ; defining (50) one of the non-terminal rectangular sub-regions as the current rectangular region and repeating the above until all rectangular regions are tagged terminal ; and determining (56) color characteristics for each of the terminal rectangular regions . EP1771008A2 CLAIM 4 The method of claim 3 , wherein determining (42) a split point comprises determining (86 , 88) a center-of-gravity of the rectangular region based on color values (rectangular partitions) of pixels in the rectangular region (82) and an average luminance value of the rectangular region (84) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2007027402A2 Filed: 2006-08-11 Issued: 2007-03-08 Multi-stage cabac decoding pipeline (Original Assignee) Micronas Usa, Inc. Ankur Shah, Liang Peng |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (n bits) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2007027402A2 CLAIM 1 What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ; table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ; a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ; a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ; a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ; a decode stage adapted to operate on all bins , for carrying out decode processes ; a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ; and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ; stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit . WO2007027402A2 CLAIM 1 3 . The device of claim I wherein bits (integer multiple) of the 1 video bit stream are decoded in 2 an average of four clock cycles . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
WO2007027402A2 CLAIM 1 What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ; table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ; a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ; a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ; a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ; a decode stage adapted to operate on all bins , for carrying out decode processes ; a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ; and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ; stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (n bits) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2007027402A2 CLAIM 1 What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ; table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ; a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ; a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ; a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ; a decode stage adapted to operate on all bins , for carrying out decode processes ; a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ; and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ; stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit . WO2007027402A2 CLAIM 1 3 . The device of claim I wherein bits (integer multiple) of the 1 video bit stream are decoded in 2 an average of four clock cycles . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
WO2007027402A2 CLAIM 1 What is claimed is : L A multi-stage content-adaptive binary arithmetic coding (CABAC) architecture device for decoding a video bit stream , comprising : a history ' ; table lookup stage adapted to operate on Bin 0 only , for detecting macroblocks neighboring a current macroblock , and for retrieving previously stored syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) information for the neighboring macroblocks ; a neighborhood information context increment stage adapted to operate on Bin 0 only , for parsing the retrieved syntax elements information for the neighboring macroblocks , and calculating a context index increment for each syntax element ; a context final stage adapted to operate on all bins , for calculating a final context for a particular syntax element ; a state MPS read stage adapted to operate on all bins , for storing and retrieving a probability s , tate index and a most probable symbol value using a stateMps value table and a context index as an offset into that table ; a decode stage adapted to operate on all bins , for carrying out decode processes ; a binarisation stage adapted to operate on all bins , for performing a bin match process for each syntax element ; and a state MPS write-back stage adapted to operate on all bins , for writing updated probability state index and most probable symbol values back to the ' ; stateMPS value table corresponding to a current context index , wherein if the current context index and a subsequent context index are the same , the updated probability state index and a most probable symbol values are forwarded to the decode stage of a subsequent bit . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP2008011431A Filed: 2006-06-30 Issued: 2008-01-17 画像符号化装置 (Original Assignee) Toshiba Corp; 株式会社東芝 Hideji Michinaka, Hiromitsu Nakayama, Akihiro Ogami, Takaya Ogawa, Yoshinori Shigeta, Tatsuhiro Suzumura, Satoshi Takegawa, Kiwamu Watanabe, 啓満 中山, 晃弘 大上, 貴也 小川, 究 渡辺, 智 竹川, 良則 繁田, 秀治 道中, 竜広 鈴村 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP2008011431A CLAIM 1 入力される2値シンボル列中の各2値シンボルに対する発生確率に基づく値を符号量の情報として出力する符号量情報出力手段と (second syntax) 、 前記符号量情報出力手段からの情報によって与えられる符号量を加算して、前記2値シンボル列を算術符号化した場合の発生符号量の推定値を出力する符号量累積加算手段と、を有する発生符号量推定部 を具備したことを特徴とする画像符号化装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (発生符号量, 発生確率) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP2008011431A CLAIM 1 入力される2値シンボル列中の各2値シンボルに対する発生確率 (generating syntax information) に基づく値を符号量の情報として出力する符号量情報出力手段と (second syntax) 、 前記符号量情報出力手段からの情報によって与えられる符号量を加算して、前記2値シンボル列を算術符号化した場合の発生符号量 (generating syntax information) の推定値を出力する符号量累積加算手段と、を有する発生符号量推定部 を具備したことを特徴とする画像符号化装置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2006128076A2 Filed: 2006-05-25 Issued: 2006-11-30 Fixed point integer division techniques for ac/dc prediction in video coding devices (Original Assignee) Qualcomm Incorporated Shu Xiao, Junchen Du, Tao Shen |
---|---|
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2006128076A2 CLAIM 2 . The coding device of claim 1 , wherein the coding device comprises a video coding (generating syntax information) device and the coding standard comprises an MPEG-4 video coding standard . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN101189641A Filed: 2006-05-09 Issued: 2008-05-28 编码数字图像的像素或体素的方法及处理数字图像的方法 (Original Assignee) 布雷克成像有限公司 I·戈里, M·马蒂乌齐 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (处理设) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (数值数据) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN101189641A CLAIM 11 . 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size, minimum size value) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。 CN101189641A CLAIM 40 . 根据权利要求39所述的图像处理设 (generate syntax information, generating syntax information) 备,其特征在于,所述计算 机硬件由所述成像设备的计算机硬件形成,并且,所述软件装置被存储或 加载于所述成像设备的所述计算机硬件中并可通过所述计算机硬件执行。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (处理设) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (数值数据) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN101189641A CLAIM 11 . 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size, minimum size value) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。 CN101189641A CLAIM 40 . 根据权利要求39所述的图像处理设 (generate syntax information, generating syntax information) 备,其特征在于,所述计算 机硬件由所述成像设备的计算机硬件形成,并且,所述软件装置被存储或 加载于所述成像设备的所述计算机硬件中并可通过所述计算机硬件执行。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20060233254A1 Filed: 2006-04-18 Issued: 2006-10-19 Method and apparatus for adaptively selecting context model for entropy coding (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd Bae-keun Lee, Kyo-hyuk Lee, Sang-Chang Cha, Woo-jin Han |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (generates data) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information (layer decoder) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20060233254A1 CLAIM 1 . A method of coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the method comprising : calculating characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (block pattern) (CBP) of the lower layer block ; determining a coding method for the residual prediction flag according to the characteristic data ; and coding the residual prediction flag using the determined coding method . US20060233254A1 CLAIM 30 . A video encoder (second syntax element) for coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the encoder comprising : a lower layer encoding unit which generates characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (CBP) of the lower layer block ; an enhancement layer encoding unit which codes the residual prediction flag using a coding method for the residual prediction flag , wherein the method is determined according to the characteristic data . US20060233254A1 CLAIM 64 . The coding method of claim 55 , wherein the coding of the data for the enhancement layer data is performed by entropy coding (square partitions) . US20060233254A1 CLAIM 78 . A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ; and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data . US20060233254A1 CLAIM 89 . A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ; and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (layer decoder) comprises a fixed-length code corresponding to the size of the plurality of video blocks (generates data) . |
US20060233254A1 CLAIM 78 . A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ; and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data . US20060233254A1 CLAIM 89 . A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ; and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (generates data) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information (layer decoder) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20060233254A1 CLAIM 1 . A method of coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the method comprising : calculating characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (block pattern) (CBP) of the lower layer block ; determining a coding method for the residual prediction flag according to the characteristic data ; and coding the residual prediction flag using the determined coding method . US20060233254A1 CLAIM 30 . A video encoder (second syntax element) for coding a residual prediction flag indicating whether residual data for an enhancement layer block of a multi-layered video signal is predicted from residual data for a lower layer block corresponding to the residual data for the enhancement layer block , the encoder comprising : a lower layer encoding unit which generates characteristic data for the lower layer block , wherein the characteristic data comprises at least one of energy of the residual data of the lower layer block and a value of a coded block pattern (CBP) of the lower layer block ; an enhancement layer encoding unit which codes the residual prediction flag using a coding method for the residual prediction flag , wherein the method is determined according to the characteristic data . US20060233254A1 CLAIM 64 . The coding method of claim 55 , wherein the coding of the data for the enhancement layer data is performed by entropy coding (square partitions) . US20060233254A1 CLAIM 78 . A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ; and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data . US20060233254A1 CLAIM 89 . A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ; and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (layer decoder) comprises a fixed-length code corresponding to the size of the plurality of video blocks (generates data) . |
US20060233254A1 CLAIM 78 . A video encoder comprising : a base layer encoder which obtains a base layer frame from an input frame ; and an enhancement layer encoder which generates data (video blocks) for an enhancement layer predicted from reference data comprising at least one of data for the base layer frame , data for a second enhancement layer at a different temporal position and data for a different region of the enhancement layer , wherein the enhancement layer encoder comprises a transformer which codes the data for the enhancement layer according to a result of determining whether the data for the enhancement layer is predictable from the reference data . US20060233254A1 CLAIM 89 . A video decoder comprising : a base layer decoder (syntax information, generating syntax information) which decodes an input base layer frame ; and an enhancement layer decoder which decodes data for an enhancement layer by referring to the base layer frame , wherein the enhancement layer decoder comprises an inverse transformer which decodes the data for the enhancement layer according to a result of determining whether data for the base layer frame is needed to predict the data for the enhancement layer . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN101204092A Filed: 2006-02-13 Issued: 2008-06-18 从低分辨率图像导出高分辨率图像的编码信息的方法以及实施所述方法的编码和解码设备 (Original Assignee) 汤姆森许可贸易公司 纪尧姆·布瓦松, 尼古拉斯·比尔丹, 爱德华·弗朗索瓦, 帕特里克·洛佩兹, 格温埃利·马康, 热罗姆·维耶龙 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (的尺寸) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN101204092A CLAIM 15、 如权利要求1至14之一所述的方法,其中,所述第一尺寸块 具有8*8像素的尺寸 (minimum size, minimum size value) ,所述宏块具有16*16像素的尺寸,以及所述第二尺寸块具有4*4像素的尺寸。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (的尺寸) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN101204092A CLAIM 15、 如权利要求1至14之一所述的方法,其中,所述第一尺寸块 具有8*8像素的尺寸 (minimum size, minimum size value) ,所述宏块具有16*16像素的尺寸,以及所述第二尺寸块具有4*4像素的尺寸。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20060120450A1 Filed: 2005-12-01 Issued: 2006-06-08 Method and apparatus for multi-layered video encoding and decoding (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd Woo-jin Han, Sang-Chang Cha, Ho-Jin Ha |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions (intra-prediction modes) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20060120450A1 CLAIM 1 . A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ; performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ; selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ; and encoding the selected prediction residual . US20060120450A1 CLAIM 2 . The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction modes (rectangular partitions) . US20060120450A1 CLAIM 8 . The method of claim 1 , wherein the encoding of the selected prediction residual comprises : performing spatial transform on the selected prediction residual to create transform coefficients (square partitions) ; quantizing the transform coefficients to generate quantization coefficients ; and losslessly encoding the quantization coefficients . US20060120450A1 CLAIM 20 . A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ; a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ; a unit configured to select one of the two prediction residuals that offers higher coding efficiency ; and a unit configured to encode the selected prediction residual . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions (intra-prediction modes) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (prediction residuals) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20060120450A1 CLAIM 1 . A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ; performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ; selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ; and encoding the selected prediction residual . US20060120450A1 CLAIM 2 . The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction modes (rectangular partitions) . US20060120450A1 CLAIM 8 . The method of claim 1 , wherein the encoding of the selected prediction residual comprises : performing spatial transform on the selected prediction residual to create transform coefficients (square partitions) ; quantizing the transform coefficients to generate quantization coefficients ; and losslessly encoding the quantization coefficients . US20060120450A1 CLAIM 20 . A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ; a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ; a unit configured to select one of the two prediction residuals that offers higher coding efficiency ; and a unit configured to encode the selected prediction residual . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20080025391A1 Filed: 2005-11-09 Issued: 2008-01-31 Transcoding Method And Device (Original Assignee) Siemens AG (Current Assignee) Siemens AG Peter Amon, Jurgen Pandel |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (syntax elements, network element) for the coded unit wherein the syntax information includes : a first syntax (syntax elements, network element) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element (syntax elements, network element) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20080025391A1 CLAIM 13 . The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard . US20080025391A1 CLAIM 15 . The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data . US20080025391A1 CLAIM 28 . A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (syntax elements, network element) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US20080025391A1 CLAIM 15 . The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data . US20080025391A1 CLAIM 28 . A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (syntax elements, network element) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements, network element) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element (syntax elements, network element) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20080025391A1 CLAIM 13 . The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard . US20080025391A1 CLAIM 15 . The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data . US20080025391A1 CLAIM 28 . A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (syntax elements, network element) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US20080025391A1 CLAIM 15 . The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data . US20080025391A1 CLAIM 28 . A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070206679A1 Filed: 2005-09-06 Issued: 2007-09-06 Motion Image Encoding Method and Motion Image Decoding Method (Original Assignee) Panasonic Corp (Current Assignee) Dolby International AB Chong Lim, Sheng Shen, Shinya Kadono |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070206679A1 CLAIM 6 . The video coding method according to claim 5 , wherein in the implicit mode , the transform block size is determined to be 4×4 pixels in the case where a degree to which the motion vectors differ from each other is greater than or equal to a predetermined threshold value (minimum size value) , and the transform block size is determined to be 8×8 pixels in the case where the degree to which the motion vectors differ from each other is less than the predetermined threshold value . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070206679A1 CLAIM 1 . A video coding (generating syntax information) method for coding an input picture on a block-by-block basis so as to generate a coded stream , said method comprising : determining a notification method for notifying a transform block size to be used in orthogonal transform of a current block ; creating an adaptive block transform mode indicating the notification method ; and adding the adaptive block transform mode to the coded stream . US20070206679A1 CLAIM 6 . The video coding method according to claim 5 , wherein in the implicit mode , the transform block size is determined to be 4×4 pixels in the case where a degree to which the motion vectors differ from each other is greater than or equal to a predetermined threshold value (minimum size value) , and the transform block size is determined to be 8×8 pixels in the case where the degree to which the motion vectors differ from each other is less than the predetermined threshold value . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20060126740A1 Filed: 2005-05-25 Issued: 2006-06-15 Shared pipeline architecture for motion vector prediction and residual decoding (Original Assignee) Micronas USA Inc (Current Assignee) TDK Micronas GmbH Teng Lin, Weimin Zeng |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (d log) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax (data state) element (motion vector differences) representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (motion vector differences) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20060126740A1 CLAIM 3 . The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis . US20060126740A1 CLAIM 7 . The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information . US20060126740A1 CLAIM 9 . The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (d log) . |
US20060126740A1 CLAIM 3 . The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (d log) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (data state) element (motion vector differences) representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (motion vector differences) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20060126740A1 CLAIM 3 . The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis . US20060126740A1 CLAIM 7 . The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information . US20060126740A1 CLAIM 9 . The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (d log) . |
US20060126740A1 CLAIM 3 . The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20050025246A1 Filed: 2004-07-16 Issued: 2005-02-03 Decoding jointly coded transform type and subblock pattern information (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Thomas Holcomb |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (first VLC) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (syntax elements) for the coded unit wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20050025246A1 CLAIM 14 . The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . US20050025246A1 CLAIM 17 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ; decoding the first VLC ; receiving one or more other VLCs signaled at block level ; and decoding each of the one or more other VLCs . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (first VLC) . |
US20050025246A1 CLAIM 14 . The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . US20050025246A1 CLAIM 17 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ; decoding the first VLC ; receiving one or more other VLCs signaled at block level ; and decoding each of the one or more other VLCs . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (first VLC) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (syntax elements) for the coded unit , wherein the syntax information includes : a first syntax (syntax elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (syntax elements) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20050025246A1 CLAIM 14 . The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . US20050025246A1 CLAIM 17 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ; decoding the first VLC ; receiving one or more other VLCs signaled at block level ; and decoding each of the one or more other VLCs . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (syntax elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (first VLC) . |
US20050025246A1 CLAIM 14 . The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, syntax information, first syntax element, second syntax element, generate syntax information, generating syntax information) . US20050025246A1 CLAIM 17 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ; decoding the first VLC ; receiving one or more other VLCs signaled at block level ; and decoding each of the one or more other VLCs . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20050053292A1 Filed: 2004-06-29 Issued: 2005-03-10 Advanced bi-directional predictive coding of interlaced video (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Kunal Mukerjee, Thomas Holcomb |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second motion vector predictor) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20050053292A1 CLAIM 8 . The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ; and calculating a second motion vector predictor (block pattern) based at least in part on one or more of the second set of plural motion vector predictor candidates . US20050053292A1 CLAIM 28 . A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second motion vector predictor) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (video encoder) representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20050053292A1 CLAIM 8 . The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ; and calculating a second motion vector predictor (block pattern) based at least in part on one or more of the second set of plural motion vector predictor candidates . US20050053292A1 CLAIM 28 . A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN1810037A Filed: 2004-06-17 Issued: 2006-07-26 帧间的快速模式确定编码 (Original Assignee) 汤姆森许可贸易公司 尹鹏, 亚历山德罗斯·迈克尔·图拉普斯, 吉尔·麦克唐纳·布瓦斯 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (一种视频编码方法, 针对帧) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (尺寸的) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN1810037A CLAIM 1 . 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。 CN1810037A CLAIM 7 . 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。 CN1810037A CLAIM 10 . 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。 |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (一种视频编码方法, 针对帧) . |
CN1810037A CLAIM 1 . 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。 CN1810037A CLAIM 10 . 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (一种视频编码方法, 针对帧) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (尺寸的) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN1810037A CLAIM 1 . 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。 CN1810037A CLAIM 7 . 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。 CN1810037A CLAIM 10 . 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。 |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (一种视频编码方法, 针对帧) . |
CN1810037A CLAIM 1 . 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。 CN1810037A CLAIM 10 . 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20050135484A1 Filed: 2004-06-07 Issued: 2005-06-23 Method of encoding mode determination, method of motion estimation and encoding apparatus (Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group (Current Assignee) Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group Yung-Lyul Lee, Ki-hoon Han |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (other ones) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20050135484A1 CLAIM 2 . The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size, maximum size value) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ; and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison . US20050135484A1 CLAIM 16 . A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ; determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ; calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size) of the plurality of modes ; selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ; and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (current picture) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (other ones) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20050135484A1 CLAIM 2 . The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size, maximum size value) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ; and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison . US20050135484A1 CLAIM 16 . A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ; determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ; calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size) of the plurality of modes ; selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ; and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN1857001A Filed: 2004-05-20 Issued: 2006-11-01 混合视频压缩方法 (Original Assignee) Amt先进多媒体科技公司 P·托思, J·卡罗利 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (块进行编码) of size NxN (余弦变换) , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (的尺寸, 尺寸的) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN1857001A CLAIM 5 . 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size, minimum size value) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。 CN1857001A CLAIM 6 . 按照权利要求1-5中任何一条权利要求的方法,其特征在于:应用离散余弦变换 (size NxN) (DCT)作为将空间域表示转换成频率域表示的变换。 CN1857001A CLAIM 14 . 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size, minimum size value) 和可用于传输编码数据的带宽而修改的。 CN1857001A CLAIM 23 . 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。 |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (块进行编码) . |
CN1857001A CLAIM 23 . 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (块进行编码) of size NxN (余弦变换) , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (从而产生) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (的尺寸, 尺寸的) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN1857001A CLAIM 5 . 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size, minimum size value) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。 CN1857001A CLAIM 6 . 按照权利要求1-5中任何一条权利要求的方法,其特征在于:应用离散余弦变换 (size NxN) (DCT)作为将空间域表示转换成频率域表示的变换。 CN1857001A CLAIM 14 . 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size, minimum size value) 和可用于传输编码数据的带宽而修改的。 CN1857001A CLAIM 23 . 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。 CN1857001A CLAIM 33 . 按照权利要求23-32中任何一条权利要求的方法,其特征在于:在差分块的信息内容上执行一个将空间表示转换成频率表示的变换(DCT或哈达马变换),从而产生 (generating syntax information) 多元素的二维变换块(DCT或哈达马系数矩阵),并且用熵编码对变换块的信息内容进行编码。 |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (块进行编码) . |
CN1857001A CLAIM 23 . 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | WO2004104930A2 Filed: 2004-05-20 Issued: 2004-12-02 Hybrid video compression method (Original Assignee) Amt Advanced Multimedia Technology Ab Peter Toth, József KÁROLY |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (given block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (subsequent frame) ; generate syntax information (processing element, data elements) for the coded unit wherein the syntax information includes : a first syntax (processing element, data elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (processing element, data elements) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
WO2004104930A2 CLAIM 1 . Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ; representation , and finally e , encoding the information content of the transformed blocks by entropy coding (square partitions) , characterised by that * . ' ; * - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c . WO2004104930A2 CLAIM 4 . The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value . WO2004104930A2 CLAIM 23 . Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frame (block pattern) s (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks . WO2004104930A2 CLAIM 35 . Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ; characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network . WO2004104930A2 CLAIM 46 . The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (processing element, data elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (given block) . |
WO2004104930A2 CLAIM 4 . The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value . WO2004104930A2 CLAIM 35 . Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ; characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network . WO2004104930A2 CLAIM 46 . The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (given block) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (subsequent frame) ; generating syntax information (processing element, data elements) for the coded unit , wherein the syntax information includes : a first syntax (processing element, data elements) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (processing element, data elements) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
WO2004104930A2 CLAIM 1 . Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ; representation , and finally e , encoding the information content of the transformed blocks by entropy coding (square partitions) , characterised by that * . ' ; * - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c . WO2004104930A2 CLAIM 4 . The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value . WO2004104930A2 CLAIM 23 . Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frame (block pattern) s (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks . WO2004104930A2 CLAIM 35 . Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ; characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network . WO2004104930A2 CLAIM 46 . The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (processing element, data elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks (given block) . |
WO2004104930A2 CLAIM 4 . The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value . WO2004104930A2 CLAIM 35 . Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ; characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network . WO2004104930A2 CLAIM 46 . The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window . |
EP2347591B1 CLAIM 7 A computer-readable medium comprising instructions for causing a programmable processor (processing element, data elements) to perform the methods of any of claims 4-6 . |
WO2004104930A2 CLAIM 35 . Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ; characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network . WO2004104930A2 CLAIM 46 . The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (first syntax, programmable processor, syntax information, generate syntax information, generating syntax information, second syntax) stored in the time window . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20070053436A1 Filed: 2004-05-04 Issued: 2007-03-08 Encoding video information using block based adaptive scan order (Original Assignee) Koninklijke Philips NV (Current Assignee) Koninklijke Philips NV Lambertus Van Eggelen |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20070053436A1 CLAIM 2 . A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . US20070053436A1 CLAIM 3 . A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) . |
US20070053436A1 CLAIM 2 . A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20070053436A1 CLAIM 2 . A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . US20070053436A1 CLAIM 3 . A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) . |
US20070053436A1 CLAIM 2 . A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20040234144A1 Filed: 2004-02-18 Issued: 2004-11-25 Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program (Original Assignee) NTT Docomo Inc (Current Assignee) NTT Docomo Inc Kazuo Sugimoto, Satoru Adachi, Sadaatsu Kato, Minoru Etoh, Mitsuru Kobayashi, Hiroyuki Yamaguchi |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20040234144A1 CLAIM 1 . An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ; a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ; a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ; a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ; a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ; and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ; wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ; a coding procedure provider for providing one or more types of coding procedures ; and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data . US20040234144A1 CLAIM 6 . The image encoding apparatus according to claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20040234144A1 CLAIM 1 . An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ; a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ; a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ; a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ; a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ; and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ; wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ; a coding procedure provider for providing one or more types of coding procedures ; and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data . US20040234144A1 CLAIM 6 . The image encoding apparatus according to claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20050276323A1 Filed: 2003-09-26 Issued: 2005-12-15 Real-time video coding/decoding (Original Assignee) Vanguard Video LLC (Current Assignee) Beamr Imaging Ltd Alexey Martemyanov, Nick Terterov, Alexander Zheludkov, Alexey Dolgoborodov, Vladimir Semenyuk, Eugene Neimark, Irena Terterov |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients, entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20050276323A1 CLAIM 1 . A video codec for encoding/decoding digitized sequence of video frames with high compression efficiency , comprising : means for frame encoding ; means for setting and storing codec setting parameters ; means for controlling desired frame encoding time and CPU loading ; means for rate control including size of frame encoding output bitstream ; and means for arithmetic coding of quantized transform coefficients (square partitions) . US20050276323A1 CLAIM 8 . The video codec of claim 7 , comprising means for macroblock encoding , comprising : means for intra prediction , at least one means for inter prediction , means for selecting macroblock type and encoding setting , means for calculating macroblock texture prediction and prediction error , means for performing texture (second syntax, first syntax, first syntax element) prediction error transform and transform coefficient quantization ; means calculating motion vector prediction and prediction error ; and means for entropy encoding (second syntax, first syntax, first syntax element) providing arithmetic context encoding of motion vectors , header parameters and transform coefficients . US20050276323A1 CLAIM 24 . The method of claim 23 , comprising wavelet transform , wherein resulting wavelet transform coefficients are compressed by context-based entropy coding (square partitions) . US20050276323A1 CLAIM 30 . The method of claim 28 , wherein the motion estimation comprises : calculating motion vectors MV(wb , hb , CF , RF) with integer-pel accuracy using previously calculated motion data ; calculating motion vectors MV(wb , hb , CF , RF) [block x][block y] performing inverse logarithmic motion search with parameters block x , block y , current range ; performing motion vector refinement (second syntax, first syntax, first syntax element) choosing from sets of neighboring motion vectors MVNeiborhood (wb , hb , CF , RF)[block x][block y] elements (mvx , mvy) that provide minimum value of motion vector weight function Q(mvx , mvy , CF , RF , wb , hb , block x , block y) ; performing motion vector estimation with quarter-pel accuracy based on results of motion vector estimation with integer-pel accuracy by changing components of the integer-pel accuracy motion vector MV(wb , hb , CF , RF) [block x][block y] in range [−¾ ; +¾] with a step ¼ ; and calculating motion vectors MV(wb , hb , CF , RF) with quarter-pel accuracy by sequentially applying the motion estimation steps with integer-pel accuracy and with quarter-pel accuracy , wherein CF—current frame with horizontal coordinate x and vertical coordinate y ; RF—reference frame with horizontal coordinate x and vertical coordinate y ; wb—width of the blocks for which motion estimation is performed ; hb—height of the blocks for which motion estimation is performed ; W—a multiple of wb , current and reference frame width ; H—a multiple of hb , current and reference frame height ; Q(mvx , mvy , CF , RF , wb , hb , block x , block y)—motion vector weight calculation function ; MV(wb , hb , CF , RF)[block x][block y]—motion vector (i . e . pair (mvx , mvy) of integers) corresponding to the frame CF and reference frame RF for a block of width wb , height hb , which left-top corner is located at a pixel with horizontal coordinate block x and vertical coordinate block y ; MV(wb , hb , CF , RF)—a set of motion vectors MV(wb , hb , CF , RF)[block x][block y] for : block x=0 , wb , 2 wb , 3 wb , . . . , block x< ; W , and block y=0 , hb , 2·hb , 3·hb , . . . , block y< ; H . MVNeighborhood (wb , hb , CF , RF)[block x][block y]—a set of neighboring motion vectors MV(wb , hb , CF , RF)[nx][ny] , where nx may be equal to block x−wb , block x , block x+wb , and ny may be equal to block y−hb , block y , block y+hb , and nx≧0 , ny≧0 , nx≦W−wb , ny≦H−hb . US20050276323A1 CLAIM 34 . A method of decoding of sequence of video frames encoded according to claim 13 , comprising steps of : arithmetic decoding ; decoding coded block pattern (block pattern) of macroblock mode and texture using arithmetic context-based mode (second syntax, first syntax, first syntax element) ling ; decoding texture prediction error using arithmetic context-based modeling ; calculating prediction for motion vectors ; and decoding motion vectors using context-based arithmetic modeling . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients, entropy coding) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (motion vector refinement, performing texture, entropy encoding, based mode) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20050276323A1 CLAIM 1 . A video codec for encoding/decoding digitized sequence of video frames with high compression efficiency , comprising : means for frame encoding ; means for setting and storing codec setting parameters ; means for controlling desired frame encoding time and CPU loading ; means for rate control including size of frame encoding output bitstream ; and means for arithmetic coding of quantized transform coefficients (square partitions) . US20050276323A1 CLAIM 8 . The video codec of claim 7 , comprising means for macroblock encoding , comprising : means for intra prediction , at least one means for inter prediction , means for selecting macroblock type and encoding setting , means for calculating macroblock texture prediction and prediction error , means for performing texture (second syntax, first syntax, first syntax element) prediction error transform and transform coefficient quantization ; means calculating motion vector prediction and prediction error ; and means for entropy encoding (second syntax, first syntax, first syntax element) providing arithmetic context encoding of motion vectors , header parameters and transform coefficients . US20050276323A1 CLAIM 24 . The method of claim 23 , comprising wavelet transform , wherein resulting wavelet transform coefficients are compressed by context-based entropy coding (square partitions) . US20050276323A1 CLAIM 30 . The method of claim 28 , wherein the motion estimation comprises : calculating motion vectors MV(wb , hb , CF , RF) with integer-pel accuracy using previously calculated motion data ; calculating motion vectors MV(wb , hb , CF , RF) [block x][block y] performing inverse logarithmic motion search with parameters block x , block y , current range ; performing motion vector refinement (second syntax, first syntax, first syntax element) choosing from sets of neighboring motion vectors MVNeiborhood (wb , hb , CF , RF)[block x][block y] elements (mvx , mvy) that provide minimum value of motion vector weight function Q(mvx , mvy , CF , RF , wb , hb , block x , block y) ; performing motion vector estimation with quarter-pel accuracy based on results of motion vector estimation with integer-pel accuracy by changing components of the integer-pel accuracy motion vector MV(wb , hb , CF , RF) [block x][block y] in range [−¾ ; +¾] with a step ¼ ; and calculating motion vectors MV(wb , hb , CF , RF) with quarter-pel accuracy by sequentially applying the motion estimation steps with integer-pel accuracy and with quarter-pel accuracy , wherein CF—current frame with horizontal coordinate x and vertical coordinate y ; RF—reference frame with horizontal coordinate x and vertical coordinate y ; wb—width of the blocks for which motion estimation is performed ; hb—height of the blocks for which motion estimation is performed ; W—a multiple of wb , current and reference frame width ; H—a multiple of hb , current and reference frame height ; Q(mvx , mvy , CF , RF , wb , hb , block x , block y)—motion vector weight calculation function ; MV(wb , hb , CF , RF)[block x][block y]—motion vector (i . e . pair (mvx , mvy) of integers) corresponding to the frame CF and reference frame RF for a block of width wb , height hb , which left-top corner is located at a pixel with horizontal coordinate block x and vertical coordinate block y ; MV(wb , hb , CF , RF)—a set of motion vectors MV(wb , hb , CF , RF)[block x][block y] for : block x=0 , wb , 2 wb , 3 wb , . . . , block x< ; W , and block y=0 , hb , 2·hb , 3·hb , . . . , block y< ; H . MVNeighborhood (wb , hb , CF , RF)[block x][block y]—a set of neighboring motion vectors MV(wb , hb , CF , RF)[nx][ny] , where nx may be equal to block x−wb , block x , block x+wb , and ny may be equal to block y−hb , block y , block y+hb , and nx≧0 , ny≧0 , nx≦W−wb , ny≦H−hb . US20050276323A1 CLAIM 34 . A method of decoding of sequence of video frames encoded according to claim 13 , comprising steps of : arithmetic decoding ; decoding coded block pattern (block pattern) of macroblock mode and texture using arithmetic context-based mode (second syntax, first syntax, first syntax element) ling ; decoding texture prediction error using arithmetic context-based modeling ; calculating prediction for motion vectors ; and decoding motion vectors using context-based arithmetic modeling . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN101087424A Filed: 2003-05-27 Issued: 2007-12-12 图像内预测模式估计、通信和组织的方法和系统 (Original Assignee) 夏普株式会社 孙式军, L·J·柯罗夫斯基 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (的左侧) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (平均值) , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN101087424A CLAIM 2 . 一种用于估计象素内预测模式的方法,该方法包括:用于使用邻接块中的一个象素的象素值来预测要被预测的目标块的象素值的内预测步骤;以及用于估计用于目标块的预测模式的预测模式估计步骤,其中所述内预测步骤使用至少一个使用等于邻接于目标块并且在目标块之上的第一个块的象素值的预测值的垂直预测模式、使用等于邻接于目标块的左侧 (rectangular partitions) 的第二个块的象素值的预测值的水平预测模式和使用等于第一个块和第二个块的象素值的平均值 (minimum size value) 的预测值的DC预测模式,利用逐渐增加的数字按照垂直预测模式、水平预测模式和DC预测模式的顺序对于预测模式进行编号,以及预测模式估计步骤确定一个具有第一个块的预测模式和第二个块的预测模式中的较低模式编号的预测模式作为用于目标块的预测模式。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (的左侧) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (平均值) , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN101087424A CLAIM 2 . 一种用于估计象素内预测模式的方法,该方法包括:用于使用邻接块中的一个象素的象素值来预测要被预测的目标块的象素值的内预测步骤;以及用于估计用于目标块的预测模式的预测模式估计步骤,其中所述内预测步骤使用至少一个使用等于邻接于目标块并且在目标块之上的第一个块的象素值的预测值的垂直预测模式、使用等于邻接于目标块的左侧 (rectangular partitions) 的第二个块的象素值的预测值的水平预测模式和使用等于第一个块和第二个块的象素值的平均值 (minimum size value) 的预测值的DC预测模式,利用逐渐增加的数字按照垂直预测模式、水平预测模式和DC预测模式的顺序对于预测模式进行编号,以及预测模式估计步骤确定一个具有第一个块的预测模式和第二个块的预测模式中的较低模式编号的预测模式作为用于目标块的预测模式。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN1457196A Filed: 2003-04-25 Issued: 2003-11-19 基于时空域相关性运动矢量预测的视频编码方法 (Original Assignee) 北京工业大学 薛金柱, 沈兰荪 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN1457196A CLAIM 1 . 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频 (minimum size value) 序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (基于运动) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN1457196A CLAIM 1 . 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频 (minimum size value) 序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。 CN1457196A CLAIM 2 . 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动 (generating syntax information) 补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准H . 264 JM6 . 0的编码框架。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | EP1478190A1 Filed: 2003-04-25 Issued: 2004-11-17 Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program (Original Assignee) NTT Docomo Inc (Current Assignee) NTT Docomo Inc Satoru c/o NTT DoCoMo Inc. ADACHI, Minoru c/o NTT DoCoMo Inc. ETOH, Sadaatsu c/o NTT DoCoMo Inc. KATO, Mitsuru c/o NTT DoCoMo Inc. KOBAYASHI, Kazuo c/o NTT DoCoMo Inc. SUGIMOTO, Hiroyuki NTT DoCoMo Inc. YAMAGUCHI |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
EP1478190A1 CLAIM 1 An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ; a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ; a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ; a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ; a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ; and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ; wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ; a coding procedure provider for providing one or more types of coding procedures ; and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data . EP1478190A1 CLAIM 6 The image encoding apparatus according to Claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
EP1478190A1 CLAIM 1 An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ; a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ; a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ; a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ; a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ; and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ; wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ; a coding procedure provider for providing one or more types of coding procedures ; and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data . EP1478190A1 CLAIM 6 The image encoding apparatus according to Claim 1 , wherein , on the occasion of encoding the quantized transform coefficients in the coding target macroblock , the coding procedure provider provides a coding procedure of setting a smaller deviation of code lengths with increase in absolute values (minimum size value) of the quantized transform coefficients in a neighboring macroblock to the coding target macroblock . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | CN1647541A Filed: 2003-04-23 Issued: 2005-07-27 用于在视频编码系统中指示量化器参数的方法与设备 (Original Assignee) 诺基亚有限公司 J·莱恩马 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax (多个分) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of the smallest partition for the coded unit . |
CN1647541A CLAIM 6 . 根据权利要求5的编码方法,其特征在于:已编码比特流从一个视频 (minimum size value) 编码设备发射到一个相应的视频解码设备。 CN1647541A CLAIM 13 . 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax (多个分) element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (一个视频) , the minimum size value indicating a size of a smallest partition for the coded unit . |
CN1647541A CLAIM 6 . 根据权利要求5的编码方法,其特征在于:已编码比特流从一个视频 (minimum size value) 编码设备发射到一个相应的视频解码设备。 CN1647541A CLAIM 13 . 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20030179940A1 Filed: 2003-03-19 Issued: 2003-09-25 Efficient macroblock header coding for video compression (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Chih-Lung Lin, Ming-Chieh Lee |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table, block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (chrominance values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20030179940A1 CLAIM 1 . In a video coder for coding video images in a block format , a method for improving compression of the video images comprising : for a macroblock in a video frame , determining whether texture values for the color values (rectangular partitions) of the macroblock are coded and setting the coded block parameters corresponding to the colors to indicate whether or not the texture values are coded ; forming a combined parameter representing all of the coded block parameters for the macroblock ; determining a single variable length code for the combined parameter of the macroblock ; and repeating the above-steps for macroblocks in the video image . US20030179940A1 CLAIM 2 . The method of claim 1 wherein : the texture values are chrominance values (four square partitions, minimum size value) U and V , and luminance values Y , the macroblock includes one block for U , one block for V and foul blocks for Y ; and the coded block parameters include one bit each for U and V indicating whether the corresponding U and V blocks are coded , and four bits for Y indicating whether the four corresponding Y blocks are coded . US20030179940A1 CLAIM 4 . The method of claim 1 further including : selecting a predictor for the coded block parameters ; and computing an exclusive OR between the predictor and the coded block parameters to compute predicted coded block parameters , the predicted coded block parameters forming at least a part of the combined coded block parameter for the macroblock ; wherein the step of determining the single variable length code includes looking up the combined coded block parameter in a variable length coding table (block pattern) to find the single variable length code for the combined coded block parameter . US20030179940A1 CLAIM 13 . The method of claim 12 wherein the first variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block pattern (block pattern) s for chrominance and luminance ; and wherein the second variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block patterns for chrominance and luminance . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (color values) or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table, block pattern) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (chrominance values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20030179940A1 CLAIM 1 . In a video coder for coding video images in a block format , a method for improving compression of the video images comprising : for a macroblock in a video frame , determining whether texture values for the color values (rectangular partitions) of the macroblock are coded and setting the coded block parameters corresponding to the colors to indicate whether or not the texture values are coded ; forming a combined parameter representing all of the coded block parameters for the macroblock ; determining a single variable length code for the combined parameter of the macroblock ; and repeating the above-steps for macroblocks in the video image . US20030179940A1 CLAIM 2 . The method of claim 1 wherein : the texture values are chrominance values (four square partitions, minimum size value) U and V , and luminance values Y , the macroblock includes one block for U , one block for V and foul blocks for Y ; and the coded block parameters include one bit each for U and V indicating whether the corresponding U and V blocks are coded , and four bits for Y indicating whether the four corresponding Y blocks are coded . US20030179940A1 CLAIM 4 . The method of claim 1 further including : selecting a predictor for the coded block parameters ; and computing an exclusive OR between the predictor and the coded block parameters to compute predicted coded block parameters , the predicted coded block parameters forming at least a part of the combined coded block parameter for the macroblock ; wherein the step of determining the single variable length code includes looking up the combined coded block parameter in a variable length coding table (block pattern) to find the single variable length code for the combined coded block parameter . US20030179940A1 CLAIM 13 . The method of claim 12 wherein the first variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block pattern (block pattern) s for chrominance and luminance ; and wherein the second variable length coding table stores entries for variable length codes , each representing a combined macroblock parameter that includes coded block patterns for chrominance and luminance . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US20030156648A1 Filed: 2002-12-17 Issued: 2003-08-21 Sub-block transform coding of prediction residuals (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Thomas Holcomb, Chih-Lung Lin |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction, main data) of size NxN , where N is an integer multiple (n bits) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (motion prediction, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US20030156648A1 CLAIM 15 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) . US20030156648A1 CLAIM 20 . The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size . US20030156648A1 CLAIM 29 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ; selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ; and encoding the spatial domain data using transforms of the selected one or more transform sizes . US20030156648A1 CLAIM 37 . In a computer system , a computer-implemented method of processing one or more video frames , the method comprising : switching transform sizes from among plural available transform sizes for data for the one or more video frames , wherein bits (integer multiple) tream syntax allows the switching at plural different bitstream levels for the one or more video frames ; and processing the data for the one or more video frames with one or more transforms of switched transform size . US20030156648A1 CLAIM 46 . A video encoder (second syntax element) comprising : means for switching frequency transform sizes from among plural available frequency transform sizes for prediction residual data for one or more motion-predicted video frames , wherein bitstream syntax allows the switching at plural different bitstream levels for the one or more motion-predicted video frames ; and means for processing the prediction residual data with one or more frequency transforms of switched frequency transform size . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction, main data) . |
US20030156648A1 CLAIM 20 . The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size . US20030156648A1 CLAIM 29 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ; selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ; and encoding the spatial domain data using transforms of the selected one or more transform sizes . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion prediction, main data) of size NxN , where N is an integer multiple (n bits) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (motion prediction, main data) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US20030156648A1 CLAIM 15 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) . US20030156648A1 CLAIM 20 . The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size . US20030156648A1 CLAIM 29 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ; selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ; and encoding the spatial domain data using transforms of the selected one or more transform sizes . US20030156648A1 CLAIM 37 . In a computer system , a computer-implemented method of processing one or more video frames , the method comprising : switching transform sizes from among plural available transform sizes for data for the one or more video frames , wherein bits (integer multiple) tream syntax allows the switching at plural different bitstream levels for the one or more video frames ; and processing the data for the one or more video frames with one or more transforms of switched transform size . US20030156648A1 CLAIM 46 . A video encoder (second syntax element) comprising : means for switching frequency transform sizes from among plural available frequency transform sizes for prediction residual data for one or more motion-predicted video frames , wherein bitstream syntax allows the switching at plural different bitstream levels for the one or more motion-predicted video frames ; and means for processing the prediction residual data with one or more frequency transforms of switched frequency transform size . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion prediction, main data) . |
US20030156648A1 CLAIM 20 . The method of claim 19 wherein for at least part of the prediction residual data the selected transform size is different than motion prediction (video blocks, four square partitions) block size . US20030156648A1 CLAIM 29 . A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : evaluating plural available transform sizes for spatial domain data (video blocks, four square partitions) for one or more video frames ; selecting one or more of the plural available transform sizes based at least in part upon results of the evaluating ; and encoding the spatial domain data using transforms of the selected one or more transform sizes . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US7200275B2 Filed: 2002-12-16 Issued: 2007-04-03 Skip macroblock coding (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Sridhar Srinivasan, Pohsiang Hsu |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US7200275B2 CLAIM 2 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US7200275B2 CLAIM 2 . A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 1 during video encoding (second syntax) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US7206456B2 Filed: 2002-11-26 Issued: 2007-04-17 Video coding and decoding (Original Assignee) Nokia Oyj (Current Assignee) Nokia Technologies Oy Miska Hannuksela, Ye-Kui Wang |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (coding elements) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US7206456B2 CLAIM 1 . A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; encoding the at least one separate coding object ; and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US7206456B2 CLAIM 1 . A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; encoding the at least one separate coding object ; and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (coding elements) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US7206456B2 CLAIM 1 . A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; encoding the at least one separate coding object ; and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
US7206456B2 CLAIM 1 . A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; encoding the at least one separate coding object ; and encoding as one coding object the background object , wherein the encoding the background coding object comprises the sub-step of defining coding slices in a scan-order so that the slices are composed by consecutive coding blocks skipping those basic coding objects which are included in the at least one separate coding object . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP2002359852A Filed: 2002-03-22 Issued: 2002-12-13 画像予測復号化装置及び方法 (Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社 Sen Bun Chun, Mei Shen Shen, Thiow Keng Tan, メイ・シェン シェン, セン・ブン チュン, ケン・タン ティオ |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (量子化ステップサイズ) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP2002359852A CLAIM 1 【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size (量子化ステップサイズ) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP2002359852A CLAIM 1 【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JPH11164305A Filed: 1998-02-25 Issued: 1999-06-18 動画像符号化方法、動画像符号化装置および動画像復号装置 (Original Assignee) Mitsubishi Electric Corp; 三菱電機株式会社 Kotaro Asai, Yoshimi Isu, Shunichi Sekiguchi, 芳美 井須, 光太郎 浅井, 俊一 関口 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (のクラス) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (示す複数) , the minimum size value indicating a size of the smallest partition for the coded unit . |
JPH11164305A CLAIM 8 【請求項8】 前記分割処理部は、 各領域に関し、その領域の画像の特性を示す複数 (minimum size value) の数値 の線形和をその領域のアクティビティとして算出するア クティビティ算出部と、 算出されたアクティビティとあらかじめ設定された基準 値を比較する分割判定部と、 を備え、比較の結果、アクティビティが基準値を上回る 領域がさらに小さな領域へ分割されることを特徴とする 請求項4、5のいずれかに記載の動画像符号化装置。 JPH11164305A CLAIM 12 【請求項12】 前記クラス識別部は、複数の領域にま たがる被写体構造に注目して各領域のクラス (rectangular partitions) を決定する ことを特徴とする請求項11に記載の動画像符号化装 置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (のクラス) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (示す複数) , the minimum size value indicating a size of a smallest partition for the coded unit . |
JPH11164305A CLAIM 8 【請求項8】 前記分割処理部は、 各領域に関し、その領域の画像の特性を示す複数 (minimum size value) の数値 の線形和をその領域のアクティビティとして算出するア クティビティ算出部と、 算出されたアクティビティとあらかじめ設定された基準 値を比較する分割判定部と、 を備え、比較の結果、アクティビティが基準値を上回る 領域がさらに小さな領域へ分割されることを特徴とする 請求項4、5のいずれかに記載の動画像符号化装置。 JPH11164305A CLAIM 12 【請求項12】 前記クラス識別部は、複数の領域にま たがる被写体構造に注目して各領域のクラス (rectangular partitions) を決定する ことを特徴とする請求項11に記載の動画像符号化装 置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5748789A Filed: 1996-10-31 Issued: 1998-05-05 Transparent block skipping in object-based video coding systems (Original Assignee) Microsoft Corp (Current Assignee) Microsoft Technology Licensing LLC Ming-Chieh Lee, Wei-ge Chen |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5748789A CLAIM 17 . In an object-based video coding method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a transformation block partially covered by a first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture coding for the sub-transformation block and encoding a coded block pattern (block pattern) bit or bits only for non-transparent sub-transformation blocks in the partially covered transformation block ; and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a transformation block partially covered by the first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture decoding for the sub-transformation block . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (block pattern) ; generating syntax information (video coding) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5748789A CLAIM 1 . In an object-based video coding (generating syntax information) method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ; and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block . US5748789A CLAIM 17 . In an object-based video coding method , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a transformation block partially covered by a first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture coding for the sub-transformation block and encoding a coded block pattern (block pattern) bit or bits only for non-transparent sub-transformation blocks in the partially covered transformation block ; and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a transformation block partially covered by the first object , evaluating the shape of the first object to determine whether a sub-transformation block in the partially covered transformation block is transparent based on the shape of the first object , and if so , then skipping texture decoding for the sub-transformation block . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US6125143A Filed: 1996-10-18 Issued: 2000-09-26 Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium (Original Assignee) Sony Corp (Current Assignee) Ironworks Patents LLC Teruhiko Suzuki, Masami Ogata, Tak Yen Tong |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (mean value) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (mean value) value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US6125143A CLAIM 2 . A picture encoding device according to claim 1 , further comprising transform means for performing prescribed transform processing on said picture data using a plurality of said encoding block units and outputting transform coefficients (square partitions) , wherein said encoding means encodes said transform coefficients outputted by said transform means using said encoding block units . US6125143A CLAIM 21 . A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ; and mean value (maximum size, minimum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (mean value) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size (mean value) value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US6125143A CLAIM 2 . A picture encoding device according to claim 1 , further comprising transform means for performing prescribed transform processing on said picture data using a plurality of said encoding block units and outputting transform coefficients (square partitions) , wherein said encoding means encodes said transform coefficients outputted by said transform means using said encoding block units . US6125143A CLAIM 21 . A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ; and mean value (maximum size, minimum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5691770A Filed: 1996-04-24 Issued: 1997-11-25 Device and method for coding video pictures (Original Assignee) US Philips Corp (Current Assignee) US Philips Corp Gerrit J. Keesman, Willem M. J. Coene, Eduard W. Salomons |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (spatial frequency) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (same amplitude) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5691770A CLAIM 4 . A device as claimed in claim 1 , wherein the amplitude distribution is obtained by counting , for each spatial frequency (integer multiple) (i) , the number of times (h i , n) when a coefficient indicative of said spatial frequency assumes the same amplitude (rectangular partitions) (n) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple (spatial frequency) of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions (same amplitude) or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5691770A CLAIM 4 . A device as claimed in claim 1 , wherein the amplitude distribution is obtained by counting , for each spatial frequency (integer multiple) (i) , the number of times (h i , n) when a coefficient indicative of said spatial frequency assumes the same amplitude (rectangular partitions) (n) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | EP0727910A2 Filed: 1996-02-05 Issued: 1996-08-21 Temporal-spatial error concealment apparatus and method for video signal processors (Original Assignee) Thomson Multimedia SA (Current Assignee) Technicolor SA Max Chien, Wilson Kwok, Huifang Sun |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (bad block) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
EP0727910A2 CLAIM 14 A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (block pattern) ) , characterized by ; providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ; determining a dominant image gradient in blocks of pixels adjacent a bad block ; spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ; temporally predicting a substitute block of data for said bad block in response to said error signal ; converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ; converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ; forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ; inverse transforming said further block of coefficients to produce a substitute block of pixel data . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (bad block) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
EP0727910A2 CLAIM 14 A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (block pattern) ) , characterized by ; providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ; determining a dominant image gradient in blocks of pixels adjacent a bad block ; spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ; temporally predicting a substitute block of data for said bad block in response to said error signal ; converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ; converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ; forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ; inverse transforming said further block of coefficients to produce a substitute block of pixel data . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5576767A Filed: 1995-09-21 Issued: 1996-11-19 Interframe video encoding and decoding system (Original Assignee) Qualcomm Inc (Current Assignee) Qualcomm Inc Chong U. Lee, Donald Pian |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (additional reference) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5576767A CLAIM 4 . The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference (block pattern) blocks of data . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (additional reference) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5576767A CLAIM 4 . The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference (block pattern) blocks of data . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5768434A Filed: 1995-06-07 Issued: 1998-06-16 Quadtree-structured walsh transform coding (Original Assignee) National Semiconductor Corp (Current Assignee) National Semiconductor Corp Xiaonong Ran |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value, absolute values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5768434A CLAIM 11 . The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ; I< ; M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , N and M are integers with N< ; M , and for each integer J with N< ; J< ; M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ; each block in the partition corresponding to the integer M consists of x 2 values , x being a power of two ; each block in the partition corresponding to an integer M-1 consists of 4*x 2 values ; and partitioning the two-dimensional array to form the quadtree partition , further comprises the steps of : performing a Walsh transform on each block of the partition corresponding to the integer M to derive y Walsh transform coefficients , where y is an integer less than or equal to x 2 ; determining an error for each block in the partition corresponding to the integer M-1 , wherein the error depends on a first set of 4*x 2 values generated by an inverse Walsh transform of a 2x-by-2x matrix having at most y non-zero coefficients , and a second set of 4*x 2 values generated by inverse Walsh transforms of four x-by-x matrices , each of the four x-by-x matrices containing the y Walsh transform coefficients derived for a block which is in the partition corresponding to M and is a subset of the block for which the error is being determined ; and including in the quadtree partition four blocks from the partition corresponding to the integer M , wherein a union of the four blocks equals a block having an error greater than a predetermined threshold value (minimum size value) . US5768434A CLAIM 14 . The method of claim 12 , wherein step (c) further comprises determining a sum of absolute values (minimum size value) of differences between values in the block of the selected partition and corresponding values generated by the inverse Walsh transform of the matrix having no more than x 2 non-zero coefficients . US5768434A CLAIM 16 . The method of claim 6 , wherein the step of generating signals which identify the blocks in the quadtree partition comprises the sequential steps of : providing a signal for each block which is in a first partition (square partitions) from a series of uniform partitions , wherein the signal for a block has a first value if the block is in the quadtree partition and has a second value otherwise ; and for each block in a second partition (square partitions) from the series of uniform partitions : providing a signal having the first value if the block in the quadtree partition ; and providing a signal having the second value if the block is not in the quadtree partition and is not a subset of a block in the quadtree partition . US5768434A CLAIM 17 . The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ; I< ; M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ; M , and wherein for each integer J with N< ; J< ; M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ; and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks , wherein no two blocks (block pattern) in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ; and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (second partition, first partition) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (two blocks) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value (predetermined threshold value, absolute values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5768434A CLAIM 11 . The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ; I< ; M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , N and M are integers with N< ; M , and for each integer J with N< ; J< ; M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ; each block in the partition corresponding to the integer M consists of x 2 values , x being a power of two ; each block in the partition corresponding to an integer M-1 consists of 4*x 2 values ; and partitioning the two-dimensional array to form the quadtree partition , further comprises the steps of : performing a Walsh transform on each block of the partition corresponding to the integer M to derive y Walsh transform coefficients , where y is an integer less than or equal to x 2 ; determining an error for each block in the partition corresponding to the integer M-1 , wherein the error depends on a first set of 4*x 2 values generated by an inverse Walsh transform of a 2x-by-2x matrix having at most y non-zero coefficients , and a second set of 4*x 2 values generated by inverse Walsh transforms of four x-by-x matrices , each of the four x-by-x matrices containing the y Walsh transform coefficients derived for a block which is in the partition corresponding to M and is a subset of the block for which the error is being determined ; and including in the quadtree partition four blocks from the partition corresponding to the integer M , wherein a union of the four blocks equals a block having an error greater than a predetermined threshold value (minimum size value) . US5768434A CLAIM 14 . The method of claim 12 , wherein step (c) further comprises determining a sum of absolute values (minimum size value) of differences between values in the block of the selected partition and corresponding values generated by the inverse Walsh transform of the matrix having no more than x 2 non-zero coefficients . US5768434A CLAIM 16 . The method of claim 6 , wherein the step of generating signals which identify the blocks in the quadtree partition comprises the sequential steps of : providing a signal for each block which is in a first partition (square partitions) from a series of uniform partitions , wherein the signal for a block has a first value if the block is in the quadtree partition and has a second value otherwise ; and for each block in a second partition (square partitions) from the series of uniform partitions : providing a signal having the first value if the block in the quadtree partition ; and providing a signal having the second value if the block is not in the quadtree partition and is not a subset of a block in the quadtree partition . US5768434A CLAIM 17 . The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ; I< ; M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ; M , and wherein for each integer J with N< ; J< ; M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ; and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks , wherein no two blocks (block pattern) in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ; and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5666461A Filed: 1995-05-30 Issued: 1997-09-09 High efficiency encoding and decoding of picture signals and recording medium containing same (Original Assignee) Sony Corp (Current Assignee) Sony Corp Katsuji Igarashi, Jun Yonemitsu, Yoichi Yagasaki, Yasushi Fujinami, Tomoyuki Sato, Motoki Kato, Teruhiko Suzuki |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (picture portion) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5666461A CLAIM 1 . A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (picture portion) . |
US5666461A CLAIM 1 . A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (picture portion) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5666461A CLAIM 1 . A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (picture portion) . |
US5666461A CLAIM 1 . A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5793897A Filed: 1994-12-16 Issued: 1998-08-11 Adaptive variable-length coding and decoding methods for image data (Original Assignee) Samsung Electronics Co Ltd (Current Assignee) Samsung Electronics Co Ltd Jae-moon Jo, Je-chang Jeong |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5793897A CLAIM 1 . An adaptive variable-length coding method whereby quantized orthogonal transform coefficients (square partitions) are scanned in a zigzag pattern , are modified into run , level data and then are variable-length coded in a coding system for image data , said method comprising the steps of : setting a plurality of variable-length coding table (block pattern) s having different patterns of a regular region and an escape region according to statistical characteristics of said run , level data ; selecting one of said plurality of variable-length coding tables according to intra/inter mode information of the currently processed block , zigzag scanning position and quantization step size ; and variable-length coding the orthogonal transform coefficients according to said selected variable-length coding table , wherein said selecting step has the selecting range of a plurality of variable-length coding tables having different patterns of a regular region and an escape region according to said intra/inter mode information of the currently Processed block . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (length coding table) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5793897A CLAIM 1 . An adaptive variable-length coding method whereby quantized orthogonal transform coefficients (square partitions) are scanned in a zigzag pattern , are modified into run , level data and then are variable-length coded in a coding system for image data , said method comprising the steps of : setting a plurality of variable-length coding table (block pattern) s having different patterns of a regular region and an escape region according to statistical characteristics of said run , level data ; selecting one of said plurality of variable-length coding tables according to intra/inter mode information of the currently processed block , zigzag scanning position and quantization step size ; and variable-length coding the orthogonal transform coefficients according to said selected variable-length coding table , wherein said selecting step has the selecting range of a plurality of variable-length coding tables having different patterns of a regular region and an escape region according to said intra/inter mode information of the currently Processed block . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5623311A Filed: 1994-10-28 Issued: 1997-04-22 MPEG video decoder having a high bandwidth memory (Original Assignee) Panasonic Corp of North America (Current Assignee) Panasonic Corp of North America Larry Phillips, Shuji Inoue, Edwin R. Meyer |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (luminance component) representing a minimum size value (chrominance values) , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5623311A CLAIM 1 . Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance component (second syntax element) s and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow image data representing corresponding chrominance and luminance components to be accessed concurrently using the first and second channels , respectively , and to allow image data representing first and second contiguous luminance components to be accessed concurrently using the first and second channels , respectively ; means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ; decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ; (b) means for processing the fetched variable length encoded data stream to produce first decoded values ; (c) means for fetching reference image data from the memory via the single memory port ; (d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ; and (e) means for storing the second decoded values in the memory via the single memory port ; and means for fetching image data from the memory , via the single memory port , for display . US5623311A CLAIM 2 . Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance components and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow components of the image data to be accessed concurrently using the first and second channels ; means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ; decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ; (b) means for (1) processing the fetched variable length encoded data stream to produce first decoded values and (2) generating luminance values and corresponding chrominance values (four square partitions, minimum size value) as the first decoded values ; (c) means for fetching reference image data from the memory via the single memory port ; (d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ; and (e) means (1) for storing the second decoded values in the memory via the single memory port and (2) for storing luminance components of the second decoded values using one of the first and second channels and storing corresponding chrominance components of the second decoded values using the other one of the first and second channels ; and means (1) for fetching image data from the memory , via the single memory port , for display and (2) for fetching luminance components of the image data using one of the first and second channels and for fetching the corresponding chrominance components of the image data using the other one of the first and second channels . US5623311A CLAIM 5 . A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ; a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ; an inverse quantization processor ; an inverse discrete cosine transform (video blocks) processor ; and a motion compensation processor , coupled to the image memory via the single memory port ; and a block to raster converter , coupled to the image memory via the single memory port . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) . |
US5623311A CLAIM 5 . A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ; a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ; an inverse quantization processor ; an inverse discrete cosine transform (video blocks) processor ; and a motion compensation processor , coupled to the image memory via the single memory port ; and a block to raster converter , coupled to the image memory via the single memory port . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (inverse discrete cosine transform) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions (chrominance values) , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element (luminance component) representing a minimum size value (chrominance values) , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5623311A CLAIM 1 . Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance component (second syntax element) s and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow image data representing corresponding chrominance and luminance components to be accessed concurrently using the first and second channels , respectively , and to allow image data representing first and second contiguous luminance components to be accessed concurrently using the first and second channels , respectively ; means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ; decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ; (b) means for processing the fetched variable length encoded data stream to produce first decoded values ; (c) means for fetching reference image data from the memory via the single memory port ; (d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ; and (e) means for storing the second decoded values in the memory via the single memory port ; and means for fetching image data from the memory , via the single memory port , for display . US5623311A CLAIM 2 . Apparatus for decoding a data stream including a plurality of variable length encoded data values representing an image where the data stream represents video signals having luminance components and chrominance components , the apparatus comprising : a memory for storing image data where the memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation to the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow components of the image data to be accessed concurrently using the first and second channels ; means for receiving the data stream and for storing the data stream into a buffer area of the memory via the single memory port ; decoding means including : (a) means for fetching the variable length encoded data stream from the memory via the single memory port ; (b) means for (1) processing the fetched variable length encoded data stream to produce first decoded values and (2) generating luminance values and corresponding chrominance values (four square partitions, minimum size value) as the first decoded values ; (c) means for fetching reference image data from the memory via the single memory port ; (d) means for combining the fetched reference image data with the first decoded values to produce second decoded values ; and (e) means (1) for storing the second decoded values in the memory via the single memory port and (2) for storing luminance components of the second decoded values using one of the first and second channels and storing corresponding chrominance components of the second decoded values using the other one of the first and second channels ; and means (1) for fetching image data from the memory , via the single memory port , for display and (2) for fetching luminance components of the image data using one of the first and second channels and for fetching the corresponding chrominance components of the image data using the other one of the first and second channels . US5623311A CLAIM 5 . A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ; a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ; an inverse quantization processor ; an inverse discrete cosine transform (video blocks) processor ; and a motion compensation processor , coupled to the image memory via the single memory port ; and a block to raster converter , coupled to the image memory via the single memory port . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (inverse discrete cosine transform) . |
US5623311A CLAIM 5 . A circuit for decoding video signals encoded according to the MPEG-2 standard , consisting essentially of : an image memory for storing image data where the image memory has a single memory port which includes first and second channels , arranged in parallel , wherein each memory operation which uses the single memory port concurrently transfers data via the first and second channels and the image data is arranged in the memory to allow respectively different components of the image data to be accessed concurrently using the first and second channels ; a decoding processor including : a variable length decoder coupled to the image memory via the single memory port ; an inverse quantization processor ; an inverse discrete cosine transform (video blocks) processor ; and a motion compensation processor , coupled to the image memory via the single memory port ; and a block to raster converter , coupled to the image memory via the single memory port . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5594504A Filed: 1994-07-06 Issued: 1997-01-14 Predictive video coding using a motion vector updating routine (Original Assignee) Nokia of America Corp (Current Assignee) Nokia of America Corp Touradj Ebrahimi |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (motion estimation unit) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (value z) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5594504A CLAIM 3 . The method of claim 2 wherein the initial motion vector assigned to each smaller region in the first segmentation level has the value z (square partitions, four square partitions) ero . US5594504A CLAIM 4 . The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ; (b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ; and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level . US5594504A CLAIM 19 . A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ; a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ; a first region label memory unit for storing a motion vector associated with each merged region ; a second region label memory unit for storing the region label assigned to each pixel in the current frame ; a region shape representation unit for forming a set of contour pixels that defines the merged regions ; a contour coding unit for encoding the set of contour pixels ; and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion estimation unit) . |
US5594504A CLAIM 19 . A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ; a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ; a first region label memory unit for storing a motion vector associated with each merged region ; a second region label memory unit for storing the region label assigned to each pixel in the current frame ; a region shape representation unit for forming a set of contour pixels that defines the merged regions ; a contour coding unit for encoding the set of contour pixels ; and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (motion estimation unit) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (value z) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (predetermined shape) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5594504A CLAIM 3 . The method of claim 2 wherein the initial motion vector assigned to each smaller region in the first segmentation level has the value z (square partitions, four square partitions) ero . US5594504A CLAIM 4 . The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ; (b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ; and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level . US5594504A CLAIM 19 . A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ; a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ; a first region label memory unit for storing a motion vector associated with each merged region ; a second region label memory unit for storing the region label assigned to each pixel in the current frame ; a region shape representation unit for forming a set of contour pixels that defines the merged regions ; a contour coding unit for encoding the set of contour pixels ; and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (motion estimation unit) . |
US5594504A CLAIM 19 . A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ; a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ; a first region label memory unit for storing a motion vector associated with each merged region ; a second region label memory unit for storing the region label assigned to each pixel in the current frame ; a region shape representation unit for forming a set of contour pixels that defines the merged regions ; a contour coding unit for encoding the set of contour pixels ; and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5422676A Filed: 1993-10-22 Issued: 1995-06-06 System for coding an image representative signal (Original Assignee) Deutsche Thomson Brandt GmbH Carsten Herpel, Heinz-Werner Keesen |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5422676A CLAIM 1 . A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ; for static images , transforming interlaced scanned pixel blocks with an original line structure ; and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ; and (b) transforming said first and second blocks (block pattern) . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern (second blocks) ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5422676A CLAIM 1 . A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ; for static images , transforming interlaced scanned pixel blocks with an original line structure ; and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ; and (b) transforming said first and second blocks (block pattern) . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | US5091782A Filed: 1990-04-09 Issued: 1992-02-25 Apparatus and method for adaptively compressing successive blocks of digital video (Original Assignee) Arris Technology Inc (Current Assignee) CIF LICENSING LLC Edward A. Krause, Woo H. Paik |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
US5091782A CLAIM 8 . Apparatus in accordance with claim 1 wherein : said first and second compressing means produce and quantize respective first and second arrays of transform coefficients (square partitions) for the set of pixel data ; and said error evaluating means determines the error between the quantized transform coefficients and the unquantized transform coefficients of each array . US5091782A CLAIM 25 . Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means) . |
US5091782A CLAIM 25 . Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (transforming means) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions (transform coefficients) can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
US5091782A CLAIM 8 . Apparatus in accordance with claim 1 wherein : said first and second compressing means produce and quantize respective first and second arrays of transform coefficients (square partitions) for the set of pixel data ; and said error evaluating means determines the error between the quantized transform coefficients and the unquantized transform coefficients of each array . US5091782A CLAIM 25 . Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (transforming means) . |
US5091782A CLAIM 25 . Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | EP1623577A1 Filed: 2004-05-04 Issued: 2006-02-08 Encoding of video information using block based adaptive scan order (Original Assignee) Koninklijke Philips NV (Current Assignee) Koninklijke Philips NV Lambertus A. Van Eggelen |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
EP1623577A1 CLAIM 2 . A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; - pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . EP1623577A1 CLAIM 3 . A method according to Claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) . |
EP1623577A1 CLAIM 2 . A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; - pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks (preceding image) of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (corresponding data blocks) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
EP1623577A1 CLAIM 2 . A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; - pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . EP1623577A1 CLAIM 3 . A method according to Claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) . |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information comprises a fixed-length code corresponding to the size of the plurality of video blocks (preceding image) . |
EP1623577A1 CLAIM 2 . A method according to Claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ; spatial scaling aspect ratio of one or more image frames present in the video information ; - pulldown material being present in the data of one or more of the image frames ; one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ; a degree of temporal motion occurring in a series of the image frames ; and statistical data relating to earlier selected scanning routes and their associated data compression performance . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JP3910594B2 Filed: 2004-02-20 Issued: 2007-04-25 画像符号化装置 (Original Assignee) 三菱電機株式会社 芳美 井須, 光太郎 浅井, 博文 西川, 由里 長谷川, 俊一 関口, 慎一 黒田 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (ビットストリーム) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JP3910594B2 CLAIM 1 画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と (second syntax) 、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。 |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (ビットストリーム) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
JP3910594B2 CLAIM 1 画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (ビットストリーム) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (手段と) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JP3910594B2 CLAIM 1 画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と (second syntax) 、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。 |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (ビットストリーム) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
JP3910594B2 CLAIM 1 画像信号を H.263符号化方式 で符号化し 、H.263符号化方式の 第1のヘッダ情報を有する第1の符号化ビットストリーム (syntax information) を生成する符号化手段と、 上記第1のヘッダ情報の前に、 MPEG4符号化方式 で符号化された第2の符号化ビットストリームと互換性を確保するための 第2の ヘッダ情報を多重化するヘッダ情報多重化手段とを備え、 上記ヘッダ情報多重化手段は、互換性を確保するための 上記第2の ヘッダ情報として、 MPEG4符号化方式 のビデオ・オブジェクト・スタート・コードと MPEG4符号化方式 のビデオ・オブジェクト識別番号含む 上記第2の ヘッダ情報を多重化することを特徴とする画像符号化装置。 |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | GB2382940A Filed: 2001-11-27 Issued: 2003-06-11 Encoding objects and background blocks (Original Assignee) Nokia Oyj (Current Assignee) Nokia Oyj Miska Hannuksela, Ye-Kui Wang |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (coding elements) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
GB2382940A CLAIM 1 . A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; 10 determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; 15 encoding the at least one separate coding object ; and encoding as one coding object the background object . GB2382940A CLAIM 13 . A video encoder (second syntax element) comprising : means for dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding 5 elements ; means for determining at least one shape within a picture ; means for selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; means for determining as at least one separate coding object the selected 10 at least one subset of the coding blocks ; means for determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; means for encoding the at least one separate coding object ; and 15 means for encoding as one coding object the background object . |
EP2347591B1 CLAIM 2 The apparatus of claim 1 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
GB2382940A CLAIM 1 . A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; 10 determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; 15 encoding the at least one separate coding object ; and encoding as one coding object the background object . |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information (coding elements) for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax (video encoding) element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
GB2382940A CLAIM 1 . A method of video encoding (second syntax) comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; 10 determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; 15 encoding the at least one separate coding object ; and encoding as one coding object the background object . GB2382940A CLAIM 13 . A video encoder (second syntax element) comprising : means for dividing a picture into a set of regular shaped coding blocks having a predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding 5 elements ; means for determining at least one shape within a picture ; means for selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; means for determining as at least one separate coding object the selected 10 at least one subset of the coding blocks ; means for determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; means for encoding the at least one separate coding object ; and 15 means for encoding as one coding object the background object . GB2382940A CLAIM 17 . A video signal comprising : a plurality of compressed video coding (generating syntax information) blocks corresponding to at least one 5 separate coding object corresponding to a part of a video encoded picture , the part of the video encoded picture having a variable size and position ; an identifier for each of the at least one separate coding object ; position and size information of the at least one separate coding object ; and 10 a plurality of compressed video coding blocks corresponding to a background object that corresponds to a set of the coding blocks that corresponds to the video encoded picture excluding the at least one separate coding object . cat . ' ; . 5 . , : - a . L |
EP2347591B1 CLAIM 5 The method of claim 4 , wherein the syntax information (coding elements) comprises a fixed-length code corresponding to the size of the plurality of video blocks . |
GB2382940A CLAIM 1 . A method of video encoding comprising the steps of : dividing a picture into a set of regular shaped coding blocks having a 5 predetermined alignment in relation to the area of the picture , each coding block corresponding to at least one group of elementary coding elements (syntax information) ; determining at least one shape within a picture ; selecting at least one subset of the coding blocks defining at least one area covering the at least one determined shape ; 10 determining as at least one separate coding object the selected at least one subset of the coding blocks ; determining as a background object the subset of the coding blocks that corresponds to the part of the picture that excludes the at least one separate coding object ; 15 encoding the at least one separate coding object ; and encoding as one coding object the background object . |
EP2347591B1 Filed: 2008-10-03 Issued: 2020-04-08 Video coding with large macroblocks (Original Assignee) Velos Media International Ltd (Current Assignee) Velos Media International Ltd Peisong Chen, Yan Ye, Marta Karczewicz | JPH082106B2 Filed: 1986-11-10 Issued: 1996-01-10 動画像信号のハイブリツド符号化方式 (Original Assignee) 国際電信電話株式会社; 富士通株式会社; 日本電信電話株式会社; 日本電気株式会社 洋一 加藤, 睦 太田, 康宏 小杉, 好律 羽鳥 |
---|---|
EP2347591B1 CLAIM 1 An apparatus comprising : a memory configured to store a coded unit , and a processor , in communication with the memory , configured to : encode the coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generate syntax information (変換部) for the coded unit wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of the smallest partition for the coded unit . |
JPH082106B2 CLAIM 2 【請求項2】前記有効/無効判定部は、有効/無効の判 定を直交変換を行なうブロックサイズ (maximum size, maximum size value) より小さいブロッ クサイズのサブブロックに分割し、サブブロック単位で 評価量Smを計算して閾値T 1 と比較し、当該ブロック中に 1つでも有効なサブブロックがある場合には当該ブロッ クを有効ブロック,その他の場合には無効ブロックと判 定することを特徴とする特許請求の範囲第1項記載の動 画像信号のハイブリッド符号化方式。 JPH082106B2 CLAIM 6 【請求項6】前記有効/無効判定部は有効サブブロック が当該ブロック内の1個或いは , 2個である時に後続の直 交変換部 (generate syntax information) において1/2M×1/2N , 1/2M×N又はM×1/2Nの 直交変換を行なうように指示情報を送出することを特徴 とする特許請求の範囲第4項記載の動画像信号のハイブ リッド符号化方式。 |
EP2347591B1 CLAIM 4 A method comprising : encoding a coded unit comprising a plurality of equally sized , square shaped video blocks of size NxN , where N is an integer multiple of 16 ; wherein each of the plurality of video blocks can be partitioned into partitions using hierarchical partitioning with one or more levels , wherein , at each hierarchical partitioning level , square partitions can be further partitioned into two equally sized rectangular partitions or into four square partitions , and the encoding of each of the plurality of video blocks uses a hierarchical coded block pattern ; generating syntax information for the coded unit , wherein the syntax information includes : a first syntax element representing a maximum size (ブロックサイズ) value , the maximum size value indicating the size of the plurality of video blocks in the coded unit ; a second syntax element representing a minimum size value , the minimum size value indicating a size of a smallest partition for the coded unit . |
JPH082106B2 CLAIM 2 【請求項2】前記有効/無効判定部は、有効/無効の判 定を直交変換を行なうブロックサイズ (maximum size, maximum size value) より小さいブロッ クサイズのサブブロックに分割し、サブブロック単位で 評価量Smを計算して閾値T 1 と比較し、当該ブロック中に 1つでも有効なサブブロックがある場合には当該ブロッ クを有効ブロック,その他の場合には無効ブロックと判 定することを特徴とする特許請求の範囲第1項記載の動 画像信号のハイブリッド符号化方式。 |