Purpose: Invalidity Analysis


Patent: US9930365B2
Filed: 2008-10-03
Issued: 2018-03-27
Patent Holder: (Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC
Inventor(s): Peisong Chen, Marta Karczewicz, Yan Ye

Title: Video coding with large macroblocks

Abstract: Techniques are described for encoding and decoding digital video data using macroblocks that are larger than the macroblocks prescribed by conventional video encoding and decoding standards. For example, the techniques include encoding and decoding a video stream using macroblocks comprising greater than 16×16 pixels. In one example, an apparatus includes a video encoder configured to encode a coded unit comprising a plurality of video blocks, wherein at least one of the plurality of video blocks comprises a size of more than 16×16 pixels and to generate syntax information for the coded unit that includes a maximum size value, wherein the maximum size value indicates a size of a largest one of the plurality of video blocks in the coded unit. The syntax information may also include a minimum size value. In this manner, the encoder may indicate to a decoder the proper syntax decoder to apply to the coded unit.




Disclaimer: The promise of Apex Standards Pseudo Claim Charting (PCC) is not to replace expert opinion but to provide due diligence and transparency prior to high precision charting. PCC conducts aggressive mapping (based on Broadest Reasonable, Ordinary or Customary Interpretation and Multilingual Translation) between a target patent's claim elements and other documents (potential technical standard specification or prior arts in the same or across different jurisdictions), therefore allowing for a top-down, apriori evaluation, with which, stakeholders can assess standard essentiality (potential strengths) or invalidity (potential weaknesses) quickly and effectively before making complex, high-value decisions. PCC is designed to relieve initial burden of proof via an exhaustive listing of contextual semantic mapping as potential building blocks towards a litigation-ready work product. Stakeholders may then use the mapping to modify upon shortlisted PCC or identify other relevant materials in order to formulate strategy and achieve further purposes.

Click on references to view corresponding claim charts.


GroundReferencesOwner of the ReferenceTitleSemantic MappingChallenged Claims
12345678910111214151617181920
1US20080192824A1

(Chong Soon Lim, 2008)
(Original Assignee) Panasonic Corp     

(Current Assignee)
Sovereign Peak Ventures LLC
Video coding method and video coding apparatus intra-prediction mode decoding tool
encoding mode coding method
XXXXXX
2JP2008219870A

(Teo Han Boon, 2008)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     動画像符号化方法及び動画像符号化装置 quantization parameter modification value 前記予測ブロック
minimum size, maximum size 小さいサイズ, 大きいサイズ
mobile device 大きい場合
processor to decode one マトリクス
second syntax 手段と
encoding mode comprises one ツール
XXXXXXXXXXX
3WO2008027192A2

(Oscar Divorra Escoda, 2008)
(Original Assignee) Thomson Licensing     Methods and apparatus for reduced resolution partitioning second syntax video encoding
current block current block
quantization parameter modification value slice level
video data video data
XXXXXXXXXXXX
4US20080049834A1

(Thomas Holcomb, 2008)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Sub-block transform coding of prediction residuals video blocks motion prediction
second syntax video encoding
second syntax element video encoder
XXX
5WO2007117711A2

(Cheng Chang, 2007)
(Original Assignee) Microsoft Corporation     Dynamic selection of motion estimation search ranges and extended motion vector ranges first syntax prediction residuals
second syntax element vertical components
second syntax entropy encoding
maximum size current picture
XXXXXX
6WO2007115126A1

(Tao Tian, 2007)
(Original Assignee) Qualcomm Incorporated     Adaptive encoder-assisted frame rate up conversion second syntax video encoding
encoding mode coding method, third mode
video data video data
decoding video data zero value
XXXXXX
7US20070297518A1

(Woo-jin Han, 2007)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Flag encoding method, flag decoding method, and apparatus thereof minimum size minimum size
encoding mode bit strings
video blocks flag value
XXXXXXXXX
8US20070160303A1

(Onur Guleryuz, 2007)
(Original Assignee) Docomo Communications Labs USA Inc     

(Current Assignee)
NTT Docomo Inc
Geometrical image representation and compression third syntax element reconstructed image
sub -block prediction unit
XXX
9US20070230563A1

(Tao Tian, 2007)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Adaptive encoder-assisted frame rate up conversion second syntax video encoding
encoding mode coding method, third mode
video data video data
decoding video data zero value
XXXXXX
10US20070171982A1

(Jens Meggers, 2007)
(Original Assignee) WIMAXIMUM LLC     

(Current Assignee)
Nordic Interactive Technologies LLC ; WIMAXIMUM LLC
Methods for encoding digital video for decoding on low performance devices video data, video blocks frame compression, memory block
current block more blocks
XXXXXXXXX
11US20080002770A1

(Kemal Ugur, 2008)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Oyj
Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices readable storage medium readable storage medium
video blocks first encoding mode
sub partitions first partition
XX
12WO2006069297A1

(Yan Ye, 2006)
(Original Assignee) Qualcomm Incorporated     Temporal estimation of a motion vector for video communications top box specified amount
mobile device said object
broadcast receiver device said memory
X
13US20060120450A1

(Woo-jin Han, 2006)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Method and apparatus for multi-layered video encoding and decoding intra-prediction mode, video blocks performing intra-prediction, intra-prediction mode
first syntax prediction residuals
second syntax element video encoder
XXXX
14US20080025391A1

(Peter Amon, 2008)
(Original Assignee) Siemens AG     

(Current Assignee)
Siemens AG
Transcoding Method And Device first syntax, third syntax syntax elements, network element
second syntax video encoding
encoding mode coding method
decoding video data second video
mobile device one second
XXXXXXX
15US20070030903A1

(Guy Cote, 2007)
(Original Assignee) LSI Corp     

(Current Assignee)
Avago Technologies International Sales Pte Ltd
Method and apparatus for H.264 to MPEG-2 video transcoding decoding video data discrete cosine
encoding mode least number
XXXXXX
16US20060153297A1

(Jill Boyce, 2006)
(Original Assignee) Thomson Licensing SAS     

(Current Assignee)
InterDigital Madison Patent Holdings Inc
Mixed inter/intra video coding of macroblock partitions encoding mode coding method, encoding mode
decoding video data comprises i, zero value
video data video data
XXXXXX
17US20060126740A1

(Teng Lin, 2006)
(Original Assignee) Micronas USA Inc     

(Current Assignee)
TDK Micronas GmbH
Shared pipeline architecture for motion vector prediction and residual decoding first syntax element, second syntax element motion vector differences
sub -block intra prediction mode
starting size backward reference
first syntax data state
video blocks d log
XXX
18JP2006304107A

(Katsuyuki Ozaki, 2006)
(Original Assignee) Ntt Electornics Corp; Nttエレクトロニクス株式会社     符号化装置、及び、この符号化装置に適用されるプログラム quantization parameter modification value 前記予測ブロック
partitioning comprises 前記予測モード
XXXXXX
19US20060203905A1

(Shih-Chang Hsia, 2006)
(Original Assignee) National Kaohsiung First University of Science and Technology     

(Current Assignee)
National Kaohsiung First University of Science and Technology
Video coding system minimum size, third syntax element horizontal resolution
maximum size processing speed
current block reference blocks
second syntax element video encoder
sub -block block address
thereon instructions frame memory
starting size memory cell
XXXXXXXXXXXX
20US20060133495A1

(Yan Ye, 2006)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Temporal error concealment for video communications top box specified amount
mobile device said object
broadcast receiver device said memory
X
21WO2005043310A2

(Ramaswamy Murali, 2005)
(Original Assignee) Qualcomm Incorporated     Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system receiver device, broadcast receiver device second time interval
decoding video data comprises i, first base
XXX
22US20050101319A1

(Ramaswamy Murali, 2005)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system receiver device, broadcast receiver device second time interval
decoding video data comprises i, first base
XXX
23US20050213831A1

(Mihaela van der Schaar, 2005)
(Original Assignee) Koninklijke Philips NV     

(Current Assignee)
Koninklijke Philips NV
Method and system for encoding fractional bitplanes current block pixel blocks
readable storage medium said blocks
XXXXXXXXX
24US20050025246A1

(Thomas Holcomb, 2005)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Decoding jointly coded transform type and subblock pattern information first syntax, third syntax syntax elements
video blocks, decoding video data first VLC
XXX
25US20050053292A1

(Kunal Mukerjee, 2005)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Advanced bi-directional predictive coding of interlaced video current block second motion vector predictor
encoding mode interlaced B-field
second syntax element video encoder
XXXXXXXXXXXX
26CN1810037A

(尹鹏, 2006)
(Original Assignee) 汤姆森许可贸易公司     帧间的快速模式确定编码 video blocks 一种视频编码方法, 针对帧
minimum size 尺寸的
XXXXXX
27US20070053436A1

(Lambertus Van Eggelen, 2007)
(Original Assignee) Koninklijke Philips NV     

(Current Assignee)
Koninklijke Philips NV
Encoding video information using block based adaptive scan order maximum size corresponding data blocks
video blocks preceding image
decoding video data discrete cosine
XXXXXX
28CN1723706A

(吉尔·M·博伊斯, 2006)
(Original Assignee) 汤姆森特许公司     宏块分割部分的混合块间/块内视频编码 video data 的视频数据
first syntax 多个分
XXX
29JP2004112825A

(Tihao Chiang, 2004)
(Original Assignee) Sarnoff Corp; Sharp Corp; サーノフ コーポレイション; シャープ株式会社     ビデオ画像符号化方法 maximum size ブロックサイズ
third syntax, third syntax element ブロック分
XXXXXX
30US7397857B2

(Alexander Romanowski, 2008)
(Original Assignee) Robert Bosch GmbH     

(Current Assignee)
Robert Bosch GmbH
Method for transformation-coding full motion image sequences video blocks estimating motion
video data image signal
XXX
31US7426308B2

(Pohsiang Hsu, 2008)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Intraframe and interframe interlace coding and decoding current block intra-coded blocks
inter-prediction mode interlaced video
decoding video data comprises i
top box column A
XXXXXXXXX
32US7289674B2

(Marta Karczewicz, 2007)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Technologies Oy
Spatial prediction based intra coding readable storage medium readable storage medium
current block second blocks
XXXXXXXXX
33US7162091B2

(Albert S. Wang, 2007)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Intra compression of pixel blocks using predicted mean encoding mode comprises one frame prediction
current block pixel blocks
XXXXXXXXX
34US7181072B2

(Albert S. Wang, 2007)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Intra compression of pixel blocks using predicted mean third syntax different video
starting size same frame
XXX
35US20030123738A1

(Per Frojdh, 2003)
(Original Assignee) Telefonaktiebolaget LM Ericsson AB     

(Current Assignee)
Telefonaktiebolaget LM Ericsson AB
Global motion compensation for video pictures current block pixel blocks
video data video data
XXXXXXXXX
36US20030229719A1

(Eiji Iwata, 2003)
(Original Assignee) Sony Interactive Entertainment Inc     

(Current Assignee)
Sony Interactive Entertainment Inc
System and method for data compression mobile device said object
video data video data
XXXX
37JP2003319394A

(Kuhn Peter, 2003)
(Original Assignee) Sony Corp; ソニー株式会社     符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム partitioning comprises 前記予測モード
second syntax 手段と
intra-prediction mode 参照画
XXXX
38US6959116B2

(Ugur Sezer, 2005)
(Original Assignee) EMC Corp     

(Current Assignee)
EMC Corp
Largest magnitude indices selection for (run, level) encoding of a block coded picture second syntax entropy encoding
decoding video data discrete cosine
XXX
39US6529634B1

(Kadayam Thyagarajan, 2003)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Contrast sensitive variance based adaptive block size DCT image compression current block minimum block size, divided blocks
video blocks transforming means, main data
quantization parameter modification value Huffman code
readable storage medium said blocks
encoding mode comprises one input block
decoding video data zero value
processor to decode one th column
XXXXXXXXXXXX
40US6633611B2

(Shunichi Sekiguchi, 2003)
(Original Assignee) Mitsubishi Electric Corp     

(Current Assignee)
Mitsubishi Electric Corp
Method and apparatus for region-based moving image encoding and decoding video blocks prediction error signal
video data image signal
mobile device said object
XXXX
41US5748116A

(Charles K. Chui, 1998)
(Original Assignee) Teralogic Inc     

(Current Assignee)
Qualcomm Inc ; CSR Technology Inc
System and method for nested split coding of sparse data sets broadcast receiver device said memory
decoding video data zero value
XXX
42US6084908A

(Tihao Chiang, 2000)
(Original Assignee) Sharp Corp; Sarnoff Corp     

(Current Assignee)
Sharp Corp ; MediaTek Inc
Apparatus and method for quadtree based variable block size motion estimation sub -block quadtree structure
decoding video data discrete cosine
thereon instructions control module
readable storage medium said blocks
XXX
43US6125143A

(Teruhiko Suzuki, 2000)
(Original Assignee) Sony Corp     

(Current Assignee)
Ironworks Patents LLC
Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium encoding mode coding method
minimum size, maximum size mean value
XXXXXXXXX
44US5956088A

(Paul Shen, 1999)
(Original Assignee) Imedia Corp     

(Current Assignee)
Terayon Corp ; Google Technology Holdings LLC
Method and apparatus for modifying encoded digital video for improved channel utilization second syntax store packets
video data flow channels
XXX
45US5107345A

(Chong U. Lee, 1992)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Adaptive block size image compression method and system video blocks, decoding video data inverse discrete cosine transform, compressed image
encoding mode comprises one receiving said one
video data image signal
quantization parameter modification value Huffman code
readable storage medium code values
broadcast receiver device said memory
XXXXXXX
46WO2008117158A1

(Jani Lainema, 2008)
(Original Assignee) Nokia Corporation; Nokia Inc.     Method and system for motion vector predictions current block second motion vector predictor, second blocks
readable storage medium readable storage medium
video data, video blocks encoded video signal, motion prediction
broadcast receiver device said memory
XXXXXXXXXX
47CN101198064A

(胡瑞敏, 2008)
(Original Assignee) 武汉大学     一种分辨率分层技术中的运动矢量预测方法 encoding mode comprises one 搜索运动矢量, 方法为
inter-prediction mode 量预测模式
XXX
48WO2008054688A1

(Debargha Mukherjee, 2008)
(Original Assignee) Hewlett-Packard Development Company, L.P.     Method for decomposing a video sequence frame second syntax different motion vectors
readable storage medium readable storage medium
sub partitions edge partition
XXX
49WO2008002881A2

(Harinath Garudadri, 2008)
(Original Assignee) Qualcomm Incorporated     Reduction of errors during computation of inverse discrete cosine transform video blocks, decoding video data inverse discrete cosine transform, midpoint bias value
partitioning comprises point representations
XXX
50KR100842558B1

(김봉곤, 2008)
(Original Assignee) 삼성전자주식회사; 성균관대학교산학협력단     동영상 부호화를 위한 블록 모드 결정 방법 및 그 장치 encoding mode comprises one 프레임들의
inter-prediction mode 블록을
XXX
51WO2007081908A1

(Peng Yin, 2007)
(Original Assignee) Thomson Licensing     Method and apparatus for providing reduced resolution update mode for multi-view video coding second syntax video encoding
encoding mode coding method
XXXXXX
52JP2008092137A

(Yuji Okuda, 2008)
(Original Assignee) Toshiba Corp; 株式会社東芝     画像符号化装置及び画像符号化方法 third syntax 基づく第1
intra-prediction mode 参照画
XXXX
53EP1811786A1

(Liming Wang, 2007)
(Original Assignee) Sony Corp     

(Current Assignee)
Sony Corp
Encoding apparatus, encoding method and program thereof, and decoding apparatus, decoding method and program thereof video blocks motion prediction
encoding mode coding method
XXXXXX
54CN101213842A

(王祥林, 2008)
(Original Assignee) 诺基亚公司     用于使用运动补偿时域滤波的视频编码中的更新步骤的方法和装置 video data 的视频数据
sub -block 块上执行
encoding mode 关于参
XXXXXX
55CN101189641A

(I·戈里, 2008)
(Original Assignee) 布雷克成像有限公司     编码数字图像的像素或体素的方法及处理数字图像的方法 intra-prediction mode, encoding mode comprises one 来确定的
minimum size 数值数据
XXXXXXX
56KR20060027795A

(요제프 카롤리, 2006)
(Original Assignee) 에이엠티 어드밴스트 멀티미디어 테그놀로지 에이비     하이브리드 비디오 압축 방법 sub -block 서브-블록의, 블록들을
encoding mode comprises one 프레임들의, 편차값을
XXX
57EP1655967A1

(Kim Wooshik, 2006)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Method and apparatus for encoding and decoding image data sub -block prediction unit
current block current block
XXXXXXXXX
58US20050135484A1

(Yung-Lyul Lee, 2005)
(Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group     

(Current Assignee)
Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group
Method of encoding mode determination, method of motion estimation and encoding apparatus maximum size current picture
decoding video data discrete cosine
current block identical size, divided blocks
minimum size, starting size other ones
XXXXXXXXXXXX
59CN1857001A

(P·托思, 2006)
(Original Assignee) Amt先进多媒体科技公司     混合视频压缩方法 video data 的视频数据
video blocks 块进行编码
sub -block 块上执行
minimum size 的尺寸, 尺寸的
XXXXXX
60WO2004104930A2

(Peter Toth, 2004)
(Original Assignee) Amt Advanced Multimedia Technology Ab     Hybrid video compression method video data, decoding video data encoded video data, discrete cosine
first syntax processing element
current block individual blocks, reference blocks
intra-prediction mode, inter-prediction mode prediction modes
readable storage, readable storage medium function value
second syntax, third syntax data elements
video blocks given block
XXXXXXXXXX
61JP2004254327A

(Yoshiichiro Kashiwagi, 2004)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     画像符号化方法および画像復号化方法 maximum size ブロックサイズ
processor to decode one マトリクス, 逆変換
second syntax 手段と
XXXXXXX
62US20040234144A1

(Kazuo Sugimoto, 2004)
(Original Assignee) NTT Docomo Inc     

(Current Assignee)
NTT Docomo Inc
Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program maximum size predetermined shape
encoding mode coding method
XXXXXXXXX
63US7170937B2

(Minhua Zhou, 2007)
(Original Assignee) Texas Instruments Inc     

(Current Assignee)
Texas Instruments Inc
Complexity-scalable intra-frame prediction technique first syntax, first syntax element prediction direction
current block current block
XXXXXXXXX
64CN1457196A

(薛金柱, 2003)
(Original Assignee) 北京工业大学     基于时空域相关性运动矢量预测的视频编码方法 video data 的视频数据
intra-prediction mode 编码标准
encoding mode comprises one 方法为
XXXX
65EP1478190A1

(Satoru c/o NTT DoCoMo Inc. ADACHI, 2004)
(Original Assignee) NTT Docomo Inc     

(Current Assignee)
NTT Docomo Inc
Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program maximum size predetermined shape
encoding mode coding method
XXXXXXXXX
66CN1647541A

(J·莱恩马, 2005)
(Original Assignee) 诺基亚有限公司     用于在视频编码系统中指示量化器参数的方法与设备 inter-prediction mode 的解码方法
first syntax 多个分
XXX
67US20030202594A1

(Jani Lainema, 2003)
(Original Assignee) Nokia Oyj     

(Current Assignee)
Nokia Technologies Oy
Method for coding motion in a video sequence second syntax element video encoder
decoding video data coded region
XXX
68EP1876838A2

(Byeong Moon Jeon, 2008)
(Original Assignee) LG Electronics Inc     

(Current Assignee)
LG Electronics Inc
Method for block prediction of a bi-predictive picture in direct mode starting size backward reference
current block current block
XXXXXXXXX
69JP2002359852A

(Sen Bun Chun, 2002)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     画像予測復号化装置及び方法 minimum size, starting size 量子化ステップサイズ
second syntax 手段と
XXXXXX
70JP2001036912A

(Pierre Ruellou, 2001)
(Original Assignee) Thomson Multimedia Sa; トムソン マルチメデイア ソシエテ アノニム     画像圧縮方法 intra-prediction mode デフォルト, 予測モード
receiver device 割り当てら
XXXXX
71KR20000053028A

(비알쟝-프랑수아, 2000)
(Original Assignee) 브릭크만 게오르그; 톰슨 멀티미디어     움직임 보상을 이용한 예측 코딩 방법 및 장치 sub -block 블록들을, 블록간
intra-prediction mode, encoding mode weighting
inter-prediction mode 블록을, 그룹화
XXXXXX
72JP2000059769A

(Yung-Lyul Lee, 2000)
(Original Assignee) Korea Advanced Inst Of Sci Technol; Samsung Electronics Co Ltd; 三星電子株式会社; 韓國科學技術院     映像デ―タル―プフィルタリング方法及び装置 intra-prediction mode 予測モード
third syntax 前記一つ
XXXX
73US6148109A

(Choong Seng Boon, 2000)
(Original Assignee) Panasonic Corp     

(Current Assignee)
Sun Patent Trust
Image predictive coding method encoding mode coding method
video data image signal
XXXXXX
74US6094225A

(Seok-Won Han, 2000)
(Original Assignee) Daewoo Electronics Co Ltd     

(Current Assignee)
Quarterhill Inc
Method and apparatus for encoding mode signals for use in a binary shape coder current block reference blocks
encoding mode encoding mode
XXXXXXXXXXXX
75JPH10191324A

(Taisuke Matsumoto, 1998)
(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     画像符号化方法および画像符号化装置並びに画像符号化プログラムを記録した記録媒体 inter-prediction mode 符号化/復号
third syntax, third syntax element ブロック分
second syntax 手段と
XXX
76WO9803018A2

(Antonius Adrianus Cornelis Maria Kalker, 1998)
(Original Assignee) Philips Electronics N.V.; Philips Norden Ab     Transmission and reception of encoded video pictures video data, video blocks encoded video signal
quantization parameter modification value Huffman codes
XXXXXX
77US5748789A

(Ming-Chieh Lee, 1998)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Transparent block skipping in object-based video coding systems encoding mode coding method
video data video data
XXXXXX
78US5959673A

(Ming-Chieh Lee, 1999)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Transform coding of dense motion vector fields for frame and object based video coding applications receiver device two dimensional array
video blocks lossy compression
decoding video data discrete cosine, second video
XXX
79US5970173A

(Ming-Chieh Lee, 1999)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Image compression and affine transformation for image motion compensation video data affine transformation
minimum size singular value
XXXXXX
80EP0727910A2

(Max Chien, 1996)
(Original Assignee) Thomson Multimedia SA     

(Current Assignee)
Technicolor SA
Temporal-spatial error concealment apparatus and method for video signal processors readable storage medium said blocks
video data video data
broadcast receiver device image area
current block bad block
XXXXXXXXXX
81US5692063A

(Ming-Chieh Lee, 1997)
(Original Assignee) Microsoft Corp     

(Current Assignee)
Microsoft Technology Licensing LLC
Method and system for unrestricted motion estimation for video first syntax, third syntax horizontal boundary, vertical boundary
decoding video data second video
mobile device said object
XXXX
82US5576767A

(Chong U. Lee, 1996)
(Original Assignee) Qualcomm Inc     

(Current Assignee)
Qualcomm Inc
Interframe video encoding and decoding system current block reference blocks
readable storage medium said blocks
XXXXXXXXX
83US5768434A

(Xiaonong Ran, 1998)
(Original Assignee) National Semiconductor Corp     

(Current Assignee)
National Semiconductor Corp
Quadtree-structured walsh transform coding receiver device two dimensional array
sub partitions first partition
current block more blocks
XXXXXXXXXX
84US5666461A

(Katsuji Igarashi, 1997)
(Original Assignee) Sony Corp     

(Current Assignee)
Sony Corp
High efficiency encoding and decoding of picture signals and recording medium containing same video blocks picture portion
sub -block block address
X
85US5539466A

(Katsuji Igarashi, 1996)
(Original Assignee) Sony Corp     

(Current Assignee)
Sony Corp
Efficient coding apparatus for picture signal and decoding apparatus therefor video blocks first encoding mode
encoding mode coding method
XXXXXX
86US5594504A

(Touradj Ebrahimi, 1997)
(Original Assignee) Nokia of America Corp     

(Current Assignee)
Nokia of America Corp
Predictive video coding using a motion vector updating routine video blocks motion estimation unit
maximum size predetermined shape
encoding mode comprises one frame prediction
processor to decode one initial vector
top box control unit
thereon instructions frame memory
XXXXXXX
87JPH07274176A

(Shinichiro Koto, 1995)
(Original Assignee) Toshiba Corp; 株式会社東芝     動画像伝送装置 video data 前記送信
second syntax 手段と
intra-prediction mode 参照画
XXXX
88EP0651574A1

(Motoki Sony Corporation Kato, 1995)
(Original Assignee) Sony Corp     

(Current Assignee)
Sony Corp
Method and apparatus for coding/decoding motion vector, and method and apparatus for coding/decoding image signal encoding mode comprises one frame prediction
encoding mode coding method
XXXXXX
89EP0614318A2

(Kenshi C/O Intellectual Prop. Div. Dachiku, 1994)
(Original Assignee) Toshiba Corp     

(Current Assignee)
Toshiba Corp
Video encoder, video decoder, and video motion estimation apparatus video blocks transforming means, motion prediction
second syntax video encoding
thereon instructions frame memory
XXX
90EP0607042A2

(Wilson Kwok, 1994)
(Original Assignee) RCA Licensing Corp     

(Current Assignee)
RCA Licensing Corp
Error concealment apparatus for video signal processors minimum size, maximum size background pixel, mean value
current block bad block
XXXXXXXXXXXX
91US5461421A

(Heon H. Moon, 1995)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Encoding and decoding method and apparatus thereof video blocks, decoding video data inverse discrete cosine transform, respective frame
encoding mode coding method
encoding mode comprises one input block
XXXXXX
92US5422676A

(Carsten Herpel, 1995)
(Original Assignee) Deutsche Thomson Brandt GmbH     System for coding an image representative signal current block second blocks
encoding mode coding method
video data image signal
readable storage medium said blocks
XXXXXXXXXXXX
93EP0588653A2

(Kimitaka C/O Fujitsu Limited Murashita, 1994)
(Original Assignee) Fujitsu Ltd     

(Current Assignee)
Fujitsu Ltd
Image data coding and restoring method and appatatus for coding and restoring the same video data image changes
mobile device, decoding video data then output
XXXX
94US5448297A

(Adnan Alattar, 1995)
(Original Assignee) Intel Corp     

(Current Assignee)
Intel Corp
Method and system for encoding images using skip blocks starting size horizontal dimension
video data image signal
XXX
95US5467086A

(Jechang Jeong, 1995)
(Original Assignee) Samsung Electronics Co Ltd     

(Current Assignee)
Samsung Electronics Co Ltd
Apparatus and method of coding/decoding video data encoding mode coding method
maximum size, thereon instructions said switch
XXXXXXXXX
96US5442400A

(Huifang Sun, 1995)
(Original Assignee) RCA Licensing Corp     

(Current Assignee)
RCA Licensing Corp
Error concealment apparatus for MPEG-like video data video blocks respective frame
readable storage medium said blocks
sub partitions other frame
broadcast receiver device said memory, image area
video data video data
decoding video data zero value
XXXX
97USRE35158E

(Kenji Sugiyama, 1996)
(Original Assignee) Victor Company of Japan Ltd     

(Current Assignee)
Victor Company of Japan Ltd
Apparatus for adaptive inter-frame predictive encoding of video signal intra-prediction mode, inter-prediction mode prediction modes
thereon instructions frame memory
readable storage medium said blocks
XXXX
98US5347308A

(Lucas H. Y. Wai, 1994)
(Original Assignee) Panasonic Corp     

(Current Assignee)
Panasonic Corp
Adaptive coding method for interlaced scan digital video sequences video blocks motion prediction
decoding video data discrete cosine
XX
99US5227878A

(Atul Puri, 1993)
(Original Assignee) Nokia Bell Labs     

(Current Assignee)
AT&T Corp ; Multimedia Patent Trust
Adaptive coding and decoding of frames and fields of video decoding video data discrete cosine
intra-prediction mode stored value
XXXX
100US5155594A

(Jeffrey Bernstein, 1992)
(Original Assignee) PictureTel Corp     

(Current Assignee)
Polycom Inc
Hierarchical encoding method and apparatus employing background references for efficiently communicating image sequences third syntax element reconstructed image
thereon instructions estimated number
XXX
101US5068724A

(Edward A. Krause, 1991)
(Original Assignee) Arris Technology Inc     

(Current Assignee)
CIF LICENSING LLC
Adaptive motion compensation for digital television receiver device digital television
encoding mode least number
maximum size, thereon instructions said switch
readable storage medium said blocks
video data video data
XXXXXXXXXX
102US5091782A

(Edward A. Krause, 1992)
(Original Assignee) Arris Technology Inc     

(Current Assignee)
CIF LICENSING LLC
Apparatus and method for adaptively compressing successive blocks of digital video video blocks transforming means
receiver device digital television
inter-prediction mode interlaced video
readable storage medium said blocks
video data video data
XXXX




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20080192824A1

Filed: 2008-02-06     Issued: 2008-08-14

Video coding method and video coding apparatus

(Original Assignee) Panasonic Corp     (Current Assignee) Sovereign Peak Ventures LLC

Chong Soon Lim, Viktor Wahadaniah, Teo Han Boon, Toshiyasu Sugio, Takahiro Nishi, Youji Shibahara, Kazuya Takagi
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (decoding tool) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .

US20080192824A1
CLAIM 12
. A video decoding method for decoding coded image data on a per predetermined block basis , said video decoding method comprising : extracting a quantization control variable from the coded image data ;
and determining a quantization step size based on the extracted quantization control variable , and decoding the coded image data coded on the determined quantization step basis using a predetermined image decoding tool (intra-prediction mode) .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (decoding tool) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .

US20080192824A1
CLAIM 12
. A video decoding method for decoding coded image data on a per predetermined block basis , said video decoding method comprising : extracting a quantization control variable from the coded image data ;
and determining a quantization step size based on the extracted quantization control variable , and decoding the coded image data coded on the determined quantization step basis using a predetermined image decoding tool (intra-prediction mode) .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode (decoding tool) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .

US20080192824A1
CLAIM 12
. A video decoding method for decoding coded image data on a per predetermined block basis , said video decoding method comprising : extracting a quantization control variable from the coded image data ;
and determining a quantization step size based on the extracted quantization control variable , and decoding the coded image data coded on the determined quantization step basis using a predetermined image decoding tool (intra-prediction mode) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (decoding tool) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .

US20080192824A1
CLAIM 12
. A video decoding method for decoding coded image data on a per predetermined block basis , said video decoding method comprising : extracting a quantization control variable from the coded image data ;
and determining a quantization step size based on the extracted quantization control variable , and decoding the coded image data coded on the determined quantization step basis using a predetermined image decoding tool (intra-prediction mode) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20080192824A1
CLAIM 1
. A video coding method (encoding mode) for coding a video on a per predetermined block basis using one or more predetermined quantization control variables , said video coding method comprising : calculating a correlation degree indicating a correlation between an original block in a current original picture to be coded and a prediction block in a previous uncompressed picture ;
changing a first predetermined quantization control variable to a second predetermined quantization control variable so as to reduce a quantization step size in the case where the correlation degree is greater than a predetermined threshold ;
and determining the quantization step size based on the second quantization control variable , and coding the original block on the determined quantization step size basis using a predetermined image coding tool .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2008219870A

Filed: 2008-01-28     Issued: 2008-09-18

動画像符号化方法及び動画像符号化装置

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Teo Han Boon, Chong Soon Lim, Takahiro Nishi, Yoji Shibahara, Toshiyasu Sugio, Kazuya Takagi, Viktor Wahadaniah, ハン ブーン テオ, ワハダニア ビクター, チョン スーン リム, 敏康 杉尾, 陽司 柴原, 孝啓 西, 一也 高木
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (小さいサイズ, 大きいサイズ) of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (小さいサイズ, 大きいサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size (小さいサイズ, 大きいサイズ) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (ツール) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2008219870A
CLAIM 1
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化方法であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツール (encoding mode comprises one) を用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とする動画像符号化方法。

JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

JP2008219870A
CLAIM 10
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (小さいサイズ, 大きいサイズ) and greater than the minimum size (小さいサイズ, 大きいサイズ) .
JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2008219870A
CLAIM 3
前記相関算出ステップは、 前記予測ブロック (quantization parameter modification value) について動きベクトルを算出するサブステップと、 前記原画ブロックのサンプル値、前記予測ブロックのサンプル値、及び算出された前記動きベクトルを用いて、相関を表わすコスト値を算出するサブステップとを含み、 前記変数決定ステップでは、算出された前記コスト値に基づいて前記量子化制御変数を変更する ことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (小さいサイズ, 大きいサイズ) of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (小さいサイズ, 大きいサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (小さいサイズ, 大きいサイズ) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (ツール) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2008219870A
CLAIM 1
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化方法であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツール (encoding mode comprises one) を用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とする動画像符号化方法。

JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

JP2008219870A
CLAIM 10
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (小さいサイズ, 大きいサイズ) and greater than the minimum size (小さいサイズ, 大きいサイズ) .
JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2008219870A
CLAIM 3
前記相関算出ステップは、 前記予測ブロック (quantization parameter modification value) について動きベクトルを算出するサブステップと、 前記原画ブロックのサンプル値、前記予測ブロックのサンプル値、及び算出された前記動きベクトルを用いて、相関を表わすコスト値を算出するサブステップとを含み、 前記変数決定ステップでは、算出された前記コスト値に基づいて前記量子化制御変数を変更する ことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (大きい場合) , a broadcast receiver device , or a set-top box .
JP2008219870A
CLAIM 1
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化方法であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合 (mobile device) に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とする動画像符号化方法。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (小さいサイズ, 大きいサイズ) of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (小さいサイズ, 大きいサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (小さいサイズ, 大きいサイズ) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (ツール) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2008219870A
CLAIM 1
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化方法であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出ステップと、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定ステップと、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツール (encoding mode comprises one) を用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化ステップと を含むことを特徴とする動画像符号化方法。

JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

JP2008219870A
CLAIM 10
既定の量子化制御変数を用いて、所定のブロック単位で動画像を符号化する動画像符号化装置であって、 符号化対象の原画における原画ブロックと先行する非圧縮ピクチャにおける予測ブロックとの相関の度合いを算出する相関算出手段と (second syntax) 、 前記相関の度合いが所定の閾値より大きい場合に、量子化ステップサイズが小さくなるように前記量子化制御変数を変更する変数決定手段と、 変更された前記量子化制御変数に基づいて量子化ステップサイズを決定し、所定の画像符号化ツールを用いて、決定された前記量子化ステップサイズで前記原画ブロックを符号化する画像符号化手段と を備えることを特徴とする動画像符号化装置。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (小さいサイズ, 大きいサイズ) and greater than the minimum size (小さいサイズ, 大きいサイズ) .
JP2008219870A
CLAIM 4
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化パラメータを特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ (minimum size, maximum size, starting size) 又は所定の小さいサイズ (minimum size, maximum size, starting size) の何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化パラメータ値分だけ前記量子化パラメータを小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one (マトリクス) or more syntax elements representative of partitioning for the current block .
JP2008219870A
CLAIM 5
前記変数決定ステップは、 前記原画ブロックについて、既定の量子化マトリクス (processor to decode one) を特定するサブステップと、 前記原画ブロックに用いられる直交変換サイズを決定するサブステップと、 前記直交変換サイズが所定の大きいサイズ又は所定の小さいサイズの何れであるか否かを判定するサブステップと、 前記直交変換サイズが所定の大きいサイズであると判定された場合に、既定の第1デルタ量子化マトリクスの各係数分だけ前記量子化マトリクスの各係数を小さくするサブステップと を含むことを特徴とする請求項1記載の動画像符号化方法。

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2008219870A
CLAIM 3
前記相関算出ステップは、 前記予測ブロック (quantization parameter modification value) について動きベクトルを算出するサブステップと、 前記原画ブロックのサンプル値、前記予測ブロックのサンプル値、及び算出された前記動きベクトルを用いて、相関を表わすコスト値を算出するサブステップとを含み、 前記変数決定ステップでは、算出された前記コスト値に基づいて前記量子化制御変数を変更する ことを特徴とする請求項1記載の動画像符号化方法。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2008027192A2

Filed: 2007-08-16     Issued: 2008-03-06

Methods and apparatus for reduced resolution partitioning

(Original Assignee) Thomson Licensing     

Oscar Divorra Escoda, Peng Yin
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2008027192A2
CLAIM 3
. The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation .

WO2008027192A2
CLAIM 4
. The apparatus of claim 2 , wherein said encoder (300) uses the bottom- up tree joining to generate super-block partitions for the video data (video data) .

WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (current block) .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (current block) has a size of at least 64×64 pixels .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (slice level) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO2008027192A2
CLAIM 8
. The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (quantization parameter modification value) , and a macroblock level .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2008027192A2
CLAIM 3
. The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation .

WO2008027192A2
CLAIM 4
. The apparatus of claim 2 , wherein said encoder (300) uses the bottom- up tree joining to generate super-block partitions for the video data (video data) .

WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (current block) .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (current block) has a size of at least 64×64 pixels .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (slice level) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO2008027192A2
CLAIM 8
. The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (quantization parameter modification value) , and a macroblock level .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2008027192A2
CLAIM 3
. The apparatus of claim 2 , wherein said encoder (300) uses a block size of an initial partition to generate a tree-based frame partition , the block size being any of available block sizes capable of being used for tree-based frame partitioning relating to a particular video encoding (second syntax) standard or recommendation .

WO2008027192A2
CLAIM 4
. The apparatus of claim 2 , wherein said encoder (300) uses the bottom- up tree joining to generate super-block partitions for the video data (video data) .

WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (current block) .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (current block) has a size of at least 64×64 pixels .
WO2008027192A2
CLAIM 13
. The apparatus of claim 9 , wherein in at least one block in a super- block , at least one of block coding modes , the local super-block related information and motion information is directly derived for a current block (current block) to be encoded in the super-block from at least one of the local super-block related information , block mode information and motion information , explicitly or implicitly embedded in at least one neighboring block .

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (slice level) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO2008027192A2
CLAIM 8
. The apparatus of claim 7 , wherein said encoder (300) codes the high level syntax in at least one of a slice header level , a Supplemental Enhancement Information (SEI) level , a picture parameter set level , a sequence parameter set level , a network abstraction layer unit header level , a picture level , a slice level (quantization parameter modification value) , and a macroblock level .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20080049834A1

Filed: 2007-08-03     Issued: 2008-02-28

Sub-block transform coding of prediction residuals

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Thomas Holcomb, Chih-Lung Lin
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20080049834A1
CLAIM 60
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes .

US20080049834A1
CLAIM 62
. In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ;
and outputting the plural sub-block pattern codes .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (motion prediction) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20080049834A1
CLAIM 56
. The method of claim 55 wherein the data for the one or more video frames comprises motion prediction (video blocks) residual data .

US20080049834A1
CLAIM 60
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes .

US20080049834A1
CLAIM 62
. In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ;
and outputting the plural sub-block pattern codes .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20080049834A1
CLAIM 60
. A computer-readable medium storing computer-executable instructions for causing the computer system to perform the method of claim 55 during video encoding (second syntax) , wherein the method further comprises outputting the one or more sub-block pattern codes .

US20080049834A1
CLAIM 62
. In a video encoder (second syntax element) , a computer-implemented method of processing one or more motion-predicted video frames , wherein the one or more motion-predicted video frames include plural blocks , the method comprising : using plural sub-block pattern codes during encoding of prediction residual data , wherein each of the plural sub-block pattern codes indicates presence or absence of information for plural sub-blocks of a corresponding block of the plural blocks ;
and outputting the plural sub-block pattern codes .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2007117711A2

Filed: 2007-04-09     Issued: 2007-10-18

Dynamic selection of motion estimation search ranges and extended motion vector ranges

(Original Assignee) Microsoft Corporation     

Cheng Chang, Chih-Lung Lin, Thomas W. Holcomb
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax (prediction residuals) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (entropy encoding) element (vertical components) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2007117711A2
CLAIM 4
. The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ;
and signaling a second motion vector range indicating the second search range ;
whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation .

WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals (first syntax) into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .

WO2007117711A2
CLAIM 19
. The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values of horizontal and vertical components (second syntax element) ;
and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size .
WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (prediction residuals) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (entropy encoding) element (vertical components) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2007117711A2
CLAIM 4
. The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ;
and signaling a second motion vector range indicating the second search range ;
whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation .

WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals (first syntax) into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .

WO2007117711A2
CLAIM 19
. The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values of horizontal and vertical components (second syntax element) ;
and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size .
WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (prediction residuals) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (entropy encoding) element (vertical components) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2007117711A2
CLAIM 4
. The method of claim 1 further comprising : signaling a first motion vector range indicating the first search range ;
and signaling a second motion vector range indicating the second search range ;
whereby the signaling improves efficiency of entropy encoding (second syntax) of motion vector information from the first motion estimation and second motion estimation .

WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals (first syntax) into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .

WO2007117711A2
CLAIM 19
. The encoder of claim 16 wherein the motion estimation controller : tracks each of the plural previous motion vectors in a histogram based upon a sum of absolute values of horizontal and vertical components (second syntax element) ;
and using the histogram , counts how many of the plural previous motion vectors fall within each of plural intervals of a motion vector distribution .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size .
WO2007117711A2
CLAIM 16
. An encoder comprising : a motion estimator for performing motion estimation ;
a frequency transformer for transforming prediction residuals into transform coefficients ;
a quantizer for quantizing the transform coefficients ;
an entropy encoder for entropy encoding motion vector information and the quantized transform coefficients ;
and a motion estimation controller for selecting search ranges that constrain the motion estimation , wherein the selecting for a current picture (maximum size) is based at least in part upon motion vector distribution information for plural previous motion vectors .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2007115126A1

Filed: 2007-03-29     Issued: 2007-10-11

Adaptive encoder-assisted frame rate up conversion

(Original Assignee) Qualcomm Incorporated     

Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2007115126A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value (decoding video data) to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

WO2007115126A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method, third mode) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
WO2007115126A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2007115126A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value (decoding video data) to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

WO2007115126A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method, third mode) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
WO2007115126A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2007115126A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

WO2007115126A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method, third mode) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
WO2007115126A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

WO2007115126A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070297518A1

Filed: 2007-03-22     Issued: 2007-12-27

Flag encoding method, flag decoding method, and apparatus thereof

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Woo-jin Han, Bae-keun Lee, Tammy Lee, Kyo-hyuk Lee
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (minimum size) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (bit strings) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (minimum size) .
US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (bit strings) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (flag value) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (minimum size) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (bit strings) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070297518A1
CLAIM 1
. An apparatus for encoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a flag-assembling unit which collects flag value (video blocks) s allotted for the plurality of blocks and produces a flag bit string , based on a spatial correlation of the plurality of blocks ;
a maximum-run-determining unit which determines a maximum run of the flag bit string ;
and a converting unit which converts bits included in the flag bit string into a codeword with a size no greater than the maximum run by using a predetermined codeword table .

US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (minimum size) .
US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (bit strings) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (minimum size) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (bit strings) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (minimum size) .
US20070297518A1
CLAIM 24
. The apparatus for encoding a flag of claim 19 , further comprising a group-size-determining unit which determines the predetermined size of the group as a value with the minimum size (minimum size) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (bit strings) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20070297518A1
CLAIM 25
. An apparatus for decoding a flag used to code a video frame composed of a plurality of blocks , the apparatus comprising : a skip-bit-reading unit which reads a skip bit from the input bitstream ;
a group-size-reading unit which reads out a group size from the input bitstream ;
and a flag-restoring unit which restores individual flags with respect to the plurality of blocks from bits as large as the group size among flag bit strings (encoding mode) included in the input bitstream according to the read skip bit .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070160303A1

Filed: 2006-12-20     Issued: 2007-07-12

Geometrical image representation and compression

(Original Assignee) Docomo Communications Labs USA Inc     (Current Assignee) NTT Docomo Inc

Onur Guleryuz, Arthur Cunha
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070160303A1
CLAIM 38
. A decompressor comprising : a decoder to decode compressed bits to produce decoded data ;
an inverse predictor to perform inverse prediction on the decoded data using information specifying geometric flow , the inverse predictor producing a first plurality of coefficients ;
and an inverse wavelet transform to apply an inverse transform to the first plurality of coefficient to create reconstructed image (third syntax element) data .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070160303A1
CLAIM 38
. A decompressor comprising : a decoder to decode compressed bits to produce decoded data ;
an inverse predictor to perform inverse prediction on the decoded data using information specifying geometric flow , the inverse predictor producing a first plurality of coefficients ;
and an inverse wavelet transform to apply an inverse transform to the first plurality of coefficient to create reconstructed image (third syntax element) data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070160303A1
CLAIM 38
. A decompressor comprising : a decoder to decode compressed bits to produce decoded data ;
an inverse predictor to perform inverse prediction on the decoded data using information specifying geometric flow , the inverse predictor producing a first plurality of coefficients ;
and an inverse wavelet transform to apply an inverse transform to the first plurality of coefficient to create reconstructed image (third syntax element) data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070230563A1

Filed: 2006-11-10     Issued: 2007-10-04

Adaptive encoder-assisted frame rate up conversion

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Tao Tian, Fang Shi, Vijayalakshmi R. Raveendran
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070230563A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value (decoding video data) to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US20070230563A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method, third mode) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20070230563A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070230563A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value (decoding video data) to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US20070230563A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method, third mode) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20070230563A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, third mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070230563A1
CLAIM 1
. A digital video encoding (second syntax) method comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .

US20070230563A1
CLAIM 13
. A processor for encoding digital video data (video data) , the processor being configured to interpolate a frame rate up-conversion (FRUC) video frame , encode at least a portion of a video frame using the FRUC frame as a reference , select one of a plurality of FRUC modes for the at least a portion of the encoded video frame , and adjust one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method, third mode) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20070230563A1
CLAIM 1
. A digital video encoding method (encoding mode) comprising : interpolating a frame rate up-conversion (FRUC) video frame ;
encoding at least a portion of a video frame using the FRUC frame as a reference ;
selecting one of a plurality of FRUC modes for the at least a portion of the encoded video frame ;
and adjusting one or more parameters for the at least a portion of the encoded video frame to indicate the selected FRUC mode .

US20070230563A1
CLAIM 4
. The method of claim 3 , further comprising setting the CBP parameter to zero and the MV parameter to zero to indicate a first mode , setting the CBP parameter to zero and the MV parameter to a nonzero value to indicate a second mode , setting the CBP parameter to a nonzero value and the MV parameter to zero to indicate a third mode (encoding mode) , and setting the CBP parameter to a nonzero value and the MV parameter to a nonzero value to indicate a fourth mode .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070171982A1

Filed: 2006-08-31     Issued: 2007-07-26

Methods for encoding digital video for decoding on low performance devices

(Original Assignee) WIMAXIMUM LLC     (Current Assignee) Nordic Interactive Technologies LLC ; WIMAXIMUM LLC

Jens Meggers
US9930365B2
CLAIM 1
. A method of decoding video data (frame compression, memory block) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070171982A1
CLAIM 25
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ;
subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ;
selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression (video data, video blocks) result ;
and encoding the video frame in the video stream with the block size selected by said selecting .

US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video data, video blocks) using an LZW type compression algorithm .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (more blocks) .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US9930365B2
CLAIM 7
. A device for decoding video data (frame compression, memory block) , the device comprising : a memory configured to store decoded video blocks (frame compression, memory block) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070171982A1
CLAIM 25
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ;
subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ;
selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression (video data, video blocks) result ;
and encoding the video frame in the video stream with the block size selected by said selecting .

US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video data, video blocks) using an LZW type compression algorithm .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (more blocks) .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (frame compression, memory block) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070171982A1
CLAIM 25
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding a video stream by : receiving a current video frame of the video stream ;
subdividing the current video frame into uniform blocks of a selected block size , and comparing the blocks to corresponding blocks of a decoded version of a preceding video frame to determine whether any of the blocks of the current video frame are sufficiently unchanged from the preceding frame to be discarded ;
optionally repeating said subdividing or said comparing , or combinations thereof , using at least one additional block size , wherein the current video frame is capable of being separately subdivided or compared , or combinations thereof , to the preceding video frame using multiple block sizes ;
selecting from the multiple block sizes a uniform block size that is capable of producing a relatively better video frame compression (video data, video blocks) result ;
and encoding the video frame in the video stream with the block size selected by said selecting .

US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US20070171982A1
CLAIM 34
. Article of manufacture as claimed in claim 28 , wherein the instructions , if executed , further result in encoding video by compressing at least in part the contiguous or nearly contiguous memory block (video data, video blocks) using an LZW type compression algorithm .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (more blocks) .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US20070171982A1
CLAIM 28
. An article of manufacture comprising a storage medium having instructions stored thereon that , if executed , result in encoding video by : dividing a video frame into blocks according to a uniform or nearly uniform block size ;
identifying changed blocks by comparing one or more blocks (current block) to a corresponding block in a decoded version of a previous frame ;
copying changed blocks into a contiguous or nearly contiguous block of memory ;
and adding a preamble block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20080002770A1

Filed: 2006-06-30     Issued: 2008-01-03

Methods, apparatus, and a computer program product for providing a fast inter mode decision for video encoding in resource constrained devices

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Oyj

Kemal Ugur, Jani Lainema
US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (first encoding mode) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20080002770A1
CLAIM 1
. A method of selecting a mode for encoding a macroblock using motion compensated prediction , the method comprising : extracting at least one motion vector from at least one macroblock of a video frame , the at least one macroblock comprising a first plurality of inter modes having a plurality of block sizes ;
generating at least one prediction for the macroblock based on the at least one motion vector by analyzing a reference frame ;
and comparing a distortion value to a first predetermined threshold and selecting a first encoding mode (video blocks) among first and second encoding modes without evaluating the second encoding mode based upon the comparison of the distortion value to the first predetermined threshold .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions (first partition) .
US20080002770A1
CLAIM 7
. A method according to claim 4 , further comprising : determining whether the summation of a first distortion value and a second distortion value exceeds a fourth predetermined threshold , wherein the first distortion value and the second distortion value correspond to a first partition (sub partitions) of the at least one macroblock when the at least one macroblock consists of a second block size among the plurality of block sizes ;
estimating the motion corresponding to the first partition when the summation of the first distortion value and the second distortion value exceeds the fourth predetermined threshold ;
and using the at least one motion vector extracted from the at least one macroblock , when the at least one macroblock consists of the first block size among the plurality of block sizes , as a motion vector corresponding to the first partition when the summation of the first distortion value and the second distortion value is less than the fourth predetermined threshold .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2006069297A1

Filed: 2005-12-22     Issued: 2006-06-29

Temporal estimation of a motion vector for video communications

(Original Assignee) Qualcomm Incorporated     

Yan Ye, Gokce Dane, Yen-Chi Lee, Ming-Chang Tsai, Nien-Chung Feng, Karl Ni
US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (said object) , a broadcast receiver device (said memory) , or a set-top box .
WO2006069297A1
CLAIM 10
. The method of Claim 1 wherein a motion vector associated with an object that is contained in a first macroblock in said first frame at the same position as a second macroblock in said second frame is included in said set provided that in said second frame said object (mobile device) overlaps said second macroblock by a specified amount .

WO2006069297A1
CLAIM 28
. A device comprising : a microprocessor ;
and a memory unit coupled to said microprocessor , said memory (broadcast receiver device) unit containing instructions that when executed by said microprocessor implement a method for processing video data , said method comprising : selecting a set of motion vectors from a first plurality of motion vectors associated with a first plurality of macroblocks in a first frame of said video data and from a second plurality of motion vectors associated with a second plurality of macroblocks in a second frame of said video data ;
determining a statistical measure of said set of motion vectors , said statistical measure defining a motion vector for a macroblock of interest in said second plurality of macroblocks ;
and applying said motion vector to said macroblock of interest .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20060120450A1

Filed: 2005-12-01     Issued: 2006-06-08

Method and apparatus for multi-layered video encoding and decoding

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Woo-jin Han, Sang-Chang Cha, Ho-Jin Ha
US9930365B2
CLAIM 1
. A method of decoding video data (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) , the method comprising : decoding a first syntax (prediction residuals) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) and an inter-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction mode (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) s .

US20060120450A1
CLAIM 10
. The method of claim 9 , wherein the generating of the residual image from the texture data comprises i (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) nversely quantizing the texture data and performing inverse spatial transform on the inversely quantized result .

US20060120450A1
CLAIM 20
. A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
a unit configured to select one of the two prediction residuals that offers higher coding efficiency ;
and a unit configured to encode the selected prediction residual .

US9930365B2
CLAIM 7
. A device for decoding video data (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) , the device comprising : a memory configured to store decoded video blocks (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (prediction residuals) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) and an inter-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction mode (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) s .

US20060120450A1
CLAIM 10
. The method of claim 9 , wherein the generating of the residual image from the texture data comprises i (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) nversely quantizing the texture data and performing inverse spatial transform on the inversely quantized result .

US20060120450A1
CLAIM 20
. A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
a unit configured to select one of the two prediction residuals that offers higher coding efficiency ;
and a unit configured to encode the selected prediction residual .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction mode (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) s .

US20060120450A1
CLAIM 10
. The method of claim 9 , wherein the generating of the residual image from the texture data comprises i (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) nversely quantizing the texture data and performing inverse spatial transform on the inversely quantized result .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (prediction residuals) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) and an inter-prediction mode (performing intra-prediction, intra-prediction mode, prediction modes, comprises i) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20060120450A1
CLAIM 1
. A method for encoding video based on a multi-layer structure , comprising : performing intra-prediction (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
performing prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
selecting one of the two prediction residuals (first syntax) that offers higher coding efficiency ;
and encoding the selected prediction residual .

US20060120450A1
CLAIM 2
. The method of claim 1 , wherein the intra-prediction is performed according to 8 directional intra-prediction mode (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) s .

US20060120450A1
CLAIM 10
. The method of claim 9 , wherein the generating of the residual image from the texture data comprises i (intra-prediction mode, video blocks, decoding video data, inter-prediction mode) nversely quantizing the texture data and performing inverse spatial transform on the inversely quantized result .

US20060120450A1
CLAIM 20
. A video encoder (second syntax element) comprising : a unit configured to perform intra-prediction on a current intra-block using images of neighboring intra-blocks of the current intra-block to obtain a prediction residual ;
a unit configured to perform prediction on the current intra-block using an image of a lower layer region corresponding to the current intra-block to obtain a prediction residual ;
a unit configured to select one of the two prediction residuals that offers higher coding efficiency ;
and a unit configured to encode the selected prediction residual .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20080025391A1

Filed: 2005-11-09     Issued: 2008-01-31

Transcoding Method And Device

(Original Assignee) Siemens AG     (Current Assignee) Siemens AG

Peter Amon, Jurgen Pandel
US9930365B2
CLAIM 1
. A method of decoding video data (second video) , the method comprising : decoding a first syntax (syntax elements, network element) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element (syntax elements, network element) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (syntax elements, network element) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US20080025391A1
CLAIM 13
. The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard .

US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US9930365B2
CLAIM 7
. A device for decoding video data (second video) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (syntax elements, network element) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element (syntax elements, network element) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (syntax elements, network element) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US20080025391A1
CLAIM 13
. The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard .

US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (one second) , a broadcast receiver device , or a set-top box .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second (mobile device) piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (syntax elements, network element) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element (syntax elements, network element) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (syntax elements, network element) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .

US20080025391A1
CLAIM 13
. The method as claimed in claim 12 , wherein the third piece of page data and the third piece of transform coefficient data of the encoded digital video output signal are encoded in accordance with a first video encoding (second syntax) standard and the first and second piece of page data and the first and second piece of transform coefficient data are based on a second video encoding standard .

US20080025391A1
CLAIM 15
. The method as claimed in claim 13 , wherein a first part of syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) of the respective third piece of page data is taken from the respective second piece of page data , and a second part of syntax elements of the respective third piece of page data is generated from a combination of the first and the respective second pieces of page data .

US20080025391A1
CLAIM 28
. A transcoding device as claimed in claim 27 , wherein said transcoding device is integrated into one of a terminal and a network element (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20080025391A1
CLAIM 11
. A method for transcoding an encoded digital video input signal having a base signal and at least one differential extension signal , comprising : determining a first piece of page data and a first piece of transform coefficient data from the base signal ;
determining at least one second piece of page data and at least one second piece of transform coefficient data respectively from at least one differential extension signal ;
generating a third piece of page data from the first piece of page data and the at least one second piece of page data ;
generating a third piece of transform coefficient data by weighted addition from the first and at least one second piece of transform coefficient data , the first , second and third pieces of transform coefficient data being based on an identical transform coefficient encoding method (encoding mode) ;
and creating an encoded digital video output signal from the third piece of page data and the third piece of transform coefficient data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070030903A1

Filed: 2005-08-05     Issued: 2007-02-08

Method and apparatus for H.264 to MPEG-2 video transcoding

(Original Assignee) LSI Corp     (Current Assignee) Avago Technologies International Sales Pte Ltd

Guy Cote, Lowell Winger
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070030903A1
CLAIM 6
. The method according to claim 1 , further comprising the step of : performing an MPEG-2 discrete cosine (decoding video data) transform scan decision to generate a discrete cosine transform indicator , wherein use of (i) said MPEG-2 field mode coding and (ii) said MPEG-2 frame mode coding is further determined by said discrete cosine transform indicator .

US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (least number) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070030903A1
CLAIM 6
. The method according to claim 1 , further comprising the step of : performing an MPEG-2 discrete cosine (decoding video data) transform scan decision to generate a discrete cosine transform indicator , wherein use of (i) said MPEG-2 field mode coding and (ii) said MPEG-2 frame mode coding is further determined by said discrete cosine transform indicator .

US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (least number) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (least number) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20070030903A1
CLAIM 12
. The method according to claim 11 , further comprising the step of : adjusting an MPEG-2 group of pictures to utilize a least number (encoding mode) of lowest priority frames among said MPEG-2 reference frames .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20060153297A1

Filed: 2005-07-07     Issued: 2006-07-13

Mixed inter/intra video coding of macroblock partitions

(Original Assignee) Thomson Licensing SAS     (Current Assignee) InterDigital Madison Patent Holdings Inc

Jill Boyce
US9930365B2
CLAIM 1
. A method of decoding video data (video data) (comprises i, zero value) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20060153297A1
CLAIM 2
. A method as defined in claim 1 wherein said macroblock comprises video data (video data) in compliance with the Joint Video Team (“JVT”) standard .

US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 6
. A method as defined in claim 1 wherein the inter-coded at least one of said plurality of partitions has a reference picture index associated with a non-zero value (decoding video data) d weighting factor .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .

US20060153297A1
CLAIM 19
. A video encoder as defined in claim 16 wherein said intra-coding means comprises i (decoding video data) ndexing means for providing a reference picture index that is associated with a weighting factor of zero .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method, encoding mode) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) (comprises i, zero value) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20060153297A1
CLAIM 2
. A method as defined in claim 1 wherein said macroblock comprises video data (video data) in compliance with the Joint Video Team (“JVT”) standard .

US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 6
. A method as defined in claim 1 wherein the inter-coded at least one of said plurality of partitions has a reference picture index associated with a non-zero value (decoding video data) d weighting factor .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .

US20060153297A1
CLAIM 19
. A video encoder as defined in claim 16 wherein said intra-coding means comprises i (decoding video data) ndexing means for providing a reference picture index that is associated with a weighting factor of zero .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method, encoding mode) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method, encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20060153297A1
CLAIM 2
. A method as defined in claim 1 wherein said macroblock comprises video data (video data) in compliance with the Joint Video Team (“JVT”) standard .

US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method, encoding mode) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20060153297A1
CLAIM 4
. A method as defined in claim 1 wherein said intra-coding comprises non-predictive intra-coding performed within a weighted prediction encoding mode (encoding mode) by using a weighting factor of zero with a weighted prediction tool from at least one of the Main and Extended profiles of the JVT standard .

US20060153297A1
CLAIM 7
. A method as defined in claim 6 , further comprising deciding between inter-coding and non-predictive intra-coding of a partition in response to a measure of cost for each coding method (encoding mode) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20060126740A1

Filed: 2005-05-25     Issued: 2006-06-15

Shared pipeline architecture for motion vector prediction and residual decoding

(Original Assignee) Micronas USA Inc     (Current Assignee) TDK Micronas GmbH

Teng Lin, Weimin Zeng
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax (data state) element (motion vector differences) associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element (motion vector differences) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element (motion vector differences) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20060126740A1
CLAIM 7
. The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element, third syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information .

US20060126740A1
CLAIM 9
. The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode .

US20060126740A1
CLAIM 17
. The system of claim 1 further comprising : a fractional interpolation block and in-line loop filter (FIB/ILF) memory interfaced to the fifth stage via a 128-bit bus , wherein for every macro block , there are 18 data beats of 128 bits that contain macro block properties and expanded motion vector information of luma and chroma for forward and backward reference (starting size) ;
a DSP macro block header/data memory interfaced to the fifth stage via the 128-bit bus , wherein for every 16×16 macro block , the DSP macro block header/data memory has 1 data beat of 128 bits for a header that includes information for macro block properties and 48 data beats of 128 bits for coefficients ;
and a dual channel write memory interfaced to the fifth stage via the 128-bit bus , wherein write channel 1 of the memory is for writing even row macro blocks of frame pictures and top fields of field pictures , while write channel 2 of the memory is for writing odd row macro blocks of frame pictures and bottom fields of field pictures .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (d log) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (data state) element (motion vector differences) associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (motion vector differences) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (motion vector differences) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20060126740A1
CLAIM 3
. The system of claim 1 wherein B-type picture decoding has a dual channel read write structure , and log (video blocks) ic index ID searching is performed on a block by block basis .

US20060126740A1
CLAIM 7
. The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element, third syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information .

US20060126740A1
CLAIM 9
. The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode .

US20060126740A1
CLAIM 17
. The system of claim 1 further comprising : a fractional interpolation block and in-line loop filter (FIB/ILF) memory interfaced to the fifth stage via a 128-bit bus , wherein for every macro block , there are 18 data beats of 128 bits that contain macro block properties and expanded motion vector information of luma and chroma for forward and backward reference (starting size) ;
a DSP macro block header/data memory interfaced to the fifth stage via the 128-bit bus , wherein for every 16×16 macro block , the DSP macro block header/data memory has 1 data beat of 128 bits for a header that includes information for macro block properties and 48 data beats of 128 bits for coefficients ;
and a dual channel write memory interfaced to the fifth stage via the 128-bit bus , wherein write channel 1 of the memory is for writing even row macro blocks of frame pictures and top fields of field pictures , while write channel 2 of the memory is for writing odd row macro blocks of frame pictures and bottom fields of field pictures .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (data state) element (motion vector differences) associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (motion vector differences) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (motion vector differences) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20060126740A1
CLAIM 7
. The system of claim 5 wherein N=24 , and the N-bits designate current macro block properties including at least one of : intra prediction , inter prediction , skip mode , raw data mode , macro block ID , slice ID , direct mode , macro block split mode , sub block split mode which can be down to 4×4 for luma and 2×2 for chroma and all corresponding intra prediction flag and motion vector differences (first syntax element, second syntax element, third syntax element) on X and Y direction , picture reference index for forward and backward predictions , CABAC bit map and level , and CALVC run level information .

US20060126740A1
CLAIM 9
. The system of claim 1 wherein the first stage includes a PCM raw data state (first syntax) machine for carrying put a PCM raw data mode .

US20060126740A1
CLAIM 17
. The system of claim 1 further comprising : a fractional interpolation block and in-line loop filter (FIB/ILF) memory interfaced to the fifth stage via a 128-bit bus , wherein for every macro block , there are 18 data beats of 128 bits that contain macro block properties and expanded motion vector information of luma and chroma for forward and backward reference (starting size) ;
a DSP macro block header/data memory interfaced to the fifth stage via the 128-bit bus , wherein for every 16×16 macro block , the DSP macro block header/data memory has 1 data beat of 128 bits for a header that includes information for macro block properties and 48 data beats of 128 bits for coefficients ;
and a dual channel write memory interfaced to the fifth stage via the 128-bit bus , wherein write channel 1 of the memory is for writing even row macro blocks of frame pictures and top fields of field pictures , while write channel 2 of the memory is for writing odd row macro blocks of frame pictures and bottom fields of field pictures .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2006304107A

Filed: 2005-04-22     Issued: 2006-11-02

符号化装置、及び、この符号化装置に適用されるプログラム

(Original Assignee) Ntt Electornics Corp; Nttエレクトロニクス株式会社     

Katsuyuki Ozaki, Mihoko Usui, 克之 尾崎, 美穂子 薄井
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2006304107A
CLAIM 2
前記算出部は、前記予測誤差の絶対値の総和である予測誤差総和を予測モード毎に算出し、 前記予測モード (partitioning comprises) 選択部は、前記第1予測モード及び前記第2予測モードに加えて、前記算出部によって算出された前記予測誤差総和が最小となる予測モードである第3予測モードにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択することを特徴とする請求項1に記載の符号化装置。

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2006304107A
CLAIM 1
符号化の対象となる符号化対象画像に含まれる画素を用いて、前記符号化対象画像に含まれるブロックである符号化対象ブロックを符号化する符号化装置であって、 複数種類の予測モードに従って、前記符号化対象画像に含まれる画素に基づいて予測ブロックを生成する予測ブロック生成部と、 前記予測ブロック (quantization parameter modification value) 生成部によって生成された前記予測ブロックと前記符号化対象ブロックとの差分である予測誤差を直交変換し、直交変換して得られた値の絶対値の総和である直交変換値総和を予測モード毎に算出する算出部と、 前記符号化対象画像内で前記符号化対象ブロックに隣接するブロックである隣接ブロックを符号化した際に適用された予測モードによって特定される第1予測モードと、前記算出部によって算出された前記直交変換値総和が最小となる予測モードである第2予測モードとにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択する予測モード選択部とを備えることを特徴とする符号化装置。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2006304107A
CLAIM 2
前記算出部は、前記予測誤差の絶対値の総和である予測誤差総和を予測モード毎に算出し、 前記予測モード (partitioning comprises) 選択部は、前記第1予測モード及び前記第2予測モードに加えて、前記算出部によって算出された前記予測誤差総和が最小となる予測モードである第3予測モードにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択することを特徴とする請求項1に記載の符号化装置。

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2006304107A
CLAIM 1
符号化の対象となる符号化対象画像に含まれる画素を用いて、前記符号化対象画像に含まれるブロックである符号化対象ブロックを符号化する符号化装置であって、 複数種類の予測モードに従って、前記符号化対象画像に含まれる画素に基づいて予測ブロックを生成する予測ブロック生成部と、 前記予測ブロック (quantization parameter modification value) 生成部によって生成された前記予測ブロックと前記符号化対象ブロックとの差分である予測誤差を直交変換し、直交変換して得られた値の絶対値の総和である直交変換値総和を予測モード毎に算出する算出部と、 前記符号化対象画像内で前記符号化対象ブロックに隣接するブロックである隣接ブロックを符号化した際に適用された予測モードによって特定される第1予測モードと、前記算出部によって算出された前記直交変換値総和が最小となる予測モードである第2予測モードとにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択する予測モード選択部とを備えることを特徴とする符号化装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2006304107A
CLAIM 2
前記算出部は、前記予測誤差の絶対値の総和である予測誤差総和を予測モード毎に算出し、 前記予測モード (partitioning comprises) 選択部は、前記第1予測モード及び前記第2予測モードに加えて、前記算出部によって算出された前記予測誤差総和が最小となる予測モードである第3予測モードにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択することを特徴とする請求項1に記載の符号化装置。

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (前記予測ブロック) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
JP2006304107A
CLAIM 1
符号化の対象となる符号化対象画像に含まれる画素を用いて、前記符号化対象画像に含まれるブロックである符号化対象ブロックを符号化する符号化装置であって、 複数種類の予測モードに従って、前記符号化対象画像に含まれる画素に基づいて予測ブロックを生成する予測ブロック生成部と、 前記予測ブロック (quantization parameter modification value) 生成部によって生成された前記予測ブロックと前記符号化対象ブロックとの差分である予測誤差を直交変換し、直交変換して得られた値の絶対値の総和である直交変換値総和を予測モード毎に算出する算出部と、 前記符号化対象画像内で前記符号化対象ブロックに隣接するブロックである隣接ブロックを符号化した際に適用された予測モードによって特定される第1予測モードと、前記算出部によって算出された前記直交変換値総和が最小となる予測モードである第2予測モードとにRate−Distortion最適化法を適用して、前記符号化対象ブロックを符号化する際に適用する予測モードを選択する予測モード選択部とを備えることを特徴とする符号化装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20060203905A1

Filed: 2005-03-14     Issued: 2006-09-14

Video coding system

(Original Assignee) National Kaohsiung First University of Science and Technology     (Current Assignee) National Kaohsiung First University of Science and Technology

Shih-Chang Hsia
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (horizontal resolution) of blocks of the sequence of pictures ;

decoding a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (processing speed) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size (memory cell) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element (horizontal resolution) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US20060203905A1
CLAIM 5
. A video coding system , comprising A memory structure having the pseudo address decoder and internal storage cells capable of separately being implemented ;
the sizes of pseudo address decoder and internal storage cells fitting in with the actual frame size ;
the used lines decoded only for the internal cell access . The frame size is H×V , and the n and m addressing lines are individually decoded to H lines and V lines rather than 2 n+m decoding lines . The memory address has (n+m) pins , but only H and V decoding lines are implemented to access internal cells . The practical memory cell (starting size) s are implemented to meet the real frame size . (2 n −H)+(2 m −V) address decoding circuits and 2 n+m −H×V internal cells can be saved while inputting 2 m −V and 2 n −H pseudo address lines . 2 n+m −H×V space is a pseudo plane that doesn' ;
t require to be implemented . The pseudo address decoding is suitable for non-2 n ×2 m video format in claim 3 . Change a non-2 n ×2 m video format to 2 n ×2 m video format with pseudo address decoding .

US20060203905A1
CLAIM 6
. A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ;
the system includes six address generators (AG 1 ˜AG 6) ;
the internal storage cell being consisted with input memory M 1 and frame memory M 2 .

US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (processing speed) and greater than the minimum size (horizontal resolution) .
US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (reference blocks) .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (horizontal resolution) of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (processing speed) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size (memory cell) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (horizontal resolution) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US20060203905A1
CLAIM 5
. A video coding system , comprising A memory structure having the pseudo address decoder and internal storage cells capable of separately being implemented ;
the sizes of pseudo address decoder and internal storage cells fitting in with the actual frame size ;
the used lines decoded only for the internal cell access . The frame size is H×V , and the n and m addressing lines are individually decoded to H lines and V lines rather than 2 n+m decoding lines . The memory address has (n+m) pins , but only H and V decoding lines are implemented to access internal cells . The practical memory cell (starting size) s are implemented to meet the real frame size . (2 n −H)+(2 m −V) address decoding circuits and 2 n+m −H×V internal cells can be saved while inputting 2 m −V and 2 n −H pseudo address lines . 2 n+m −H×V space is a pseudo plane that doesn' ;
t require to be implemented . The pseudo address decoding is suitable for non-2 n ×2 m video format in claim 3 . Change a non-2 n ×2 m video format to 2 n ×2 m video format with pseudo address decoding .

US20060203905A1
CLAIM 6
. A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ;
the system includes six address generators (AG 1 ˜AG 6) ;
the internal storage cell being consisted with input memory M 1 and frame memory M 2 .

US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (processing speed) and greater than the minimum size (horizontal resolution) .
US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (frame memory) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (horizontal resolution) of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (processing speed) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size (memory cell) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (horizontal resolution) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20060203905A1
CLAIM 1
. A video coding system , comprising a memory addressing method partitioning the frame data into uniform blocks , and the frame memory (thereon instructions) accessed with block-by-block ;
a hierarchical layer processing employed with macro-blocks and sub-blocks and the starting and ending code of each layer capable of directly addressing the range of memory . some macro-blocks are possibly skipped during inter frame coding , the macro block address increases by 16×S , where S denotes the number of macro-block skipped .

US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US20060203905A1
CLAIM 5
. A video coding system , comprising A memory structure having the pseudo address decoder and internal storage cells capable of separately being implemented ;
the sizes of pseudo address decoder and internal storage cells fitting in with the actual frame size ;
the used lines decoded only for the internal cell access . The frame size is H×V , and the n and m addressing lines are individually decoded to H lines and V lines rather than 2 n+m decoding lines . The memory address has (n+m) pins , but only H and V decoding lines are implemented to access internal cells . The practical memory cell (starting size) s are implemented to meet the real frame size . (2 n −H)+(2 m −V) address decoding circuits and 2 n+m −H×V internal cells can be saved while inputting 2 m −V and 2 n −H pseudo address lines . 2 n+m −H×V space is a pseudo plane that doesn' ;
t require to be implemented . The pseudo address decoding is suitable for non-2 n ×2 m video format in claim 3 . Change a non-2 n ×2 m video format to 2 n ×2 m video format with pseudo address decoding .

US20060203905A1
CLAIM 6
. A video coding system with the apparatus for interface to apply the new memory addressing , comprising The memory addressing control , address decoder and internal storage cell capable of being merged into one body as a memory core to implement full video encoder (second syntax element) ;
the system includes six address generators (AG 1 ˜AG 6) ;
the internal storage cell being consisted with input memory M 1 and frame memory M 2 .

US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (processing speed) and greater than the minimum size (horizontal resolution) .
US20060203905A1
CLAIM 8
. The video coding system as claimed in claim 6 , further comprising two kinds of memory used , one is the input memory M 1 , and the other is the frame memory M 2 . The input memory as buffer function is required for block-based processing . The ports of M 1 memory contain one input and two outputs . The output-i is for DCT transformation and the output- 2 is for motion estimation . The “write” address AG 1 is used for storing the pixel input , and “read” address AG 2 for reading the current processing pixel . For the real-time requirement , M 1 memory is split into two banks , one for input and the other for output . As the size of macro-block is 16×16 , each bank needs 16×H words , where H is the horizontal resolution (minimum size, third syntax element) . Two banks are executed with interlaced operations for real-time data access .

US20060203905A1
CLAIM 20
. The comparison can be applied on motion estimation to find the motion vector . Also , applied on the fast computing such data sorting , data searching , pattern comparison and pattern recognition . The circuit can compare m-data in parallel processing . The processing speed (maximum size) is very fast and it is suitable complex comparison system , such as biological technology .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US20060203905A1
CLAIM 2
. One macro block can be split into four sub-blocks . The address increases by one when one pixel inputs . And the address increases by H values when the block position changes to the next column . Successively , the address of bottom blocks (3 rd and 4 th SB) is equal to the sum of the top block address and 8 H . The encoder performs motion estimation by the function which searches the best matching between the current and the reference blocks (current block) of frame memory . The searching memory address can be determined by MB (My , Mx) Addr =MB (p , q) Addr +My×H+Mx , where Mx and My are the searching vector in the horizontal and vertical directions respectively .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20060133495A1

Filed: 2004-12-22     Issued: 2006-06-22

Temporal error concealment for video communications

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Yan Ye, Gokce Dane, Yen-Chi Lee, Ming-Chang Tsai, Nien-Chung Feng, Karl Ni
US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (said object) , a broadcast receiver device (said memory) , or a set-top box .
US20060133495A1
CLAIM 10
. The method of claim 1 wherein a motion vector associated with an object that is contained in a first macroblock in said first frame at the same position as a second macroblock in said second frame is included in said set provided that in said second frame said object (mobile device) overlaps said second macroblock by a specified amount .

US20060133495A1
CLAIM 28
. A device comprising : a microprocessor ;
and a memory unit coupled to said microprocessor , said memory (broadcast receiver device) unit containing instructions that when executed by said microprocessor implement a method for processing video data , said method comprising : selecting a set of motion vectors from a first plurality of motion vectors associated with a first plurality of macroblocks in a first frame of said video data and from a second plurality of motion vectors associated with a second plurality of macroblocks in a second frame of said video data ;
determining a statistical measure of said set of motion vectors , said statistical measure defining a motion vector for a macroblock of interest in said second plurality of macroblocks ;
and applying said motion vector to said macroblock of interest .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2005043310A2

Filed: 2004-10-22     Issued: 2005-05-12

Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system

(Original Assignee) Qualcomm Incorporated     

Ramaswamy Murali, Kent G. Walker, Rajiv Vijayan
US9930365B2
CLAIM 1
. A method of decoding video data (comprises i, first base) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2005043310A2
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base (decoding video data) stream and a first enhancement stream for a first program in a first time interval , the first base sfream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .

WO2005043310A2
CLAIM 38
. The method of claim 28 , wherein the initiating the decoding of the at least one portion of the first program comprises i (decoding video data) nitiating decoding of an audio portion of the first program prior to user selection of the first program .

US9930365B2
CLAIM 7
. A device for decoding video data (comprises i, first base) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2005043310A2
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base (decoding video data) stream and a first enhancement stream for a first program in a first time interval , the first base sfream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .

WO2005043310A2
CLAIM 38
. The method of claim 28 , wherein the initiating the decoding of the at least one portion of the first program comprises i (decoding video data) nitiating decoding of an audio portion of the first program prior to user selection of the first program .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (second time interval) , or a set-top box .
WO2005043310A2
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base stream and a first enhancement stream for a first program in a first time interval , the first base sfream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval (receiver device, broadcast receiver device) after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20050101319A1

Filed: 2004-10-18     Issued: 2005-05-12

Method and apparatus for seamlessly switching reception between multimedia streams in a wireless communication system

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Ramaswamy Murali, Gordon Walker, Rajiv Vijayan
US9930365B2
CLAIM 1
. A method of decoding video data (comprises i, first base) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20050101319A1
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base (decoding video data) stream and a first enhancement stream for a first program in a first time interval , the first base stream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .

US20050101319A1
CLAIM 38
. The method of claim 28 , wherein the initiating the decoding of the at least one portion of the first program comprises i (decoding video data) nitiating decoding of an audio portion of the first program prior to user selection of the first program .

US9930365B2
CLAIM 7
. A device for decoding video data (comprises i, first base) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20050101319A1
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base (decoding video data) stream and a first enhancement stream for a first program in a first time interval , the first base stream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .

US20050101319A1
CLAIM 38
. The method of claim 28 , wherein the initiating the decoding of the at least one portion of the first program comprises i (decoding video data) nitiating decoding of an audio portion of the first program prior to user selection of the first program .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (second time interval) , or a set-top box .
US20050101319A1
CLAIM 19
. A method of receiving multimedia programs in a wireless communication system , comprising : decoding a first base stream and a first enhancement stream for a first program in a first time interval , the first base stream carrying base information for the first program and the first enhancement stream carrying additional information for the first program ;
decoding the first base stream for the first program and a second base stream for a second program in a second time interval (receiver device, broadcast receiver device) after the first time interval ;
and decoding the second base stream and a second enhancement stream for the second program in a third time interval after the second time interval .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20050213831A1

Filed: 2004-09-01     Issued: 2005-09-29

Method and system for encoding fractional bitplanes

(Original Assignee) Koninklijke Philips NV     (Current Assignee) Koninklijke Philips NV

Mihaela van der Schaar
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20050213831A1
CLAIM 1
. In a layered encoding system having at least one layer comprising a plurality of sub-layers , a method for encoding a video image (200) , composed of a plurality of pixel blocks (current block) , containing at least one area determined to be significant (210) within a corresponding sub-layer (272 , 274 , 276) , said method comprising the steps of : a . associating a level of significance with each block of a known size (250 , 252) within said at least one significant area (210) ;
b . associating a level of significance with each of at least one successively larger blocks (222 , 244) dependent upon said level of significance of at least one of said blocks (250 , 252) of a known size contained within said successively larger block (222 , 244) ;
and c . mapping each of said associated levels of significance .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20050025246A1

Filed: 2004-07-16     Issued: 2005-02-03

Decoding jointly coded transform type and subblock pattern information

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Thomas Holcomb
US9930365B2
CLAIM 1
. A method of decoding video data (first VLC) , the method comprising : decoding a first syntax (syntax elements) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element (syntax elements) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (syntax elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks, decoding video data) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .

US9930365B2
CLAIM 7
. A device for decoding video data (first VLC) , the device comprising : a memory configured to store decoded video blocks (first VLC) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (syntax elements) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (syntax elements) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (syntax elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .

US20050025246A1
CLAIM 17
. A computer-readable medium storing computer-executable instructions for causing a computer system programmed thereby to perform a method comprising : receiving a first VLC (video blocks, decoding video data) signaled at macroblock level , wherein the first VLC jointly represents a transform type signal level , a transform type and a subblock pattern ;
decoding the first VLC ;
receiving one or more other VLCs signaled at block level ;
and decoding each of the one or more other VLCs .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (syntax elements) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (syntax elements) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (syntax elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20050025246A1
CLAIM 14
. The method of claim 13 wherein the quantization level is derived from one or more picture-level syntax elements (first syntax, third syntax, first syntax element, second syntax element, third syntax element) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20050053292A1

Filed: 2004-06-29     Issued: 2005-03-10

Advanced bi-directional predictive coding of interlaced video

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Kunal Mukerjee, Thomas Holcomb
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (second motion vector predictor) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (interlaced B-field) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US20050053292A1
CLAIM 28
. A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (second motion vector predictor) .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (interlaced B-field) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (second motion vector predictor) has a size of at least 64×64 pixels .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second motion vector predictor) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (interlaced B-field) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US20050053292A1
CLAIM 28
. A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (second motion vector predictor) .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (interlaced B-field) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (second motion vector predictor) has a size of at least 64×64 pixels .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second motion vector predictor) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (interlaced B-field) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US20050053292A1
CLAIM 28
. A computer-readable medium having stored thereon computer-executable instructions for performing the method of claim 16 in a video encoder (second syntax element) .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (second motion vector predictor) .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (interlaced B-field) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20050053292A1
CLAIM 11
. The method of claim 1 wherein the interlaced bi-directionally predicted picture is an interlaced B-field (encoding mode) , and wherein the first motion vector is for either forward or backward prediction for the entire current macroblock .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (second motion vector predictor) has a size of at least 64×64 pixels .
US20050053292A1
CLAIM 8
. The method of claim 1 further comprising , for a second motion vector for the current macroblock of the interlaced bi-directionally predicted picture : selecting a second set of plural motion vector predictor candidates from a second motion vector buffer , wherein the second motion vector buffer is for a second prediction direction opposite the first prediction direction ;
and calculating a second motion vector predictor (current block) based at least in part on one or more of the second set of plural motion vector predictor candidates .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN1810037A

Filed: 2004-06-17     Issued: 2006-07-26

帧间的快速模式确定编码

(Original Assignee) 汤姆森许可贸易公司     

尹鹏, 亚历山德罗斯·迈克尔·图拉普斯, 吉尔·麦克唐纳·布瓦斯
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (尺寸的) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (尺寸的) .
CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (一种视频编码方法, 针对帧) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (尺寸的) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN1810037A
CLAIM 1
. 一种视频编码方法 (video blocks) ,用于选择帧间编码的帧的当前宏块的模式,该方法包括以下步骤的至少一个:针对宏块模式的子集来检查第一模式,响应检查的第一模式的运动矢量信息,选择性地检查其它模式,并且响应检查的模式,选择针对当前宏块的模式;检查至少一个相邻宏块的宏块模式,并且响应所述至少一个检查的相邻宏块的宏块模式,选择针对当前宏块的模式;检查宏块模式的子集的成本,如果检查的成本满足预设标准,进一步只检查帧内编码模式,并且响应检查的模式来选择针对当前宏块的模式;以及响应检查的宏块模式来调整及早停止阈值,并且如果满足调整的及早停止阈值,则响应检查的宏块模式来选择针对当前宏块的模式。

CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

CN1810037A
CLAIM 10
. 根据权利要求1所述的视频编码方法,还包括:最初针对帧 (video blocks) 间模式和帧内模式的子集执行模式检查;响应模式检查,计算复杂度测量;以及使用复杂度测量来确定是否应该执行其它的帧间模式或帧内模式。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (尺寸的) .
CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (尺寸的) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (尺寸的) .
CN1810037A
CLAIM 7
. 根据权利要求1所述的视频编码方法,还包括:最初只针对可能块尺寸的 (minimum size) 子集执行运动估计;以及使用运动信息来确定是否需要针对其它块尺寸来执行其它的运动估计或复杂度测量。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20070053436A1

Filed: 2004-05-04     Issued: 2007-03-08

Encoding video information using block based adaptive scan order

(Original Assignee) Koninklijke Philips NV     (Current Assignee) Koninklijke Philips NV

Lambertus Van Eggelen
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (corresponding data blocks) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

US20070053436A1
CLAIM 6
. A method according to claim 1 , wherein transformation of data of each macro block into a corresponding coefficient data block recording at least spatial information present in its associated data block in step (c) is implemented using a discrete cosine (decoding video data) transform .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (corresponding data blocks) and greater than the minimum size .
US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks (preceding image) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (corresponding data blocks) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20070053436A1
CLAIM 2
. A method according to claim 1 , wherein a determination of the asymmetry in each coefficient block controlling the scanning route in step (d) is dependent upon at least one of : utilization of frame interlacing in the input video information ;
spatial scaling aspect ratio of one or more image frames present in the video information ;
pulldown material being present in the data of one or more of the image frames ;
one or more scanning routes utilized for processing preceding image (video blocks) frames in the video information ;
a degree of temporal motion occurring in a series of the image frames ;
and statistical data relating to earlier selected scanning routes and their associated data compression performance .

US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

US20070053436A1
CLAIM 6
. A method according to claim 1 , wherein transformation of data of each macro block into a corresponding coefficient data block recording at least spatial information present in its associated data block in step (c) is implemented using a discrete cosine (decoding video data) transform .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (corresponding data blocks) and greater than the minimum size .
US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (corresponding data blocks) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (corresponding data blocks) and greater than the minimum size .
US20070053436A1
CLAIM 3
. A method according to claim 1 , wherein field and frame macro modes of operation are provided in step (b) , the field macro mode being operable to mutually isolate interlaced image frame line information according to their associated temporal instances to generate corresponding data blocks (maximum size) for transformation in step (c) , and the frame macro mode being operable to maintain spatial correspondence between each image frame and its associated data blocks to generate corresponding data macro blocks for transformation in step (c) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN1723706A

Filed: 2004-01-06     Issued: 2006-01-18

宏块分割部分的混合块间/块内视频编码

(Original Assignee) 汤姆森特许公司     

吉尔·M·博伊斯
US9930365B2
CLAIM 1
. A method of decoding video data (的视频数据) , the method comprising : decoding a first syntax (多个分) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN1723706A
CLAIM 1
. 一种用于编码具有多个分 (first syntax) 割部分的宏块的方法(400),所述方法包括:块间编码所述多个分割部分中的至少一个(426);和块内编码所述多个分割部分中的至少第二个(428),其中所述块内编码包括提供与零加权系数相关联的参考图形索引。

CN1723706A
CLAIM 2
. 如权利要求1所述的方法,其中所述宏块包括符合联合视频组(“JVT”)标准的视频数据 (video data)

US9930365B2
CLAIM 7
. A device for decoding video data (的视频数据) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (多个分) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN1723706A
CLAIM 1
. 一种用于编码具有多个分 (first syntax) 割部分的宏块的方法(400),所述方法包括:块间编码所述多个分割部分中的至少一个(426);和块内编码所述多个分割部分中的至少第二个(428),其中所述块内编码包括提供与零加权系数相关联的参考图形索引。

CN1723706A
CLAIM 2
. 如权利要求1所述的方法,其中所述宏块包括符合联合视频组(“JVT”)标准的视频数据 (video data)

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (多个分) element associated with a sequence of pictures of video data (的视频数据) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN1723706A
CLAIM 1
. 一种用于编码具有多个分 (first syntax) 割部分的宏块的方法(400),所述方法包括:块间编码所述多个分割部分中的至少一个(426);和块内编码所述多个分割部分中的至少第二个(428),其中所述块内编码包括提供与零加权系数相关联的参考图形索引。

CN1723706A
CLAIM 2
. 如权利要求1所述的方法,其中所述宏块包括符合联合视频组(“JVT”)标准的视频数据 (video data)




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2004112825A

Filed: 2003-11-04     Issued: 2004-04-08

ビデオ画像符号化方法

(Original Assignee) Sarnoff Corp; Sharp Corp; サーノフ コーポレイション; シャープ株式会社     

Tihao Chiang, Jungwoo Lee, Zixiang Xiong, Yaqin Zhang, ザン,ヤチン, ション,ジシャン, チャン,ティーハオ, リー,ジャンウー
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分 (third syntax, third syntax element) 割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分 (third syntax, third syntax element) 割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分 (third syntax, third syntax element) 割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004112825A
CLAIM 1
 ビデオ画像を可変サイズのブロックに分割するブロック分割ステップと、前記ブロックに対応した動きベクトルによって動き補償処理を行うステップとを備え、前記ブロック分割ステップは可変にできるブロックサイズ (maximum size) の種類を制限し、ブロックの分割構造を伝送する際のビットを削減することを特徴とするビデオ画像符号化方法。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7397857B2

Filed: 2003-10-07     Issued: 2008-07-08

Method for transformation-coding full motion image sequences

(Original Assignee) Robert Bosch GmbH     (Current Assignee) Robert Bosch GmbH

Alexander Romanowski, Sven Bauer, Peter Siepen, Mathias Wien, Thomas Wedi
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7397857B2
CLAIM 1
. A method for transformation encoding of moving-image sequences , comprising the following steps : estimating motion vectors block-by-block between a reference image signal (video data) and an actual image signal from the image sequence , with which said motion vector compensation is carried out , wherein individual frames of the moving image sequence are divided into macroblocks , whereby for each macroblock the block size of a macroblock , square sub-blocks of a macroblock , or rectangle sub-blocks of a macroblock are adaptively used for motion compensation , and transformation-encoding the residual prediction error with the following measure : the block size of the transformation encoding is coupled to the block size used in each case for the motion compensation , whereby a maximum portion of the prediction error is commonly transformed , wherein the block transformation is selected for each macroblock that has the same block size as the block size used for the motion compensation .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) , the device comprising : a memory configured to store decoded video blocks (estimating motion) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7397857B2
CLAIM 1
. A method for transformation encoding of moving-image sequences , comprising the following steps : estimating motion (video blocks) vectors block-by-block between a reference image signal (video data) and an actual image signal from the image sequence , with which said motion vector compensation is carried out , wherein individual frames of the moving image sequence are divided into macroblocks , whereby for each macroblock the block size of a macroblock , square sub-blocks of a macroblock , or rectangle sub-blocks of a macroblock are adaptively used for motion compensation , and transformation-encoding the residual prediction error with the following measure : the block size of the transformation encoding is coupled to the block size used in each case for the motion compensation , whereby a maximum portion of the prediction error is commonly transformed , wherein the block transformation is selected for each macroblock that has the same block size as the block size used for the motion compensation .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7397857B2
CLAIM 1
. A method for transformation encoding of moving-image sequences , comprising the following steps : estimating motion vectors block-by-block between a reference image signal (video data) and an actual image signal from the image sequence , with which said motion vector compensation is carried out , wherein individual frames of the moving image sequence are divided into macroblocks , whereby for each macroblock the block size of a macroblock , square sub-blocks of a macroblock , or rectangle sub-blocks of a macroblock are adaptively used for motion compensation , and transformation-encoding the residual prediction error with the following measure : the block size of the transformation encoding is coupled to the block size used in each case for the motion compensation , whereby a maximum portion of the prediction error is commonly transformed , wherein the block transformation is selected for each macroblock that has the same block size as the block size used for the motion compensation .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7426308B2

Filed: 2003-07-18     Issued: 2008-09-16

Intraframe and interframe interlace coding and decoding

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Pohsiang Hsu, Bruce Chih-Lung Lin, Thomas W. Holcomb, Kunal Mukerjee, Sridhar Srinivasan
US9930365B2
CLAIM 1
. A method of decoding video data (comprises i) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (intra-coded blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US7426308B2
CLAIM 9
. One or more computer-readable media having stored thereon computer-executable instructions for causing one or more computers to perform a method comprising : finding a DC differential for a current intra-coded block in a current macroblock of an interlaced video (inter-prediction mode) frame ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises i (decoding video data) f DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (intra-coded blocks) .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (intra-coded blocks) has a size of at least 64×64 pixels .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US9930365B2
CLAIM 7
. A device for decoding video data (comprises i) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (intra-coded blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US7426308B2
CLAIM 9
. One or more computer-readable media having stored thereon computer-executable instructions for causing one or more computers to perform a method comprising : finding a DC differential for a current intra-coded block in a current macroblock of an interlaced video (inter-prediction mode) frame ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises i (decoding video data) f DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (intra-coded blocks) .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (intra-coded blocks) has a size of at least 64×64 pixels .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (intra-coded blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US7426308B2
CLAIM 9
. One or more computer-readable media having stored thereon computer-executable instructions for causing one or more computers to perform a method comprising : finding a DC differential for a current intra-coded block in a current macroblock of an interlaced video (inter-prediction mode) frame ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (intra-coded blocks) .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (intra-coded blocks) has a size of at least 64×64 pixels .
US7426308B2
CLAIM 1
. A method of decoding a field-coded macroblock comprising one or more intra-coded blocks (current block) , the method comprising : finding a DC differential for a current intra-coded block in the macroblock ;
selecting a DC predictor from among plural candidate blocks according to a set of DC predictor selection rules , wherein the plural candidate blocks include a top block above the current block , a left block left of the current block , and a top-left block above and to the left of the current block , and wherein the set of DC predictor selection rules indicates the selected DC predictor as follows : if the top-left block , the left block , and the top block are intra-coded , the selected DC predictor depends on respective DC values of the top-left block , the left block , and the top block ;
if the top-left block is not intra-coded but the left block and the top block are intra-coded , the selected DC predictor depends on the respective DC values of the top block and the left block ;
if the left block is not intra-coded but the top block is intra-coded , the selected DC predictor is the DC value of the top block ;
if the top block is not intra-coded but the left block is intra-coded , the selected DC predictor is the DC value of the left block ;
if the left block is not intra-coded and the top block is not intra-coded , DC prediction is skipped for the current block ;
obtaining a DC value for the current block , wherein the obtaining comprises ;
if DC prediction is skipped for the current block , using the DC differential as the DC value for the current block ;
and if DC prediction is used for the current block , adding the selected DC predictor to the DC differential ;
and using the DC value in reconstruction of the current block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7289674B2

Filed: 2003-06-10     Issued: 2007-10-30

Spatial prediction based intra coding

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Technologies Oy

Marta Karczewicz
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (second blocks) .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (second blocks) .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (second blocks) .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US7289674B2
CLAIM 11
. The method of claim 10 , wherein the mapping of the complementary prediction mode of the block is carried out by a mirroring function mirroring the first and second blocks (current block) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7162091B2

Filed: 2003-03-31     Issued: 2007-01-09

Intra compression of pixel blocks using predicted mean

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Albert S. Wang, Sanjeev Mehrotra
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7162091B2
CLAIM 1
. In a computer system , a computer-implemented method of decoding one or more video frames in a video sequence , the method comprising : for a plurality of blocks of pixels in a current video frame of the video sequence , selecting between multiple types , the multiple types including a first type and a second type ;
if the selected type is the first type , decoding the plurality of blocks using inter-frame prediction (encoding mode comprises one) with motion compensation , including computing one or more predictors for the plurality of blocks relative to a reference video frame ;
and if the selected type is the second type , decoding the plurality of blocks using intra-frame spatial prediction of pixel values , including for each block in the plurality of blocks : obtaining one or more pixel values for spatial prediction of pixel values of the block , wherein the one or more obtained pixel values for at least one of the plurality of blocks are from spatially adjacent pixels wherein : if the block is the top left block of the current video frame , the one or more obtained pixel values consist of a gray value , otherwise , if the block is in the top row of the current video frame , the spatially adjacent pixels consist of pixels immediately left of the block , otherwise , if the block is in the left column of the current video frame , the spatially adjacent pixels consist of pixels immediately above the block , otherwise , the spatially adjacent pixels consist of the pixels immediately left of the block and the pixels immediately above the block ;
predicting the pixel values of the block from the one or more obtained pixel values , and reconstructing the block from the predicted pixel values and a residual .

US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7162091B2
CLAIM 1
. In a computer system , a computer-implemented method of decoding one or more video frames in a video sequence , the method comprising : for a plurality of blocks of pixels in a current video frame of the video sequence , selecting between multiple types , the multiple types including a first type and a second type ;
if the selected type is the first type , decoding the plurality of blocks using inter-frame prediction (encoding mode comprises one) with motion compensation , including computing one or more predictors for the plurality of blocks relative to a reference video frame ;
and if the selected type is the second type , decoding the plurality of blocks using intra-frame spatial prediction of pixel values , including for each block in the plurality of blocks : obtaining one or more pixel values for spatial prediction of pixel values of the block , wherein the one or more obtained pixel values for at least one of the plurality of blocks are from spatially adjacent pixels wherein : if the block is the top left block of the current video frame , the one or more obtained pixel values consist of a gray value , otherwise , if the block is in the top row of the current video frame , the spatially adjacent pixels consist of pixels immediately left of the block , otherwise , if the block is in the left column of the current video frame , the spatially adjacent pixels consist of pixels immediately above the block , otherwise , the spatially adjacent pixels consist of the pixels immediately left of the block and the pixels immediately above the block ;
predicting the pixel values of the block from the one or more obtained pixel values , and reconstructing the block from the predicted pixel values and a residual .

US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7162091B2
CLAIM 1
. In a computer system , a computer-implemented method of decoding one or more video frames in a video sequence , the method comprising : for a plurality of blocks of pixels in a current video frame of the video sequence , selecting between multiple types , the multiple types including a first type and a second type ;
if the selected type is the first type , decoding the plurality of blocks using inter-frame prediction (encoding mode comprises one) with motion compensation , including computing one or more predictors for the plurality of blocks relative to a reference video frame ;
and if the selected type is the second type , decoding the plurality of blocks using intra-frame spatial prediction of pixel values , including for each block in the plurality of blocks : obtaining one or more pixel values for spatial prediction of pixel values of the block , wherein the one or more obtained pixel values for at least one of the plurality of blocks are from spatially adjacent pixels wherein : if the block is the top left block of the current video frame , the one or more obtained pixel values consist of a gray value , otherwise , if the block is in the top row of the current video frame , the spatially adjacent pixels consist of pixels immediately left of the block , otherwise , if the block is in the left column of the current video frame , the spatially adjacent pixels consist of pixels immediately above the block , otherwise , the spatially adjacent pixels consist of the pixels immediately left of the block and the pixels immediately above the block ;
predicting the pixel values of the block from the one or more obtained pixel values , and reconstructing the block from the predicted pixel values and a residual .

US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US7162091B2
CLAIM 8
. The method of claim 1 wherein each of the plurality of blocks is a 4 by 4 pixel blocks (current block) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7181072B2

Filed: 2003-03-31     Issued: 2007-02-20

Intra compression of pixel blocks using predicted mean

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Albert S. Wang, Sanjeev Mehrotra
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size (same frame) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (different video) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7181072B2
CLAIM 4
. A method as recited in claim 1 , wherein inter compression comprises performing motion estimation of a first block in the plurality of blocks with a second block , having an identical spatial location , but in a different video (third syntax) frame .

US7181072B2
CLAIM 9
. A method as recited in claim 8 , wherein the video frame and the other video frame are the same frame (starting size) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (same frame) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (different video) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7181072B2
CLAIM 4
. A method as recited in claim 1 , wherein inter compression comprises performing motion estimation of a first block in the plurality of blocks with a second block , having an identical spatial location , but in a different video (third syntax) frame .

US7181072B2
CLAIM 9
. A method as recited in claim 8 , wherein the video frame and the other video frame are the same frame (starting size) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (same frame) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (different video) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7181072B2
CLAIM 4
. A method as recited in claim 1 , wherein inter compression comprises performing motion estimation of a first block in the plurality of blocks with a second block , having an identical spatial location , but in a different video (third syntax) frame .

US7181072B2
CLAIM 9
. A method as recited in claim 8 , wherein the video frame and the other video frame are the same frame (starting size) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20030123738A1

Filed: 2002-11-27     Issued: 2003-07-03

Global motion compensation for video pictures

(Original Assignee) Telefonaktiebolaget LM Ericsson AB     (Current Assignee) Telefonaktiebolaget LM Ericsson AB

Per Frojdh, Rickard Sjoberg, Torbjorn Einarsson
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20030123738A1
CLAIM 1
. In a method of video data (video data) compression for use with video frames comprising a plurality of image blocks , wherein each image block shall be decoded according to one of a plurality of coding modes , a method of decoding a particular image block in a current frame according to an implicit global motion compensation (IGMC) mode comprising the steps of : copying from a previous frame a collocated block dislocated by a motion vector ;
and predicting said motion vector from neighboring image blocks of said current frame .

US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20030123738A1
CLAIM 1
. In a method of video data (video data) compression for use with video frames comprising a plurality of image blocks , wherein each image block shall be decoded according to one of a plurality of coding modes , a method of decoding a particular image block in a current frame according to an implicit global motion compensation (IGMC) mode comprising the steps of : copying from a previous frame a collocated block dislocated by a motion vector ;
and predicting said motion vector from neighboring image blocks of said current frame .

US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (pixel blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20030123738A1
CLAIM 1
. In a method of video data (video data) compression for use with video frames comprising a plurality of image blocks , wherein each image block shall be decoded according to one of a plurality of coding modes , a method of decoding a particular image block in a current frame according to an implicit global motion compensation (IGMC) mode comprising the steps of : copying from a previous frame a collocated block dislocated by a motion vector ;
and predicting said motion vector from neighboring image blocks of said current frame .

US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (pixel blocks) .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (pixel blocks) has a size of at least 64×64 pixels .
US20030123738A1
CLAIM 11
. The method of claim 1 wherein : an image block comprises 16×16 pixels partitioned into 4×4 pixel blocks (current block) having one motion vector each .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20030229719A1

Filed: 2002-06-11     Issued: 2003-12-11

System and method for data compression

(Original Assignee) Sony Interactive Entertainment Inc     (Current Assignee) Sony Interactive Entertainment Inc

Eiji Iwata, Masakazu Suzuoki
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20030229719A1
CLAIM 1
. A method for compressing video data (video data) comprising : generating in a graphics pipeline , from video graphics data modeling objects , vertex data corresponding to said objects ;
rendering said video graphics data to produce a current frame of pixel data and a reference frame of pixel data ;
based upon said vertex data , defining a search area within said reference frame for calculating a motion vector for a block of pixel data within said current frame .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20030229719A1
CLAIM 1
. A method for compressing video data (video data) comprising : generating in a graphics pipeline , from video graphics data modeling objects , vertex data corresponding to said objects ;
rendering said video graphics data to produce a current frame of pixel data and a reference frame of pixel data ;
based upon said vertex data , defining a search area within said reference frame for calculating a motion vector for a block of pixel data within said current frame .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (said object) , a broadcast receiver device , or a set-top box .
US20030229719A1
CLAIM 1
. A method for compressing video data comprising : generating in a graphics pipeline , from video graphics data modeling objects , vertex data corresponding to said object (mobile device) s ;
rendering said video graphics data to produce a current frame of pixel data and a reference frame of pixel data ;
based upon said vertex data , defining a search area within said reference frame for calculating a motion vector for a block of pixel data within said current frame .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20030229719A1
CLAIM 1
. A method for compressing video data (video data) comprising : generating in a graphics pipeline , from video graphics data modeling objects , vertex data corresponding to said objects ;
rendering said video graphics data to produce a current frame of pixel data and a reference frame of pixel data ;
based upon said vertex data , defining a search area within said reference frame for calculating a motion vector for a block of pixel data within said current frame .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2003319394A

Filed: 2002-04-26     Issued: 2003-11-07

符号化装置および方法、復号装置および方法、記録媒体、並びにプログラム

(Original Assignee) Sony Corp; ソニー株式会社     

Kuhn Peter, Kazufumi Sato, Teruhiko Suzuki, Yoichi Yagasaki, クーン ピーター, 数史 佐藤, 陽一 矢ヶ崎, 輝彦 鈴木
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2003319394A
CLAIM 1
【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。

JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード (partitioning comprises) 情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画 (intra-prediction mode) 像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2003319394A
CLAIM 1
【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。

JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード (partitioning comprises) 情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画 (intra-prediction mode) 像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (参照画) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画 (intra-prediction mode) 像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (前記予測モード) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2003319394A
CLAIM 1
【請求項1】 飛び越し走査の画像情報を入力として、 マクロブロックレベルで適応的にフィールドベースまた はフレームベースの符号化処理を施す符号化装置におい て、 予め設定されている複数の直交変換用ブロックサイズの いずれかにマクロブロックを分割して直交変換する直交 変換手段と (second syntax) 、 最も符号化効率が高くなるように、前記直交変換用ブロ ックサイズを決定するブロックサイズ決定手段と、 前記直交変換手段の処理結果を量子化する量子化手段 と、 前記量子化手段の処理結果を可逆符号化する可逆符号化 手段とを含むことを特徴とする符号化装置。

JP2003319394A
CLAIM 29
【請求項29】 飛び越し走査の画像情報がマクロブロ ックレベルで適応的にフィールドベースまたはフレーム ベースで符号化されている圧縮画像情報を元に前記画像 情報を復元する復号装置において、 前記圧縮画像情報を復号して、前記マクロブロックレベ ルでの符号化処理が前記フィールドベースであるか前記 フレームベースであるかを示すfield/frame flag、前記 符号化処理に含まれる直交変換処理でのブロックサイズ を示すMB type、量子化されている直交変換係数、並び に予測モード情報および動きベクトル情報を生成する復 号手段と、 前記量子化されている直交変換係数を逆量子化する逆量 子化手段と、 前記MB typeに基づき、前記逆量子化手段の処理結果を 逆直交変換する逆直交変換手段と、 前記予測モード (partitioning comprises) 情報および前記動きベクトル情報に従 い、復号画像情報を元にして参照画 (intra-prediction mode) 像を生成する参照画 像生成手段とを含むことを特徴とする復号装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6959116B2

Filed: 2001-09-18     Issued: 2005-10-25

Largest magnitude indices selection for (run, level) encoding of a block coded picture

(Original Assignee) EMC Corp     (Current Assignee) EMC Corp

Ugur Sezer, Sorin Faibish, Seyfullah H. Oguz, Wayne W. Duso
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (entropy encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6959116B2
CLAIM 2
. The method as claimed in claim 1 , wherein the respective sets of non-zero transform coefficients for the blocks of pixels are produced by computing discrete cosine (decoding video data) transforms .

US6959116B2
CLAIM 8
. The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a .) parsing and copying a differential DC coefficient variable-length code (VLC) ;
b .) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ;
c .) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ;
and d .) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (entropy encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6959116B2
CLAIM 2
. The method as claimed in claim 1 , wherein the respective sets of non-zero transform coefficients for the blocks of pixels are produced by computing discrete cosine (decoding video data) transforms .

US6959116B2
CLAIM 8
. The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a . ) parsing and copying a differential DC coefficient variable-length code (VLC) ;
b . ) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ;
c . ) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ;
and d . ) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (entropy encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6959116B2
CLAIM 8
. The method as claimed in claim 7 , which includes , for each of the 8×8 blocks of the non-scalable MPEG-2 video : a . ) parsing and copying a differential DC coefficient variable-length code (VLC) ;
b . ) parsing and decoding (run , level) event variable-length codes (VLCs) up to an end-of-block marker to identify a respective set of non-zero quantization indices ;
c . ) finding up to the selected number of non-zero quantization indices having the largest level magnitudes in the respective set of non-zero quantization indices to identify a respective set of largest magnitude non-zero quantization indices ;
and d . ) applying (run , level) event formation and entropy encoding (second syntax) to the set of largest magnitude non-zero quantization indices .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6529634B1

Filed: 1999-11-08     Issued: 2003-03-04

Contrast sensitive variance based adaptive block size DCT image compression

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Kadayam Thyagarajan, Michael J. Merritt
US9930365B2
CLAIM 1
. A method of decoding video data (zero value) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (minimum block size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (input block) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block (encoding mode comprises one) of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US6529634B1
CLAIM 12
. The system of claim 11 wherein said Huffman coder employs multiple look-up tables to code run-length and non-zero value (decoding video data) s .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (minimum block size, divided blocks) .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (minimum block size, divided blocks) has a size of at least 64×64 pixels .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US6529634B1
CLAIM 11
. The system of claim 4 wherein said variable length coding means comprises a Huffman code (quantization parameter modification value) r .

US9930365B2
CLAIM 7
. A device for decoding video data (zero value) , the device comprising : a memory configured to store decoded video blocks (transforming means, main data) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (minimum block size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (input block) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block (encoding mode comprises one) of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US6529634B1
CLAIM 4
. An image compression system for compressing a block of pixel data , comprising : block size assignment means for selecting said block or subdivided blocks of said block to be compressed based on the variances of pixel values of said block of pixel data and subdivided blocks of said block of pixel data , wherein said block size assignment means determines a variance of pixel values for said block of pixel data , compares said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated , makes a decision to subdivide said block if said variance is greater than said predetermined threshold , if said decision is to subdivide said block , then repeats the determination of variance , the comparison with said predetermined threshold , and the decision to subdivide for each subdivided block until a predetermined criteria is satisfied , and designates as said block size assignment each block that is not further subdivided ;
transform means for transforming pixel data of said selected block or subdivided blocks into frequency domain data (video blocks) ;
quantizer means for quantizing said frequency domain data ;
serializer means for scanning said quantized data into a serialized stream of data ;
and variable length coding means for coding said serialized stream of data in preparation for transmission .

US6529634B1
CLAIM 12
. The system of claim 11 wherein said Huffman coder employs multiple look-up tables to code run-length and non-zero value (decoding video data) s .

US6529634B1
CLAIM 30
. The system of claim 27 , wherein said transforming means (video blocks) utilizes the Discrete Cosine Transform .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (minimum block size, divided blocks) .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (minimum block size, divided blocks) has a size of at least 64×64 pixels .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US6529634B1
CLAIM 11
. The system of claim 4 wherein said variable length coding means comprises a Huffman code (quantization parameter modification value) r .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (minimum block size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (input block) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block (encoding mode comprises one) of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one (th column) or more syntax elements representative of partitioning for the current block (minimum block size, divided blocks) .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US6529634B1
CLAIM 24
. The method of claim 1 , wherein said variance is determined by the following equation : var = 1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j 2 - (1 N 2  ∑ i = 0 N - 1  ∑ j = 0 N - 1  x i , j) 2 where N=dimension of the block , and x ij is the pixel in the i th row , j th column (processor to decode one) within the N×N block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (minimum block size, divided blocks) has a size of at least 64×64 pixels .
US6529634B1
CLAIM 1
. A method for determining a block size assignment for an input block of image pixels to be used in compressing said input block , comprising the steps of : reading a block of pixel data ;
generating a block size assignment based on the variances of pixel values of said block of pixel data and subdivided blocks (current block) of said block of pixel data , said step of generating further comprising the steps of : determining a variance of pixel values for said block of pixel data ;
comparing said variance with a predetermined threshold , wherein said threshold is a function of the mean of pixel values of the block being evaluated ;
making a decision to subdivide said block if said variance is greater than said threshold ;
if said decision is to subdivide said block , then repeating the steps of determining , comparing , and making for each subdivided block until a predetermined criteria is satisfied ;
designating as said block size assignment each block that is not further subdivided ;
and providing a data structure containing information on said block size assignment .

US6529634B1
CLAIM 3
. The method of claim 1 wherein said predetermined criteria for no longer repeating the steps of determining , comparing , and making is based on a pre-selected minimum block size (current block) of pixel data .

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US6529634B1
CLAIM 11
. The system of claim 4 wherein said variable length coding means comprises a Huffman code (quantization parameter modification value) r .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6633611B2

Filed: 1997-10-24     Issued: 2003-10-14

Method and apparatus for region-based moving image encoding and decoding

(Original Assignee) Mitsubishi Electric Corp     (Current Assignee) Mitsubishi Electric Corp

Shunichi Sekiguchi, Yoshimi Isu, Kohtaro Asai
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6633611B2
CLAIM 1
. An apparatus for encoding an image signal (video data) , said apparatus comprising : a region partitioning section which includes : a partitioning processing section partitioning the image signal into a first plurality of regions based on a partition judgment criterion ;
and an integration processing section integrating each region of said first plurality of regions with adjacent regions to convert said first plurality of regions into a second plurality of regions , wherein said integration processing section includes ;
a provisional encoder preliminarily encoding an image per region of said first plurality of regions and calculating the amount of code thereof ;
a decoder decoding the image encoded by said provisional encoder ;
an encoding distortion calculating section calculating encoding distortion by using the image decoded by said decoder ;
and an evaluation value calculating section calculating an evaluation value for judging merit of encoding while taking into consideration both said amount of code and said encoding distortion , wherein it is determined for each region of said first plurality of regions whether or not to perform integration based on a result of comparing the evaluation value that is obtained in the case where said each region is integrated with adjacent regions and the evaluation value that is obtained in the case where said each region is not integrated with adjacent regions ;
and an encoder separately encoding each region of said second plurality of regions .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) , the device comprising : a memory configured to store decoded video blocks (prediction error signal) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6633611B2
CLAIM 1
. An apparatus for encoding an image signal (video data) , said apparatus comprising : a region partitioning section which includes : a partitioning processing section partitioning the image signal into a first plurality of regions based on a partition judgment criterion ;
and an integration processing section integrating each region of said first plurality of regions with adjacent regions to convert said first plurality of regions into a second plurality of regions , wherein said integration processing section includes ;
a provisional encoder preliminarily encoding an image per region of said first plurality of regions and calculating the amount of code thereof ;
a decoder decoding the image encoded by said provisional encoder ;
an encoding distortion calculating section calculating encoding distortion by using the image decoded by said decoder ;
and an evaluation value calculating section calculating an evaluation value for judging merit of encoding while taking into consideration both said amount of code and said encoding distortion , wherein it is determined for each region of said first plurality of regions whether or not to perform integration based on a result of comparing the evaluation value that is obtained in the case where said each region is integrated with adjacent regions and the evaluation value that is obtained in the case where said each region is not integrated with adjacent regions ;
and an encoder separately encoding each region of said second plurality of regions .

US6633611B2
CLAIM 13
. An apparatus as in claim 1 wherein a quantization parameter of a prediction error signal (video blocks) accompanying motion-compensated prediction is variably set in said provisional encoder , and said evaluation value calculating section calculates the evaluation value while varying the quantization parameter .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (said object) , a broadcast receiver device , or a set-top box .
US6633611B2
CLAIM 9
. An apparatus as in claim 8 wherein said object (mobile device) structure is judged on the basis of original signal dispersion of the region , edge intensity , and degree of connection of the edge with adjacent regions .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6633611B2
CLAIM 1
. An apparatus for encoding an image signal (video data) , said apparatus comprising : a region partitioning section which includes : a partitioning processing section partitioning the image signal into a first plurality of regions based on a partition judgment criterion ;
and an integration processing section integrating each region of said first plurality of regions with adjacent regions to convert said first plurality of regions into a second plurality of regions , wherein said integration processing section includes ;
a provisional encoder preliminarily encoding an image per region of said first plurality of regions and calculating the amount of code thereof ;
a decoder decoding the image encoded by said provisional encoder ;
an encoding distortion calculating section calculating encoding distortion by using the image decoded by said decoder ;
and an evaluation value calculating section calculating an evaluation value for judging merit of encoding while taking into consideration both said amount of code and said encoding distortion , wherein it is determined for each region of said first plurality of regions whether or not to perform integration based on a result of comparing the evaluation value that is obtained in the case where said each region is integrated with adjacent regions and the evaluation value that is obtained in the case where said each region is not integrated with adjacent regions ;
and an encoder separately encoding each region of said second plurality of regions .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5748116A

Filed: 1996-11-27     Issued: 1998-05-05

System and method for nested split coding of sparse data sets

(Original Assignee) Teralogic Inc     (Current Assignee) Qualcomm Inc ; CSR Technology Inc

Charles K. Chui, Rongxiang Yi
US9930365B2
CLAIM 1
. A method of decoding video data (zero value) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5748116A
CLAIM 1
. A method of encoding a specified data array , comprising the steps of : successively identifying successively smaller blocks of data within the specified data array ;
wherein said data blocks are identified in a predefined order ;
processing each identified data block by : determining a maximum number of bits required to represent any single data value in said identified data block ;
when said determined maximum number is equal to zero , outputting data indicating said identified data block contains only zero value (decoding video data) s ;
when said determined maximum number is not equal to zero , and said identified data block contains more than a predefined threshold number of data values , outputting data indicating said determined maximum number ;
and when said determined maximum number is not equal to zero , and said identified data block does not contain more than said predefined threshold number of data values , outputting data representing each of said data values in said identified data block ;
wherein said data output by said method is output in an order corresponding to said predefined order that data blocks are identified .

US9930365B2
CLAIM 7
. A device for decoding video data (zero value) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5748116A
CLAIM 1
. A method of encoding a specified data array , comprising the steps of : successively identifying successively smaller blocks of data within the specified data array ;
wherein said data blocks are identified in a predefined order ;
processing each identified data block by : determining a maximum number of bits required to represent any single data value in said identified data block ;
when said determined maximum number is equal to zero , outputting data indicating said identified data block contains only zero value (decoding video data) s ;
when said determined maximum number is not equal to zero , and said identified data block contains more than a predefined threshold number of data values , outputting data indicating said determined maximum number ;
and when said determined maximum number is not equal to zero , and said identified data block does not contain more than said predefined threshold number of data values , outputting data representing each of said data values in said identified data block ;
wherein said data output by said method is output in an order corresponding to said predefined order that data blocks are identified .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (said memory) , or a set-top box .
US5748116A
CLAIM 7
. A memory for storing data for access by programs being executed on a data processing system , said memory (broadcast receiver device) comprising : a data encoding procedure , said data encoding procedure including instructions for successively identifying successively smaller blocks of data within a specified data block ;
wherein said data blocks are identified in a predefined order ;
said data encoding procedure further including instructions for processing each identified data block , by : determining a maximum number of bits required to represent any single data value in said identified data block ;
when said determined maximum number is equal to zero , outputting data indicating said identified data block contains only zero values ;
when said determined maximum number is not equal to zero , and said identified data block contains more than a predefined threshold number of data values , outputting data indicating said determined maximum number ;
and when said determined maximum number is not equal to zero , and said identified data block does not contain more than said predefined threshold number of data values , outputting data representing each of said data values in said identified data block ;
wherein said data output by said data encoding procedure is output in an order corresponding to said predefined order that data blocks are identified .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6084908A

Filed: 1996-10-23     Issued: 2000-07-04

Apparatus and method for quadtree based variable block size motion estimation

(Original Assignee) Sharp Corp; Sarnoff Corp     (Current Assignee) Sharp Corp ; MediaTek Inc

Tihao Chiang, Jungwoo Lee, Ya-Qin Zhang, Zixiang Xiong
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6084908A
CLAIM 16
. The apparatus of claim 14 , wherein said transform module applies a forward discrete cosine (decoding video data) transform .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6084908A
CLAIM 16
. The apparatus of claim 14 , wherein said transform module applies a forward discrete cosine (decoding video data) transform .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (control module) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6084908A
CLAIM 14
. Apparatus for encoding an input image which is partitioned in accordance with a tree structure , where said tree structure has a plurality of blocks organized in a plurality of levels with a parent-child relationship , said apparatus comprising : a variable block motion estimation module for computing a plurality of motion vectors for all of said plurality of blocks for the entire tree structure of the input image to obtain an optimal tree structure ;
a motion compensation module , coupled to said variable block motion estimation module , for generating a predicted image using said plurality of motion vectors for said optimal tree structure ;
a transform module , coupled to said motion compensation module , for applying a transformation to a difference signal between the input image and said predicted image , where said transformation produces a plurality of coefficients ;
a quantization module , coupled to said transform module , for quantizing said plurality of coefficients ;
and a rate control module (thereon instructions) , coupled to said quantization module , for selecting a quantizer scale that produces a minimum Lagrangian cost for each of said plurality of blocks .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6125143A

Filed: 1996-10-18     Issued: 2000-09-26

Picture encoding device and method thereof, picture decoding device and method thereof, and recording medium

(Original Assignee) Sony Corp     (Current Assignee) Ironworks Patents LLC

Teruhiko Suzuki, Masami Ogata, Tak Yen Tong
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (mean value) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .

US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (mean value) and greater than the minimum size (mean value) .
US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (mean value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .

US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (mean value) and greater than the minimum size (mean value) .
US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (mean value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .

US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (mean value) and greater than the minimum size (mean value) .
US6125143A
CLAIM 21
. A picture encoding device according to claim 18 , further comprising : deciding means for deciding whether or not to encode said difference picture ;
and mean value (minimum size, maximum size) calculating means for calculating a mean value for said picture data of said encoded block , wherein said encoding means encodes said difference picture when said deciding means decides to encode said difference picture and encodes the value for a difference between said mean value for the recording data corresponding to said difference data and said picture data when said deciding means decides not to encode said difference picture .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US6125143A
CLAIM 11
. A picture encoding method (encoding mode) comprising the steps of : dividing picture data having certain characteristics into encoding blocks of certain sizes depending upon said characteristics of said picture data ;
and encoding said picture data using units of said encoding blocks .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5956088A

Filed: 1995-11-21     Issued: 1999-09-21

Method and apparatus for modifying encoded digital video for improved channel utilization

(Original Assignee) Imedia Corp     (Current Assignee) Terayon Corp ; Google Technology Holdings LLC

Paul Shen, Efraim Arazi, Edward A. Krause, Adam S. Tom
US9930365B2
CLAIM 1
. A method of decoding video data (flow channels) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (store packets) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5956088A
CLAIM 24
. An apparatus in accordance with claim 20 , further comprising a receiver for receiving the combined primary and overflow channels (video data) , the receiver comprising : primary and overflow demodulators configured to demodulate data from the primary and overflow channels , respectively ;
a first demultiplexer , coupled to the primary demodulator , configured to extract a primary packet stream from an output of the primary demodulator ;
a second demultiplexer , coupled to the overflow demodulator , configured to extract an overflow packet stream from an output of the overflow demodulator ;
a buffer coupled to the second demultiplexer ;
a time stamp comparator , coupled to the first demultiplexer and the buffer , configured to compare a time stamp associated with a next packet from the primary packet stream with a time stamp associated with a next packet from the overflow packet stream ;
and a packet multiplexer , coupled to the first demultiplexer , the buffer and the time stamp comparator , configured to select one of the next packets from the primary packet stream and the overflow packet stream in response to a comparison made by the time stamp comparator .

US5956088A
CLAIM 28
. An apparatus including an encoded digital video modification system for transmitting one or more encoded program streams which are decodeable by a corresponding decoder having a decoder buffer , the decoder buffers having a maximum allowable size , the encoded digital video modification system comprising : a program analyzer and modifier configured to receive the one or more encoded program streams and to select encoded pictures to be modified according to a criterion that includes preventing any underflow of any decoder buffer , the program analyzer and modifier further configured to modify each said selected encoded picture to form a corresponding modified encoded picture having less data than said selected encoded picture ;
a primary packet storage device , coupled to the program analyzer and modifier , configured to store packets (second syntax) that include the corresponding modified encoded pictures ;
an overflow packet storage device , coupled to the program analyzer and modifier , configured to store packets that include the selected encoded pictures ;
and a packet multiplexer coupled to and configured to select packets from the primary and overflow packet storage devices and to provide an output multiplex of selected packets .

US9930365B2
CLAIM 7
. A device for decoding video data (flow channels) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (store packets) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5956088A
CLAIM 24
. An apparatus in accordance with claim 20 , further comprising a receiver for receiving the combined primary and overflow channels (video data) , the receiver comprising : primary and overflow demodulators configured to demodulate data from the primary and overflow channels , respectively ;
a first demultiplexer , coupled to the primary demodulator , configured to extract a primary packet stream from an output of the primary demodulator ;
a second demultiplexer , coupled to the overflow demodulator , configured to extract an overflow packet stream from an output of the overflow demodulator ;
a buffer coupled to the second demultiplexer ;
a time stamp comparator , coupled to the first demultiplexer and the buffer , configured to compare a time stamp associated with a next packet from the primary packet stream with a time stamp associated with a next packet from the overflow packet stream ;
and a packet multiplexer , coupled to the first demultiplexer , the buffer and the time stamp comparator , configured to select one of the next packets from the primary packet stream and the overflow packet stream in response to a comparison made by the time stamp comparator .

US5956088A
CLAIM 28
. An apparatus including an encoded digital video modification system for transmitting one or more encoded program streams which are decodeable by a corresponding decoder having a decoder buffer , the decoder buffers having a maximum allowable size , the encoded digital video modification system comprising : a program analyzer and modifier configured to receive the one or more encoded program streams and to select encoded pictures to be modified according to a criterion that includes preventing any underflow of any decoder buffer , the program analyzer and modifier further configured to modify each said selected encoded picture to form a corresponding modified encoded picture having less data than said selected encoded picture ;
a primary packet storage device , coupled to the program analyzer and modifier , configured to store packets (second syntax) that include the corresponding modified encoded pictures ;
an overflow packet storage device , coupled to the program analyzer and modifier , configured to store packets that include the selected encoded pictures ;
and a packet multiplexer coupled to and configured to select packets from the primary and overflow packet storage devices and to provide an output multiplex of selected packets .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (flow channels) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (store packets) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5956088A
CLAIM 24
. An apparatus in accordance with claim 20 , further comprising a receiver for receiving the combined primary and overflow channels (video data) , the receiver comprising : primary and overflow demodulators configured to demodulate data from the primary and overflow channels , respectively ;
a first demultiplexer , coupled to the primary demodulator , configured to extract a primary packet stream from an output of the primary demodulator ;
a second demultiplexer , coupled to the overflow demodulator , configured to extract an overflow packet stream from an output of the overflow demodulator ;
a buffer coupled to the second demultiplexer ;
a time stamp comparator , coupled to the first demultiplexer and the buffer , configured to compare a time stamp associated with a next packet from the primary packet stream with a time stamp associated with a next packet from the overflow packet stream ;
and a packet multiplexer , coupled to the first demultiplexer , the buffer and the time stamp comparator , configured to select one of the next packets from the primary packet stream and the overflow packet stream in response to a comparison made by the time stamp comparator .

US5956088A
CLAIM 28
. An apparatus including an encoded digital video modification system for transmitting one or more encoded program streams which are decodeable by a corresponding decoder having a decoder buffer , the decoder buffers having a maximum allowable size , the encoded digital video modification system comprising : a program analyzer and modifier configured to receive the one or more encoded program streams and to select encoded pictures to be modified according to a criterion that includes preventing any underflow of any decoder buffer , the program analyzer and modifier further configured to modify each said selected encoded picture to form a corresponding modified encoded picture having less data than said selected encoded picture ;
a primary packet storage device , coupled to the program analyzer and modifier , configured to store packets (second syntax) that include the corresponding modified encoded pictures ;
an overflow packet storage device , coupled to the program analyzer and modifier , configured to store packets that include the selected encoded pictures ;
and a packet multiplexer coupled to and configured to select packets from the primary and overflow packet storage devices and to provide an output multiplex of selected packets .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5107345A

Filed: 1991-05-28     Issued: 1992-04-21

Adaptive block size image compression method and system

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Chong U. Lee
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) (inverse discrete cosine transform, compressed image) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (receiving said one) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5107345A
CLAIM 19
. A method for image signal (video data) compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ;
performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ;
performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ;
determining bit count values corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ;
determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ;
selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ;
generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ;
ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ;
encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ;
and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data .

US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks, decoding video data) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one (encoding mode comprises one) of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks, decoding video data) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US5107345A
CLAIM 17
. The system of claim 8 wherein said encoder means comprises : a Huffman code (quantization parameter modification value) r having an input coupled to said ordering means and an output coupled to said assembler means .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) (inverse discrete cosine transform, compressed image) , the device comprising : a memory configured to store decoded video blocks (inverse discrete cosine transform, compressed image) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (receiving said one) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5107345A
CLAIM 19
. A method for image signal (video data) compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ;
performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ;
performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ;
determining bit count values corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ;
determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ;
selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ;
generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ;
ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ;
encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ;
and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data .

US5107345A
CLAIM 23
. A system for decoding a compressed image (video blocks, decoding video data) signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one (encoding mode comprises one) of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US5107345A
CLAIM 28
. The system of claim 23 wherein said ordering means provides an output of ones of a 16×16 block and 8×8 , 4×4 and 2×2 sub-blocks of discrete cosine transform (DCT) coefficient values and wherein said inverse transform means comprises : first inverse discrete cosine transform (video blocks, decoding video data) (IDCT) means for receiving and performing a 2×2 inverse discrete cosine transform (IDCT) operation on each received 2×2 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 2×2 pixel data ;
second IDCT transform means for receiving and performing a 4×4 IDCT operation on each received 4×4 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 4×4 pixel data ;
third IDCT transform means for receiving and performing an 8×8 IDCT operation on each received 8×8 sub-block of DCT coefficient values , and providing corresponding sub-blocks of 8×8 pixel data ;
and fourth IDCT transform means for receiving and performing a 16×16 IDCT transform operation on said 16× 16 block of DCT coefficient values , and providing a corresponding 16×16 block of pixel data .

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US5107345A
CLAIM 17
. The system of claim 8 wherein said encoder means comprises : a Huffman code (quantization parameter modification value) r having an input coupled to said ordering means and an output coupled to said assembler means .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (said memory) , or a set-top box .
US5107345A
CLAIM 37
. The method of claim 36 wherein said step of decoding comprises the steps of : storing Huffman decode values in a memory ;
addressing said memory (broadcast receiver device) with said coded values ;
and providing a decoded value output from said memory corresponding to each input coded value .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (receiving said one) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5107345A
CLAIM 19
. A method for image signal (video data) compression of a block of pixel data comprising the steps of : performing a 16×16 discrete cosine transform (DCT) operation on an 16×16 block of pixel data so as to provide a corresponding 16×16 block of DCT coefficient values ;
performing four 8×8 DCT operations on said block of pixel data so as to provide a corresponding block of four 8×8 DCT coefficient value sub-blocks ;
performing sixteen 4×4 DCT operations on said block of pixel data so as to performing sixty-four 2×2 DCT operations on said block of pixel data so as to provide a corresponding block of sixty-four 2×2 DCT coefficient value sub-blocks ;
determining bit count values corresponding to a number of bits required to respectively encode said 16×16 block of DCT coefficient values and predetermined groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values according to a predetermined coding format ;
determining , from said bit count values , ones of said 16×16 block of DCT coefficient values and said corresponding groups of 8×8 , 4×4 and 2×2 sub-blocks of DCT coefficient values requiring a lesser number of bits to encode according to said predetermined coding format so as to provide a corresponding selection value ;
selecting ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values in accordance with said selection value ;
generating a composite block of DCT coefficient values formed from said selected ones of said 16×16 block of DCT coefficient values and groups of 8×8 , 4×4 and 2×2 sub-block of DCT coefficient values ;
ordering said composite block of DCT coefficient values according to a predetermined ordering format so as to provide a corresponding output of ordered DCT coefficient values ;
encoding said ordered DCT coefficient values according to said predetermined coding format so as to provide corresponding coded values ;
and assembling said selection value and said coded values in a predetermined order as a coded image value representative of said block of pixel data .

US5107345A
CLAIM 23
. A system for decoding a compressed image signal comprising : separator means for , receiving a compressed image signal representative of a selection value and coded values , separating said selection value and said coded values from one another , and providing an output of said separated selection value and said coded values ;
decoder means for receiving and decoding according to a predetermined coding format said coded values , and for providing corresponding decoded values ;
ordering means for , receiving said decoded values and said selection value , ordering said decoded values according to a predetermined ordering format in response to said selection value , and providing an output forming a composite block of one of a block of transform coefficient values and various sub-blocks of transform coefficient values ;
inverse transform means for , receiving said composite block of transform coefficient values , performing an inverse transform operation on said one of said block and each sub-block of transform coefficient values of said composite block transform coefficient values , and providing an output of a corresponding one of a block of pixel data and sub-blocks of pixel data ;
and combiner means for , receiving said one (encoding mode comprises one) of said block of pixel data and said sub-blocks of pixel data , receiving said selection value , and in accordance with said selection value combining when present said sub-blocks of pixel data into a combined block of pixel data and providing one of said block of pixel data and said combined block of pixel data as an output block of pixel data .

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (Huffman code) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
US5107345A
CLAIM 17
. The system of claim 8 wherein said encoder means comprises : a Huffman code (quantization parameter modification value) r having an input coupled to said ordering means and an output coupled to said assembler means .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2008117158A1

Filed: 2008-03-25     Issued: 2008-10-02

Method and system for motion vector predictions

(Original Assignee) Nokia Corporation; Nokia Inc.     

Jani Lainema
US9930365B2
CLAIM 1
. A method of decoding video data (encoded video signal, motion prediction) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (second motion vector predictor, second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal (video data, video blocks) , characterized by : retrieving a motion prediction (video data, video blocks) method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (second motion vector predictor, second blocks) .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (second motion vector predictor, second blocks) has a size of at least 64×64 pixels .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 7
. A device for decoding video data (encoded video signal, motion prediction) , the device comprising : a memory configured to store decoded video blocks (encoded video signal, motion prediction) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second motion vector predictor, second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal (video data, video blocks) , characterized by : retrieving a motion prediction (video data, video blocks) method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (second motion vector predictor, second blocks) .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (second motion vector predictor, second blocks) has a size of at least 64×64 pixels .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (said memory) , or a set-top box .
WO2008117158A1
CLAIM 12
. An apparatus , comprising : a processor ;
and a memory unit communicatively connected to the processor , said memory (broadcast receiver device) unit characterized by : computer code for retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
and computer code for calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing motion-prediction operation for the first block and the second block independently , if it is determined that the first block and the second block can be decoded in independently .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (encoded video signal, motion prediction) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second motion vector predictor, second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal (video data, video blocks) , characterized by : retrieving a motion prediction (video data, video blocks) method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (second motion vector predictor, second blocks) .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (second motion vector predictor, second blocks) has a size of at least 64×64 pixels .
WO2008117158A1
CLAIM 1
. A method of decoding an encoded video signal , characterized by : retrieving a motion prediction method indicator in the encoded video signal , the motion prediction method indicator indicative of whether or not a first block and a second block can be decoded independently ;
if it is determined that the first block and the second block can be decoded in independently , the method further comprises : calculating a first motion vector predictor of a first block based on a motion vector of at least one surrounding block of the first block ;
reconstructing a motion vector for the first block based on the first motion vector predictor ;
calculating a second motion vector predictor (current block) of a second block based on a motion vector of at least one surrounding block of the second block , wherein the second motion predictor is independent of the motion vector reconstructed for the first block ;
and performing a motion-prediction operation for the first block and the second block independently .

WO2008117158A1
CLAIM 9
. The method of claim 7 , characterized in that said indication is also indicative of a pixel prediction for each of a plurality of pixels in the first and second blocks (current block) if one of the first and second blocks is coded in intra mode .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN101198064A

Filed: 2007-12-10     Issued: 2008-06-11

一种分辨率分层技术中的运动矢量预测方法

(Original Assignee) 武汉大学     

胡瑞敏, 兰诚栋, 王啟军
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (搜索运动矢量, 方法为) of an intra-prediction mode and an inter-prediction mode (量预测模式) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN101198064A
CLAIM 4
. 釆用权利要求1或2或3所述运动矢量预测方法的视频编码方法,其特征在 于:采集获取高分辨率的视频信号,将高分辨率视频信号下采样成低分辨率视 频信号,将两种分辨率的视频信号时域分解对齐,形成增强层和基本层,对增 强层和基本层的信息处理后复合成一路码流输出;其中增强层和基本层的纹理信息处理方法为 (encoding mode comprises one) ,经过帧内纹理预测后,进行变换和熵编码,增强层的帧内纹理预测通过基本层的重构上采样实现; 其中增强层和基本层的运动信息处理方法为,基本层使用全搜索运动估计获取 运动矢量并编码,增强层对时域候选运动矢量、空域候选运动矢量和层间域候 选运动矢量进行选择,获取增强层的预测运动矢量并编码。

CN101198064A
CLAIM 5
. 根据权利要求4所述的视频编码方法,其特征在于:宏块的数据编码组织过 程包括如下步骤,首先,在宏块数据头加一个模式标志位,写宏块码流操作开始后,如果选中了 多域运动矢量预测模式 (inter-prediction mode) ,该模式标志位被写为l,后面跟着写多域选择索引和精 细运动矢量差信息,所述精细运动矢量差信息是对运动矢量修正得到的压縮信 息;如果没有选中多域运动矢量预测模式,则模式标志位被写为0 ;
接着,写宏块模式和全搜索运动矢量 (encoding mode comprises one) ;最后,写预测残差数据。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (搜索运动矢量, 方法为) of an intra-prediction mode and an inter-prediction mode (量预测模式) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN101198064A
CLAIM 4
. 釆用权利要求1或2或3所述运动矢量预测方法的视频编码方法,其特征在 于:采集获取高分辨率的视频信号,将高分辨率视频信号下采样成低分辨率视 频信号,将两种分辨率的视频信号时域分解对齐,形成增强层和基本层,对增 强层和基本层的信息处理后复合成一路码流输出;其中增强层和基本层的纹理信息处理方法为 (encoding mode comprises one) ,经过帧内纹理预测后,进行变换和熵编码,增强层的帧内纹理预测通过基本层的重构上采样实现; 其中增强层和基本层的运动信息处理方法为,基本层使用全搜索运动估计获取 运动矢量并编码,增强层对时域候选运动矢量、空域候选运动矢量和层间域候 选运动矢量进行选择,获取增强层的预测运动矢量并编码。

CN101198064A
CLAIM 5
. 根据权利要求4所述的视频编码方法,其特征在于:宏块的数据编码组织过 程包括如下步骤,首先,在宏块数据头加一个模式标志位,写宏块码流操作开始后,如果选中了 多域运动矢量预测模式 (inter-prediction mode) ,该模式标志位被写为l,后面跟着写多域选择索引和精 细运动矢量差信息,所述精细运动矢量差信息是对运动矢量修正得到的压縮信 息;如果没有选中多域运动矢量预测模式,则模式标志位被写为0 ;
接着,写宏块模式和全搜索运动矢量 (encoding mode comprises one) ;最后,写预测残差数据。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (搜索运动矢量, 方法为) of an intra-prediction mode and an inter-prediction mode (量预测模式) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN101198064A
CLAIM 4
. 釆用权利要求1或2或3所述运动矢量预测方法的视频编码方法,其特征在 于:采集获取高分辨率的视频信号,将高分辨率视频信号下采样成低分辨率视 频信号,将两种分辨率的视频信号时域分解对齐,形成增强层和基本层,对增 强层和基本层的信息处理后复合成一路码流输出;其中增强层和基本层的纹理信息处理方法为 (encoding mode comprises one) ,经过帧内纹理预测后,进行变换和熵编码,增强层的帧内纹理预测通过基本层的重构上采样实现; 其中增强层和基本层的运动信息处理方法为,基本层使用全搜索运动估计获取 运动矢量并编码,增强层对时域候选运动矢量、空域候选运动矢量和层间域候 选运动矢量进行选择,获取增强层的预测运动矢量并编码。

CN101198064A
CLAIM 5
. 根据权利要求4所述的视频编码方法,其特征在于:宏块的数据编码组织过 程包括如下步骤,首先,在宏块数据头加一个模式标志位,写宏块码流操作开始后,如果选中了 多域运动矢量预测模式 (inter-prediction mode) ,该模式标志位被写为l,后面跟着写多域选择索引和精 细运动矢量差信息,所述精细运动矢量差信息是对运动矢量修正得到的压縮信 息;如果没有选中多域运动矢量预测模式,则模式标志位被写为0 ;
接着,写宏块模式和全搜索运动矢量 (encoding mode comprises one) ;最后,写预测残差数据。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2008054688A1

Filed: 2007-10-26     Issued: 2008-05-08

Method for decomposing a video sequence frame

(Original Assignee) Hewlett-Packard Development Company, L.P.     

Debargha Mukherjee, Sam Liu
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (different motion vectors) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2008054688A1
CLAIM 6
. The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (different motion vectors) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2008054688A1
CLAIM 6
. The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (different motion vectors) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions (edge partition) .
WO2008054688A1
CLAIM 2
. The method (300) according to claim 1 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises restricting the partitioning of the blocks (122-126) into wedges (130a , 130b) to a pre-selected set of wedges (130a , 130b) , wherein the pre-selected set of wedges (130a , 130b) comprises a subset of all possible wedge partition (sub partitions) s in the block (122-126) , to thereby substantially reduce computational requirements associated with searching for the wedges (130a ;
130b) .

WO2008054688A1
CLAIM 6
. The method (300) according to any of claims 1-5 , wherein partitioning (308) the block (122-126) into wedges (130a , 130b) further comprises partitioning (308) the block (122-126) into two wedges (130a , 130b) and a parallel band of variable thickness between the two wedges (130a , 130b) , wherein each of the two wedges (130a , 130b) and the parallel band are associated with different motion vectors (second syntax) (132) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2008002881A2

Filed: 2007-06-25     Issued: 2008-01-03

Reduction of errors during computation of inverse discrete cosine transform

(Original Assignee) Qualcomm Incorporated     

Harinath Garudadri, Yuriy Reznik
US9930365B2
CLAIM 1
. A method of decoding video data (inverse discrete cosine transform, midpoint bias value) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (point representations) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks, decoding video data) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations (partitioning comprises) of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

WO2008002881A2
CLAIM 6
. The method of claim 1 , wherein the method further comprises adding a midpoint bias term to the coefficient ;
and wherein midpoint bias value (video blocks, decoding video data) is equal to 2 P ~ l , wherein P is equal to the number of mantissa bits in fixed-point numbers used in the transform plus a number of bits that are added to coefficients in the matrix of transformed coefficients by applying the transform .

US9930365B2
CLAIM 7
. A device for decoding video data (inverse discrete cosine transform, midpoint bias value) , the device comprising : a memory configured to store decoded video blocks (inverse discrete cosine transform, midpoint bias value) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (point representations) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform (video blocks, decoding video data) ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations (partitioning comprises) of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .

WO2008002881A2
CLAIM 6
. The method of claim 1 , wherein the method further comprises adding a midpoint bias term to the coefficient ;
and wherein midpoint bias value (video blocks, decoding video data) is equal to 2 P ~ l , wherein P is equal to the number of mantissa bits in fixed-point numbers used in the transform plus a number of bits that are added to coefficients in the matrix of transformed coefficients by applying the transform .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises (point representations) determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2008002881A2
CLAIM 1
. A method comprising : generating a matrix of biased coefficients by adding one or more bias values to a coefficient of a matrix of source coefficients ;
generating a matrix of transformed coefficients by using fixed-point arithmetic to repeatedly apply a one-dimensional transform to coefficients in the matrix of biased coefficients ;
generating a matrix of output coefficients by right-shifting coefficients in the matrix of transformed coefficients ;
and wherein coefficients in the matrix of output coefficients are approximations of values that would be produced by transforming the matrix of source coefficients using an ideal inverse discrete cosine transform ;
causing a media presentation unit to output audible or visible signals based on the matrix of output values ;
wherein the addition of the bias values to the coefficient causes positive errors and negative errors to be , on average , equal in magnitude and to be , on average , symmetric about zero , and wherein the errors represent differences between values that result from right- shifting limited precision fixed-point representations (partitioning comprises) of coefficients in the matrix of transformed coefficients by a first magnitude and results from dividing the coefficients in the matrix of transformed coefficients by 2 raised to the power of the first magnitude , without regard to precision .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
KR100842558B1

Filed: 2007-01-26     Issued: 2008-07-01

동영상 부호화를 위한 블록 모드 결정 방법 및 그 장치

(Original Assignee) 삼성전자주식회사; 성균관대학교산학협력단     

김봉곤, 오윤제, 전병우, 정봉수, 주영훈, 최광표
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의) of an intra-prediction mode and an inter-prediction mode (블록을) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
KR100842558B1
CLAIM 5
제1항에 있어서 , 상기 블록 크기 활동도는 다음 수학식에 의해 구해지는 것을 특징으로 하는 동영상 부호화를 위한 블록 모드 결정 방법 . 여기서 , Low는 과거 프레임들의 (encoding mode comprises one) 번째 매크로블록이 대부분 16x16의 큰 블록을 (inter-prediction mode) 나타내며 , High는 전반적으로 모든 블록의 크기를 나타낸다 . 또한 Medium은 High에 비하여 P8x8 모드의 발생 비율이 적었음을 나타내며 , 소정의 임계값 는 보다 큰 값을 가진다 .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의) of an intra-prediction mode and an inter-prediction mode (블록을) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
KR100842558B1
CLAIM 5
제1항에 있어서 , 상기 블록 크기 활동도는 다음 수학식에 의해 구해지는 것을 특징으로 하는 동영상 부호화를 위한 블록 모드 결정 방법 . 여기서 , Low는 과거 프레임들의 (encoding mode comprises one) 번째 매크로블록이 대부분 16x16의 큰 블록을 (inter-prediction mode) 나타내며 , High는 전반적으로 모든 블록의 크기를 나타낸다 . 또한 Medium은 High에 비하여 P8x8 모드의 발생 비율이 적었음을 나타내며 , 소정의 임계값 는 보다 큰 값을 가진다 .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의) of an intra-prediction mode and an inter-prediction mode (블록을) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
KR100842558B1
CLAIM 5
제1항에 있어서 , 상기 블록 크기 활동도는 다음 수학식에 의해 구해지는 것을 특징으로 하는 동영상 부호화를 위한 블록 모드 결정 방법 . 여기서 , Low는 과거 프레임들의 (encoding mode comprises one) 번째 매크로블록이 대부분 16x16의 큰 블록을 (inter-prediction mode) 나타내며 , High는 전반적으로 모든 블록의 크기를 나타낸다 . 또한 Medium은 High에 비하여 P8x8 모드의 발생 비율이 적었음을 나타내며 , 소정의 임계값 는 보다 큰 값을 가진다 .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2007081908A1

Filed: 2007-01-05     Issued: 2007-07-19

Method and apparatus for providing reduced resolution update mode for multi-view video coding

(Original Assignee) Thomson Licensing     

Peng Yin, Yeping Su, Cristina Gomila
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2007081908A1
CLAIM 15
. A video encoding (second syntax) method , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
WO2007081908A1
CLAIM 15
. A video encoding method (encoding mode) , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2007081908A1
CLAIM 15
. A video encoding (second syntax) method , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
WO2007081908A1
CLAIM 15
. A video encoding method (encoding mode) , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2007081908A1
CLAIM 15
. A video encoding (second syntax) method , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
WO2007081908A1
CLAIM 15
. A video encoding method (encoding mode) , comprising : encoding a picture using a reduced resolution update mode , the picture being one of a set of pictures corresponding to multi-view content having different view points with respect to a same or similar scene , the picture representing one of the different view points (600 , 800) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2008092137A

Filed: 2006-09-29     Issued: 2008-04-17

画像符号化装置及び画像符号化方法

(Original Assignee) Toshiba Corp; 株式会社東芝     

Yuji Okuda, Koji Tsuchie, 江 孝 二 土, 田 裕 二 奥
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (基づく第1) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2008092137A
CLAIM 1
複数の予測モードに基づき符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく予測画像を直交変換して予測画像周波数空間パターンに変換し出力する予測画像直交変換部と、 前記原画像を直交変換して原画像周波数空間パターンに変換し出力する原画像直交変換部と、 前記原画像周波数空間パターンと前記予測画像周波数空間パターンから残差画像周波数空間パターンを算出し出力する残差画像周波数空間パターン算出部と、 前記残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画 (intra-prediction mode) 像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

JP2008092137A
CLAIM 2
複数の予測モードに基づき第1の画素単位又は第2の画素単位の符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく前記第1の画素単位の予測画像と前記原画像とから前記第1の画素単位の残差画像を作成し出力する残差画像作成部と、 前記第1の画素単位の残差画像を直交変換して前記第1の画素単位での残差画像周波数空間パターンに変換し出力する第1の残差画像直交変換部と、 前記選択された予測モードに基づく第1 (third syntax) の画素単位の予測画像を直交変換して第1の画素単位での予測画像周波数空間パターンに変換し出力する第1の予測画像直交変換部と、 前記第1の残差画像直交変換部から出力される残差画像周波数空間パターンと前記第1の予測画像直交変換部から出力される予測画像周波数空間パターンとを加算して原画像周波数空間パターンを作成し出力する加算部と、 前記選択された予測モードに基づく第2の画素単位の予測画像を直交変換して第2の画素単位での予測画像周波数空間パターンに変換し出力する第2の予測画像直交変換部と、 前記加算部から出力される原画像周波数空間パターンと前記第2の予測画像直交変換部から出力される予測画像周波数空間パターンとを減算して第2の画素単位での残差画像周波数空間パターンを作成し出力する減算部と、 前記第1又は第2の画素単位での残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された前記前記第1又は第2の画素単位での残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (基づく第1) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2008092137A
CLAIM 1
複数の予測モードに基づき符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく予測画像を直交変換して予測画像周波数空間パターンに変換し出力する予測画像直交変換部と、 前記原画像を直交変換して原画像周波数空間パターンに変換し出力する原画像直交変換部と、 前記原画像周波数空間パターンと前記予測画像周波数空間パターンから残差画像周波数空間パターンを算出し出力する残差画像周波数空間パターン算出部と、 前記残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画 (intra-prediction mode) 像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

JP2008092137A
CLAIM 2
複数の予測モードに基づき第1の画素単位又は第2の画素単位の符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく前記第1の画素単位の予測画像と前記原画像とから前記第1の画素単位の残差画像を作成し出力する残差画像作成部と、 前記第1の画素単位の残差画像を直交変換して前記第1の画素単位での残差画像周波数空間パターンに変換し出力する第1の残差画像直交変換部と、 前記選択された予測モードに基づく第1 (third syntax) の画素単位の予測画像を直交変換して第1の画素単位での予測画像周波数空間パターンに変換し出力する第1の予測画像直交変換部と、 前記第1の残差画像直交変換部から出力される残差画像周波数空間パターンと前記第1の予測画像直交変換部から出力される予測画像周波数空間パターンとを加算して原画像周波数空間パターンを作成し出力する加算部と、 前記選択された予測モードに基づく第2の画素単位の予測画像を直交変換して第2の画素単位での予測画像周波数空間パターンに変換し出力する第2の予測画像直交変換部と、 前記加算部から出力される原画像周波数空間パターンと前記第2の予測画像直交変換部から出力される予測画像周波数空間パターンとを減算して第2の画素単位での残差画像周波数空間パターンを作成し出力する減算部と、 前記第1又は第2の画素単位での残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された前記前記第1又は第2の画素単位での残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (参照画) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
JP2008092137A
CLAIM 1
複数の予測モードに基づき符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく予測画像を直交変換して予測画像周波数空間パターンに変換し出力する予測画像直交変換部と、 前記原画像を直交変換して原画像周波数空間パターンに変換し出力する原画像直交変換部と、 前記原画像周波数空間パターンと前記予測画像周波数空間パターンから残差画像周波数空間パターンを算出し出力する残差画像周波数空間パターン算出部と、 前記残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画 (intra-prediction mode) 像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (基づく第1) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2008092137A
CLAIM 1
複数の予測モードに基づき符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく予測画像を直交変換して予測画像周波数空間パターンに変換し出力する予測画像直交変換部と、 前記原画像を直交変換して原画像周波数空間パターンに変換し出力する原画像直交変換部と、 前記原画像周波数空間パターンと前記予測画像周波数空間パターンから残差画像周波数空間パターンを算出し出力する残差画像周波数空間パターン算出部と、 前記残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画 (intra-prediction mode) 像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。

JP2008092137A
CLAIM 2
複数の予測モードに基づき第1の画素単位又は第2の画素単位の符号化対象ブロックの予測画像を作成し出力する予測画像作成部と、 前記予測画像と原画像の残差画像を作成し出力する残差画像作成部と、 前記残差画像から前記複数の予測モードのうちの1つを選択する予測モード選択部と、 前記選択された予測モードに基づく前記第1の画素単位の予測画像と前記原画像とから前記第1の画素単位の残差画像を作成し出力する残差画像作成部と、 前記第1の画素単位の残差画像を直交変換して前記第1の画素単位での残差画像周波数空間パターンに変換し出力する第1の残差画像直交変換部と、 前記選択された予測モードに基づく第1 (third syntax) の画素単位の予測画像を直交変換して第1の画素単位での予測画像周波数空間パターンに変換し出力する第1の予測画像直交変換部と、 前記第1の残差画像直交変換部から出力される残差画像周波数空間パターンと前記第1の予測画像直交変換部から出力される予測画像周波数空間パターンとを加算して原画像周波数空間パターンを作成し出力する加算部と、 前記選択された予測モードに基づく第2の画素単位の予測画像を直交変換して第2の画素単位での予測画像周波数空間パターンに変換し出力する第2の予測画像直交変換部と、 前記加算部から出力される原画像周波数空間パターンと前記第2の予測画像直交変換部から出力される予測画像周波数空間パターンとを減算して第2の画素単位での残差画像周波数空間パターンを作成し出力する減算部と、 前記第1又は第2の画素単位での残差画像周波数空間パターンの画質調整を行う画質調整部と、 前記画質調整された前記前記第1又は第2の画素単位での残差画像周波数空間パターンを逆直交変換して実空間残差画像に変換し出力する逆直交変換部と、 前記実空間残差画像と前記選択された予測モードに基づく予測画像から参照画像を作成する参照画像作成部と、 を備えることを特徴とする画像符号化装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP1811786A1

Filed: 2006-09-06     Issued: 2007-07-25

Encoding apparatus, encoding method and program thereof, and decoding apparatus, decoding method and program thereof

(Original Assignee) Sony Corp     (Current Assignee) Sony Corp

Liming Wang, Teruhiko Suzuki, Yoichi Yagasaki
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (motion prediction) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP1811786A1
CLAIM 7
An encoding apparatus as set forth in claim 1 , wherein the second encoding unit selects predictive image data minimizing the difference from the picture data to be encoded from among predictive image data generated by intra-prediction using the picture data to be encoded , predictive image data generated by motion prediction (video blocks) and/or compensation , and predictive image generated by the up sample processing unit and encodes the difference between the picture data to be encoded and the selected predictive image data .

EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
EP1811786A1
CLAIM 10
An encoding method (encoding mode) for encoding progressive image data and interlace image data corresponding to the progressive image data , the encoding method including : a first step of encoding picture data composing the interlace image data to generate first encoded data and decoding and recomposing the first encoded data to generate recomposed image data , a second step of up sampling the recomposed image data generated at the first step to generate image data having the same resolution as that of the progressive image data , and a third step of encoding the picture data composing the progressive image data using the image data generated at the second step as predictive image data to generate second encoded data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN101213842A

Filed: 2006-06-29     Issued: 2008-07-02

用于使用运动补偿时域滤波的视频编码中的更新步骤的方法和装置

(Original Assignee) 诺基亚公司     

王祥林, M·卡克扎威克兹, 鲍亦亮, J·里奇
US9930365B2
CLAIM 1
. A method of decoding video data (的视频数据) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (关于参) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据 (video data) 的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (关于参) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。

US9930365B2
CLAIM 7
. A device for decoding video data (的视频数据) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (关于参) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据 (video data) 的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (关于参) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (的视频数据) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (关于参) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据 (video data) 的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (关于参) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
CN101213842A
CLAIM 1
. 一种使用运动补偿时域滤波对数字视频序列进行编码以用于提供具有代表已编码视频序列的视频数据的比特流的方法,所述数字视频序列包括多个帧,其中每个帧包括可被划分成多个宏块中的像素的阵列,所述方法包括: 对于宏块, 选择宏块模式; 基于所述宏块模式将宏块分割为多个块; 基于关于参 (encoding mode) 考视频帧的运动补偿预测以及运动向量来在块 上执行预测操作,以提供相应的预测残差块;以及 基于关于所述预测残差块和所述宏块模式的运动补偿预测, 并且还基于所述运动向量的逆向,更新所述视频参考帧。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN101189641A

Filed: 2006-05-09     Issued: 2008-05-28

编码数字图像的像素或体素的方法及处理数字图像的方法

(Original Assignee) 布雷克成像有限公司     

I·戈里, M·马蒂乌齐
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (数值数据) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (来确定的) of an intra-prediction mode (来确定的) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

CN101189641A
CLAIM 21
. 根据前述权利要求11到20中的一个或多个所述的方法,其特征 在于,执行基于两种、三种或更多的滤波算法的两个、三个或更多的滤波 步骤;各种滤波算法(Al, A2 , . . . , Am)确定对所述图像的像素或体素的外 观进行定义的参数的值的变换(fl,f2,…,fm) ;
对于形成所述二维或三维图像的各个像素(P(i , j)) , 定义仅包括所述 像素(P(i , j))或所述体素的窗口 ;通过向量对所述图像的各个像素(P(i,j))或各个体素或者选定数目的 像素(P(i,j))或体素进行编码,所述向量的分量被定义为,通过定义所述 像素(P(i,j))或体素的所述参数的原始值的所述两种、三种或更多的变换 (fl, O, . . . , fm)来确定的 (intra-prediction mode, encoding mode comprises one) 各个像素或各个体素的参数的值。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (数值数据) .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (数值数据) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (来确定的) of an intra-prediction mode (来确定的) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

CN101189641A
CLAIM 21
. 根据前述权利要求11到20中的一个或多个所述的方法,其特征 在于,执行基于两种、三种或更多的滤波算法的两个、三个或更多的滤波 步骤;各种滤波算法(Al, A2 , . . . , Am)确定对所述图像的像素或体素的外 观进行定义的参数的值的变换(fl,f2,…,fm) ;
对于形成所述二维或三维图像的各个像素(P(i , j)) , 定义仅包括所述 像素(P(i , j))或所述体素的窗口 ;通过向量对所述图像的各个像素(P(i,j))或各个体素或者选定数目的 像素(P(i,j))或体素进行编码,所述向量的分量被定义为,通过定义所述 像素(P(i,j))或体素的所述参数的原始值的所述两种、三种或更多的变换 (fl, O, . . . , fm)来确定的 (intra-prediction mode, encoding mode comprises one) 各个像素或各个体素的参数的值。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (数值数据) .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (来确定的) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
CN101189641A
CLAIM 21
. 根据前述权利要求11到20中的一个或多个所述的方法,其特征 在于,执行基于两种、三种或更多的滤波算法的两个、三个或更多的滤波 步骤;各种滤波算法(Al, A2 , . . . , Am)确定对所述图像的像素或体素的外 观进行定义的参数的值的变换(fl,f2,…,fm) ;
对于形成所述二维或三维图像的各个像素(P(i , j)) , 定义仅包括所述 像素(P(i , j))或所述体素的窗口 ;通过向量对所述图像的各个像素(P(i,j))或各个体素或者选定数目的 像素(P(i,j))或体素进行编码,所述向量的分量被定义为,通过定义所述 像素(P(i,j))或体素的所述参数的原始值的所述两种、三种或更多的变换 (fl, O, . . . , fm)来确定的 (intra-prediction mode, encoding mode comprises one) 各个像素或各个体素的参数的值。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (数值数据) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (来确定的) of an intra-prediction mode (来确定的) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。

CN101189641A
CLAIM 21
. 根据前述权利要求11到20中的一个或多个所述的方法,其特征 在于,执行基于两种、三种或更多的滤波算法的两个、三个或更多的滤波 步骤;各种滤波算法(Al, A2 , . . . , Am)确定对所述图像的像素或体素的外 观进行定义的参数的值的变换(fl,f2,…,fm) ;
对于形成所述二维或三维图像的各个像素(P(i , j)) , 定义仅包括所述 像素(P(i , j))或所述体素的窗口 ;通过向量对所述图像的各个像素(P(i,j))或各个体素或者选定数目的 像素(P(i,j))或体素进行编码,所述向量的分量被定义为,通过定义所述 像素(P(i,j))或体素的所述参数的原始值的所述两种、三种或更多的变换 (fl, O, . . . , fm)来确定的 (intra-prediction mode, encoding mode comprises one) 各个像素或各个体素的参数的值。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (数值数据) .
CN101189641A
CLAIM 11
. 根据前述权利要求中的一个或多个所述的方法,其特征在于,表 示所述图像的像素或体素的所述矩阵的数值数据 (minimum size) ,在所述像素或体素的编 码之前,经历一个或多个预处理步骤。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
KR20060027795A

Filed: 2005-11-21     Issued: 2006-03-28

하이브리드 비디오 압축 방법

(Original Assignee) 에이엠티 어드밴스트 멀티미디어 테그놀로지 에이비     

요제프 카롤리, 피터 토드
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의, 편차값을) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
KR20060027795A
CLAIM 4
제3항에 있어서 , 상기 편차 분석은 다음의 수식 을 이용하여 수행되고 , 상기 수식에서 , M은 주어진 블록 또는 서브-블록 내의 요소(element)의 수이고 , pixel(i) 는 압축되지 않은(원래의) 블록의 요소이며 , 상기 편차가 주어진 임계값을 초과하였는지 확인하기 위해 상기 계산된 편차값을 (encoding mode comprises one) 상기 주어진 임계값과 비교하는 것을 특징으로 하는 압축 방법 .

KR20060027795A
CLAIM 30
제29항에 있어서 , 상기 부호화할 블록을 서브-블록들로 분할한 것이 만족스런 결과를 가져오지 못한 경우에 , 다른 참조 프레임들의 (encoding mode comprises one) 최상의 위치의 주변에서 참조 서브-블록의 검색을 수행하는 것을 특징으로 하는 압축 방법 .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의, 편차값을) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
KR20060027795A
CLAIM 4
제3항에 있어서 , 상기 편차 분석은 다음의 수식 을 이용하여 수행되고 , 상기 수식에서 , M은 주어진 블록 또는 서브-블록 내의 요소(element)의 수이고 , pixel(i) 는 압축되지 않은(원래의) 블록의 요소이며 , 상기 편차가 주어진 임계값을 초과하였는지 확인하기 위해 상기 계산된 편차값을 (encoding mode comprises one) 상기 주어진 임계값과 비교하는 것을 특징으로 하는 압축 방법 .

KR20060027795A
CLAIM 30
제29항에 있어서 , 상기 부호화할 블록을 서브-블록들로 분할한 것이 만족스런 결과를 가져오지 못한 경우에 , 다른 참조 프레임들의 (encoding mode comprises one) 최상의 위치의 주변에서 참조 서브-블록의 검색을 수행하는 것을 특징으로 하는 압축 방법 .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (프레임들의, 편차값을) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
KR20060027795A
CLAIM 4
제3항에 있어서 , 상기 편차 분석은 다음의 수식 을 이용하여 수행되고 , 상기 수식에서 , M은 주어진 블록 또는 서브-블록 내의 요소(element)의 수이고 , pixel(i) 는 압축되지 않은(원래의) 블록의 요소이며 , 상기 편차가 주어진 임계값을 초과하였는지 확인하기 위해 상기 계산된 편차값을 (encoding mode comprises one) 상기 주어진 임계값과 비교하는 것을 특징으로 하는 압축 방법 .

KR20060027795A
CLAIM 30
제29항에 있어서 , 상기 부호화할 블록을 서브-블록들로 분할한 것이 만족스런 결과를 가져오지 못한 경우에 , 다른 참조 프레임들의 (encoding mode comprises one) 최상의 위치의 주변에서 참조 서브-블록의 검색을 수행하는 것을 특징으로 하는 압축 방법 .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP1655967A1

Filed: 2005-11-07     Issued: 2006-05-10

Method and apparatus for encoding and decoding image data

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Kim Wooshik
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (current block) .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (current block) .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (current block) .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1655967A1
CLAIM 7
The image data encoding method of any preceding claim , further comprising : spatially predicting pixel values of a current block (current block) using blocks spatially adjacent to the current block , or temporally predicting pixel values of the current block using a frame previous to the current block , and then beginning the process of dividing the block into the sub-blocks .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20050135484A1

Filed: 2004-06-07     Issued: 2005-06-23

Method of encoding mode determination, method of motion estimation and encoding apparatus

(Original Assignee) Samsung Electronics Co Ltd; Sejong University Industry-Academy Cooperation Group     (Current Assignee) Samsung Electronics Co Ltd ; Sejong University Industry-Academy Cooperation Group

Yung-Lyul Lee, Ki-hoon Han
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (other ones) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (identical size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size (other ones) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US20050135484A1
CLAIM 14
. An encoding apparatus comprising : a DCT+Q performing unit which receives picture data and performs discrete cosine (decoding video data) transform (DCT) and quantization ;
a rate-distortion optimization unit which calculates a rate-distortion cost of the picture data and determines an encoding block mode to be used in encoding the picture , and transfers the determined block mode to the DCT+Q performing unit ;
and a motion estimator and a motion compensator which by using a reference frame and the input picture , performs motion estimation and compensation and transfers the result to the DCT+Q performing unit .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size (other ones) .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (identical size, divided blocks) .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (identical size, divided blocks) has a size of at least 64×64 pixels .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (other ones) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (identical size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size (other ones) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US20050135484A1
CLAIM 14
. An encoding apparatus comprising : a DCT+Q performing unit which receives picture data and performs discrete cosine (decoding video data) transform (DCT) and quantization ;
a rate-distortion optimization unit which calculates a rate-distortion cost of the picture data and determines an encoding block mode to be used in encoding the picture , and transfers the determined block mode to the DCT+Q performing unit ;
and a motion estimator and a motion compensator which by using a reference frame and the input picture , performs motion estimation and compensation and transfers the result to the DCT+Q performing unit .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size (other ones) .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (identical size, divided blocks) .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (identical size, divided blocks) has a size of at least 64×64 pixels .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (other ones) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (current picture) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (identical size, divided blocks) of a plurality of blocks of the sequence of pictures has a starting size (other ones) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (current picture) and greater than the minimum size (other ones) .
US20050135484A1
CLAIM 2
. The method of claim 1 , wherein the determining of whether to further perform motion estimation in P8×8 mode comprises : for each of the Inter16×16 mode , the Inter16×8 mode , and the Inter8×16 mode , calculating a sum of absolute difference (SAD) value , which is a difference of a pixel value of a current picture (maximum size) and a pixel value of a previous picture , and calculating MVcost , which is a difference value of a motion vector of the current picture and a motion vector of the previous picture ;
and comparing a sum of the SAD and MVcost values (SAD+MVcost) of the respective modes and determining whether to further perform motion estimation in the P8×8 mode based on the comparison .

US20050135484A1
CLAIM 16
. A method of determining an encoding mode , comprising : performing motion estimation of a macro block in a plurality of modes including an inter16×16 mode , an inter16×8 mode , and an inter8×16 mode ;
determining a SAD value , an MVcost value and a sum of the SAD value and the MVcost value (SAD+MVcost) for each of the plurality of modes , the SAD value being a difference of a pixel value of a current picture and a pixel value of a previous picture and the MVcost value being a difference value of a motion vector of the current picture and a motion vector of a previous picture ;
calculating a rate distortion cost (RDcost) in each of the plurality of modes and in a skip mode , if the (SAD+MVcost) for the inter 16×16mode is smaller than the (SAD+MVcost) corresponding to the other ones (minimum size, starting size) of the plurality of modes ;
selecting the mode having the smallest RDcost from among the plurality of modes and the skip mode ;
and if the RDcost of the selected mode is less than M , determining the selected mode as the encoding mode , where M is a mean RDcost value of a plurality Intra macro blocks encoded in a spatial prediction mode and occurring in previous frames .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (identical size, divided blocks) .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (identical size, divided blocks) has a size of at least 64×64 pixels .
US20050135484A1
CLAIM 10
. A method of motion estimation of a macro block comprising : performing motion estimation by using an entire macro block ;
dividing the macro block into two in the horizontal direction or the vertical direction and then performing motion estimation by using each divided block ;
dividing the macro block into 4 blocks of an identical size (current block) and performing motion estimation by using each divided block ;
dividing each of the four divided blocks (current block) into two in the horizontal direction or the vertical direction and performing motion estimation by using each further divided block ;
and dividing each of the four divided blocks into four and performing motion estimation by using each further divided block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN1857001A

Filed: 2004-05-20     Issued: 2006-11-01

混合视频压缩方法

(Original Assignee) Amt先进多媒体科技公司     

P·托思, J·卡罗利
US9930365B2
CLAIM 1
. A method of decoding video data (的视频数据) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (的尺寸, 尺寸的) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。

CN1857001A
CLAIM 60
. 用于从编码序列中解压编码的视频数据 (video data) 的方法,所述编码序列是由按照权利要求1-55中任意一条所描述的压缩方法产生的。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (的尺寸, 尺寸的) .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。

US9930365B2
CLAIM 7
. A device for decoding video data (的视频数据) , the device comprising : a memory configured to store decoded video blocks (块进行编码) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (的尺寸, 尺寸的) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。

CN1857001A
CLAIM 23
. 用于对数字编码的视频帧序列进行压缩的方法,其中某些帧的信息内容根据前导或后继帧(参考帧)的内容进行编码,该方法进一步包括如下步骤:a . 将所述的待编码帧划分成块,b . 在包含所述待编码块的帧之前或后续的参考图像中为给定的待编码块搜索一个匹配的参考块,c . 通过将匹配的参考块与待编码块进行比较来执行一个可压缩性分析,d . 根据可压缩性分析的结果选择最佳参考块,以及e . 使用刚刚选择的最佳参考块对所述的块进行编码 (video blocks) ,其特征在于,在步骤b中,在搜索参考块的过程中:i)将待编码块划分成子块,ii)分析子块的内容,iii)根据预定义的标准,选择预定数量的子块,优选地,至少为两块,iv)使用所选择的子块执行参考块搜索,所述的搜索是在所选择的参考帧中的特定搜索范围内执行的,以便搜索包含与所选择的子块区别最小的子块的参考块,在所述的搜索过程中,所选择块的相对位置保持不变,以及v)使用所选择的子块来选择最佳参考块作为搜索结果。

CN1857001A
CLAIM 60
. 用于从编码序列中解压编码的视频数据 (video data) 的方法,所述编码序列是由按照权利要求1-55中任意一条所描述的压缩方法产生的。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (的尺寸, 尺寸的) .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (的视频数据) , the first syntax element representing a minimum size (的尺寸, 尺寸的) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。

CN1857001A
CLAIM 60
. 用于从编码序列中解压编码的视频数据 (video data) 的方法,所述编码序列是由按照权利要求1-55中任意一条所描述的压缩方法产生的。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (的尺寸, 尺寸的) .
CN1857001A
CLAIM 5
. 按照权利要求1-4中任何一条权利要求的方法,其特征在于:用熵编码对赋予一个给定帧中具有最大允许块尺寸的 (minimum size) 块的特定数据进行编码,所述特定数据表示其被赋予的块的块分割(四杈树)。

CN1857001A
CLAIM 14
. 用于对数字编码的视频帧序列进行压缩的方法,包括如下步骤:a . 将一个给定的帧划分成二维块,b . 建立该帧的一个块分割,在特定情况下,通过将各个块进一步划分成子块,c . 在所述块的信息内容上执行一个变换(DCT),该变换将空间表示转换成频率表示,因而产生了多元素二维变换块(DCT系数矩阵)以及d . 根据外部边界条件修改变换块的元素(量化),并且最后e . 用熵编码对包含在变换块中的信息进行编码。该方法的特色在于:在步骤d中,对多元素二维变换块中的数据的修改是基于所述块的尺寸 (minimum size) 和可用于传输编码数据的带宽而修改的。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO2004104930A2

Filed: 2004-05-20     Issued: 2004-12-02

Hybrid video compression method

(Original Assignee) Amt Advanced Multimedia Technology Ab     

Peter Toth, József KÁROLY
US9930365B2
CLAIM 1
. A method of decoding video data (encoded video data, discrete cosine) , the method comprising : decoding a first syntax (processing element) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (data elements) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (individual blocks, reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (data elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 6
. The method according to any one of Claims 1-5 , characterised by that discrete cosine (video data, decoding video data) transform (DCT) is applied as the transformation that converts the representation in the spatial domain into a representation in the frequency domain .

WO2004104930A2
CLAIM 9
. The method according to any one of Claims 8 , characterised by that during the prediction of individual blocks prediction is carried out using multiple prediction modes (intra-prediction mode, inter-prediction mode) , with the prediction mode yielding the lowest " ;
sum" ;
value being applied on the given block .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (second syntax, third syntax) stored in the time window .

WO2004104930A2
CLAIM 60
. Method for decompressing encoded video data (video data, decoding video data) from a coded sequence produced by the ' ;
compression method according to any one of Claims 1-55 .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (individual blocks, reference blocks) .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (individual blocks, reference blocks) has a size of at least 64×64 pixels .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

US9930365B2
CLAIM 7
. A device for decoding video data (encoded video data, discrete cosine) , the device comprising : a memory configured to store decoded video blocks (given block) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (processing element) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (data elements) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (individual blocks, reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (data elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 4
. The method according to Claim 3 , characterised by that the variance an using the following formula : M M ∑ pixel - (∑ pixel j) 2 variance = J=0 j=0 M where M is the number of elements in the given block (video blocks) or sub-block and pixel(i) is an element of the uncompressed (original) block , with the computed variance value being compared with a given threshold value to establish if the variance exceeds said given threshold value .

WO2004104930A2
CLAIM 6
. The method according to any one of Claims 1-5 , characterised by that discrete cosine (video data, decoding video data) transform (DCT) is applied as the transformation that converts the representation in the spatial domain into a representation in the frequency domain .

WO2004104930A2
CLAIM 9
. The method according to any one of Claims 8 , characterised by that during the prediction of individual blocks prediction is carried out using multiple prediction modes (intra-prediction mode, inter-prediction mode) , with the prediction mode yielding the lowest " ;
sum" ;
value being applied on the given block .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (second syntax, third syntax) stored in the time window .

WO2004104930A2
CLAIM 60
. Method for decompressing encoded video data (video data, decoding video data) from a coded sequence produced by the ' ;
compression method according to any one of Claims 1-55 .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (individual blocks, reference blocks) .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (prediction modes) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
WO2004104930A2
CLAIM 9
. The method according to any one of Claims 8 , characterised by that during the prediction of individual blocks prediction is carried out using multiple prediction modes (intra-prediction mode, inter-prediction mode) , with the prediction mode yielding the lowest " ;
sum" ;
value being applied on the given block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (individual blocks, reference blocks) has a size of at least 64×64 pixels .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (processing element) element associated with a sequence of pictures of video data (encoded video data, discrete cosine) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (data elements) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (individual blocks, reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (data elements) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 6
. The method according to any one of Claims 1-5 , characterised by that discrete cosine (video data, decoding video data) transform (DCT) is applied as the transformation that converts the representation in the spatial domain into a representation in the frequency domain .

WO2004104930A2
CLAIM 9
. The method according to any one of Claims 8 , characterised by that during the prediction of individual blocks prediction is carried out using multiple prediction modes (intra-prediction mode, inter-prediction mode) , with the prediction mode yielding the lowest " ;
sum" ;
value being applied on the given block .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

WO2004104930A2
CLAIM 35
. Method for compressing a digitally coded video frame sequence , com a , dividing each frame into blocks that are to be separately coded , b , carrying out on the information content of the blocks a transformation (DCT) that converts spatial representation into frequency representation , producing thereby transformed blocks , and finally e , encoding the information contained in transformed blocks by entropy coding , and applying arithmetic coding as entropy coding , during which - a bit sequence is encoded by modifying the lower and upper limit of an interval as a function of values of consecutive bits of the bit sequence , and - the distribution of the already arrived bits of the sequence is taken into account in the function that modifies the limits of said interval , . < ;
characterised by that - addresses are generated from already arrived bit values of the bit sequence , - said addresses are applied for addressing individual processing element (first syntax) s of a neural network comprising multiple processing elements , and - parameters of the processing element are modified such that the frequency of individual addressing operations and the value of the currently arriving bit of the bit sequence are used as input data , and the output of the neural network is applied for determining a parameter that modifies the lower or upper limit the interval , - after an initial learning phase involving the processing of multiple bits , the upper or lower limits of the interval being determined during the encoding of incoming bits as a function of the output of the neural network .

WO2004104930A2
CLAIM 46
. The method according to Claim 45 , characterised by that the number of neurons in the input layer of the network equals the number of data elements (second syntax, third syntax) stored in the time window .

WO2004104930A2
CLAIM 60
. Method for decompressing encoded video data (video data, decoding video data) from a coded sequence produced by the ' ;
compression method according to any one of Claims 1-55 .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (individual blocks, reference blocks) .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (individual blocks, reference blocks) has a size of at least 64×64 pixels .
WO2004104930A2
CLAIM 1
. Method for compressing a digitally coded video frame sequence , comprising the steps of a , dividing a given frame into blocks , b , optionally , further dividing individual blocks (current block) into smaller blocks , c , modifying the information content of selected blocks relying on information contained in a neighbouring block or blocks (prediction) , d , generating transformed blocks by carrying out on the selected blocks a transformation (DCT) that converts spatial representation into frequency> ;
representation , and finally e , encoding the information content of the transformed blocks by entropy coding , characterised by that * . ' ;
* - i , compressibility analysis is performed on said selected blocks before carrying out the transformation specified in step d , and , depending on the result of the analysis ii , steps c , (prediction) and d , (DCT) are carried out on the block or iii , optionally , the block is further partitioned into sub-blocks , and the compressibility analysis specified in step i , is performed again on the blocks resulting from individual partitioning , and iv , the block partitioning that will potentially yield the best results is chosen relying on results given by steps i and iii , and finally v , the transformation specified in step d , is carried out using the block partitioning with the best potential results , relying on the prediction specified in step c .

WO2004104930A2
CLAIM 23
. Method for compressing a digitally coded video frame sequence , where the information content of certain frames is encoded from the contents of the preceding or subsequent frames (reference frames) , the method further comprising the steps of a , dividing the frame to be encoded into blocks , b , searching a matching reference block for the given block to be encoded in the reference image preceding or following the frame containing said block to be encoded , c , carrying out a compressibility analysis by comparing matched reference blocks (current block) and the block to be encoded , d , selecting the best reference block relying on the result of the compressibility analysis , and e , encoding said block using the best reference block just selected , characterised by that in step b , during the search for reference blocks i) the block to be encoded is divided into sub-blocks , 5 ii) the contents of the sub-blocks are analysed , iii) according to pre-defined criteria , a predetermined number of sub-blocks , preferably at least two , are selected , iv) a reference block search is performed using the selected sub-blocks , said search being performed in a specific search range in the selected reference frame for the reference block 0 . ■ containing sub-blocks that differ the least from , the -selected sub-blocks , with the relative position of the selected blocks kept constant during said search , and . . v) the best reference block is chosen as a result of a search , using the selected sub-blocks .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2004254327A

Filed: 2004-02-20     Issued: 2004-09-09

画像符号化方法および画像復号化方法

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Yoshiichiro Kashiwagi, Masayuki Kozuka, Jiuhuai Lu, Shinya Sumino, ルー ジュファイ, 雅之 小塚, 吉一郎 柏木, 眞也 角野
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

JP2004254327A
CLAIM 23
ピクチャをブロック単位で符号化する画像符号化装置であって、 直交変換すべきブロックのサイズとして、少なくとも2つのサイズの中1つを決定する決定手段と (second syntax) 、 決定されたサイズを単位にブロックを直交変換する変換手段と、 変換手段で得られたブロックデータを符号化する符号化手段と、 符号化後のブロックデータと、決定手段において決定されたサイズに関するサイズ情報とを含む符号列を生成する生成手段と を有することを特徴とする画像符号化装置。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

JP2004254327A
CLAIM 23
ピクチャをブロック単位で符号化する画像符号化装置であって、 直交変換すべきブロックのサイズとして、少なくとも2つのサイズの中1つを決定する決定手段と (second syntax) 、 決定されたサイズを単位にブロックを直交変換する変換手段と、 変換手段で得られたブロックデータを符号化する符号化手段と、 符号化後のブロックデータと、決定手段において決定されたサイズに関するサイズ情報とを含む符号列を生成する生成手段と を有することを特徴とする画像符号化装置。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (ブロックサイズ) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

JP2004254327A
CLAIM 23
ピクチャをブロック単位で符号化する画像符号化装置であって、 直交変換すべきブロックのサイズとして、少なくとも2つのサイズの中1つを決定する決定手段と (second syntax) 、 決定されたサイズを単位にブロックを直交変換する変換手段と、 変換手段で得られたブロックデータを符号化する符号化手段と、 符号化後のブロックデータと、決定手段において決定されたサイズに関するサイズ情報とを含む符号列を生成する生成手段と を有することを特徴とする画像符号化装置。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (ブロックサイズ) and greater than the minimum size .
JP2004254327A
CLAIM 5
前記第1のサイズは縦横4画素からなるブロックサイズ (maximum size) を示し、前記第2のサイズは縦横8画素からなるブロックサイズを示す ことを特徴とする請求項4記載の画像符号化方法。

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one (マトリクス, 逆変換) or more syntax elements representative of partitioning for the current block .
JP2004254327A
CLAIM 6
前記変換ステップにおいて、直交変換後のブロックをそのサイズに応じた重みマトリクス (processor to decode one) を用いて量子化する ことを特徴とする請求項4記載の画像符号化方法。

JP2004254327A
CLAIM 12
ピクチャをブロック単位で復号化する画像復号化方法であって、 符号化されたピクチャを含む符号列を復号することにより、直交変換されたブロックデータとブロックサイズに関するサイズ情報とを抽出する抽出ステップと サイズ情報に従うブロックサイズを単位にブロックデータを逆直交変換する逆変換 (processor to decode one) ステップと を有することを特徴とする画像復号化方法。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20040234144A1

Filed: 2004-02-18     Issued: 2004-11-25

Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program

(Original Assignee) NTT Docomo Inc     (Current Assignee) NTT Docomo Inc

Kazuo Sugimoto, Satoru Adachi, Sadaatsu Kato, Minoru Etoh, Mitsuru Kobayashi, Hiroyuki Yamaguchi
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US20040234144A1
CLAIM 1
. An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from them subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US20040234144A1
CLAIM 39
. An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US7170937B2

Filed: 2003-05-01     Issued: 2007-01-30

Complexity-scalable intra-frame prediction technique

(Original Assignee) Texas Instruments Inc     (Current Assignee) Texas Instruments Inc

Minhua Zhou
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax (prediction direction) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US7170937B2
CLAIM 1
. A method of motion picture encoding comprising the steps of : defining a plurality of intra frame prediction modes , said plurality of intra frame prediction modes including a DC mode and a plurality of modes having respective differing prediction direction (first syntax, first syntax element) s ;
forming a subset having a predetermined number of said plurality of intra frame prediction modes including said DC mode and some but not all of said plurality of modes having respective differing prediction directions ;
dividing a picture frame into a plurality of sub-blocks ;
selecting one of a low-complexity encoding and a high-complexity encoding ;
upon selection of said low-complexity encoding , determining an intra frame prediction mode for each sub-block from among said subset of intra frame prediction modes ;
upon selection of said high-complexity encoding , determining an intra frame prediction mode for each sub-block from among said plurality of intra frame prediction modes ;
transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder .

US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (current block) .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (current block) has a size of at least 64×64 pixels .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (prediction direction) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US7170937B2
CLAIM 1
. A method of motion picture encoding comprising the steps of : defining a plurality of intra frame prediction modes , said plurality of intra frame prediction modes including a DC mode and a plurality of modes having respective differing prediction direction (first syntax, first syntax element) s ;
forming a subset having a predetermined number of said plurality of intra frame prediction modes including said DC mode and some but not all of said plurality of modes having respective differing prediction directions ;
dividing a picture frame into a plurality of sub-blocks ;
selecting one of a low-complexity encoding and a high-complexity encoding ;
upon selection of said low-complexity encoding , determining an intra frame prediction mode for each sub-block from among said subset of intra frame prediction modes ;
upon selection of said high-complexity encoding , determining an intra frame prediction mode for each sub-block from among said plurality of intra frame prediction modes ;
transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder .

US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (current block) .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (current block) has a size of at least 64×64 pixels .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (prediction direction) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US7170937B2
CLAIM 1
. A method of motion picture encoding comprising the steps of : defining a plurality of intra frame prediction modes , said plurality of intra frame prediction modes including a DC mode and a plurality of modes having respective differing prediction direction (first syntax, first syntax element) s ;
forming a subset having a predetermined number of said plurality of intra frame prediction modes including said DC mode and some but not all of said plurality of modes having respective differing prediction directions ;
dividing a picture frame into a plurality of sub-blocks ;
selecting one of a low-complexity encoding and a high-complexity encoding ;
upon selection of said low-complexity encoding , determining an intra frame prediction mode for each sub-block from among said subset of intra frame prediction modes ;
upon selection of said high-complexity encoding , determining an intra frame prediction mode for each sub-block from among said plurality of intra frame prediction modes ;
transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder .

US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (current block) .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (current block) has a size of at least 64×64 pixels .
US7170937B2
CLAIM 3
. The method of claim 1 , further comprising the steps of : forming a probability table relating the intra frame prediction mode assigned to two adjacent sub-blocks to the intra frame prediction mode assigned to a current sub-block , for each combination of intra frame prediction mode assigned to each of said two adjacent sub-blocks , said probability table including a list of intra frame prediction modes arranged in order of probability of expected occurrence for the current block (current block) from the most probable to the least probable ;
adjusting each list for each combination of said probability table wherein the intra frame prediction modes of both adjacent sub-blocks within are said subset of intra frame prediction modes to include said predetermined number of intra frame prediction modes within said subset of intra frame prediction modes and a further number of intra frame prediction modes outside said subset of intra frame prediction modes ;
and wherein said step of transmitting compressed data including said determined intra frame prediction mode for each sub-block to a decoder includes transmitting a variable length code reference to an entry in said probability table for each sub-block indicating which prediction mode within said corresponding list is assigned to said sub-block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN1457196A

Filed: 2003-04-25     Issued: 2003-11-19

基于时空域相关性运动矢量预测的视频编码方法

(Original Assignee) 北京工业大学     

薛金柱, 沈兰荪
US9930365B2
CLAIM 1
. A method of decoding video data (的视频数据) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (方法为) of an intra-prediction mode (编码标准) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN1457196A
CLAIM 1
. 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据 (video data) ,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。

CN1457196A
CLAIM 2
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准 (intra-prediction mode) H . 264 JM6 . 0的编码框架。

CN1457196A
CLAIM 3
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于所述的运动矢量预测子程序中所采用的根据相邻宏块间运动相关性的不同,综合利用运动矢量在时间域和空间域上相关性的运动矢量预测方法,其预测方法如下:(1)首先进行相邻宏块间运动相关性的判断;对当前任一编码宏块,其相邻宏块间运动相关性的描述方法为 (encoding mode comprises one) :设Ei是当前帧i的编码宏块,Ai、Bi、Ci、Di、Gi、Hi分别为其左侧、上侧、右上侧、左上侧、右侧和下侧的相邻宏块,在前一帧(i-1)对应位置上的宏块分别表示为Ei-1、Ai-1、Bi-1、Ci-1、Di-1、Gi-1和Hi-1;这里设B={C1C2C3C4C5}为Ei相邻宏块Ai、Bi、Ei-1、Gi-1和Hi-1的运动矢量组成的集合;(a)如果对于i,i∈[1,5],都满足|Ci-C3|≤TH则Ei相邻宏块间的运动相关性为高;(b)如果对于i,i∈[1,5],都满足|Ci-C3|>TH则Ei相邻宏块间的运动相关性为低;(2)根据(1)的判断结果,当前编码宏块的运动矢量的预测方法如下:为了利用时间、空间域上的相关性,本发明选取Ai、Bi和Ci的运动矢量作为空间域上的侯选预测因子,Ei-1、Gi-1和Hi-1的运动矢量作为时间域上的侯选预测因子,运动矢量预测的具体算法如下:(1)当Ei-1、Gi-1和Hi-1都不存在时,则以Ai、Bi、Ci作为侯选预测因子;(2)当相邻宏块间的运动相关性为高时,则选取Ai、Bi和Ei-1为侯选预测因子;(3)当相邻宏块间的运动相关性为低时,则选取Ai、Bi、Gi-1和Hi-1为侯选预测因子;(4)当任意一个Ai、Bi、Ci和Di采用帧内编码时,如果在(i-1)帧与其对应位置宏块的运动矢量存在,则该宏块的运动矢量选取在(i-1)帧与其对应位置块相同的运动矢量和参考帧,否则该块的运动矢量置为(0,0);(5)如果在Ai、Bi、Ci、Ei-1、Gi-1和Hi-1中存在与Ei具有相同参考帧的宏块,则预测因子选取该宏块相同的运动矢量,否则预测因子选取侯选预测因子的中值;预测因子即为当前编码宏块的运动矢量的预测值;运动矢量在水平和垂直方向上的分量都采用上述预测方法;计算机需要对运动矢量在水平和垂直方向上的分量单独执行上述程序,完成运动矢量的预测。

US9930365B2
CLAIM 7
. A device for decoding video data (的视频数据) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (方法为) of an intra-prediction mode (编码标准) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN1457196A
CLAIM 1
. 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据 (video data) ,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。

CN1457196A
CLAIM 2
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准 (intra-prediction mode) H . 264 JM6 . 0的编码框架。

CN1457196A
CLAIM 3
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于所述的运动矢量预测子程序中所采用的根据相邻宏块间运动相关性的不同,综合利用运动矢量在时间域和空间域上相关性的运动矢量预测方法,其预测方法如下:(1)首先进行相邻宏块间运动相关性的判断;对当前任一编码宏块,其相邻宏块间运动相关性的描述方法为 (encoding mode comprises one) :设Ei是当前帧i的编码宏块,Ai、Bi、Ci、Di、Gi、Hi分别为其左侧、上侧、右上侧、左上侧、右侧和下侧的相邻宏块,在前一帧(i-1)对应位置上的宏块分别表示为Ei-1、Ai-1、Bi-1、Ci-1、Di-1、Gi-1和Hi-1;这里设B={C1C2C3C4C5}为Ei相邻宏块Ai、Bi、Ei-1、Gi-1和Hi-1的运动矢量组成的集合;(a)如果对于i,i∈[1,5],都满足|Ci-C3|≤TH则Ei相邻宏块间的运动相关性为高;(b)如果对于i,i∈[1,5],都满足|Ci-C3|>TH则Ei相邻宏块间的运动相关性为低;(2)根据(1)的判断结果,当前编码宏块的运动矢量的预测方法如下:为了利用时间、空间域上的相关性,本发明选取Ai、Bi和Ci的运动矢量作为空间域上的侯选预测因子,Ei-1、Gi-1和Hi-1的运动矢量作为时间域上的侯选预测因子,运动矢量预测的具体算法如下:(1)当Ei-1、Gi-1和Hi-1都不存在时,则以Ai、Bi、Ci作为侯选预测因子;(2)当相邻宏块间的运动相关性为高时,则选取Ai、Bi和Ei-1为侯选预测因子;(3)当相邻宏块间的运动相关性为低时,则选取Ai、Bi、Gi-1和Hi-1为侯选预测因子;(4)当任意一个Ai、Bi、Ci和Di采用帧内编码时,如果在(i-1)帧与其对应位置宏块的运动矢量存在,则该宏块的运动矢量选取在(i-1)帧与其对应位置块相同的运动矢量和参考帧,否则该块的运动矢量置为(0,0);(5)如果在Ai、Bi、Ci、Ei-1、Gi-1和Hi-1中存在与Ei具有相同参考帧的宏块,则预测因子选取该宏块相同的运动矢量,否则预测因子选取侯选预测因子的中值;预测因子即为当前编码宏块的运动矢量的预测值;运动矢量在水平和垂直方向上的分量都采用上述预测方法;计算机需要对运动矢量在水平和垂直方向上的分量单独执行上述程序,完成运动矢量的预测。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (编码标准) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
CN1457196A
CLAIM 2
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准 (intra-prediction mode) H . 264 JM6 . 0的编码框架。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (的视频数据) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (方法为) of an intra-prediction mode (编码标准) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN1457196A
CLAIM 1
. 基于时空域相关性运动矢量预测的视频编码方法,是由摄像机(1),将目标物状态转换成视频信号置于采集卡(2)中;采集卡将视频信号转换成数字视频序列,并存放在视频缓存中,这些数字视频序列或是以视频序列文件形式存放在计算机硬盘中的视频数据 (video data) ,作为该系统的输入用于压缩;计算机(3)存放原始视频序列和执行视频编码程序,并生成压缩后的码流文件,本发明的特征在于:计算机首先从采集卡的视频存储器或存放在计算机硬盘上的视频序列文件中读一帧视频数据到计算机的缓冲区中,通过执行视频编码子程序对该帧进行编码;视频编码子程序在视频编码时,一个输入的视频序列帧被分为16×16的宏块,编码过程是以宏块为单位进行的;计算机开始执行视频编码子程序后,首先对编码进行初始化;然后计算机按照从上到下、从左到右的次序先对第一个宏块做运动估计与补偿;运动估计与补偿之后得到该宏块的运动矢量和残差图像,对其中的运动矢量,首先执行运动矢量子程序进行预测,然后对预测差值进行上下文自适应的变长编码(CAVLC);而对残差图像则首先进行DCT变换,然后对DCT系数进行量化和CAVLC编码;之后输出该宏块的压缩码流放入计算机的缓冲区中;最后进行解码、生成相应参考宏块;该宏块编码结束后,对当前编码帧的所有宏块循环执行上述编码过程,完成对一个视频序列帧的编码;一帧编码结束后循环编码下一帧,至到视频序列的最后一帧编码结束,并生成压缩后的码流文件,系统执行程序结束;在上述的视频编码子程序中,运动矢量预测子程序中所采用的预测方法,是根据相邻宏块间运动相关性的不同,综合利用了时间和空间域上的相关性对其进行预测编码的。

CN1457196A
CLAIM 2
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于,视频编码子程序采用了基于运动补偿/DCT的混合编码方法,除运动矢量预测子程序这一部分外,采用了国际视频编码标准 (intra-prediction mode) H . 264 JM6 . 0的编码框架。

CN1457196A
CLAIM 3
. 根据权利要求1所述的基于时空域相关性运动矢量预测的视频编码方法,其特征在于所述的运动矢量预测子程序中所采用的根据相邻宏块间运动相关性的不同,综合利用运动矢量在时间域和空间域上相关性的运动矢量预测方法,其预测方法如下:(1)首先进行相邻宏块间运动相关性的判断;对当前任一编码宏块,其相邻宏块间运动相关性的描述方法为 (encoding mode comprises one) :设Ei是当前帧i的编码宏块,Ai、Bi、Ci、Di、Gi、Hi分别为其左侧、上侧、右上侧、左上侧、右侧和下侧的相邻宏块,在前一帧(i-1)对应位置上的宏块分别表示为Ei-1、Ai-1、Bi-1、Ci-1、Di-1、Gi-1和Hi-1;这里设B={C1C2C3C4C5}为Ei相邻宏块Ai、Bi、Ei-1、Gi-1和Hi-1的运动矢量组成的集合;(a)如果对于i,i∈[1,5],都满足|Ci-C3|≤TH则Ei相邻宏块间的运动相关性为高;(b)如果对于i,i∈[1,5],都满足|Ci-C3|>TH则Ei相邻宏块间的运动相关性为低;(2)根据(1)的判断结果,当前编码宏块的运动矢量的预测方法如下:为了利用时间、空间域上的相关性,本发明选取Ai、Bi和Ci的运动矢量作为空间域上的侯选预测因子,Ei-1、Gi-1和Hi-1的运动矢量作为时间域上的侯选预测因子,运动矢量预测的具体算法如下:(1)当Ei-1、Gi-1和Hi-1都不存在时,则以Ai、Bi、Ci作为侯选预测因子;(2)当相邻宏块间的运动相关性为高时,则选取Ai、Bi和Ei-1为侯选预测因子;(3)当相邻宏块间的运动相关性为低时,则选取Ai、Bi、Gi-1和Hi-1为侯选预测因子;(4)当任意一个Ai、Bi、Ci和Di采用帧内编码时,如果在(i-1)帧与其对应位置宏块的运动矢量存在,则该宏块的运动矢量选取在(i-1)帧与其对应位置块相同的运动矢量和参考帧,否则该块的运动矢量置为(0,0);(5)如果在Ai、Bi、Ci、Ei-1、Gi-1和Hi-1中存在与Ei具有相同参考帧的宏块,则预测因子选取该宏块相同的运动矢量,否则预测因子选取侯选预测因子的中值;预测因子即为当前编码宏块的运动矢量的预测值;运动矢量在水平和垂直方向上的分量都采用上述预测方法;计算机需要对运动矢量在水平和垂直方向上的分量单独执行上述程序,完成运动矢量的预测。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP1478190A1

Filed: 2003-04-25     Issued: 2004-11-17

Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program

(Original Assignee) NTT Docomo Inc     (Current Assignee) NTT Docomo Inc

Satoru c/o NTT DoCoMo Inc. ADACHI, Minoru c/o NTT DoCoMo Inc. ETOH, Sadaatsu c/o NTT DoCoMo Inc. KATO, Mitsuru c/o NTT DoCoMo Inc. KOBAYASHI, Kazuo c/o NTT DoCoMo Inc. SUGIMOTO, Hiroyuki NTT DoCoMo Inc. YAMAGUCHI
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
EP1478190A1
CLAIM 1
An image encoding apparatus comprising : a motion detection part for dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape (maximum size) and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation part for generating a predicted image from a reference image on the basis of the motion vector outputted from the motion detection part ;
a subtraction part for calculating a difference between the predicted image generated by the motion compensation part , and the coding target image to output a predictive residual image ;
a transform part for performing transform coding of the predictive residual image outputted from the subtraction part , in each unit of a predetermined transform block to output transform coefficients ;
a quantization part for quantizing the transform coefficients outputted from the transform part , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding part for performing compression coding of coding-related information containing the motion vector outputted from the motion detection part , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted from the quantization part , to output coded data ;
   wherein the encoding part comprises a coded symbol memory for memorizing the coding-related information ;
a coding procedure provider for providing one or more types of coding procedures ;
and a coded data outputting part for selecting one of the coding procedures provided by the coding procedure provider , based on a predetermined coding procedure selection criterion , using the coding-related information stored in the coded symbol memory , and for performing the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
EP1478190A1
CLAIM 39
An image encoding method (encoding mode) comprising : a motion detection step of dividing a coding target image into macroblocks of a predetermined size , dividing each said macroblock into blocks of a predetermined shape and size specified based on a macroblock type , and performing a motion compensated prediction in each unit of said block to output a motion vector ;
a motion compensation step of generating a predicted image from a reference image on the basis of the motion vector outputted in the motion detection step ;
a subtraction step of calculating a difference between the predicted image generated in the motion compensation step , and the coding target image to output a predictive residual image ;
a transform step of performing transform coding of the predictive residual image outputted in the subtraction step , in each unit of a predetermined transform block to output transform coefficients ;
a quantization step of quantizing the transform coefficients outputted in the transform step , based on a quantization parameter , to output quantized transform coefficients ;
and an encoding step of performing compression coding of coding-related information containing the motion vector outputted in the motion detection step , the macroblock type , the quantization parameter , and the quantized transform coefficients outputted in the quantization step , to output coded data ;
   wherein the encoding step is to prepare one or more types of coding procedures , select a coding procedure out of the one or more types of coding procedures , based on a predetermined coding procedure selection criterion , using the coding-related information stored in a coded symbol memory , and perform the compression coding of the coding-related information on the basis of the selected coding procedure to output the coded data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
CN1647541A

Filed: 2003-04-23     Issued: 2005-07-27

用于在视频编码系统中指示量化器参数的方法与设备

(Original Assignee) 诺基亚有限公司     

J·莱恩马
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax (多个分) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (的解码方法) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
CN1647541A
CLAIM 13
. 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。

CN1647541A
CLAIM 29
. 根据权利要求28的解码方法 (inter-prediction mode) ,其特征在于:所述缺省反向量化级对正在被解码的已编码视频序列是特定的。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (多个分) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (的解码方法) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
CN1647541A
CLAIM 13
. 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。

CN1647541A
CLAIM 29
. 根据权利要求28的解码方法 (inter-prediction mode) ,其特征在于:所述缺省反向量化级对正在被解码的已编码视频序列是特定的。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (多个分) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (的解码方法) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
CN1647541A
CLAIM 13
. 根据权利要求8到12任何一个的编码方法,其特征在于:所述数字视频序列的一帧被划分到多个像素块,所述多个像素块被归组到一个或多个分 (first syntax) 段,并且其中应用到所述转换系数值组的量化级从一帧的一分段到另一分段进行调整,以使对于一帧的一个特定分段,应用到转换系数组的实际量化级不同于缺省量化级。

CN1647541A
CLAIM 29
. 根据权利要求28的解码方法 (inter-prediction mode) ,其特征在于:所述缺省反向量化级对正在被解码的已编码视频序列是特定的。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US20030202594A1

Filed: 2003-03-14     Issued: 2003-10-30

Method for coding motion in a video sequence

(Original Assignee) Nokia Oyj     (Current Assignee) Nokia Technologies Oy

Jani Lainema
US9930365B2
CLAIM 1
. A method of decoding video data (coded region) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US20030202594A1
CLAIM 9
. A method according to claim 8 , wherein the determining step comprises : performing an analysis of motion in a previously decoded region (decoding video data) of the first frame surrounding the segment to be decoded in order to determine a characteristic of the motion in said region .

US20030202594A1
CLAIM 12
. A video encoder (second syntax element) arranged to encode a digital video sequence using motion compensated prediction , said digital video sequence comprising a number of frames , said encoder comprising : means for assigning a coding mode to a segment of a first frame of the sequence to be encoded using motion compensated prediction with respect to a second frame of the sequence , wherein said coding mode is one of a set of possible modes including a skip mode ;
and means for forming a prediction for the segment , wherein said skip mode , when assigned to said segment of the first frame indicates either (i) that the segment has an insignificant level of motion , in which case , said forming means forms a prediction for the segment by copying directly from a corresponding region of the second frame , or (ii) that the segment has motion characteristic of global or regional motion in the video sequence , in which case , said forming means forms a prediction for the segment with respect to the second frame using motion information representative of said global or regional motion .

US9930365B2
CLAIM 7
. A device for decoding video data (coded region) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US20030202594A1
CLAIM 9
. A method according to claim 8 , wherein the determining step comprises : performing an analysis of motion in a previously decoded region (decoding video data) of the first frame surrounding the segment to be decoded in order to determine a characteristic of the motion in said region .

US20030202594A1
CLAIM 12
. A video encoder (second syntax element) arranged to encode a digital video sequence using motion compensated prediction , said digital video sequence comprising a number of frames , said encoder comprising : means for assigning a coding mode to a segment of a first frame of the sequence to be encoded using motion compensated prediction with respect to a second frame of the sequence , wherein said coding mode is one of a set of possible modes including a skip mode ;
and means for forming a prediction for the segment , wherein said skip mode , when assigned to said segment of the first frame indicates either (i) that the segment has an insignificant level of motion , in which case , said forming means forms a prediction for the segment by copying directly from a corresponding region of the second frame , or (ii) that the segment has motion characteristic of global or regional motion in the video sequence , in which case , said forming means forms a prediction for the segment with respect to the second frame using motion information representative of said global or regional motion .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element (video encoder) , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US20030202594A1
CLAIM 12
. A video encoder (second syntax element) arranged to encode a digital video sequence using motion compensated prediction , said digital video sequence comprising a number of frames , said encoder comprising : means for assigning a coding mode to a segment of a first frame of the sequence to be encoded using motion compensated prediction with respect to a second frame of the sequence , wherein said coding mode is one of a set of possible modes including a skip mode ;
and means for forming a prediction for the segment , wherein said skip mode , when assigned to said segment of the first frame indicates either (i) that the segment has an insignificant level of motion , in which case , said forming means forms a prediction for the segment by copying directly from a corresponding region of the second frame , or (ii) that the segment has motion characteristic of global or regional motion in the video sequence , in which case , said forming means forms a prediction for the segment with respect to the second frame using motion information representative of said global or regional motion .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP1876838A2

Filed: 2003-01-10     Issued: 2008-01-09

Method for block prediction of a bi-predictive picture in direct mode

(Original Assignee) LG Electronics Inc     (Current Assignee) LG Electronics Inc

Byeong Moon Jeon
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference (starting size) picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (current block) .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference (starting size) picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (current block) .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (current block) of a plurality of blocks of the sequence of pictures has a starting size (backward reference) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference (starting size) picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (current block) .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (current block) has a size of at least 64×64 pixels .
EP1876838A2
CLAIM 1
A method of predicting a current block (current block) (B c ' ;
) of a bi-predictive picture (B 5) in direct mode , comprising : obtaining a forward motion-compensated block (B f) pertaining to a forward reference picture (P 1 ;
P 4) by using a forward motion vector (MV f) , and obtaining a backward motion-compensated block (B b) pertaining to a backward reference picture (P 7) by using a backward motion vector (MV b) ;
and predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) using a weighted sum of the forward motion-compensated block (B f) and the backward motion-compensated block (B b) , wherein in predicting the current block (B c ' ;
) of the bi-predictive picture (B 5) , a first weight factor is applied to the forward motion-compensated block (B f) and a second weight factor is applied to the backward motion-compensated block (B b) , wherein the first and second weight factors are derived using a picture order count (T b) of the backward reference picture (P 7) , a picture order count (T c) of the bi-predictive picture (B 5) and a picture order count (T f) of the forward reference picture (P 1 ;
P 4) , and wherein the forward motion vector (MV f) and the backward motion vector (MV b) are calculated from a motion vector (MV) of a co-located block (B S) in the backward reference picture (P 7) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2002359852A

Filed: 2002-03-22     Issued: 2002-12-13

画像予測復号化装置及び方法

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Sen Bun Chun, Mei Shen Shen, Thiow Keng Tan, メイ・シェン シェン, セン・ブン チュン, ケン・タン ティオ
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (量子化ステップサイズ) of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size (量子化ステップサイズ) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (量子化ステップサイズ) .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (量子化ステップサイズ) of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (量子化ステップサイズ) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (量子化ステップサイズ) .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (量子化ステップサイズ) of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (量子化ステップサイズ) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と (second syntax) 、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (量子化ステップサイズ) .
JP2002359852A
CLAIM 1
【請求項1】 可変長符号化されたDCT係数を可変長 復号化する可変長復号化手段と、 上記可変長復号化されたDCT係数を量子化された二次 元列係数に変換する逆スキャン手段と、 カレントブロック(C)に隣接する上ブロック(A)又 は左ブロック(B)のいずれかから上記カレントブロッ ク(C)のAC係数を予測するための予測ブロックを適 応的に選択する選択手段と、 上記予測ブロックの量子化されたAC係数(QFa)に 対して、上記カレントブロック(C)の量子化ステップ サイズと上記予測ブロックの量子化ステップサイズ (minimum size, starting size) との 比を用いてスケーリングを行うスケーリング手段と、 上記スケーリング手段によりスケーリングされた上記予 測ブロックの量子化されたAC係数(QFa×(QPa /QPx))と、上記逆スキャン手段により得られた量 子化された二次元列係数のAC係数データ(PQFx) とを加算することにより、上記カレントブロック(C) の量子化されたAC係数(QFx)を予測復号化する加 算手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を記憶する記憶手段と、 上記加算手段により予測復号化されたカレントブロック (C)の量子化されたAC係数を逆量子化する逆量子化 手段と、 上記逆量子化手段で得られた逆量子化されたAC係数を 逆DCT変換する逆DCT変換手段とを備え、 上記カレントブロック(C)よりも遅れて復号化される ブロックの復号化時には、上記選択された予測ブロック の量子化されたAC係数として、上記記憶手段に記憶さ れた量子化されたAC係数を用いることを特徴とする画 像予測復号化装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2001036912A

Filed: 2000-06-15     Issued: 2001-02-09

画像圧縮方法

(Original Assignee) Thomson Multimedia Sa; トムソン マルチメデイア ソシエテ アノニム     

Pierre Ruellou, Dominique Thoreau, トロ ドミニク, リュル ピエール
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (デフォルト, 予測モード) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2001036912A
CLAIM 15
【請求項15】 値MB activは、量MB activ−α . Qが所 定の閾値よりも降下したとき、デフォルト (intra-prediction mode) 値として上記 量MB activ−α . Qに割り当てられることを特徴とする請 求項9乃至14のうちいずれか一項記載の画像圧縮方 法。

JP2001036912A
CLAIM 20
【請求項20】 MB src(i , j)がマクロブロックMB内の 画素(i , j)のルミナンスに対応するとき、項Hdr MBは、 ソースマクロブロックの平均動き量、すなわち、 【数4】 と、ソースマクロブロックの平均値、すなわち、 【数5】 とが所定の閾値に満たない場合に、双方向予測モード (intra-prediction mode) を 優先するよう値valに関する式に従って重み付けされる ことを特徴とする請求項12記載の画像圧縮方法。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (デフォルト, 予測モード) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2001036912A
CLAIM 15
【請求項15】 値MB activは、量MB activ−α . Qが所 定の閾値よりも降下したとき、デフォルト (intra-prediction mode) 値として上記 量MB activ−α . Qに割り当てられることを特徴とする請 求項9乃至14のうちいずれか一項記載の画像圧縮方 法。

JP2001036912A
CLAIM 20
【請求項20】 MB src(i , j)がマクロブロックMB内の 画素(i , j)のルミナンスに対応するとき、項Hdr MBは、 ソースマクロブロックの平均動き量、すなわち、 【数4】 と、ソースマクロブロックの平均値、すなわち、 【数5】 とが所定の閾値に満たない場合に、双方向予測モード (intra-prediction mode) を 優先するよう値valに関する式に従って重み付けされる ことを特徴とする請求項12記載の画像圧縮方法。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (デフォルト, 予測モード) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
JP2001036912A
CLAIM 15
【請求項15】 値MB activは、量MB activ−α . Qが所 定の閾値よりも降下したとき、デフォルト (intra-prediction mode) 値として上記 量MB activ−α . Qに割り当てられることを特徴とする請 求項9乃至14のうちいずれか一項記載の画像圧縮方 法。

JP2001036912A
CLAIM 20
【請求項20】 MB src(i , j)がマクロブロックMB内の 画素(i , j)のルミナンスに対応するとき、項Hdr MBは、 ソースマクロブロックの平均動き量、すなわち、 【数4】 と、ソースマクロブロックの平均値、すなわち、 【数5】 とが所定の閾値に満たない場合に、双方向予測モード (intra-prediction mode) を 優先するよう値valに関する式に従って重み付けされる ことを特徴とする請求項12記載の画像圧縮方法。

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (割り当てら) , or a set-top box .
JP2001036912A
CLAIM 13
【請求項13】 上記パラメータMB dfd はイントラ符号 化モードの場合にインタ符号化モードよりも大きい係数 が割り当てら (receiver device) れることを特徴とする請求項9記載の画像 圧縮方法。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (デフォルト, 予測モード) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2001036912A
CLAIM 15
【請求項15】 値MB activは、量MB activ−α . Qが所 定の閾値よりも降下したとき、デフォルト (intra-prediction mode) 値として上記 量MB activ−α . Qに割り当てられることを特徴とする請 求項9乃至14のうちいずれか一項記載の画像圧縮方 法。

JP2001036912A
CLAIM 20
【請求項20】 MB src(i , j)がマクロブロックMB内の 画素(i , j)のルミナンスに対応するとき、項Hdr MBは、 ソースマクロブロックの平均動き量、すなわち、 【数4】 と、ソースマクロブロックの平均値、すなわち、 【数5】 とが所定の閾値に満たない場合に、双方向予測モード (intra-prediction mode) を 優先するよう値valに関する式に従って重み付けされる ことを特徴とする請求項12記載の画像圧縮方法。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
KR20000053028A

Filed: 1999-05-03     Issued: 2000-08-25

움직임 보상을 이용한 예측 코딩 방법 및 장치

(Original Assignee) 브릭크만 게오르그; 톰슨 멀티미디어     

비알쟝-프랑수아, 프랑수아에두아르
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (weighting) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (weighting) and an inter-prediction mode (블록을, 그룹화) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .

KR20000053028A
CLAIM 5
제 1항에 있어서 , 영상 품질 레벨은 계수 블록에 있는 계수들에 대한 그룹화 (inter-prediction mode) 로서 범위가 결정되며 , 높은 품질 레벨은 더 높은 주파수 등급의 계수를 고려한 것이며 , 또한 상기 가중치는 상기 계산된 예측 계수의 품질 레벨에 해당하는 계수들에 대하여 1인 것을 특징으로 하는 움직임 보상을 이용하는 예측 코딩 방법 .

KR20000053028A
CLAIM 10
디지털 비디오 데이터 인코더로서 , 영상을 여러 개의 영상 블록들로 분할하는 회로와 , 계수 블록들을 생성하기 위하여 상기 영상 블록들(1)에 대하여 이산적 코사인 변환을 수행하는 회로와 , 움직임 보상 회로(8)를 구비하는 예측 루프를 포함하는 디지털 비디오 데이터 인코더에 있어서 , 상기 인코더가 또한 현재 계수 블록과 예측 계수 블록간의 차이값(2)을 계산하여 잔류 계수 블록을 (inter-prediction mode) 제공하는 회로를 더 포함하고 , 또한 상기 움직임 보상 회로는 하나 이상의 재구성 계수 블록들의 계수로부터 직접 예측 계수 블록을 계산하며 , 상기 재구성은 상기 재구성 계수들을 상기 계산된 예측 계수 함수로서 가중치 처리를 함으로써 잔류 블록을 상기 해당하는 예측 블록에 가산하는 것에 기초하여 수행되는 것을 특징으로 하는 디지털 비디오 데이터 인코더 .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (weighting) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (weighting) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (weighting) and an inter-prediction mode (블록을, 그룹화) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .

KR20000053028A
CLAIM 5
제 1항에 있어서 , 영상 품질 레벨은 계수 블록에 있는 계수들에 대한 그룹화 (inter-prediction mode) 로서 범위가 결정되며 , 높은 품질 레벨은 더 높은 주파수 등급의 계수를 고려한 것이며 , 또한 상기 가중치는 상기 계산된 예측 계수의 품질 레벨에 해당하는 계수들에 대하여 1인 것을 특징으로 하는 움직임 보상을 이용하는 예측 코딩 방법 .

KR20000053028A
CLAIM 10
디지털 비디오 데이터 인코더로서 , 영상을 여러 개의 영상 블록들로 분할하는 회로와 , 계수 블록들을 생성하기 위하여 상기 영상 블록들(1)에 대하여 이산적 코사인 변환을 수행하는 회로와 , 움직임 보상 회로(8)를 구비하는 예측 루프를 포함하는 디지털 비디오 데이터 인코더에 있어서 , 상기 인코더가 또한 현재 계수 블록과 예측 계수 블록간의 차이값(2)을 계산하여 잔류 계수 블록을 (inter-prediction mode) 제공하는 회로를 더 포함하고 , 또한 상기 움직임 보상 회로는 하나 이상의 재구성 계수 블록들의 계수로부터 직접 예측 계수 블록을 계산하며 , 상기 재구성은 상기 재구성 계수들을 상기 계산된 예측 계수 함수로서 가중치 처리를 함으로써 잔류 블록을 상기 해당하는 예측 블록에 가산하는 것에 기초하여 수행되는 것을 특징으로 하는 디지털 비디오 데이터 인코더 .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (weighting) is the intra-prediction mode (weighting) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (weighting) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (weighting) and an inter-prediction mode (블록을, 그룹화) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .

KR20000053028A
CLAIM 5
제 1항에 있어서 , 영상 품질 레벨은 계수 블록에 있는 계수들에 대한 그룹화 (inter-prediction mode) 로서 범위가 결정되며 , 높은 품질 레벨은 더 높은 주파수 등급의 계수를 고려한 것이며 , 또한 상기 가중치는 상기 계산된 예측 계수의 품질 레벨에 해당하는 계수들에 대하여 1인 것을 특징으로 하는 움직임 보상을 이용하는 예측 코딩 방법 .

KR20000053028A
CLAIM 10
디지털 비디오 데이터 인코더로서 , 영상을 여러 개의 영상 블록들로 분할하는 회로와 , 계수 블록들을 생성하기 위하여 상기 영상 블록들(1)에 대하여 이산적 코사인 변환을 수행하는 회로와 , 움직임 보상 회로(8)를 구비하는 예측 루프를 포함하는 디지털 비디오 데이터 인코더에 있어서 , 상기 인코더가 또한 현재 계수 블록과 예측 계수 블록간의 차이값(2)을 계산하여 잔류 계수 블록을 (inter-prediction mode) 제공하는 회로를 더 포함하고 , 또한 상기 움직임 보상 회로는 하나 이상의 재구성 계수 블록들의 계수로부터 직접 예측 계수 블록을 계산하며 , 상기 재구성은 상기 재구성 계수들을 상기 계산된 예측 계수 함수로서 가중치 처리를 함으로써 잔류 블록을 상기 해당하는 예측 블록에 가산하는 것에 기초하여 수행되는 것을 특징으로 하는 디지털 비디오 데이터 인코더 .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (weighting) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
KR20000053028A
CLAIM 1
비디오 영상의 디지털 데이터를 코딩하기 위한 , 움직임 보상을 이용한 예측 코딩 방법으로서 , 상기 코딩 방법은 상기 영상을 여러개의 영상 블록들로 분할하는 단계와 , 현재 블록과 하나 이상의 재구성된 블록들에 대한 움직임 보상(9) 처리를 한 이후에 계산하여 얻어지는 예측 블록간의 차이값(2)에 근거하여 생성된 잔류 블록(residual block)을 코딩하는 단계를 포함하는 움직임 보상을 이용한 예측 코딩 방법에 있어서 , 상기 현재 블록과 상기 예측 블록들은 상기 영상 블록에 대하여 이산적 코사인 변환(1)이 수행된 이후에 얻어진 계수들로 이루어진 블록들이며 , 상기 예측 계수 블록에서의 예측된 계수들의 계산(8)은 하나 이상의 상기 재구성 계수 블록에서의 재구성된 계수들에 직접적으로 기초하여 실행되고 , 상기 재구성은 상기 재구성 계수에 대하여 계산된 예측 계수의 함수로서 가중(weighting (intra-prediction mode, encoding mode) )시킴으로써 잔류 블록과 대응하는 예측 블록의 가산(6)에 기초하여 이루어지는 것을 특징으로 하는 움직임 보상을 이용한 예측 코딩 방법 .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JP2000059769A

Filed: 1999-01-21     Issued: 2000-02-25

映像デ―タル―プフィルタリング方法及び装置

(Original Assignee) Korea Advanced Inst Of Sci Technol; Samsung Electronics Co Ltd; 三星電子株式会社; 韓國科學技術院     

Yung-Lyul Lee, Hyun-Wook Park, 玄旭 朴, 英烈 李
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (前記一つ) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (予測モード) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JP2000059769A
CLAIM 9
【請求項9】 映像フレームがインターフレームの場 合、現インターフレームのブロックAcに対するリンギ ングフラグ抽出は、 インターブロックの8×8ブロックにおける残差信号の 逆量子化係数IQCのAC成分のいずれか一つさえ ‘0’でなければ、現ブロックAcのリンギングフラグ RFを‘1’にセットし、前記IQCのAC成分がいず れも‘0’であれば、RFを‘0’にする段階と、 マクロブロックMBに対する動きベクトルを伝達する8 ×8予測モード (intra-prediction mode) が現ブロックに使用されると、RFを ‘1’にセットする段階とよりなることを特徴とする請 求項3に記載の映像データループフィルタリング方法。

JP2000059769A
CLAIM 21
【請求項21】 映像データをブロックに基づきエンコ ーディング及びデコーディングする時、ブロック4つが 会うクロスポイントのコーナーで発生するコーナー異常 値を低減させるために、 逆量子化された前記映像データの8×8ブロックのコー ナー異常値を検出する検出段階と、 前記検出されたコーナー異常値を補償する補償段階とを さらに具備することを特徴とし、 前記検出段階は、前記クロスポイントを中心とした4つ の画素を画素A,B,C及びDとし、A,B,C、及び Dの各画素値をa,b,c,dとし、平均値Avera ge=(a+b+c+d+2)/4とおき、各画素値と Averageとの差をH.263の量子化要素QPと 比較して、前記差がQPより大きければ、コーナー異常 値候補画素の個数を累積してコーナー異常値を検出し、 前記補償段階は、前記Aの属するブロックにおいて前記 Aと隣り合う画素をA 1 ,A 2 とし、前記画素Aと対角線 に位置する画素をA 3 とし、その画素値を各々a 1 ,a 2 及びa 3 とするとき、前記検出段階で検出されたコーナ ー異常値候補が一つで、該画素がAであれば、前記aと a 3 との差が3QP/2より小さければ、コーナー異常 値補償は次式 【数1】 のようになり、 前記コーナー異常値候補画素数が二つ以上であれば、 (a 3 +b 3 +c 3 +d 3 +2)/4(ここで、b 3 ,c 3 , d 3 もa 3 と同様の方式で定義される)から最大の差を有 する候補が選ばれ、該補償は前記一つ (third syntax) の候補の場合での ような方法で行われることを特徴とする請求項1に記載 の映像データループフィルタリング方法。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (前記一つ) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (予測モード) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JP2000059769A
CLAIM 9
【請求項9】 映像フレームがインターフレームの場 合、現インターフレームのブロックAcに対するリンギ ングフラグ抽出は、 インターブロックの8×8ブロックにおける残差信号の 逆量子化係数IQCのAC成分のいずれか一つさえ ‘0’でなければ、現ブロックAcのリンギングフラグ RFを‘1’にセットし、前記IQCのAC成分がいず れも‘0’であれば、RFを‘0’にする段階と、 マクロブロックMBに対する動きベクトルを伝達する8 ×8予測モード (intra-prediction mode) が現ブロックに使用されると、RFを ‘1’にセットする段階とよりなることを特徴とする請 求項3に記載の映像データループフィルタリング方法。

JP2000059769A
CLAIM 21
【請求項21】 映像データをブロックに基づきエンコ ーディング及びデコーディングする時、ブロック4つが 会うクロスポイントのコーナーで発生するコーナー異常 値を低減させるために、 逆量子化された前記映像データの8×8ブロックのコー ナー異常値を検出する検出段階と、 前記検出されたコーナー異常値を補償する補償段階とを さらに具備することを特徴とし、 前記検出段階は、前記クロスポイントを中心とした4つ の画素を画素A,B,C及びDとし、A,B,C、及び Dの各画素値をa,b,c,dとし、平均値Avera ge=(a+b+c+d+2)/4とおき、各画素値と Averageとの差をH.263の量子化要素QPと 比較して、前記差がQPより大きければ、コーナー異常 値候補画素の個数を累積してコーナー異常値を検出し、 前記補償段階は、前記Aの属するブロックにおいて前記 Aと隣り合う画素をA 1 ,A 2 とし、前記画素Aと対角線 に位置する画素をA 3 とし、その画素値を各々a 1 ,a 2 及びa 3 とするとき、前記検出段階で検出されたコーナ ー異常値候補が一つで、該画素がAであれば、前記aと a 3 との差が3QP/2より小さければ、コーナー異常 値補償は次式 【数1】 のようになり、 前記コーナー異常値候補画素数が二つ以上であれば、 (a 3 +b 3 +c 3 +d 3 +2)/4(ここで、b 3 ,c 3 , d 3 もa 3 と同様の方式で定義される)から最大の差を有 する候補が選ばれ、該補償は前記一つ (third syntax) の候補の場合での ような方法で行われることを特徴とする請求項1に記載 の映像データループフィルタリング方法。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (予測モード) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
JP2000059769A
CLAIM 9
【請求項9】 映像フレームがインターフレームの場 合、現インターフレームのブロックAcに対するリンギ ングフラグ抽出は、 インターブロックの8×8ブロックにおける残差信号の 逆量子化係数IQCのAC成分のいずれか一つさえ ‘0’でなければ、現ブロックAcのリンギングフラグ RFを‘1’にセットし、前記IQCのAC成分がいず れも‘0’であれば、RFを‘0’にする段階と、 マクロブロックMBに対する動きベクトルを伝達する8 ×8予測モード (intra-prediction mode) が現ブロックに使用されると、RFを ‘1’にセットする段階とよりなることを特徴とする請 求項3に記載の映像データループフィルタリング方法。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (前記一つ) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (予測モード) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JP2000059769A
CLAIM 9
【請求項9】 映像フレームがインターフレームの場 合、現インターフレームのブロックAcに対するリンギ ングフラグ抽出は、 インターブロックの8×8ブロックにおける残差信号の 逆量子化係数IQCのAC成分のいずれか一つさえ ‘0’でなければ、現ブロックAcのリンギングフラグ RFを‘1’にセットし、前記IQCのAC成分がいず れも‘0’であれば、RFを‘0’にする段階と、 マクロブロックMBに対する動きベクトルを伝達する8 ×8予測モード (intra-prediction mode) が現ブロックに使用されると、RFを ‘1’にセットする段階とよりなることを特徴とする請 求項3に記載の映像データループフィルタリング方法。

JP2000059769A
CLAIM 21
【請求項21】 映像データをブロックに基づきエンコ ーディング及びデコーディングする時、ブロック4つが 会うクロスポイントのコーナーで発生するコーナー異常 値を低減させるために、 逆量子化された前記映像データの8×8ブロックのコー ナー異常値を検出する検出段階と、 前記検出されたコーナー異常値を補償する補償段階とを さらに具備することを特徴とし、 前記検出段階は、前記クロスポイントを中心とした4つ の画素を画素A,B,C及びDとし、A,B,C、及び Dの各画素値をa,b,c,dとし、平均値Avera ge=(a+b+c+d+2)/4とおき、各画素値と Averageとの差をH.263の量子化要素QPと 比較して、前記差がQPより大きければ、コーナー異常 値候補画素の個数を累積してコーナー異常値を検出し、 前記補償段階は、前記Aの属するブロックにおいて前記 Aと隣り合う画素をA 1 ,A 2 とし、前記画素Aと対角線 に位置する画素をA 3 とし、その画素値を各々a 1 ,a 2 及びa 3 とするとき、前記検出段階で検出されたコーナ ー異常値候補が一つで、該画素がAであれば、前記aと a 3 との差が3QP/2より小さければ、コーナー異常 値補償は次式 【数1】 のようになり、 前記コーナー異常値候補画素数が二つ以上であれば、 (a 3 +b 3 +c 3 +d 3 +2)/4(ここで、b 3 ,c 3 , d 3 もa 3 と同様の方式で定義される)から最大の差を有 する候補が選ばれ、該補償は前記一つ (third syntax) の候補の場合での ような方法で行われることを特徴とする請求項1に記載 の映像データループフィルタリング方法。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6148109A

Filed: 1998-01-28     Issued: 2000-11-14

Image predictive coding method

(Original Assignee) Panasonic Corp     (Current Assignee) Sun Patent Trust

Choong Seng Boon, Sheng Mei Shen, Thiow Keng Tan
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal (video data) into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal (video data) into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal (video data) into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US6148109A
CLAIM 1
. An image predictive coding method (encoding mode) comprising : sampling an image signal into a plurality of blocks ;
transforming the image signal of the blocks into a two-dimensional array of DCT coefficients having a DC coefficient and AC coefficients ;
quantizing the DCT coefficients into a quantized DCT value ;
and predicting a quantized DC coefficient of a current block from the DC coefficient of an immediately adjacent block adaptively selected from either a left block or an above block ;
wherein the DC coefficient of the selected block used for said predicting a quantized DC coefficient of the current block is scaled by a ratio of a quantization step-size of the current block and a quantization step-size of the selected block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US6094225A

Filed: 1997-12-02     Issued: 2000-07-25

Method and apparatus for encoding mode signals for use in a binary shape coder

(Original Assignee) Daewoo Electronics Co Ltd     (Current Assignee) Quarterhill Inc

Seok-Won Han
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (reference blocks) .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (encoding mode) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (encoding mode) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (encoding mode) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (encoding mode) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US6094225A
CLAIM 1
. A method for encoding mode (encoding mode) signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US6094225A
CLAIM 1
. A method for encoding mode signals of a target block of a binary shape signal , wherein the binary shape signal includes a plurality of pictures , each picture is divided into a multiplicity of blocks of M×N pixels having one of a first and a second binary values and the target block is encoded by either a frame-based coding to be encoded on an M×N pixels basis or a field-based coding to be encoded on an M/2×N pixels basis , the target block representing one of the blocks of a current picture to be encoded and M and N being positive even integers , respectively , comprising the steps of : (a) generating a first indication signal , if error of the target block with respect to a first reference block is not greater than a predetermined threshold , and generating a second indication signal , if error of the target block with respect to a second reference block is not greater than the predetermined threshold , the respective reference blocks (current block) having M×N pixels and all pixels of the first and the second reference blocks being of the first and the second binary values , respectively ;
(b) if none of the first and the second indication signals are generated in step (a) , selecting either the frame-based coding or the field-based coding to generate a coding mode signal , wherein the coding mode signal indicates whether the target block is encoded by using either the frame-based coding or the field-based coding ;
(c) generating a base mode of the target block based on the first and the second indication signals and the coding mode signal ;
(d) if the frame-based coding is selected in step (b) , coding the target block by the frame-based coding to generate a frame mode and frame-coded data , wherein the frame mode represents a coding condition of the frame-coded data ;
(e) if the field-based coding is selected in step (b) , coding the target block by the field-based coding to generate a field mode and field coded data ;
and (f) combining the base mode and the frame mode or the field mode to generate a mode signal for transmission of the target block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JPH10191324A

Filed: 1997-11-05     Issued: 1998-07-21

画像符号化方法および画像符号化装置並びに画像符号化プログラムを記録した記録媒体

(Original Assignee) Matsushita Electric Ind Co Ltd; 松下電器産業株式会社     

Taisuke Matsumoto, 泰輔 松本
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (符号化/復号) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JPH10191324A
CLAIM 2
【請求項2】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方法を決定し、モード情報を出 力する行程と、前記モード情報を1フレーム内のすべて のブロック分 (third syntax, third syntax element) まとめて符号化する行程と、前記モード情 報にしたがって各ブロックのデータを順次符号化する行 程とを具備することを特徴とする画像符号化方法。

JPH10191324A
CLAIM 5
【請求項5】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方式を決定する符号化モード決 定手段と (second syntax) 、前記符号化モード決定手段によって決定され たモード情報にしたがってブロック内のデータを符号化 するブロックデータ符号化手段と、前記符号化モード決 定手段によって決定されたモード情報を周辺のすでに符 号化済みのブロックのモード情報から予測し符号化する モード情報符号化手段を具備する画像符号化装置。

JPH10191324A
CLAIM 15
【請求項15】 請求項5または9記載の画像符号化装 置と請求項7または10記載の画像復号化装置よりなる 画像符号化/復号 (inter-prediction mode) 化装置。

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (符号化/復号) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JPH10191324A
CLAIM 2
【請求項2】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方法を決定し、モード情報を出 力する行程と、前記モード情報を1フレーム内のすべて のブロック分 (third syntax, third syntax element) まとめて符号化する行程と、前記モード情 報にしたがって各ブロックのデータを順次符号化する行 程とを具備することを特徴とする画像符号化方法。

JPH10191324A
CLAIM 5
【請求項5】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方式を決定する符号化モード決 定手段と (second syntax) 、前記符号化モード決定手段によって決定され たモード情報にしたがってブロック内のデータを符号化 するブロックデータ符号化手段と、前記符号化モード決 定手段によって決定されたモード情報を周辺のすでに符 号化済みのブロックのモード情報から予測し符号化する モード情報符号化手段を具備する画像符号化装置。

JPH10191324A
CLAIM 15
【請求項15】 請求項5または9記載の画像符号化装 置と請求項7または10記載の画像復号化装置よりなる 画像符号化/復号 (inter-prediction mode) 化装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (ブロック分) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (符号化/復号) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JPH10191324A
CLAIM 2
【請求項2】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方法を決定し、モード情報を出 力する行程と、前記モード情報を1フレーム内のすべて のブロック分 (third syntax, third syntax element) まとめて符号化する行程と、前記モード情 報にしたがって各ブロックのデータを順次符号化する行 程とを具備することを特徴とする画像符号化方法。

JPH10191324A
CLAIM 5
【請求項5】 複数の画素からなる2次元のブロックに 分割されたディジタル画像をブロック毎に符号化する際 に、各ブロックの符号化方式を決定する符号化モード決 定手段と (second syntax) 、前記符号化モード決定手段によって決定され たモード情報にしたがってブロック内のデータを符号化 するブロックデータ符号化手段と、前記符号化モード決 定手段によって決定されたモード情報を周辺のすでに符 号化済みのブロックのモード情報から予測し符号化する モード情報符号化手段を具備する画像符号化装置。

JPH10191324A
CLAIM 15
【請求項15】 請求項5または9記載の画像符号化装 置と請求項7または10記載の画像復号化装置よりなる 画像符号化/復号 (inter-prediction mode) 化装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
WO9803018A2

Filed: 1997-06-12     Issued: 1998-01-22

Transmission and reception of encoded video pictures

(Original Assignee) Philips Electronics N.V.; Philips Norden Ab     

Antonius Adrianus Cornelis Maria Kalker, Rob Anne Beuker, Hendrik Gemmualdus Jacobus Theunis
US9930365B2
CLAIM 1
. A method of decoding video data (encoded video signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
WO9803018A2
CLAIM 17
. An encoded video signal (video data, video blocks) , including a segmentation map defining a segmentation of each picture into picture blocks having variable block sizes and coded picture data for each picture block , characterized in that the segmentation map is encoded in the form of a one-dimensional series of block-size codes , the size of each block being determined by the block-size code and the position of each block being in accordance with a predetermined scanning pattern .

US9930365B2
CLAIM 6
. The method of claim 1 , further comprising receiving a quantization parameter modification value (Huffman codes) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO9803018A2
CLAIM 12
. A method as claimed in claim 9 or 1 1 , wherein the runlengths are Huffman-coded , and comprising the step of decoding said Huffman codes (quantization parameter modification value) .

US9930365B2
CLAIM 7
. A device for decoding video data (encoded video signal) , the device comprising : a memory configured to store decoded video blocks (encoded video signal) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
WO9803018A2
CLAIM 17
. An encoded video signal (video data, video blocks) , including a segmentation map defining a segmentation of each picture into picture blocks having variable block sizes and coded picture data for each picture block , characterized in that the segmentation map is encoded in the form of a one-dimensional series of block-size codes , the size of each block being determined by the block-size code and the position of each block being in accordance with a predetermined scanning pattern .

US9930365B2
CLAIM 12
. The device of claim 7 , wherein the processor is further configured to receive a quantization parameter modification value (Huffman codes) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO9803018A2
CLAIM 12
. A method as claimed in claim 9 or 1 1 , wherein the runlengths are Huffman-coded , and comprising the step of decoding said Huffman codes (quantization parameter modification value) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (encoded video signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
WO9803018A2
CLAIM 17
. An encoded video signal (video data, video blocks) , including a segmentation map defining a segmentation of each picture into picture blocks having variable block sizes and coded picture data for each picture block , characterized in that the segmentation map is encoded in the form of a one-dimensional series of block-size codes , the size of each block being determined by the block-size code and the position of each block being in accordance with a predetermined scanning pattern .

US9930365B2
CLAIM 20
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to receive a quantization parameter modification value (Huffman codes) for the sub-block , wherein decoding the sub-block comprises dequantizing the sub-block according to the quantization parameter modification value .
WO9803018A2
CLAIM 12
. A method as claimed in claim 9 or 1 1 , wherein the runlengths are Huffman-coded , and comprising the step of decoding said Huffman codes (quantization parameter modification value) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5748789A

Filed: 1996-10-31     Issued: 1998-05-05

Transparent block skipping in object-based video coding systems

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Ming-Chieh Lee, Wei-ge Chen
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US5748789A
CLAIM 14
. A computer readable medium on which is stored software for coding video data (video data) , which when executed by a computer , perform the steps of : separately encoding video objects in a sequence of video frames including : separately coding shape for each of the objects , separately coding texture for each of the objects , and while coding texture for a first object , evaluating whether a transformation block is covered by the shape of the first object based on the shape for the first object , and if not , then skipping texture coding for the transformation block .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US5748789A
CLAIM 14
. A computer readable medium on which is stored software for coding video data (video data) , which when executed by a computer , perform the steps of : separately encoding video objects in a sequence of video frames including : separately coding shape for each of the objects , separately coding texture for each of the objects , and while coding texture for a first object , evaluating whether a transformation block is covered by the shape of the first object based on the shape for the first object , and if not , then skipping texture coding for the transformation block .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .

US5748789A
CLAIM 14
. A computer readable medium on which is stored software for coding video data (video data) , which when executed by a computer , perform the steps of : separately encoding video objects in a sequence of video frames including : separately coding shape for each of the objects , separately coding texture for each of the objects , and while coding texture for a first object , evaluating whether a transformation block is covered by the shape of the first object based on the shape for the first object , and if not , then skipping texture coding for the transformation block .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5748789A
CLAIM 1
. In an object-based video coding method (encoding mode) , a method for reducing coding overhead comprising : separately encoding video objects in a sequence of video frames including : separately encoding shape for each of the objects , separately encoding texture for each of the objects , and while coding texture for a first object , evaluating the shape of the first object to determine whether a transformation block in the first object is transparent based on the shape for the first object , and if so , then skipping texture coding for the transformation block ;
and separately decoding video objects in the sequence of video frames including : separately decoding shape for each of the objects , separately decoding texture for each of the objects , and while decoding texture for a first object , evaluating whether a transformation block in the first object is transparent based on the shape of the first object , and if so , then skipping texture decoding for the transformation block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5959673A

Filed: 1996-06-04     Issued: 1999-09-28

Transform coding of dense motion vector fields for frame and object based video coding applications

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Ming-Chieh Lee, Wei-ge Chen
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine, second video) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5959673A
CLAIM 1
. A compression method of representing changes between corresponding pixels in first and second video (decoding video data) image frames , comprising : determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame ;
wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature ;
transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format , where the rectangular configuration comprises a two dimensional array of motion vectors , and transforming the motion vector field includes assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field ;
and encoding the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region , including transforming the motion vectors for each region into spatial frequency components and transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components .

US5959673A
CLAIM 6
. The method of claim 1 in which the first compressed format includes a discrete cosine (decoding video data) transformation or lattice wavelet compression .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine, second video) , the device comprising : a memory configured to store decoded video blocks (lossy compression) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5959673A
CLAIM 1
. A compression method of representing changes between corresponding pixels in first and second video (decoding video data) image frames , comprising : determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame ;
wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature ;
transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format , where the rectangular configuration comprises a two dimensional array of motion vectors , and transforming the motion vector field includes assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field ;
and encoding the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region , including transforming the motion vectors for each region into spatial frequency components and transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components .

US5959673A
CLAIM 5
. The method of claim 1 in which the first compressed format includes lossy compression (video blocks) .

US5959673A
CLAIM 6
. The method of claim 1 in which the first compressed format includes a discrete cosine (decoding video data) transformation or lattice wavelet compression .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (two dimensional array) , or a set-top box .
US5959673A
CLAIM 1
. A compression method of representing changes between corresponding pixels in first and second video image frames , comprising : determining motion vectors for all pixel locations in each region in a set of regions of the first video image frame by estimating the motion of all of the pixels in each region relative to pixels in the second video image frame ;
wherein the regions correspond to regions of a first image feature having a non-rectangular boundary in the first image frame and the motion vectors for at least one of the regions represent a motion vector field having a non-rectangular shape corresponding to the boundary of the first image feature ;
transforming the motion vector field having a non-rectangular shape to a motion vector field of rectangular configuration for encoding in the first compressed format , where the rectangular configuration comprises a two dimensional array (receiver device) of motion vectors , and transforming the motion vector field includes assigning motion vectors to pixel locations within the rectangular configuration but not overlapping the non-rectangular shape of the motion vector field ;
and encoding the motion vectors for each of the regions in a first compressed format to produce compressed motion vectors for each region , including transforming the motion vectors for each region into spatial frequency components and transforming the motion vectors for each pixel location in the rectangular configuration into spatial frequency components .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5970173A

Filed: 1996-06-04     Issued: 1999-10-19

Image compression and affine transformation for image motion compensation

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Ming-Chieh Lee, Wei-ge Chen
US9930365B2
CLAIM 1
. A method of decoding video data (affine transformation) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (singular value) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5970173A
CLAIM 2
. The method of claim 1 in which the multi-dimensional affine motion transformations between the corresponding image components are represented as affine transformation (video data) coefficients and quantizing the multi-dimensional affine motion transformations includes representing the affine transformation coefficients by selected pairs of corresponding pixels in the first and second video image frames .

US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (singular value) .
US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .

US9930365B2
CLAIM 7
. A device for decoding video data (affine transformation) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (singular value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5970173A
CLAIM 2
. The method of claim 1 in which the multi-dimensional affine motion transformations between the corresponding image components are represented as affine transformation (video data) coefficients and quantizing the multi-dimensional affine motion transformations includes representing the affine transformation coefficients by selected pairs of corresponding pixels in the first and second video image frames .

US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (singular value) .
US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (affine transformation) , the first syntax element representing a minimum size (singular value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5970173A
CLAIM 2
. The method of claim 1 in which the multi-dimensional affine motion transformations between the corresponding image components are represented as affine transformation (video data) coefficients and quantizing the multi-dimensional affine motion transformations includes representing the affine transformation coefficients by selected pairs of corresponding pixels in the first and second video image frames .

US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size and greater than the minimum size (singular value) .
US5970173A
CLAIM 8
. The method of claim 6 in which step (c) includes performing a singular value (minimum size) decomposition process .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP0727910A2

Filed: 1996-02-05     Issued: 1996-08-21

Temporal-spatial error concealment apparatus and method for video signal processors

(Original Assignee) Thomson Multimedia SA     (Current Assignee) Technicolor SA

Max Chien, Wilson Kwok, Huifang Sun
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP0727910A2
CLAIM 13
The apparatus set forth in claim 11 characterized in that said said apparatus for determining the relative correlation of image motion includes apparatus for calculating the relative correlation of motion vectors associated with blocks of compressed video data (video data) adjacent said block of bad pixel data .

EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (bad block) .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP0727910A2
CLAIM 13
The apparatus set forth in claim 11 characterized in that said said apparatus for determining the relative correlation of image motion includes apparatus for calculating the relative correlation of motion vectors associated with blocks of compressed video data (video data) adjacent said block of bad pixel data .

EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (bad block) .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (image area) , or a set-top box .
EP0727910A2
CLAIM 2
The apparatus set forth in claim 1 characterized in that said interpolator includes digital signal processing apparatus operative upon said pixel data located spatially adjacent to a block of bad pixel data for detecting a direction of a dominant image gradient within an image area (broadcast receiver device) represented by pixel data located spatially adjacent to a block of bad pixel data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP0727910A2
CLAIM 13
The apparatus set forth in claim 11 characterized in that said said apparatus for determining the relative correlation of image motion includes apparatus for calculating the relative correlation of motion vectors associated with blocks of compressed video data (video data) adjacent said block of bad pixel data .

EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (bad block) .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0727910A2
CLAIM 14
A method for generating a block of pixel data for a missing or corrupt block of pixel data (hereafter bad block (current block) ) , characterized by ;
   providing blocks of pixel data and an error signal indicating which of said blocks are bad blocks ;
   determining a dominant image gradient in blocks of pixels adjacent a bad block ;
   spatially interpolating , in accordance with said dominant image gradient , a substitute block of data for said bad block in response to said error signal ;
   temporally predicting a substitute block of data for said bad block in response to said error signal ;
   converting the spatially interpolated substitute block to a block of transform coefficients , TC1 , representing frequency spectra ;
   converting the temporally predicted substitute block to a block of transform coefficients , TC2 , representing frequency spectra ;
   forming a further block of coefficients by selecting respective coefficients from said blocks of transform coefficients TC1 and TC2 according to a predetermined criteria ;
   inverse transforming said further block of coefficients to produce a substitute block of pixel data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5692063A

Filed: 1996-01-19     Issued: 1997-11-25

Method and system for unrestricted motion estimation for video

(Original Assignee) Microsoft Corp     (Current Assignee) Microsoft Technology Licensing LLC

Ming-Chieh Lee, Wei-ge Chen
US9930365B2
CLAIM 1
. A method of decoding video data (second video) , the method comprising : decoding a first syntax (horizontal boundary, vertical boundary) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax (horizontal boundary, vertical boundary) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5692063A
CLAIM 1
. A method of motion estimation , comprising : identifying a group of border pixels of an object in a first video frame in contact with a boundary of said first video image frame , the border pixels having pixel characteristics representing the object ;
appending a group of virtual pixels outside said first video image frame adjacent to said group of border pixels ;
assigning to said group of virtual pixels , pixel characteristics related to the pixel characteristics of said group of border pixels ;
and estimating a portion of said object in a second video (decoding video data) image frame , subsequent to the first video image frame , from the pixel characteristics assigned to said group of virtual pixels .

US5692063A
CLAIM 3
. The method of claim 2 where the object in the first video frame contacts a vertical boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes rows of virtual pixels corresponding to the border pixels contacting the vertical boundary .

US5692063A
CLAIM 5
. The method of claim 2 where the object in the first video frame contacts a horizontal boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes columns of virtual pixels corresponding to the border pixels contacting the horizontal boundary .

US9930365B2
CLAIM 7
. A device for decoding video data (second video) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax (horizontal boundary, vertical boundary) element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (horizontal boundary, vertical boundary) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5692063A
CLAIM 1
. A method of motion estimation , comprising : identifying a group of border pixels of an object in a first video frame in contact with a boundary of said first video image frame , the border pixels having pixel characteristics representing the object ;
appending a group of virtual pixels outside said first video image frame adjacent to said group of border pixels ;
assigning to said group of virtual pixels , pixel characteristics related to the pixel characteristics of said group of border pixels ;
and estimating a portion of said object in a second video (decoding video data) image frame , subsequent to the first video image frame , from the pixel characteristics assigned to said group of virtual pixels .

US5692063A
CLAIM 3
. The method of claim 2 where the object in the first video frame contacts a vertical boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes rows of virtual pixels corresponding to the border pixels contacting the vertical boundary .

US5692063A
CLAIM 5
. The method of claim 2 where the object in the first video frame contacts a horizontal boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes columns of virtual pixels corresponding to the border pixels contacting the horizontal boundary .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (said object) , a broadcast receiver device , or a set-top box .
US5692063A
CLAIM 1
. A method of motion estimation , comprising : identifying a group of border pixels of an object in a first video frame in contact with a boundary of said first video image frame , the border pixels having pixel characteristics representing the object ;
appending a group of virtual pixels outside said first video image frame adjacent to said group of border pixels ;
assigning to said group of virtual pixels , pixel characteristics related to the pixel characteristics of said group of border pixels ;
and estimating a portion of said object (mobile device) in a second video image frame , subsequent to the first video image frame , from the pixel characteristics assigned to said group of virtual pixels .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax (horizontal boundary, vertical boundary) element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax (horizontal boundary, vertical boundary) element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5692063A
CLAIM 3
. The method of claim 2 where the object in the first video frame contacts a vertical boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes rows of virtual pixels corresponding to the border pixels contacting the vertical boundary .

US5692063A
CLAIM 5
. The method of claim 2 where the object in the first video frame contacts a horizontal boundary (first syntax, third syntax) of said first video image frame and the group of virtual pixels includes columns of virtual pixels corresponding to the border pixels contacting the horizontal boundary .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5576767A

Filed: 1995-09-21     Issued: 1996-11-19

Interframe video encoding and decoding system

(Original Assignee) Qualcomm Inc     (Current Assignee) Qualcomm Inc

Chong U. Lee, Donald Pian
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (reference blocks) .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (reference blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (reference blocks) .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (reference blocks) has a size of at least 64×64 pixels .
US5576767A
CLAIM 4
. The system of claim 2 wherein said at least one additional motion predictor compares N/2×N/2 sub-blocks of pixel data with N/2×N/2 sub-blocks of data in said additional reference blocks (current block) of data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5768434A

Filed: 1995-06-07     Issued: 1998-06-16

Quadtree-structured walsh transform coding

(Original Assignee) National Semiconductor Corp     (Current Assignee) National Semiconductor Corp

Xiaonong Ran
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (more blocks) .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (more blocks) .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (two dimensional array) , or a set-top box .
US5768434A
CLAIM 7
. The method of claim 6 , wherein partitioning the two-dimensional array to form a quadtree partition , comprises : forming an intermediate partition of the two dimensional array (receiver device) , the intermediate partition containing four blocks which when combined form a merged block ;
determining a measure of additional error caused by representing the merged block with the fixed number of Walsh transform coefficients instead of separately representing each of the four blocks with the fixed number of Walsh transform coefficients ;
and replacing the four blocks in the intermediate partition with the merged block if the additional error is less than a threshold value .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (more blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions (first partition) .
US5768434A
CLAIM 16
. The method of claim 6 , wherein the step of generating signals which identify the blocks in the quadtree partition comprises the sequential steps of : providing a signal for each block which is in a first partition (sub partitions) from a series of uniform partitions , wherein the signal for a block has a first value if the block is in the quadtree partition and has a second value otherwise ;
and for each block in a second partition from the series of uniform partitions : providing a signal having the first value if the block in the quadtree partition ;
and providing a signal having the second value if the block is not in the quadtree partition and is not a subset of a block in the quadtree partition .

US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (more blocks) .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (more blocks) has a size of at least 64×64 pixels .
US5768434A
CLAIM 17
. The method of claim 6 , wherein : a series of partitions of the two-dimensional array contains a partition corresponding to each value of an index I with N< ;
I< ;
M , wherein the partition corresponding to each value of the index I contains 4 I non-intersecting blocks of data from the two-dimensional array , wherein N and M are integers with N< ;
M , and wherein for each integer J with N< ;
J< ;
M , each block in a partition corresponding to J equals a union of four blocks in a partition corresponding to J+1 ;
and generating signals which identify the blocks in the quadtree partition further comprises the sequential steps of : providing a signal having a first value for each of a sequence of one or more blocks (current block) , wherein no two blocks in the sequence of blocks are in the same partition in the series of partitions , each block in the sequence is a subset of preceding block , and none of the blocks in the sequence is in the quadtree partition ;
and providing a signal having a second value for a block which is in the quadtree partition and is a subset of the last block of the sequence .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5666461A

Filed: 1995-05-30     Issued: 1997-09-09

High efficiency encoding and decoding of picture signals and recording medium containing same

(Original Assignee) Sony Corp     (Current Assignee) Sony Corp

Katsuji Igarashi, Jun Yonemitsu, Yoichi Yagasaki, Yasushi Fujinami, Tomoyuki Sato, Motoki Kato, Teruhiko Suzuki
US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (picture portion) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5666461A
CLAIM 1
. A recording medium having data recorded thereon which represents : (i) an encoded picture signal having a plurality of macroblocks and including picture portion (video blocks) s having been frame-based and/or field-based type predictively encoded and frame-based and/or field-based type orthogonally transformed processed in which each of said picture portions includes at least one of said macroblocks and in which said encoded picture signal represents a sequence of pictures such that said sequence of pictures provides a substantially continuous video image when reproduced from said recording medium , and (ii) header information which includes (a) information identifying the type of predictive encoding utilized for each respective picture portion , (b) information identifying the type of orthogonal transformation processing utilized for each respective picture portion , and (c) macroblock address information indicating the respective location of each macroblock , in which the type of predictive encoding utilized for each respective picture portion is selected from among the frame-based type predictive encoding and field-based type predictive encoding and in which the type of orthogonal transformation processing is selected from among the frame-based type orthogonal transformation processing and field-based orthogonal transformation processing so as to provide a relatively high efficient encoding .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5539466A

Filed: 1994-09-26     Issued: 1996-07-23

Efficient coding apparatus for picture signal and decoding apparatus therefor

(Original Assignee) Sony Corp     (Current Assignee) Sony Corp

Katsuji Igarashi, Tomoyuki Sato
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (first encoding mode) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode (video blocks) or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5539466A
CLAIM 1
. A high efficiency encoding method (encoding mode) for a sequence of frame pictures , each frame picture being comprised of plural macroblocks , each of said frame pictures having a first field picture and a second field picture , comprising the steps of : selecting either a first encoding mode or a second encoding mode on a frame picture basis ;
when said first encoding mode is selected , carrying out orthogonal transform encoding of said plural macroblocks by adaptively selecting on a macroblock basis either a frame processing mode for orthogonally transforming a block comprised of a first field component and a second field component , or a field processing mode for orthogonally transforming a block comprised of either said first field component or said second field component ;
and when said second encoding mode is selected , carrying out orthogonal transform encoding of a block composed only of said first field component for one frame picture and then carrying out orthogonal transform encoding of a block composed only of said second field component for one frame picture so that said second field component of said block can be predicted by using said first field component of said block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5594504A

Filed: 1994-07-06     Issued: 1997-01-14

Predictive video coding using a motion vector updating routine

(Original Assignee) Nokia of America Corp     (Current Assignee) Nokia of America Corp

Touradj Ebrahimi
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5594504A
CLAIM 1
. An improved method of motion field estimation for use in motion compensated frame-to-frame prediction (encoding mode comprises one) coding comprising the steps of : dividing a frame having a plurality of pixels into a plurality of smaller regions to form a first segmentation level ;
assigning to each of said plurality of smaller regions an initial motion vector ;
computing a plurality of matching errors for each smaller region ;
determining the value of the smallest matching error among the computed matching errors for each smaller region ;
and performing for each of said plurality of smaller regions a motion vector updating routine which updates the motion vector of a smaller region by assigning to it a best motion vector selected from among the initial motion vector assigned to the smaller region , a matched motion vector obtained by performing a block matching technique for the smaller region , and motion vectors of the smaller region' ;
s neighboring regions , wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions .

US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (motion estimation unit) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5594504A
CLAIM 1
. An improved method of motion field estimation for use in motion compensated frame-to-frame prediction (encoding mode comprises one) coding comprising the steps of : dividing a frame having a plurality of pixels into a plurality of smaller regions to form a first segmentation level ;
assigning to each of said plurality of smaller regions an initial motion vector ;
computing a plurality of matching errors for each smaller region ;
determining the value of the smallest matching error among the computed matching errors for each smaller region ;
and performing for each of said plurality of smaller regions a motion vector updating routine which updates the motion vector of a smaller region by assigning to it a best motion vector selected from among the initial motion vector assigned to the smaller region , a matched motion vector obtained by performing a block matching technique for the smaller region , and motion vectors of the smaller region' ;
s neighboring regions , wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions .

US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US5594504A
CLAIM 19
. A system for performing motion field estimation and coding of a video image , said system comprising : a motion estimation unit (video blocks) for refining a motion vector assigned to a block of pixels in a current frame by assigning to the block of pixels a best motion vector selected from among a plurality of candidate motion vectors , wherein said plurality of candidate motion vectors includes an initial motion vector assigned to said block of pixels , a matched motion vector obtained by performing a block matching technique for said block of pixels , and motion vectors of regions neighboring said block of pixels , said motion estimation unit further computing a smallest matching error from among the plurality of candidate vectors and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions ;
a merge and label unit for merging adjacent regions of the current frame that have similar motion vectors to form merged regions and for assigning to each pixel in the current frame a region label ;
a first region label memory unit for storing a motion vector associated with each merged region ;
a second region label memory unit for storing the region label assigned to each pixel in the current frame ;
a region shape representation unit for forming a set of contour pixels that defines the merged regions ;
a contour coding unit for encoding the set of contour pixels ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the system .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (frame memory) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (predetermined shape) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5594504A
CLAIM 1
. An improved method of motion field estimation for use in motion compensated frame-to-frame prediction (encoding mode comprises one) coding comprising the steps of : dividing a frame having a plurality of pixels into a plurality of smaller regions to form a first segmentation level ;
assigning to each of said plurality of smaller regions an initial motion vector ;
computing a plurality of matching errors for each smaller region ;
determining the value of the smallest matching error among the computed matching errors for each smaller region ;
and performing for each of said plurality of smaller regions a motion vector updating routine which updates the motion vector of a smaller region by assigning to it a best motion vector selected from among the initial motion vector assigned to the smaller region , a matched motion vector obtained by performing a block matching technique for the smaller region , and motion vectors of the smaller region' ;
s neighboring regions , wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions .

US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US5594504A
CLAIM 17
. An apparatus for use in improved motion field estimation of a video image , said apparatus comprising : a previous decoded frame memory (thereon instructions) unit for storing a monochrome intensity for each pixel in a preceding frame ;
a current frame memory unit for storing a monochrome intensity for each pixel in a current frame ;
a motion field memory unit for storing a motion vector for each pixel in the current frame ;
a motion refinement unit for performing a block matching technique for a block of pixels in the current frame and for providing a matched motion vector ;
a candidate motion vector memory unit for storing motion vectors which are candidates for updating the motion vector of said block of pixels , wherein the candidate motion vector memory unit is connected to said motion field memory unit for receiving and storing an initial motion vector assigned to said block of pixels and motion vectors of regions neighboring said block of pixels , and wherein the candidate motion vector memory unit is further connected to said motion refinement unit for receiving and storing the matched motion vector ;
a matching error computing unit for computing matching errors obtained by assigning to said block of pixels the motion vectors stored in the candidate motion vector memory unit ;
a matching error memory unit for storing the matching errors computed by said matching error computing unit ;
a minimum detector unit for determining the value of the smallest matching error among the matching errors stored in the matching error memory unit ;
a best motion vector selection unit , connected to the motion field memory unit , for determining a best motion vector for refining the motion vector of the block of pixels , wherein the best motion vector is selected from among the motion vectors stored in the candidate motion vector memory unit , and wherein the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one of the matching errors stored in the matching error memory unit ;
and a control unit for controlling the other units and their interaction , wherein the control unit is connected to each of the other units in the apparatus .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (predetermined shape) and greater than the minimum size .
US5594504A
CLAIM 4
. The method of claim 3 further including the steps of : (a) dividing each smaller region in the previous segmentation level into a plurality of smaller regions of predetermined shape (maximum size) and size to form a subsequent segmentation level ;
(b) assigning to each of the plurality of smaller regions in the subsequent segmentation level an initial motion vector equal to the motion vector of its parent region ;
and (c) performing the motion vector updating routine for each of said plurality of smaller regions in the subsequent segmentation level .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one (initial vector) or more syntax elements representative of partitioning for the current block .
US5594504A
CLAIM 15
. In a method of motion compensated frame-to-frame prediction coding , wherein a displacement vector is transmitted to a video receiver decoder for each region of a current frame , an improvement for selecting the displacement vector corresponding to a group of pixels in the current frame , where said group of pixels was segmented from a larger region having a motion vector , said improvement comprising the steps of : assigning to said group of pixels an initial motion vector equal to the motion vector of the larger region ;
computing a matching error obtained by assigning the initial vector (processor to decode one) to said group of pixels ;
performing a block matching motion estimation technique with respect to the group of pixels to provide a matched motion vector ;
computing a matching error obtained by assigning the matched motion vector to the group of pixels ;
computing matching errors obtained by assigning motion vectors of neighboring regions to the group of pixels ;
determining the smallest matching error from among said computed matching error and said computed matching errors ;
and assigning to the group of pixels a best motion vector selected from among the initial motion vector , the motion vectors of the neighboring regions , and the matched motion vector , where the best motion vector is selected by comparing a predetermined threshold value to the magnitude of the difference between the smallest matching error and at least one matching error corresponding to said initial motion vector , said matched motion vector , or one of the motion vectors of said neighboring regions .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
JPH07274176A

Filed: 1994-03-30     Issued: 1995-10-20

動画像伝送装置

(Original Assignee) Toshiba Corp; 株式会社東芝     

Shinichiro Koto, 晋一郎 古藤
US9930365B2
CLAIM 1
. A method of decoding video data (前記送信) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
JPH07274176A
CLAIM 1
【請求項1】 動画像信号を参照画 (intra-prediction mode) 像を用いて符号化す る符号化手段と (second syntax) 、符号化された動画像信号を送信する送 信手段と、発信側からの符号化動画像信号を受信する受 信手段と、受信した符号化動画像信号を復号する復号手 段とをそれぞれ備えた発信側及び受信側からなる動画像 伝送装置において、受信側は、前記復号手段により復号 された動画像信号で修正を施した部分を前記符号化手段 により符号化し、前記送信 (video data) 手段により該発信側へ送信 し、該発信側は、受信した前記修正を施した部分の符号 化信号を前記復号手段により復号し、復号した前記修正 を施した部分を前記参照画像へ書込むことを特徴とする 動画像伝送装置。

US9930365B2
CLAIM 7
. A device for decoding video data (前記送信) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
JPH07274176A
CLAIM 1
【請求項1】 動画像信号を参照画 (intra-prediction mode) 像を用いて符号化す る符号化手段と (second syntax) 、符号化された動画像信号を送信する送 信手段と、発信側からの符号化動画像信号を受信する受 信手段と、受信した符号化動画像信号を復号する復号手 段とをそれぞれ備えた発信側及び受信側からなる動画像 伝送装置において、受信側は、前記復号手段により復号 された動画像信号で修正を施した部分を前記符号化手段 により符号化し、前記送信 (video data) 手段により該発信側へ送信 し、該発信側は、受信した前記修正を施した部分の符号 化信号を前記復号手段により復号し、復号した前記修正 を施した部分を前記参照画像へ書込むことを特徴とする 動画像伝送装置。

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (参照画) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
JPH07274176A
CLAIM 1
【請求項1】 動画像信号を参照画 (intra-prediction mode) 像を用いて符号化す る符号化手段と、符号化された動画像信号を送信する送 信手段と、発信側からの符号化動画像信号を受信する受 信手段と、受信した符号化動画像信号を復号する復号手 段とをそれぞれ備えた発信側及び受信側からなる動画像 伝送装置において、受信側は、前記復号手段により復号 された動画像信号で修正を施した部分を前記符号化手段 により符号化し、前記送信手段により該発信側へ送信 し、該発信側は、受信した前記修正を施した部分の符号 化信号を前記復号手段により復号し、復号した前記修正 を施した部分を前記参照画像へ書込むことを特徴とする 動画像伝送装置。

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (前記送信) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (手段と) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (参照画) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
JPH07274176A
CLAIM 1
【請求項1】 動画像信号を参照画 (intra-prediction mode) 像を用いて符号化す る符号化手段と (second syntax) 、符号化された動画像信号を送信する送 信手段と、発信側からの符号化動画像信号を受信する受 信手段と、受信した符号化動画像信号を復号する復号手 段とをそれぞれ備えた発信側及び受信側からなる動画像 伝送装置において、受信側は、前記復号手段により復号 された動画像信号で修正を施した部分を前記符号化手段 により符号化し、前記送信 (video data) 手段により該発信側へ送信 し、該発信側は、受信した前記修正を施した部分の符号 化信号を前記復号手段により復号し、復号した前記修正 を施した部分を前記参照画像へ書込むことを特徴とする 動画像伝送装置。




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP0651574A1

Filed: 1994-03-24     Issued: 1995-05-03

Method and apparatus for coding/decoding motion vector, and method and apparatus for coding/decoding image signal

(Original Assignee) Sony Corp     (Current Assignee) Sony Corp

Motoki Sony Corporation Kato
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .

EP0651574A1
CLAIM 14
A picture signal coding method comprising the step of :    coding an input picture signal every macroblock on the basis of a plurality of modes ,    whereby when picture coding type in the plurality of modes is P picture and picture structure in the plurality of modes is frame structure ,    the method further comprises the step of :    judging , every coded macroblock , whether or not there are satisfied the conditions where    predictive error signal is zero ,    motion vector is zero ,    motion compensation mode in the plurality of modes is the forward prediction , and    prediction type in the plurality of modes is frame prediction (encoding mode comprises one) ,    thus to process the coded macroblock as a skipped block when said respective conditions are satisfied .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .

EP0651574A1
CLAIM 14
A picture signal coding method comprising the step of :    coding an input picture signal every macroblock on the basis of a plurality of modes ,    whereby when picture coding type in the plurality of modes is P picture and picture structure in the plurality of modes is frame structure ,    the method further comprises the step of :    judging , every coded macroblock , whether or not there are satisfied the conditions where    predictive error signal is zero ,    motion vector is zero ,    motion compensation mode in the plurality of modes is the forward prediction , and    prediction type in the plurality of modes is frame prediction (encoding mode comprises one) ,    thus to process the coded macroblock as a skipped block when said respective conditions are satisfied .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one (frame prediction) of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .

EP0651574A1
CLAIM 14
A picture signal coding method comprising the step of :    coding an input picture signal every macroblock on the basis of a plurality of modes ,    whereby when picture coding type in the plurality of modes is P picture and picture structure in the plurality of modes is frame structure ,    the method further comprises the step of :    judging , every coded macroblock , whether or not there are satisfied the conditions where    predictive error signal is zero ,    motion vector is zero ,    motion compensation mode in the plurality of modes is the forward prediction , and    prediction type in the plurality of modes is frame prediction (encoding mode comprises one) ,    thus to process the coded macroblock as a skipped block when said respective conditions are satisfied .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
EP0651574A1
CLAIM 2
A motion vector coding method (encoding mode) as set forth in claim 1 ,    wherein the memories selected on the basis of the order of transmission are sequentially updated by the current motion vectors in the predetermined direction .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP0614318A2

Filed: 1994-03-03     Issued: 1994-09-07

Video encoder, video decoder, and video motion estimation apparatus

(Original Assignee) Toshiba Corp     (Current Assignee) Toshiba Corp

Kenshi C/O Intellectual Prop. Div. Dachiku, Shogo C/O Intellectual Prop. Div. Yamaguchi, Kazuo C/O Intellectual Prop. Div. Ozeki, Katsumi C/O Intellectual Prop. Div. Takahashi, Mitsunori C/O Intellectual Prop. Div. Omokawa, Takaaki C/O Intellectual Prop. Div. Kuratate
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP0614318A2
CLAIM 1
A video encoding (second syntax) apparatus comprising    a moving object analyzing means for extracting a moving object from an input video signal , analyzing its motion and outputing a residual signal relative to a reconstruction picture and motion parameters ,    a residual coding means for encoding said residual signal from said moving object analyzing means to form an encoded residual data ,    a reconstruction means for reconstructing a picture image using said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means , and    a variable length coding means which performs a variable length coding of said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means ,    wherein said moving object analyzing means comprising    a global motion estimating means for estimating a global motion parameter of the moving region within the input picture ,    a polygonal patch motion estimating means for estimating a polygonal patch motion parameters by applying polygonal patches to the moving region within the input picture or reconstructed picture , and    a motion compensating means for performing a motion compensation of the reconstructed picture using the global motion parameter estimated by said global motion estimating means and the polygonal patch motion parameteres estimated by said polygonal patch motion estimating means .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks (transforming means, motion prediction) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP0614318A2
CLAIM 1
A video encoding (second syntax) apparatus comprising    a moving object analyzing means for extracting a moving object from an input video signal , analyzing its motion and outputing a residual signal relative to a reconstruction picture and motion parameters ,    a residual coding means for encoding said residual signal from said moving object analyzing means to form an encoded residual data ,    a reconstruction means for reconstructing a picture image using said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means , and    a variable length coding means which performs a variable length coding of said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means ,    wherein said moving object analyzing means comprising    a global motion estimating means for estimating a global motion parameter of the moving region within the input picture ,    a polygonal patch motion estimating means for estimating a polygonal patch motion parameters by applying polygonal patches to the moving region within the input picture or reconstructed picture , and    a motion compensating means for performing a motion compensation of the reconstructed picture using the global motion parameter estimated by said global motion estimating means and the polygonal patch motion parameteres estimated by said polygonal patch motion estimating means .

EP0614318A2
CLAIM 9
The video encoding apparatus of claim 8 wherein said residual coding means performs a motion prediction (video blocks) , in a further encoding of said residual coded signal on a pixel basis for each block , from already coded pixels in the adjacent blocks and the particular block .

EP0614318A2
CLAIM 11
A video encoding apparatus comprising    a prediction means for performing a prediction by taking a difference in time direction of input picture signals ,    a selection means for selecting one transform from among a plurality of transforms ,    a transforming means (video blocks) for transforming and quantizing a prediction residual signal from said prediction means according to the transform selected by said selecting means ,    a coding means for coding and outputing the quantized signal from said transforming means ,    a dequantizing means for dequantizing the quantized signal from said transforming means and    an inverse transforming means for inverse transforming the dequantized signal from said dequantizing means according to an inverse transform corresponding to said selected transform to give a local decoded signal .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (frame memory) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax (video encoding) element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP0614318A2
CLAIM 1
A video encoding (second syntax) apparatus comprising    a moving object analyzing means for extracting a moving object from an input video signal , analyzing its motion and outputing a residual signal relative to a reconstruction picture and motion parameters ,    a residual coding means for encoding said residual signal from said moving object analyzing means to form an encoded residual data ,    a reconstruction means for reconstructing a picture image using said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means , and    a variable length coding means which performs a variable length coding of said motion parameters from said moving object analyzing means and said encoded residual data from said residual coding means ,    wherein said moving object analyzing means comprising    a global motion estimating means for estimating a global motion parameter of the moving region within the input picture ,    a polygonal patch motion estimating means for estimating a polygonal patch motion parameters by applying polygonal patches to the moving region within the input picture or reconstructed picture , and    a motion compensating means for performing a motion compensation of the reconstructed picture using the global motion parameter estimated by said global motion estimating means and the polygonal patch motion parameteres estimated by said polygonal patch motion estimating means .

EP0614318A2
CLAIM 7
A video decoding apparatus comprising a variable length decoding means for receiving variable length-coded data , decoding the coded data and resolving it into a global motion parameter , polygonal patch motion parameters and a residual code ,    a residual code decoding means for decoding the residual code from said variable length decoding means and outputs a residual signal ,    a global motion compensating means for performing a motion compensation of the entire moving region using the decoded picture of a preceding frame stored in a frame memory (thereon instructions) and the global motion parameter output from said variable length decoding means and outputs a global motion compensated image ,    a polygonal patch compensating means for performing a local motion compensation within the moving region using the global motion compensated image output from said global motion compensating means and the polygonal patch motion parameter output from said variable length decoding means and outputs a motion compensated prediction image , and    an adding means for adding the motion compensated prediction image from said polygonal patch compensating means and the residual code output from said residual code decoding means to construct a decoded picture and outputs the decoded picutre to a frame memory .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP0607042A2

Filed: 1994-01-13     Issued: 1994-07-20

Error concealment apparatus for video signal processors

(Original Assignee) RCA Licensing Corp     (Current Assignee) RCA Licensing Corp

Wilson Kwok, Huifang Sun
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (background pixel, mean value) of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (background pixel, mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (background pixel, mean value) and greater than the minimum size (background pixel, mean value) .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (bad block) .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size (background pixel, mean value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (background pixel, mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (background pixel, mean value) and greater than the minimum size (background pixel, mean value) .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (bad block) .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size (background pixel, mean value) of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (background pixel, mean value) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (bad block) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (background pixel, mean value) and greater than the minimum size (background pixel, mean value) .
EP0607042A2
CLAIM 4
The apparatus set forth in claim 2 characterized in that said means for selectively combining includes means for differentiating between foreground and background pixel (minimum size, maximum size) data ;
for differentiating between bright and dark foreground pixel data ;
and for selecting pixel data representing the brightest (darkest) foreground feature from corresponding coordinates of said interpolated blocks containing bright (dark) foreground pixel data to fill in a corresponding coordinate of said further block .

EP0607042A2
CLAIM 10
The apparatus set forth in claim 8 characterized in that said means for selectively non additively combining includes ;
   means for determining the mean value (minimum size, maximum size) of pixel data for respective interpolated blocks , and selecting pixel data greater and lesser than said mean value by a predetermined amount to represent foreground features .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (bad block) .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (bad block) has a size of at least 64×64 pixels .
EP0607042A2
CLAIM 5
The apparatus set forth in claim 1 characterized in that said means responsive to said indicators Di for generating respective blocks of image data includes :    means for performing a spatial to frequency transform on a block of said pixel data adjacent said bad block (current block) , to generate a matrix (block) of transform coefficients , wherein respective transform coefficients represent different directional image information ;
   means , responsive to said indicators Di , for forming respective intermediate matrices (blocks) , each intermediate matrix formed by selection of ones of said transform coefficients which represent similar directionally oriented image information .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5461421A

Filed: 1993-11-29     Issued: 1995-10-24

Encoding and decoding method and apparatus thereof

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Heon H. Moon
US9930365B2
CLAIM 1
. A method of decoding video data (inverse discrete cosine transform, respective frame, encoded video data) (inverse discrete cosine transform, respective frame, encoded video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5461421A
CLAIM 1
. An encoding apparatus for video data producing an interlace scanning method comprising : means for arranging input block (encoding mode comprises one) data having a predetermined magnitude into block data of a frame pattern and a field pattern ;
first means receiving the block data of the frame pattern and the block data of the field pattern , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce first and second data ;
second means receiving the block data of the frame pattern , the block data of the field pattern and respective motion compensated data of frame and field patterns , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce third and fourth data ;
means for receiving the quantized block first through fourth data and selecting a mode according to the number of generated corresponding code bits ;
third means for selecting any one among the quantized first through fourth data according to the mode signal selected in the mode selection means and for performing a direct current differential pulse code modulation (DC DPCM) operation , means for variable-length-coding the direct current differential pulse code modulated data and outputting the variable-length-coded data ;
and means for restoring the direct current differential pulse code modulated data .

US5461421A
CLAIM 9
. An encoder apparatus according to claim 8 , wherein said video restoring unit further comprises : a block delay for delaying the mode selection signal by one block interval ;
means for delaying a motion compensated data representative of said second means output and arranged in a respective frame (video blocks, decoding video data, video data) /field pattern so as to temporally synchronize said inverse DCT portion output data ;
a block DC calculator receiving delayed frame/field arranged motion compensated data for calculating an average DC value of each block data ;
means for receiving said DC detector output data and said average DC value for outputting a restored DC value for each block data ;
and adder means for selectively adding said motion compensator or the restored DC value to said inverse DCT portion output data according to the mode selection signal .

US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .

US5461421A
CLAIM 21
. A decoding apparatus for decoding inversely quantized video data into a state prior to being encoded , by using an inverse quantizer for inverse-quantizing video data which is divided into blocks each of which has a predetermined magnitude and is encoded by intramode processing or intermode processing according to transmitted quantization level , and motion compensation means , the decoding apparatus comprising : input means to which encoded data is input ;
direct current (DC) detection means receiving output data of the inverse quantizer supplied through the input means , a mode selection signal representing said intramode processing or intermode processing and a block DC latch signal representing a block interval of the block data , for detecting DC values of the respective block data ;
and video restoring means receiving output data of an inverse discrete cosine transform (video blocks, decoding video data, video data) (DCT) portion , which transforms inverse quantizer output data into a state prior to being discrete-cosine-transformed , DC data output from the DC detection means , motion compensation means output data , the mode selection signal and the block DC selection signal , for outputting an original video signal of which the DC level is restored .

US5461421A
CLAIM 33
. A decoding method for receiving encoded video data (video blocks, decoding video data, video data) and decoding the received encoded video data into an original signal prior to being encoded , the decoding method comprising the steps of : (a) detecting direct current values of respective block data from the received video data (b) motion-compensating restored block data in correspondence to a motion vector which is calculated from a motion compensator in an encoding apparatus ;
(c) discriminatively arranging the motion compensated data into frame and field patterns ;
and (d) receiving the DC data and the motion compensated frame/field patterned data and restoring and outputting an original video signal of which the DC level is restored .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .

US9930365B2
CLAIM 7
. A device for decoding video data (inverse discrete cosine transform, respective frame, encoded video data) (inverse discrete cosine transform, respective frame, encoded video data) , the device comprising : a memory configured to store decoded video blocks (inverse discrete cosine transform, respective frame, encoded video data) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5461421A
CLAIM 1
. An encoding apparatus for video data producing an interlace scanning method comprising : means for arranging input block (encoding mode comprises one) data having a predetermined magnitude into block data of a frame pattern and a field pattern ;
first means receiving the block data of the frame pattern and the block data of the field pattern , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce first and second data ;
second means receiving the block data of the frame pattern , the block data of the field pattern and respective motion compensated data of frame and field patterns , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce third and fourth data ;
means for receiving the quantized block first through fourth data and selecting a mode according to the number of generated corresponding code bits ;
third means for selecting any one among the quantized first through fourth data according to the mode signal selected in the mode selection means and for performing a direct current differential pulse code modulation (DC DPCM) operation , means for variable-length-coding the direct current differential pulse code modulated data and outputting the variable-length-coded data ;
and means for restoring the direct current differential pulse code modulated data .

US5461421A
CLAIM 9
. An encoder apparatus according to claim 8 , wherein said video restoring unit further comprises : a block delay for delaying the mode selection signal by one block interval ;
means for delaying a motion compensated data representative of said second means output and arranged in a respective frame (video blocks, decoding video data, video data) /field pattern so as to temporally synchronize said inverse DCT portion output data ;
a block DC calculator receiving delayed frame/field arranged motion compensated data for calculating an average DC value of each block data ;
means for receiving said DC detector output data and said average DC value for outputting a restored DC value for each block data ;
and adder means for selectively adding said motion compensator or the restored DC value to said inverse DCT portion output data according to the mode selection signal .

US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .

US5461421A
CLAIM 21
. A decoding apparatus for decoding inversely quantized video data into a state prior to being encoded , by using an inverse quantizer for inverse-quantizing video data which is divided into blocks each of which has a predetermined magnitude and is encoded by intramode processing or intermode processing according to transmitted quantization level , and motion compensation means , the decoding apparatus comprising : input means to which encoded data is input ;
direct current (DC) detection means receiving output data of the inverse quantizer supplied through the input means , a mode selection signal representing said intramode processing or intermode processing and a block DC latch signal representing a block interval of the block data , for detecting DC values of the respective block data ;
and video restoring means receiving output data of an inverse discrete cosine transform (video blocks, decoding video data, video data) (DCT) portion , which transforms inverse quantizer output data into a state prior to being discrete-cosine-transformed , DC data output from the DC detection means , motion compensation means output data , the mode selection signal and the block DC selection signal , for outputting an original video signal of which the DC level is restored .

US5461421A
CLAIM 33
. A decoding method for receiving encoded video data (video blocks, decoding video data, video data) and decoding the received encoded video data into an original signal prior to being encoded , the decoding method comprising the steps of : (a) detecting direct current values of respective block data from the received video data (b) motion-compensating restored block data in correspondence to a motion vector which is calculated from a motion compensator in an encoding apparatus ;
(c) discriminatively arranging the motion compensated data into frame and field patterns ;
and (d) receiving the DC data and the motion compensated frame/field patterned data and restoring and outputting an original video signal of which the DC level is restored .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (inverse discrete cosine transform, respective frame, encoded video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5461421A
CLAIM 1
. An encoding apparatus for video data producing an interlace scanning method comprising : means for arranging input block (encoding mode comprises one) data having a predetermined magnitude into block data of a frame pattern and a field pattern ;
first means receiving the block data of the frame pattern and the block data of the field pattern , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce first and second data ;
second means receiving the block data of the frame pattern , the block data of the field pattern and respective motion compensated data of frame and field patterns , for discrete-cosine-transforming the respective received data and for quantizing the discrete-cosine-transformed data to produce third and fourth data ;
means for receiving the quantized block first through fourth data and selecting a mode according to the number of generated corresponding code bits ;
third means for selecting any one among the quantized first through fourth data according to the mode signal selected in the mode selection means and for performing a direct current differential pulse code modulation (DC DPCM) operation , means for variable-length-coding the direct current differential pulse code modulated data and outputting the variable-length-coded data ;
and means for restoring the direct current differential pulse code modulated data .

US5461421A
CLAIM 9
. An encoder apparatus according to claim 8 , wherein said video restoring unit further comprises : a block delay for delaying the mode selection signal by one block interval ;
means for delaying a motion compensated data representative of said second means output and arranged in a respective frame (video blocks, decoding video data, video data) /field pattern so as to temporally synchronize said inverse DCT portion output data ;
a block DC calculator receiving delayed frame/field arranged motion compensated data for calculating an average DC value of each block data ;
means for receiving said DC detector output data and said average DC value for outputting a restored DC value for each block data ;
and adder means for selectively adding said motion compensator or the restored DC value to said inverse DCT portion output data according to the mode selection signal .

US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .

US5461421A
CLAIM 21
. A decoding apparatus for decoding inversely quantized video data into a state prior to being encoded , by using an inverse quantizer for inverse-quantizing video data which is divided into blocks each of which has a predetermined magnitude and is encoded by intramode processing or intermode processing according to transmitted quantization level , and motion compensation means , the decoding apparatus comprising : input means to which encoded data is input ;
direct current (DC) detection means receiving output data of the inverse quantizer supplied through the input means , a mode selection signal representing said intramode processing or intermode processing and a block DC latch signal representing a block interval of the block data , for detecting DC values of the respective block data ;
and video restoring means receiving output data of an inverse discrete cosine transform (video blocks, decoding video data, video data) (DCT) portion , which transforms inverse quantizer output data into a state prior to being discrete-cosine-transformed , DC data output from the DC detection means , motion compensation means output data , the mode selection signal and the block DC selection signal , for outputting an original video signal of which the DC level is restored .

US5461421A
CLAIM 33
. A decoding method for receiving encoded video data (video blocks, decoding video data, video data) and decoding the received encoded video data into an original signal prior to being encoded , the decoding method comprising the steps of : (a) detecting direct current values of respective block data from the received video data (b) motion-compensating restored block data in correspondence to a motion vector which is calculated from a motion compensator in an encoding apparatus ;
(c) discriminatively arranging the motion compensated data into frame and field patterns ;
and (d) receiving the DC data and the motion compensated frame/field patterned data and restoring and outputting an original video signal of which the DC level is restored .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5461421A
CLAIM 17
. An encoding method (encoding mode) for dividing video data processed according to an interlace scanning method into blocks , each of which has a predetermined magnitude , and encoding the respective divided block data , said method comprising the steps of : (a) discriminatively arranging the block data into block data of a frame pattern and a field pattern ;
(b) receiving the block data of the frame pattern and the block data of the field pattern , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(c) receiving the block data of the frame pattern , the block data of the field pattern and motion compensated data , discrete-cosine-transforming the respective received data and quantizing the discrete-cosine-transformed data ;
(d) accumulatively counting the number of bits of the block unit according to the respective run-length level which is generated during said steps (b) and (c) ;
(e) comparing the respective count results with one another so as to select a mode which generated a smallest number of the bits ;
(f) delaying the first and second quantized data by a predetermined time during the mode selection according to steps (d) and (e) ;
(g) selectively outputting the delayed quantized data according to the selected mode ;
(h) modulating by direct current (DC) differential pulse code modulation (DCPM) data corresponding to the selected mode ;
(i) coding the DC DPCM data ;
and (j) restoring the DC DPCM data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5422676A

Filed: 1993-10-22     Issued: 1995-06-06

System for coding an image representative signal

(Original Assignee) Deutsche Thomson Brandt GmbH     

Carsten Herpel, Heinz-Werner Keesen
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal (video data) so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 3
. The method of claim 1 , further comprising decoding one or more syntax elements representative of partitioning for the current block (second blocks) .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 5
. The method of claim 1 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal (video data) so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 9
. The device of claim 7 , wherein the processor is further configured to decode one or more syntax elements representative of partitioning for the current block (second blocks) .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 11
. The device of claim 7 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block (second blocks) of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal (video data) so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 17
. The non-transitory computer-readable storage medium of claim 15 , further comprising instructions that cause the processor to decode one or more syntax elements representative of partitioning for the current block (second blocks) .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5422676A
CLAIM 8
. A method for decoding a signal representing an image comprising picture element (pixel) blocks of predetermined size , said signal having been coded using a hybrid coder suitable for transforming blocks of scanned pixels , said coding comprising the steps of : (a) transforming progressively scanned pixel blocks independent of motion , (b) transforming interlaced scanned pixel blocks with an original line structure in the presence of a static image ;
(c) rearranging the original line structure of an interlace scanned image signal in the presence of motion so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (d) transforming said first and second blocks in the presence of motion ;
said decoding method (encoding mode) comprising the steps of : inverse transforming progressively scanned pixel blocks independent of image motion ;
inverse transforming said interlace scanned pixel blocks with an original line structure in the presence of a static image ;
inverse transforming said first and second blocks in the presence of image motion ;
and rearranging said line structure of said interlace scanned image signal back to said original line structure after said inverse transforming in the presence of motion .

US9930365B2
CLAIM 19
. The non-transitory computer-readable storage medium of claim 15 , wherein the current block (second blocks) has a size of at least 64×64 pixels .
US5422676A
CLAIM 1
. A method for coding a signal representing an image comprising picture element (pixel) blocks of predetermined size using a hybrid coder suitable for transforming blocks of scanned pixels , comprising the steps of : transforming progressively scanned pixel blocks independent of dynamic image content ;
for static images , transforming interlaced scanned pixel blocks with an original line structure ;
and for images with dynamic content , (a) rearranging the original line structure of an interlace scanned image signal so as to produce first and second vertically superimposed pixel blocks of similar predetermined size , said first block containing pixels associated with image lines of a first field , said second block containing pixels associated with image lines of an associated adjacent second field ;
and (b) transforming said first and second blocks (current block) .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
EP0588653A2

Filed: 1993-09-16     Issued: 1994-03-23

Image data coding and restoring method and appatatus for coding and restoring the same

(Original Assignee) Fujitsu Ltd     (Current Assignee) Fujitsu Ltd

Kimitaka C/O Fujitsu Limited Murashita, Tsugio C/O Fujitsu Limited Noda
US9930365B2
CLAIM 1
. A method of decoding video data (image changes) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
EP0588653A2
CLAIM 1
A method of coding data in units of block by dividing a frame into a plurality of blocks comprising :    steps (S1 , S2) of determining the existence of change of image data within a block of the present frame by comparing an image element within the block of the present frame to the corresponding image element of the block in the same position in the reference frame ;
   based on the result of the determination , step (S3) of outputting block information designating whether the block in the present frame is a valid block in which the image changes (video data) from the reference frame ;
   and steps (S4 , S5) of coding only the image data of the valid block .

EP0588653A2
CLAIM 4
The image data coding method according to claim 1 wherein the block information is coded and then output (mobile device, decoding video data) ted .

US9930365B2
CLAIM 7
. A device for decoding video data (image changes) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
EP0588653A2
CLAIM 1
A method of coding data in units of block by dividing a frame into a plurality of blocks comprising :    steps (S1 , S2) of determining the existence of change of image data within a block of the present frame by comparing an image element within the block of the present frame to the corresponding image element of the block in the same position in the reference frame ;
   based on the result of the determination , step (S3) of outputting block information designating whether the block in the present frame is a valid block in which the image changes (video data) from the reference frame ;
   and steps (S4 , S5) of coding only the image data of the valid block .

EP0588653A2
CLAIM 4
The image data coding method according to claim 1 wherein the block information is coded and then output (mobile device, decoding video data) ted .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device (then output) , a broadcast receiver device , or a set-top box .
EP0588653A2
CLAIM 4
The image data coding method according to claim 1 wherein the block information is coded and then output (mobile device, decoding video data) ted .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image changes) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
EP0588653A2
CLAIM 1
A method of coding data in units of block by dividing a frame into a plurality of blocks comprising :    steps (S1 , S2) of determining the existence of change of image data within a block of the present frame by comparing an image element within the block of the present frame to the corresponding image element of the block in the same position in the reference frame ;
   based on the result of the determination , step (S3) of outputting block information designating whether the block in the present frame is a valid block in which the image changes (video data) from the reference frame ;
   and steps (S4 , S5) of coding only the image data of the valid block .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5448297A

Filed: 1993-09-09     Issued: 1995-09-05

Method and system for encoding images using skip blocks

(Original Assignee) Intel Corp     (Current Assignee) Intel Corp

Adnan Alattar, Rohan Coelho
US9930365B2
CLAIM 1
. A method of decoding video data (image signal) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size (horizontal dimension) equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5448297A
CLAIM 1
. A computer-implemented method for encoding a current region of an image , comprising the steps of : (a) comparing the current region to a reference region using a processor ;
and (b) encoding the current region as a skip block in accordance with the comparison of step (a) using the processor , wherein : the reference region comprises a corresponding region of a previous image ;
step (a) comprises the steps of : (1) encoding the current region to generate a temporary encoded region ;
(2) decoding the temporary encoded region to generate a decoded region ;
(3) generating a Y-component partial error measure SAE y in accordance with the following equation : ##EQU12## wherein F is a function of y r (i , j) and y d (i , j) , N is the vertical dimension of a Y-component block corresponding to the current region , M is the horizontal dimension (starting size) of the Y-component block , y r (i , j) are the decoded Y-component signals corresponding to the decoded region , and y d (i , j) are the Y-component signals corresponding to the reference region ;
(4) generating a V-component partial error measure SAE y in accordance with the following equation : ##EQU13## wherein F is a function of v r (i , j) and v d (i , j) , v r (i , j) are the decoded V-component signals corresponding to the decoded region , and v d (i , j) are the V-component signals corresponding to the reference region ;
(5) generating a U-component partial error measure SAE 1 in accordance with the following equation : ##EQU14## wherein F is a function of u r (i , j) and u d (i , j) , u r (i , j) are the decoded U-component signals corresponding to the decoded region , and u d (i , j) are the U-component signals corresponding to the reference region ;
and (6) generating an error measure E in accordance with the following equation : E=(SAE . sub . y +A*SAE . sub . v +B*SAE . sub . u) . wherein A and B are constants ;
and (7) comparing the error measure E to a selected threshold value ;
and step (b) comprises the step of encoding the current region as a skipped region , if the error measure is less than a specified threshold value ;
otherwise , encoding the current region as the temporary encoded region .

US5448297A
CLAIM 12
. The method of claim 10 , wherein step (b) comprises the steps of : (1) decoding a corresponding region of a previous encoded image ;
(2) storing the decoded image signal (video data) s corresponding to the decoded corresponding region in a memory device ;
(3) decoding the skip block designation of the encoded region ;
and (4) retaining the decoded image signals corresponding to the decoded corresponding region in the memory device as the decoded image signals corresponding to the encoded region .

US9930365B2
CLAIM 7
. A device for decoding video data (image signal) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (horizontal dimension) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5448297A
CLAIM 1
. A computer-implemented method for encoding a current region of an image , comprising the steps of : (a) comparing the current region to a reference region using a processor ;
and (b) encoding the current region as a skip block in accordance with the comparison of step (a) using the processor , wherein : the reference region comprises a corresponding region of a previous image ;
step (a) comprises the steps of : (1) encoding the current region to generate a temporary encoded region ;
(2) decoding the temporary encoded region to generate a decoded region ;
(3) generating a Y-component partial error measure SAE y in accordance with the following equation : ##EQU12## wherein F is a function of y r (i , j) and y d (i , j) , N is the vertical dimension of a Y-component block corresponding to the current region , M is the horizontal dimension (starting size) of the Y-component block , y r (i , j) are the decoded Y-component signals corresponding to the decoded region , and y d (i , j) are the Y-component signals corresponding to the reference region ;
(4) generating a V-component partial error measure SAE y in accordance with the following equation : ##EQU13## wherein F is a function of v r (i , j) and v d (i , j) , v r (i , j) are the decoded V-component signals corresponding to the decoded region , and v d (i , j) are the V-component signals corresponding to the reference region ;
(5) generating a U-component partial error measure SAE 1 in accordance with the following equation : ##EQU14## wherein F is a function of u r (i , j) and u d (i , j) , u r (i , j) are the decoded U-component signals corresponding to the decoded region , and u d (i , j) are the U-component signals corresponding to the reference region ;
and (6) generating an error measure E in accordance with the following equation : E=(SAE . sub . y +A*SAE . sub . v +B*SAE . sub . u) . wherein A and B are constants ;
and (7) comparing the error measure E to a selected threshold value ;
and step (b) comprises the step of encoding the current region as a skipped region , if the error measure is less than a specified threshold value ;
otherwise , encoding the current region as the temporary encoded region .

US5448297A
CLAIM 12
. The method of claim 10 , wherein step (b) comprises the steps of : (1) decoding a corresponding region of a previous encoded image ;
(2) storing the decoded image signal (video data) s corresponding to the decoded corresponding region in a memory device ;
(3) decoding the skip block designation of the encoded region ;
and (4) retaining the decoded image signals corresponding to the decoded corresponding region in the memory device as the decoded image signals corresponding to the encoded region .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (image signal) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size (horizontal dimension) equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5448297A
CLAIM 1
. A computer-implemented method for encoding a current region of an image , comprising the steps of : (a) comparing the current region to a reference region using a processor ;
and (b) encoding the current region as a skip block in accordance with the comparison of step (a) using the processor , wherein : the reference region comprises a corresponding region of a previous image ;
step (a) comprises the steps of : (1) encoding the current region to generate a temporary encoded region ;
(2) decoding the temporary encoded region to generate a decoded region ;
(3) generating a Y-component partial error measure SAE y in accordance with the following equation : ##EQU12## wherein F is a function of y r (i , j) and y d (i , j) , N is the vertical dimension of a Y-component block corresponding to the current region , M is the horizontal dimension (starting size) of the Y-component block , y r (i , j) are the decoded Y-component signals corresponding to the decoded region , and y d (i , j) are the Y-component signals corresponding to the reference region ;
(4) generating a V-component partial error measure SAE y in accordance with the following equation : ##EQU13## wherein F is a function of v r (i , j) and v d (i , j) , v r (i , j) are the decoded V-component signals corresponding to the decoded region , and v d (i , j) are the V-component signals corresponding to the reference region ;
(5) generating a U-component partial error measure SAE 1 in accordance with the following equation : ##EQU14## wherein F is a function of u r (i , j) and u d (i , j) , u r (i , j) are the decoded U-component signals corresponding to the decoded region , and u d (i , j) are the U-component signals corresponding to the reference region ;
and (6) generating an error measure E in accordance with the following equation : E=(SAE . sub . y +A*SAE . sub . v +B*SAE . sub . u) . wherein A and B are constants ;
and (7) comparing the error measure E to a selected threshold value ;
and step (b) comprises the step of encoding the current region as a skipped region , if the error measure is less than a specified threshold value ;
otherwise , encoding the current region as the temporary encoded region .

US5448297A
CLAIM 12
. The method of claim 10 , wherein step (b) comprises the steps of : (1) decoding a corresponding region of a previous encoded image ;
(2) storing the decoded image signal (video data) s corresponding to the decoded corresponding region in a memory device ;
(3) decoding the skip block designation of the encoded region ;
and (4) retaining the decoded image signals corresponding to the decoded corresponding region in the memory device as the decoded image signals corresponding to the encoded region .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5467086A

Filed: 1993-06-18     Issued: 1995-11-14

Apparatus and method of coding/decoding video data

(Original Assignee) Samsung Electronics Co Ltd     (Current Assignee) Samsung Electronics Co Ltd

Jechang Jeong
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (coding method) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (coding method) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (said switch) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (coding method) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5467086A
CLAIM 3
. A coder as defined in Claim 1 , further comprising a second adder having a first input coupled to an output of said reconstruction means , and a second input ;
and switching means having an input coupled to an output of said displacement compensation means , and an output coupled to said second input of said second adder , said switch (maximum size, thereon instructions) ing means being responsive to said mode value ;
and said second adder supplying the output of said reconstruction means or the sum of the outputs of said reconstruction means and said displacement compensation means to said frame memory in accordance with said mode value .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (coding method) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5467086A
CLAIM 14
. A coding method (encoding mode) as defined in claim 13 further comprising the step of combining said mode value with said displacement vector into a data pack .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5442400A

Filed: 1993-04-29     Issued: 1995-08-15

Error concealment apparatus for MPEG-like video data

(Original Assignee) RCA Licensing Corp     (Current Assignee) RCA Licensing Corp

Huifang Sun, Joel W. Zdepski, Kamil M. Uz
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5442400A
CLAIM 1
. Apparatus for compressing video data (video data) comprising : a source of video signal to be compressed ;
motion compensated predictive compression means for compressing respective frames of said video signal according to either intraframe processing or interflame processing and on a block by block basis to generate blocks of compressed data corresponding to respective image areas , said motion compensated predictive compression means including means for providing interframe motion vectors MV i for blocks B i of both intraframe and interflame processed video signal , said motion vectors MV i relating respective blocks of predicted video signal to blocks of original video signal which are substantially similar ;
and formatting means for arranging said blocks of compressed data and associated said motion vectors according to a desired signal protocol , wherein said interframe motion vectors MV i for blocks B i of said interframe processed video signal are attached to blocks B i of interframe compressed data to which they relate and said interframe motion vectors MV i for blocks B i of said intraframe processed video signal are attached to blocks B k of intraframe compressed data different than blocks B i of intraframe compressed data to which the respective interframe motion vectors relate , where index k is not equal to index i .

US5442400A
CLAIM 5
. The apparatus set forth in claim 4 wherein said decompression means includes : inverse transform means having an input port coupled to receive said blocks of compressed data , and having an output port for providing blocks of inverse transformed data ;
an adder having a first input port coupled to the output port of said inverse transform means , having an output port at which decompressed signal is available and having a second input port ;
a motion compensated predictive decompressor coupled to the output port of said adder and responsive to said decompressed signal and said motion vectors for generating at an output port of said motion compensated predictive decompressor , blocks of data corresponding to predicted frames of said video signal ;
and means for coupling the output of said motion compensated predictive decompressor to the second input port of said adder when said interframe processed data is being decompressed , and for normally applying a zero value (decoding video data) to the second input port of said adder when said intraframe processed data is being decompressed , and for concurrently coupling said motion compensated predictive decompressor to the second input port of said adder and applying a zero value to the first input port of said adder to provide decompressed data for discarded blocks of intraframe data .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks (respective frame) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5442400A
CLAIM 1
. Apparatus for compressing video data (video data) comprising : a source of video signal to be compressed ;
motion compensated predictive compression means for compressing respective frame (video blocks) s of said video signal according to either intraframe processing or interflame processing and on a block by block basis to generate blocks of compressed data corresponding to respective image areas , said motion compensated predictive compression means including means for providing interframe motion vectors MV i for blocks B i of both intraframe and interflame processed video signal , said motion vectors MV i relating respective blocks of predicted video signal to blocks of original video signal which are substantially similar ;
and formatting means for arranging said blocks of compressed data and associated said motion vectors according to a desired signal protocol , wherein said interframe motion vectors MV i for blocks B i of said interframe processed video signal are attached to blocks B i of interframe compressed data to which they relate and said interframe motion vectors MV i for blocks B i of said intraframe processed video signal are attached to blocks B k of intraframe compressed data different than blocks B i of intraframe compressed data to which the respective interframe motion vectors relate , where index k is not equal to index i .

US5442400A
CLAIM 5
. The apparatus set forth in claim 4 wherein said decompression means includes : inverse transform means having an input port coupled to receive said blocks of compressed data , and having an output port for providing blocks of inverse transformed data ;
an adder having a first input port coupled to the output port of said inverse transform means , having an output port at which decompressed signal is available and having a second input port ;
a motion compensated predictive decompressor coupled to the output port of said adder and responsive to said decompressed signal and said motion vectors for generating at an output port of said motion compensated predictive decompressor , blocks of data corresponding to predicted frames of said video signal ;
and means for coupling the output of said motion compensated predictive decompressor to the second input port of said adder when said interframe processed data is being decompressed , and for normally applying a zero value (decoding video data) to the second input port of said adder when said intraframe processed data is being decompressed , and for concurrently coupling said motion compensated predictive decompressor to the second input port of said adder and applying a zero value to the first input port of said adder to provide decompressed data for discarded blocks of intraframe data .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (said memory, image area) , or a set-top box .
US5442400A
CLAIM 1
. Apparatus for compressing video data comprising : a source of video signal to be compressed ;
motion compensated predictive compression means for compressing respective frames of said video signal according to either intraframe processing or interflame processing and on a block by block basis to generate blocks of compressed data corresponding to respective image area (broadcast receiver device) s , said motion compensated predictive compression means including means for providing interframe motion vectors MV i for blocks B i of both intraframe and interflame processed video signal , said motion vectors MV i relating respective blocks of predicted video signal to blocks of original video signal which are substantially similar ;
and formatting means for arranging said blocks of compressed data and associated said motion vectors according to a desired signal protocol , wherein said interframe motion vectors MV i for blocks B i of said interframe processed video signal are attached to blocks B i of interframe compressed data to which they relate and said interframe motion vectors MV i for blocks B i of said intraframe processed video signal are attached to blocks B k of intraframe compressed data different than blocks B i of intraframe compressed data to which the respective interframe motion vectors relate , where index k is not equal to index i .

US5442400A
CLAIM 4
. Apparatus for processing motion compensated compressed video data wherein ones of frames of video signal are compressed according to intraframe processing and other frames of said video signal are compressed according to interframe processing , and wherein said intraframe and interframe processing generates compressed data on a block by block basis , and wherein motion vectors are generated for respective blocks of intraframe processed data , and respective motion vectors of intraframe processed blocks are attached to blocks of data other than blocks to which they respectively correspond , said apparatus comprising : a source of said motion compensated compressed video data ;
means for detecting errors in said motion compensated compressed video data and for discarding blocks of data including errors ;
means for separating said motion vectors from respective blocks of intraframe compressed data , and for storing said motion vectors in memory , and for applying blocks of intraframe compressed data without corresponding intraframe motion vectors to decompression means for decompression ;
means , responsive to said motion compensated compressed video data and said means for detecting errors , for providing blocks of synthesized data for said blocks of intraframe compressed data discarded by said means for detecting errors , said synthesized data formed using intraframe motion vectors corresponding to said discarded blocks , which said motion vectors are accessed from said memory (broadcast receiver device) ;
and means for substituting said blocks of synthesized data for corresponding said discarded blocks of intraframe compressed data .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions (other frame) .
US5442400A
CLAIM 1
. Apparatus for compressing video data (video data) comprising : a source of video signal to be compressed ;
motion compensated predictive compression means for compressing respective frames of said video signal according to either intraframe processing or interflame processing and on a block by block basis to generate blocks of compressed data corresponding to respective image areas , said motion compensated predictive compression means including means for providing interframe motion vectors MV i for blocks B i of both intraframe and interflame processed video signal , said motion vectors MV i relating respective blocks of predicted video signal to blocks of original video signal which are substantially similar ;
and formatting means for arranging said blocks of compressed data and associated said motion vectors according to a desired signal protocol , wherein said interframe motion vectors MV i for blocks B i of said interframe processed video signal are attached to blocks B i of interframe compressed data to which they relate and said interframe motion vectors MV i for blocks B i of said intraframe processed video signal are attached to blocks B k of intraframe compressed data different than blocks B i of intraframe compressed data to which the respective interframe motion vectors relate , where index k is not equal to index i .

US5442400A
CLAIM 4
. Apparatus for processing motion compensated compressed video data wherein ones of frames of video signal are compressed according to intraframe processing and other frame (sub partitions) s of said video signal are compressed according to interframe processing , and wherein said intraframe and interframe processing generates compressed data on a block by block basis , and wherein motion vectors are generated for respective blocks of intraframe processed data , and respective motion vectors of intraframe processed blocks are attached to blocks of data other than blocks to which they respectively correspond , said apparatus comprising : a source of said motion compensated compressed video data ;
means for detecting errors in said motion compensated compressed video data and for discarding blocks of data including errors ;
means for separating said motion vectors from respective blocks of intraframe compressed data , and for storing said motion vectors in memory , and for applying blocks of intraframe compressed data without corresponding intraframe motion vectors to decompression means for decompression ;
means , responsive to said motion compensated compressed video data and said means for detecting errors , for providing blocks of synthesized data for said blocks of intraframe compressed data discarded by said means for detecting errors , said synthesized data formed using intraframe motion vectors corresponding to said discarded blocks , which said motion vectors are accessed from said memory ;
and means for substituting said blocks of synthesized data for corresponding said discarded blocks of intraframe compressed data .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
USRE35158E

Filed: 1992-12-28     Issued: 1996-02-20

Apparatus for adaptive inter-frame predictive encoding of video signal

(Original Assignee) Victor Company of Japan Ltd     (Current Assignee) Victor Company of Japan Ltd

Kenji Sugiyama
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
USRE35158E
CLAIM 2
. An adaptive predictive encoding apparatus according to claim 1 , in which said adaptive prediction means comprises : means for deriving a first prediction signal based on a combination of pixel data of said preceding and succeeding reference frames , a second prediction signal derived only from said preceding reference frame , a third prediction signal derived only from said succeeding , and a non-prediction signal derived only from said dependent frame ;
and predictive mode selection means for selecting , for each of said blocks , one out of four prediction modes (intra-prediction mode, inter-prediction mode) in which said first , second and third prediction signals and said non-prediction signal are respectively used in deriving predictive error values for respective pixel data of said block , to be sent to said encoder means and encoded thereby , said selection being based upon judgement of said errors , said predictive mode selection means further supplying to said encoding means , to be encoded thereby , predictive mode data indicating predictive modes which have been selected for respective ones of the blocks .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
USRE35158E
CLAIM 2
. An adaptive predictive encoding apparatus according to claim 1 , in which said adaptive prediction means comprises : means for deriving a first prediction signal based on a combination of pixel data of said preceding and succeeding reference frames , a second prediction signal derived only from said preceding reference frame , a third prediction signal derived only from said succeeding , and a non-prediction signal derived only from said dependent frame ;
and predictive mode selection means for selecting , for each of said blocks , one out of four prediction modes (intra-prediction mode, inter-prediction mode) in which said first , second and third prediction signals and said non-prediction signal are respectively used in deriving predictive error values for respective pixel data of said block , to be sent to said encoder means and encoded thereby , said selection being based upon judgement of said errors , said predictive mode selection means further supplying to said encoding means , to be encoded thereby , predictive mode data indicating predictive modes which have been selected for respective ones of the blocks .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (prediction modes) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
USRE35158E
CLAIM 2
. An adaptive predictive encoding apparatus according to claim 1 , in which said adaptive prediction means comprises : means for deriving a first prediction signal based on a combination of pixel data of said preceding and succeeding reference frames , a second prediction signal derived only from said preceding reference frame , a third prediction signal derived only from said succeeding , and a non-prediction signal derived only from said dependent frame ;
and predictive mode selection means for selecting , for each of said blocks , one out of four prediction modes (intra-prediction mode, inter-prediction mode) in which said first , second and third prediction signals and said non-prediction signal are respectively used in deriving predictive error values for respective pixel data of said block , to be sent to said encoder means and encoded thereby , said selection being based upon judgement of said errors , said predictive mode selection means further supplying to said encoding means , to be encoded thereby , predictive mode data indicating predictive modes which have been selected for respective ones of the blocks .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (frame memory) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (prediction modes) and an inter-prediction mode (prediction modes) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
USRE35158E
CLAIM 2
. An adaptive predictive encoding apparatus according to claim 1 , in which said adaptive prediction means comprises : means for deriving a first prediction signal based on a combination of pixel data of said preceding and succeeding reference frames , a second prediction signal derived only from said preceding reference frame , a third prediction signal derived only from said succeeding , and a non-prediction signal derived only from said dependent frame ;
and predictive mode selection means for selecting , for each of said blocks , one out of four prediction modes (intra-prediction mode, inter-prediction mode) in which said first , second and third prediction signals and said non-prediction signal are respectively used in deriving predictive error values for respective pixel data of said block , to be sent to said encoder means and encoded thereby , said selection being based upon judgement of said errors , said predictive mode selection means further supplying to said encoding means , to be encoded thereby , predictive mode data indicating predictive modes which have been selected for respective ones of the blocks .

USRE35158E
CLAIM 5
. An adaptive predictive encoding apparatus according to claim 1 , and further comprising an (N-1) frame memory (thereon instructions) for temporarily storing each dependent frame of said video signal and outputting said each dependent frame to said adaptive prediction means after a fixed delay time , and first and second 1-frame memories for respectively holding pixel data of said preceding and succeeding reference frames and supplying pixel data of said preceding and succeeding reference frames to said adaptive prediction means during adaptive prediction processing of successive ones of said dependent frames . . Iadd .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5347308A

Filed: 1992-10-05     Issued: 1994-09-13

Adaptive coding method for interlaced scan digital video sequences

(Original Assignee) Panasonic Corp     (Current Assignee) Panasonic Corp

Lucas H. Y. Wai
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5347308A
CLAIM 4
. A method according to claim 3 , wherein the block coding process comprises the steps of : discrete cosine (decoding video data) transforming each of said sub-blocks into DCT coefficients ;
quantizing the DCT coefficients ;
zig-zag scanning the quantized DCT coefficients ;
variable length coding (VLC) the result of the zig-zag scanning to produce encoded information .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks (motion prediction) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5347308A
CLAIM 3
. A method according to claim 2 , wherein the inter-frame motion compensation process comprises the steps of : determining whether to subject said respective block (i) to an inter-frame motion prediction (video blocks) process or (ii) to an inter-frame motion interpolation process and generating a second determined result ;
selectively subjecting said respective block , according to the second determined result , to said inter-frame motion prediction process where a forward motion vector is detected with reference to a previous coded frame , wherein motion compensation is performed on said respective block using the detected forward motion vector so as to generate a motion compensated block , and comparing the motion compensated block with said respective block ;
and selectively subjecting said respective block , according to the second determined result , to said inter-frame motion interpolation process where a forward motion vector is detected with reference to a previous coded frame , a backward motion vector is detected with reference to a future coded frame , including performing three motion compensations respectively using the detected forward motion vector , the detected backward motion vector , and both the detected forward and backward motion vectors , and performing comparisons between results of the three motion compensation together with said respective block .

US5347308A
CLAIM 4
. A method according to claim 3 , wherein the block coding process comprises the steps of : discrete cosine (decoding video data) transforming each of said sub-blocks into DCT coefficients ;
quantizing the DCT coefficients ;
zig-zag scanning the quantized DCT coefficients ;
variable length coding (VLC) the result of the zig-zag scanning to produce encoded information .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5227878A

Filed: 1991-11-15     Issued: 1993-07-13

Adaptive coding and decoding of frames and fields of video

(Original Assignee) Nokia Bell Labs     (Current Assignee) AT&T Corp ; Multimedia Patent Trust

Atul Puri, Rangarajan Aravind
US9930365B2
CLAIM 1
. A method of decoding video data (discrete cosine) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (stored value) and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5227878A
CLAIM 21
. The apparatus of claim 18 , further comprising : a means for adaptively scanning stored value (intra-prediction mode) s related to the representations in response to the coding type signal .

US5227878A
CLAIM 26
. The apparatus of claim 24 , in which the means for selectively transforming comprises a means for adaptively performing a discrete cosine (decoding video data) transform of the input video signal in response to the frame/field encoding type signal .

US9930365B2
CLAIM 7
. A device for decoding video data (discrete cosine) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (stored value) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5227878A
CLAIM 21
. The apparatus of claim 18 , further comprising : a means for adaptively scanning stored value (intra-prediction mode) s related to the representations in response to the coding type signal .

US5227878A
CLAIM 26
. The apparatus of claim 24 , in which the means for selectively transforming comprises a means for adaptively performing a discrete cosine (decoding video data) transform of the input video signal in response to the frame/field encoding type signal .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode is the intra-prediction mode (stored value) , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5227878A
CLAIM 21
. The apparatus of claim 18 , further comprising : a means for adaptively scanning stored value (intra-prediction mode) s related to the representations in response to the coding type signal .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode (stored value) and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5227878A
CLAIM 21
. The apparatus of claim 18 , further comprising : a means for adaptively scanning stored value (intra-prediction mode) s related to the representations in response to the coding type signal .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5155594A

Filed: 1991-05-10     Issued: 1992-10-13

Hierarchical encoding method and apparatus employing background references for efficiently communicating image sequences

(Original Assignee) PictureTel Corp     (Current Assignee) Polycom Inc

Jeffrey Bernstein, Bernd Girod, Xiancheng Yuan
US9930365B2
CLAIM 1
. A method of decoding video data , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5155594A
CLAIM 10
. The method of claim 3 wherein said step of maintaining a background pyramid structure comprises the steps of : preparing a reconstructed image (third syntax element) pyramid data structure representative of a receiver image data structure prepared by said receiver in response to said transmitted electrical signals , for each component of said reconstructed image pyramid data structure , determining the number of frames since said component was last changed , generating a weighting factor which is proportional to said determined number of frames , and for each component of said background pyramid data structure , computing a new component value equal to a weighted sum of the background component value and the value of a corresponding component of said reconstructed image pyramid data structure , the relative weight accorded said background component value being determined by said weighting factor .

US9930365B2
CLAIM 7
. A device for decoding video data , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5155594A
CLAIM 10
. The method of claim 3 wherein said step of maintaining a background pyramid structure comprises the steps of : preparing a reconstructed image (third syntax element) pyramid data structure representative of a receiver image data structure prepared by said receiver in response to said transmitted electrical signals , for each component of said reconstructed image pyramid data structure , determining the number of frames since said component was last changed , generating a weighting factor which is proportional to said determined number of frames , and for each component of said background pyramid data structure , computing a new component value equal to a weighted sum of the background component value and the value of a corresponding component of said reconstructed image pyramid data structure , the relative weight accorded said background component value being determined by said weighting factor .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (estimated number) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element (reconstructed image) , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5155594A
CLAIM 4
. The method of claim 3 wherein selecting one of said first and second prediction components comprises the steps of : for each of said first and second error signals estimating the number of bits required to encode the error signal , and selecting from said first and second error signals , the error signal requiring the smallest estimated number (thereon instructions) of bits .

US5155594A
CLAIM 10
. The method of claim 3 wherein said step of maintaining a background pyramid structure comprises the steps of : preparing a reconstructed image (third syntax element) pyramid data structure representative of a receiver image data structure prepared by said receiver in response to said transmitted electrical signals , for each component of said reconstructed image pyramid data structure , determining the number of frames since said component was last changed , generating a weighting factor which is proportional to said determined number of frames , and for each component of said background pyramid data structure , computing a new component value equal to a weighted sum of the background component value and the value of a corresponding component of said reconstructed image pyramid data structure , the relative weight accorded said background component value being determined by said weighting factor .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5068724A

Filed: 1990-06-15     Issued: 1991-11-26

Adaptive motion compensation for digital television

(Original Assignee) Arris Technology Inc     (Current Assignee) CIF LICENSING LLC

Edward A. Krause, Woo H. Paik, Vincent Liu, Jim Essserman
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US5068724A
CLAIM 19
. A digital television system for processing blocks of pixel data to produce television pictures comprising : first means for compressing blocks of pixel data without motion compensation ;
second means for compressing said blocks of pixel data with motion compensation ;
bit counter means for quantifying the compressed data from said first and second compression means ;
comparator means coupled to said bit counter means for selecting the compressed data containing the fewest bits for each block by comparing the number of bits in the compressed data from said first and second compression means ;
means for encoding the selected data for each block to identify it as non-motion compensated or motion compensated ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream for transmission by a transmitter .

US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .

US9930365B2
CLAIM 2
. The method of claim 1 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US9930365B2
CLAIM 4
. The method of claim 1 , wherein the encoding mode (least number) is the intra-predicting mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-predicting mode .
US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US5068724A
CLAIM 19
. A digital television system for processing blocks of pixel data to produce television pictures comprising : first means for compressing blocks of pixel data without motion compensation ;
second means for compressing said blocks of pixel data with motion compensation ;
bit counter means for quantifying the compressed data from said first and second compression means ;
comparator means coupled to said bit counter means for selecting the compressed data containing the fewest bits for each block by comparing the number of bits in the compressed data from said first and second compression means ;
means for encoding the selected data for each block to identify it as non-motion compensated or motion compensated ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream for transmission by a transmitter .

US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .

US9930365B2
CLAIM 8
. The device of claim 7 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US9930365B2
CLAIM 10
. The device of claim 7 , wherein the encoding mode (least number) is the intra-prediction mode , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction mode .
US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (digital television) , or a set-top box .
US5068724A
CLAIM 19
. A digital television (receiver device) system for processing blocks of pixel data to produce television pictures comprising : first means for compressing blocks of pixel data without motion compensation ;
second means for compressing said blocks of pixel data with motion compensation ;
bit counter means for quantifying the compressed data from said first and second compression means ;
comparator means coupled to said bit counter means for selecting the compressed data containing the fewest bits for each block by comparing the number of bits in the compressed data from said first and second compression means ;
means for encoding the selected data for each block to identify it as non-motion compensated or motion compensated ;
and means for combining the encoded selected data to provide a compressed video data stream for transmission by a transmitter .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions (said switch) that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size (said switch) of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode (least number) used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US5068724A
CLAIM 19
. A digital television system for processing blocks of pixel data to produce television pictures comprising : first means for compressing blocks of pixel data without motion compensation ;
second means for compressing said blocks of pixel data with motion compensation ;
bit counter means for quantifying the compressed data from said first and second compression means ;
comparator means coupled to said bit counter means for selecting the compressed data containing the fewest bits for each block by comparing the number of bits in the compressed data from said first and second compression means ;
means for encoding the selected data for each block to identify it as non-motion compensated or motion compensated ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream for transmission by a transmitter .

US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .

US9930365B2
CLAIM 16
. The non-transitory computer-readable storage medium of claim 15 , wherein at least one other block of the plurality of blocks has a size that is less than the maximum size (said switch) and greater than the minimum size .
US5068724A
CLAIM 12
. Apparatus in accordance with claim 6 further comprising : switch means for obtaining compressed data from said first compressing means when the first compressed video signal is selected and for obtaining compressed data from said second compressing means when the second compressed video signal is selected ;
and means coupled to said switch (maximum size, thereon instructions) means for decompressing the data obtained by said switch means for use by said predicting means .

US9930365B2
CLAIM 18
. The non-transitory computer-readable storage medium of claim 15 , wherein the encoding mode (least number) is intra-prediction , and decoding the sub-block according to the encoding mode comprises predicting the sub-block from one or more neighboring pixels according to the intra-prediction .
US5068724A
CLAIM 22
. A method for encoding television signals for digital transmission comprising the steps of : dividing a digitized video signal into blocks of pixel data ;
compressing said blocks in a first path to provide a first compressed video signal ;
compressing said blocks in a second path to provide a second encompassed video signal ;
quantifying the data in said first and second compressed video signals by determining the number of bits produced by said compressing means for equivalent portions of the pixel data ;
selecting , for each block , the compressed video signal containing the least number (encoding mode) of bits ;
encoding the selected signals to identify them as first path or second path signals ;
and combining the encoded signals .




US9930365B2

Filed: 2008-10-03     Issued: 2018-03-27

Video coding with large macroblocks

(Original Assignee) Velos Media LLC     (Current Assignee) Velos Media LLC

Peisong Chen, Marta Karczewicz, Yan Ye
US5091782A

Filed: 1990-04-09     Issued: 1992-02-25

Apparatus and method for adaptively compressing successive blocks of digital video

(Original Assignee) Arris Technology Inc     (Current Assignee) CIF LICENSING LLC

Edward A. Krause, Woo H. Paik
US9930365B2
CLAIM 1
. A method of decoding video data (video data) , the method comprising : decoding a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decoding a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determining that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partitioning the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decoding a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decoding the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded subpartitions .
US5091782A
CLAIM 1
. Apparatus for processing digitized interlaced video (inter-prediction mode) signals for transmission in a compressed form comprising : first means for compressing a set of pixel data presented in a field format to provide a first compressed video signal ;
second means for compressing said set of pixel data presented in a frame format to provide a second compressed video signal ;
means coupled to said first means for evaluating errors in the first compressed video signal and coupled to said second means for evaluating errors in the second compressed video signal ;
and means responsive to said error evaluating means for selecting the compressed video signal having the least error .

US5091782A
CLAIM 27
. A digital television system comprising : first data compression means for compressing field formatted blocks of pixel data derived from a digitized interlaced video signal ;
second data compression means for compressing frame formatted blocks of pixel data corresponding to said field formatted blocks ;
means for evaluating errors in compressed data from said first and second data compression means ;
means responsive to said error evaluating means for selecting the compressed data for each block having the least error ;
means for encoding the selected data for each block to identify it as field processed or frame processed data ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream containing interspersed blocks of field processed pixel data and frame processed pixel data for transmission by a transmitter .

US9930365B2
CLAIM 7
. A device for decoding video data (video data) , the device comprising : a memory configured to store decoded video blocks (transforming means) of the video data , and a processor , in communication with the memory , configured to : decode a first syntax element associated with a sequence of pictures of the video data , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub-partitions .
US5091782A
CLAIM 1
. Apparatus for processing digitized interlaced video (inter-prediction mode) signals for transmission in a compressed form comprising : first means for compressing a set of pixel data presented in a field format to provide a first compressed video signal ;
second means for compressing said set of pixel data presented in a frame format to provide a second compressed video signal ;
means coupled to said first means for evaluating errors in the first compressed video signal and coupled to said second means for evaluating errors in the second compressed video signal ;
and means responsive to said error evaluating means for selecting the compressed video signal having the least error .

US5091782A
CLAIM 25
. Apparatus in accordance with claim 24 wherein the inverse transforming means (video blocks) is shared by said first and second means .

US5091782A
CLAIM 27
. A digital television system comprising : first data compression means for compressing field formatted blocks of pixel data derived from a digitized interlaced video signal ;
second data compression means for compressing frame formatted blocks of pixel data corresponding to said field formatted blocks ;
means for evaluating errors in compressed data from said first and second data compression means ;
means responsive to said error evaluating means for selecting the compressed data for each block having the least error ;
means for encoding the selected data for each block to identify it as field processed or frame processed data ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream containing interspersed blocks of field processed pixel data and frame processed pixel data for transmission by a transmitter .

US9930365B2
CLAIM 14
. The device of claim 7 , wherein the device is one or more of a camera , a computer , a mobile device , a broadcast receiver device (digital television) , or a set-top box .
US5091782A
CLAIM 27
. A digital television (receiver device) system comprising : first data compression means for compressing field formatted blocks of pixel data derived from a digitized interlaced video signal ;
second data compression means for compressing frame formatted blocks of pixel data corresponding to said field formatted blocks ;
means for evaluating errors in compressed data from said first and second data compression means ;
means responsive to said error evaluating means for selecting the compressed data for each block having the least error ;
means for encoding the selected data for each block to identify it as field processed or frame processed data ;
and means for combining the encoded selected data to provide a compressed video data stream containing interspersed blocks of field processed pixel data and frame processed pixel data for transmission by a transmitter .

US9930365B2
CLAIM 15
. A non-transitory computer-readable storage medium having stored thereon instructions that , when executed , cause a processor to : decode a first syntax element associated with a sequence of pictures of video data (video data) , the first syntax element representing a minimum size of blocks of the sequence of pictures ;

decode a second syntax element , separate from the first syntax element , associated with the sequence of pictures , the second syntax element representing a maximum size of the blocks of the sequence of pictures , wherein the maximum size is greater than 16×16 pixels ;

determine that a current block of a plurality of blocks of the sequence of pictures has a starting size equal to the maximum size using the second syntax element ;

partition the current block to obtain a plurality of sub-blocks for the current block , wherein partitioning comprises determining that a sub-block of the sub-blocks of the current block does not include further separately encoded sub-partitions when the size of the sub-block is equal to the minimum size indicated by the first syntax element ;

decode a third syntax element , separate from the first syntax element and the second syntax element , the third syntax element representing an encoding mode used to encode the sub-block , wherein the encoding mode comprises one of an intra-prediction mode and an inter-prediction mode (interlaced video) ;

and decode the sub-block according to the encoding mode , without further partitioning the sub-block , based on the determination that the block does not include further separately encoded sub partitions .
US5091782A
CLAIM 1
. Apparatus for processing digitized interlaced video (inter-prediction mode) signals for transmission in a compressed form comprising : first means for compressing a set of pixel data presented in a field format to provide a first compressed video signal ;
second means for compressing said set of pixel data presented in a frame format to provide a second compressed video signal ;
means coupled to said first means for evaluating errors in the first compressed video signal and coupled to said second means for evaluating errors in the second compressed video signal ;
and means responsive to said error evaluating means for selecting the compressed video signal having the least error .

US5091782A
CLAIM 27
. A digital television system comprising : first data compression means for compressing field formatted blocks of pixel data derived from a digitized interlaced video signal ;
second data compression means for compressing frame formatted blocks of pixel data corresponding to said field formatted blocks ;
means for evaluating errors in compressed data from said first and second data compression means ;
means responsive to said error evaluating means for selecting the compressed data for each block having the least error ;
means for encoding the selected data for each block to identify it as field processed or frame processed data ;
and means for combining the encoded selected data to provide a compressed video data (video data) stream containing interspersed blocks of field processed pixel data and frame processed pixel data for transmission by a transmitter .