|Table of Contents|

[1] Wang Gang, Wang Qing, Li Bing, et al. High-performance multi-transform architecture for H.264/AVC [J]. Journal of Southeast University (English Edition), 2013, 29 (3): 276-281. [doi:10.3969/j.issn.1003-7985.2013.03.009]
Copy

High-performance multi-transform architecture for H.264/AVC()
一种用于H.264/AVC的高性能多变换结构
Share:

Journal of Southeast University (English Edition)[ISSN:1003-7985/CN:32-1325/N]

Volumn:
29
Issue:
2013 3
Page:
276-281
Research Field:
Computer Science and Engineering
Publishing date:
2013-09-20

Info

Title:
High-performance multi-transform architecture for H.264/AVC
一种用于H.264/AVC的高性能多变换结构
Author(s):
Wang Gang1 2 Wang Qing1 Li Bing2 Chen Rui3
1 School of Instrument Science and Engineering, Southeast University, Nanjing 210096, China
2 Wuxi Branch, Southeast University, Wuxi 214135, China
3 Institute of Electronics, Chinese Academy of Sciences, Beijing 100190, China
王刚1 2 王庆1 李冰2 陈锐3
1东南大学仪器科学与工程学院, 南京 210096; 2东南大学无锡分校, 无锡 214135; 3中国科学院电子学研究所, 北京 100190
Keywords:
H.264/AVC multi-transform architecture Hadamard transform integer transform
H.264/AVC 多变换结构 哈达码变换 整数变换
PACS:
TP302
DOI:
10.3969/j.issn.1003-7985.2013.03.009
Abstract:
In order to increase the hardware utilization and minimize the chip area, a multi-transform coding architecture which includes 4×4 forward integer transform, 4×4 inverse integer transform, 4×4 Hadamard transform and 2×2 Hadamard transform is proposed. By simplifying these transforms and exploring their similarities, the proposed design merges the architectures, processing individual transforms into a high-performance multi-transform coding architecture. Using a semiconductor manufacturing international corporation(SMIC)0.18 μm complementary metal oxide semiconductor(CMOS)technology, the proposed architecture achieves the maximum operating clock frequency of 200 MHz and the throughput rate of 800×106 pixel/s with the hardware cost of 3 704 gates. The results demonstrate that the data throughput rate per unit area(DTUA)of this design is at least 40.28% higher than that of the reference design. This design can meet the requirements of real-time decoding digital cinema video(4 096×2 048@30 Hz)at 62.9 MHz, which helps to reduce the power consumption.
为了提高硬件使用率、减少芯片面积, 提出了一种包括4×4正反变换、4×4 哈达码变换及2×2 哈达码变换在内的多变换编码结构.通过将这几种变换算法化简, 并且研究它们之间的相似性, 将处理单个变换的结构合并成一个高性能多变换编码结构.采用SMIC 0.18 μm CMOS工艺, 该结构最高工作频率可达200 MHz, 并在消耗3704门电路的情况下, 达到800×106 pixel/s的吞吐率.结果表明, 该设计的吞吐率和电路规模的比值DTUA比参考设计至少提高了40.28%, 并且在62.9 MHz的频率下支持分辨率为4 096×2 048、刷新频率为30 Hz的视频实时解码需求, 从而有助于降低功耗.

References:

[1] Hong E P, Jung E G, Fraz H, et al. Parallel 4×4 transform architecture based on bit extended arithmetic for H.264/AVC [C]//Proc of International Symposium on Signals, Circuits and Systems. New York, USA, 2005, 1:95-98.
[2] Rubin G. Parallel 4×4 transform on bit serial shared memory architecture for H.264/AVC [C]//Proc of the 16th International Conference on Mixed Design of Integrated Circuits & Systems. Lodz, Poland, 2009:675-680.
[3] Porto R, Bampi M, Agostini S, et al. High throughput architecture for forward transforms of H.264/AVC video coding standard [C]//Proc of the 14th IEEE International Conference on Electronics, Circuits and Systems. Marrakech, Morocco, 2007:150-153.
[4] Wang T C, Huang Y W, Fang H C, et al. Parallel 4×4 2D transform and inverse transform architecture for MPEG-4 AVC/H.264 [C]//Proc of the 2003 International Symposium on Circuits and Systems. Bangkok, Thailand, 2003: Ⅱ-800-Ⅱ-803.
[5] Cao W, Hou H, Lai J M, et al. A high-performance reconfigurable 2-D transform architecture for H.264 [C]//Proc of the 15th IEEE International Conference on Electronics, Circuits and Systems. St. Julien’s, 2008:606-609.
[6] Cao W, Hou H, Lai J M, et al. A novel dynamic reconfigurable VLSI architecture for H.264 transforms [C]//Proc of IEEE Asia Pacific Conference on Circuits and Systems. Macao, China, 2008:1810-1813.
[7] Chen K H, Guo J I, Wang J S. A high-performance direct 2-D transform coding IP design for MPEG-4 AVC/H.264 [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2006, 16(4): 472-483.
[8] Peng C, Yu D, Cao X, et al. A new high throughput VLSI architecture for H.264 transform and quantization [C]//Proc of the 7th International Conference on ASIC. Guilin, China, 2007:950-953.
[9] Hwangbo W, Kim J, Kyung C M. A high-performance 2-D inverse transform architecture for the H.264/AVC decoder[C]//Proc of IEEE International Symposium on Circuits and Systems. New Orleans, LA, USA, 2007:1613-1616.
[10] Hwangbo W, Kyung C M. A multi-transform architecture for H.264/AVC high-profile coders [J]. IEEE Transactions on Multimedia, 2010, 12(3):157-167.
[11] Wang K W, Chen J L, Cao W, et al. A reconfigurable multi-transform VLSI architecture supporting video codec design[J]. IEEE Transactions on Circuits and Systems Ⅱ, 2011, 58(7):432-436.
[12] Huang C Y, Chen L F, Lai Y K. A high-speed 2-D transform architecture with unique kernel for multi-standard video applications [C]//Proc of IEEE International Symposium on Circuits and Systems. Seattle, WA, USA, 2008:21-24.

Memo

Memo:
Biographies: Wang Gang(1977—), male, graduate; Wang Qing(corresponding author), male, doctor, professor, w3398@263. net.
Foundation item: The National Key Technology R& D Program of China during the 12th Five Year Plan Period(No.2013BAJ05B03).
Citation: Wang Gang, Wang Qing, Li Bing, et al. High-performance multi-transform architecture for H.264/AVC[J].Journal of Southeast University(English Edition), 2013, 29(3):276-281.[doi:10.3969/j.issn.1003-7985.2013.03.009]
Last Update: 2013-09-20