IEICE Transactions on Electronics
Online ISSN : 1745-1353
Print ISSN : 0916-8524
E92.C 巻, 10 号
選択された号の論文の17件中1~17を表示しています
Special Section on Hardware and Software Technologies on Advanced Microprocessors
  • Kunio UCHIYAMA
    2009 年E92.C 巻10 号 p. 1231
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
  • Naohiko IRIE, Toshihiro HATTORI
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1232-1239
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    SoC has driven the evolution of embedded systems or consumer electronics. Multi-core/multi-IP is the key technology to integrate many functions on a SoC for future embedded applications. In this paper, the transition of SoC and its required functions for cellular phones as an example is described. And the state-of-the-art multi-core technology of homogeneous type and heterogeneous type are shown. When many cores and IPs are integrated on a chip, collaboration between cores and IPs becomes important to meet requirement. To realize it, “MPSoC Platform” concept and elementary technology for this platform is described.
  • Junji MICHIYAMA
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1240-1248
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    This paper describes the architecture of an integrated platform developed for improving the development efficiency of system LSIs built into digital consumer electronics equipment such as flat-panel TVs and optical disc recorders. The reason for developing an integrated platform is to improve the development efficiency of system LSIs that serve the principal functions of the said equipment. The key is to build a common interface between each software layer, with the system LSI located at the lowest layer. To make this possible, the hardware architecture of the system LSI is divided into five blocks according to its main functionality. In addition, a middleware layer is placed over the operating system to improve the ease of porting old applications and developing new applications in the higher layer. Based on this platform, a system LSI called UniPhier™ has been developed and used in 156 product families of digital consumer electronics equipment (as of December 2008).
  • Shanq-Jang RUAN, Jui-Yuan HSIEH, Chia-Han LEE
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1249-1257
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    This paper presents a gate-block selection algorithm, which can synthesize a proper parameter extractor of the pre-computation-based content-addressable memory (PB-CAM) to enhance power efficiency for specific applications such as embedded systems, microprocessor and SOC, etc. Furthermore, a novel CAM cell design with single bit-line is proposed. The proposed CAM cell design requires only one heavy loading bit-line and merely is constructed with eight transistors. The whole PB-CAM design was described in Spice with TSMC 0.35µm double-poly quadruple-metal CMOS process. We used Synopsys Nanosim to estimate power consumption. With a 128 words by 32bits CAM size, the experimental results showed that our proposed PB-CAM effectively reduces 18.21% of comparison operations in the CAM and saves 16.75% in power reduction by synthesizing a proper parameter extractor of the PB-CAM compared with the 1's count PB-CAM. This implies that our proposed PB-CAM is more flexible and adaptive for specific applications.
  • Shinya KAJIYAMA, Masamichi FUJITO, Hideo KASAI, Makoto MIZUNO, Takanor ...
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1258-1264
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    A novel 300MHz embedded flash memory for dual-core microcontrollers with a shared ROM architecture is proposed. One of its features is a three-stage pipeline read operation, which enables reduced access pitch and therefore reduces performance penalty due to conflict of shared ROM accesses. Another feature is a highly sensitive sense amplifier that achieves efficient pipeline operation with two-cycle latency one-cycle pitch as a result of a shortened sense time of 0.63ns. The combination of the pipeline architecture and proposed sense amplifiers significantly reduces access-conflict penalties with shared ROM and enhances performance of 32-bit RISC dual-core microcontrollers by 30%.
  • Pradeep RAO, Kazuaki MURAKAMI
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1265-1275
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    Despite the prevalence of Java workloads across a variety of processor architectures, there is very little published data on the impact of the various processor design decisions on Java performance. We attribute the lack of data to the large design space resulting from the complexity of the modern superscalar processor and the additional complexities associated with executing Java bytecode using a virtual machine. To address this shortcoming, we use a statistically rigorous methodology to systematically quantify the the impact of the various processor microarchitecture parameters on Java execution performance. The adopted methodology enables efficient screening of significant factor effects in a large design space consisting of 35 factors (32-billion potential configurations) using merely 72 observations per benchmark application. We quantify and tabulate the significance of each of the 35 factors for 13 benchmark applications. While these tables provide various insights into Java performance, they consistently highlight the performance significance of the instruction delivery mechanism, especially the instruction cache and the ITLB design parameters. Furthermore, these tables enable the architect to identify processor bottlenecks for Java workloads by providing an estimate of the relative impact of various design decisions.
  • Yue QIAN, Zhonghai LU, Wenhua DOU, Qiang DOU
    原稿種別: PAPER
    2009 年E92.C 巻10 号 p. 1276-1283
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    Credit-based router-to-router flow control is one main link-level flow control mechanism proposed for Networks on Chip (NoCs). Based on network calculus, we analyze its performance and optimal buffer size. To model the feedback control behavior due to credits, we introduce a virtual network service element called flow controller. Then we derive its service curve, and further the system service curve. In addition, we give and prove a theorem that determines the optimal buffer size guaranteeing the maximum system service curve. Moreover, assuming the latency-rate server model for routers, we give closed-form formulas to calculate the flit delay bound and optimal buffer size. Our experiments with real on-chip traffic traces validate that our analysis is correct; delay bounds are tight and the optimal buffer size is exact.
  • Chongyong YIN, Shouyi YIN, Leibo LIU, Shaojun WEI
    原稿種別: BRIEF PAPER
    2009 年E92.C 巻10 号 p. 1284-1290
    発行日: 2009/10/01
    公開日: 2009/10/01
    ジャーナル 認証あり
    Compiler is the most important supporting tool to facilitate the use of reconfigurable computing architecture (RCA). In this paper, a template-based compiler framework is proposed. This compiler can synthesize the executables for RCA from native high-level programming language source code directly. It supports to generate run-time dynamic configuration context. And it is capable to generate both full configuration context and partial configuration context. Experimental results show that the executables generated by the proposed compiler can achieve better execution performance and smaller configuration context size than previous compilers. Moreover, this compiler does not require the programmer to have any extra knowledge about the hardware architecture of RCA.
Regular Section
feedback
Top