IPSJ Transactions on System and LSI Design Methodology

Message from the Editor-in-Chief

Hiroyuki Tomiyama

Article type: Editorial
Subject area: Editorial
2014 Volume 7 Pages 1
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.1

JOURNAL FREE ACCESS

Download PDF (34K)
All-Digital RF Phase-Locked Loops Exploiting Phase Prediction

Jingcheng Zhuang, Robert Bogdan Staszewski

Article type: Invited Paper
Subject area: Analog Circuit Design
2014 Volume 7 Pages 2-15
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.2

JOURNAL FREE ACCESS

Show abstractHide abstract

This paper presents an all-digital phase-locked loop (ADPLL) architecture in a new light that allows it to significantly save power through complexity reduction of its phase locking and detection mechanisms. The predictive nature of the ADPLL to estimate next edge occurrence of the reference clock is exploited here to reduce the timing range and thus complexity of the fractional part of the phase detection mechanism as implemented by a time-to-digital converter (TDC) and to ease the clock retiming circuit. In addition, the integer part, which counts the DCO clock edges, can be disabled to save power once the loop has achieved lock. It can be widely used in fields of fractional-N frequency multiplication and frequency/phase modulation. The presented principles and techniques have been validated through extensive behavioral simulations as well as fabricated IC chips.

View full abstract

Download PDF (1534K)
Design Automation for Digital Microfluidic Biochips

Tsung-Yi Ho

Article type: Invited Paper
Subject area: Emerging Technology
2014 Volume 7 Pages 16-26
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.16

JOURNAL FREE ACCESS

Show abstractHide abstract

Microfluidic biochips are replacing the conventional biochemical analyzers, and are able to integrate on-chip all the basic functions for biochemical analysis. The “digital” microfluidic biochips (DMFBs) are manipulating liquids not as a continuous flow, but as discrete droplets on a two-dimensional array of electrodes. Basic microfluidic operations, such as mixing and dilution, are performed on the array, by routing the corresponding droplets on a series of electrodes. The challenges facing biochips are similar to those faced by microelectronics some decades ago. To meet the challenges of increasing design complexity, computer-aided-design (CAD) tools are being developed for DMFBs. This paper provides an overview of DMFBs and describes emerging CAD tools for the automated synthesis and optimization of DMFB designs, from fluidic-level synthesis and chip-level design to testing. Design automations are expected to alleviate the burden of manual optimization of bioassays, time-consuming chip designs, and costly testing and maintenance procedures. With the assistance of CAD tools, users can concentrate on the development and abstraction of nanoscale bioassays while leaving chip optimization and implementation details to CAD tools.

View full abstract

Download PDF (1623K)
Dynamic Power Consumption Optimization for Inductive-Coupling based Wireless 3D NoCs

Hao Zhang, Hiroki Matsutani, Michihiro Koibuchi, Hideharu Amano

Article type: Regular Paper
Subject area: System-Level Power Optimization
2014 Volume 7 Pages 27-36
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.27

JOURNAL FREE ACCESS

Show abstractHide abstract

Inductive-coupling is yet another 3D integration technique that can be used to stack more than three known-good-dies in a SiP without wire connections. Its power consumed for communication by inductive coupling link is one of big problems. A dynamic on/off link control for topology-agnostic 3D NoC (Network on Chip) architecture using inductive-coupling is proposed. The proposed low-power techniques stop the transistors by cutting off the bias voltage in the transmitter of the wireless vertical links only when their utilization is higher than the threshold. Meanwhile, the whole wireless vertical link will be shut down when the utilization is lower than the threshold in order to reduce the power consumption of wireless 3D NoCs. Full-system many-core simulations using power parameters derived from a real chip implementation show that the proposed low-power techniques reduce the power consumption by 43.8-55.0%, while the average performance overhead is 1.4% in wireless topology-agnostic 3D NoC.

View full abstract

Download PDF (4197K)
Impact of Resource Sharing and Register Retiming on Area and Performance of FPGA-based Designs

Yuko Hara-Azumi, Toshinobu Matsuba, Hiroyuki Tomiyama, Shinya Honda, H ...

Article type: Regular Paper
Subject area: Behavioral Synthesis
2014 Volume 7 Pages 37-45
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.37

JOURNAL FREE ACCESS

Show abstractHide abstract

Due to the increasing diversity and complexity of embedded systems, the use of high-level synthesis (HLS) and that of FPGAs have been both becoming prevalent in order to enhance the design productivity. Although a number of works for FPGA-oriented optimizations, particularly about resource binding, have been studied in HLS, the HLS technologies are still immature since most of them overlook some important facts on resource sharing. In this paper, for FPGA-based designs, we quantitatively evaluate effects of several resource sharing approaches in HLS using practically large benchmarks, on various FPGA devices. Through the comprehensive evaluation, the effects on clock frequency, execution time, area, and multiplexer distribution are examined. Several important discussions and findings will be disclosed, which are essential for further advance of the practical HLS technology.

View full abstract

Download PDF (1228K)
SAT-based Automatic Rectification and Debugging of Combinational Circuits with LUT Insertions

Satoshi Jo, Takeshi Matsumoto, Masahiro Fujita

Article type: Regular Paper
Subject area: Logic-Level Debugging
2014 Volume 7 Pages 46-55
Published: 2014
Released on J-STAGE: February 14, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.46

JOURNAL FREE ACCESS

Show abstractHide abstract

Introducing partial programmability in circuits by replacing some gates with look up tables (LUTs) can be an effective way to improve post-silicon or in-field rectification and debugging. Although finding configurations of LUTs that can correct the circuits can be formulated as a QBF problem, solving it by state-of-the-art QBF solvers is still a hard problem for large circuits and many LUTs. In this paper, we present a rectification and debugging method for combinational circuits with LUTs by repeatedly applying Boolean SAT solvers. The proposed method first finds a candidate of LUT configurations that can correct a given circuit by SAT solvers. Then, it checks the correctness of the candidate by checking equivalence between the circuit with LUTs and its specification. Although this can be solved as SAT problem, we introduce to use commercial equivalence checker to improve the efficiency. If the result of the check is “not equivalent”, an input pattern showing the non-equivalence will be added, and the method repeats with the pattern added. Through the experimental results on ISCAS '85 benchmark circuits and Open RISC 1200 microprocessor design, we show our proposed method can quickly find LUT configurations for large circuits with many LUTs, which cannot be solved by a QBF solver.

View full abstract

Download PDF (730K)
Test and Design-for-Testability Solutions for 3D Integrated Circuits

Krishnendu Chakrabarty, Mukesh Agrawal, Sergej Deutsch, Brandon Noia, ...

Article type: Invited Paper
Subject area: Testing
2014 Volume 7 Pages 56-73
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.56

JOURNAL FREE ACCESS

Show abstractHide abstract

Despite the promise and benefits offered by 3D integration, testing remains a major obstacle that hinders its widespread adoption. Test techniques and design-for-testability (DfT) solutions for 3D ICs are now being studied in the research community, and experts in industry have identified a number of hard problems related to the lack of probe access for wafers, test access in stacked dies, yield enhancement, and new defects arising from unique processing steps. We describe a number of testing and DfT challenges, and present some of the solutions being advocated for these challenges. Techniques highlighted in this paper include: (i) pre-bond testing of TSVs and die logic, including probing and non-invasive test using DfT; (ii) post-bond testing and DfT innovations related to the optimization of die wrappers, test scheduling, and access to dies and inter-die interconnects; (iii) interconnect testing in interposer-based 2.5D ICs; (iv) fault diagnosis and TSV repair; (v) cost modeling and test-flow selection.

View full abstract

Download PDF (2157K)
Energy-efficient High-level Synthesis for HDR Architecture with Multi-stage Clock Gating

Hiroyuki Akasaka, Shin-ya Abe, Masao Yanagisawa, Nozomu Togawa

Article type: Short Paper
Subject area: Behavioral Synthesis
2014 Volume 7 Pages 74-80
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.74

JOURNAL FREE ACCESS

Show abstractHide abstract

With the miniaturization and high performance of current and future LSIs, demand for portable devices has much more increased. Especially the problems of battery runtime and device overheating have occurred. In addition, with the downsize of the LSI design process, the ratio of an interconnection delay to a gate delay has continued to increase. High-level synthesis to estimate the interconnection delays and reduce energy consumption is essential. In this paper, we propose a high-level synthesis algorithm based on HDR architectures (huddle-based distributed register architectures) utilizing multi-stage clock gating. By increasing the number of clock gating stages in each huddle, we increase the number of the control steps at which we can apply the clock gating to registers. We can determine the configuration of the clock gating with optimized energy consumption. The experimental results demonstrate that our proposed algorithm reduced energy consumption by up to 27.7% compared with conventional algorithms.

View full abstract

Download PDF (611K)
A Delay-variation-aware High-level Synthesis Algorithm for RDR Architectures

Yuta Hagio, Masao Yanagisawa, Nozomu Togawa

Article type: Regular Paper
Subject area: Behavioral Synthesis
2014 Volume 7 Pages 81-90
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.81

JOURNAL FREE ACCESS

Show abstractHide abstract

As device feature size drops, interconnection delays often exceed gate delays. We have to incorporate interconnection delays even in high-level synthesis. Using RDR architectures is one of the effective solutions to this problem. At the same time, process and delay variation also becomes a serious problem which may result in several timing errors. How to deal with this problem is another key issue in high-level synthesis. In this paper, we propose a delay-variation-aware high-level synthesis algorithm for RDR architectures. We first obtain a non-delayed scheduling/binding result and, based on it, we also obtain a delayed scheduling/binding result. By adding several extra functional units to vacant RDR islands, we can have a delayed scheduling/binding result so that its latency is not much increased compared with the non-delayed one. After that, we similarize the two scheduling/binding results by repeatedly modifying their results. We can finally realize non-delayed and delayed scheduling/binding results simultaneously on RDR architecture with almost no area/performance overheads and we can select either one of them depending on post-silicon delay variation. Experimental results show that our algorithm successfully reduces delayed scheduling/binding latency by up to 42.9% compared with the conventional approach.

View full abstract

Download PDF (1028K)
Reinforcing Random Testing of Arithmetic Optimization of C Compilers by Scaling up Size and Number of Expressions

Eriko Nagai, Atsushi Hashimoto, Nagisa Ishiura

Article type: Regular Paper
Subject area: Software Testing
2014 Volume 7 Pages 91-100
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.91

JOURNAL FREE ACCESS

Show abstractHide abstract

This paper presents an enhanced method of testing validity of arithmetic optimization of C compilers using randomly generated programs. Its bug detection capability is improved over an existing method by 1) generating longer arithmetic expressions and 2) accommodating multiple expressions in test programs. Undefined behavior in long expressions is successfully eliminated by modifying problematic subexpressions during computation of expected values for the expressions. A new method for including floating point operations into compiler random testing is also proposed. Furthermore, an efficient method for minimizing error inducing test programs is presented, which utilizes binary search. Experimental results show that a random test system based on our method has higher bug detection capability than existing methods; it has detected more bugs than previous method in earlier versions of GCCs and has revealed new bugs in the latest versions of GCCs and LLVMs.

View full abstract

Download PDF (407K)
A Sophisticated Routing Algorithm in 3D NoC with Fixed TSVs for Low Energy and Latency

Xin Jiang, Lian Zeng, Takahiro Watanabe

Article type: Regular Paper
Subject area: System-Level Optimization
2014 Volume 7 Pages 101-109
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.101

JOURNAL FREE ACCESS

Show abstractHide abstract

With rapid progress in Integrated Circuit technologies, Three-Dimensional Network-on-Chips (3DNoCs) have become a promising solution for achieving low latency and low power. Under the constraint of the TSV number used in 3DNoCs, designing a proper routing algorithm with fewer TSVs is a critical problem for network performance improvement. In this work, we design a novel fully adaptive routing algorithm in 3D NoC. The algorithm consists of two parts: one is a vertical node assignment in inter-layer routing, which is a TSV selection scheme in a limited quantity of TSVs in the NoC architecture, and the other is a 2D fully adaptive routing algorithm in intra-layer routing, which combines the optimization of routing distance, network traffic condition and diversity of the path selection. Simulation results show that our proposed routing algorithm can achieve lower latency and energy consumption compared with other traditional routing algorithms.

View full abstract

Download PDF (2042K)
An Area Efficient Regular Expression Matching Engine Using Partial Reconfiguration for Quick Pattern Updating

Yoichi Wakaba, Shin'ichi Wakabayashi, Shinobu Nagayama, Masato Inagi

Article type: Regular Paper
Subject area: Architectural Design
2014 Volume 7 Pages 110-118
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.110

JOURNAL FREE ACCESS

Show abstractHide abstract

This paper proposes a method using partial reconfiguration to realize a compact regular expression matching engine, which can update a pattern quickly. In the proposed method, a set of partial circuits, each of which handles a different class of regular expressions, are provided in advance. When a regular expression pattern is given, a compact matching engine dedicated to the pattern is implemented on FPGA by combining the partial circuits according to the given pattern using partial reconfiguration. The method can update a pattern quickly, since it does not need re-design of a circuit. Experimental results show that the proposed method reduces 60% circuit size compared with the previous method without increasing the pattern updating time significantly.

View full abstract

Download PDF (1618K)
Forwarding Unit Generation for Loop Pipelining in High-level Synthesis

Shingo Kusakabe, Kenshu Seto

Article type: Regular Paper
Subject area: System-Level Synthesis
2014 Volume 7 Pages 119-124
Published: 2014
Released on J-STAGE: August 04, 2014

DOIhttps://doi.org/10.2197/ipsjtsldm.7.119

JOURNAL FREE ACCESS

Show abstractHide abstract

In the loop pipelining of high-level synthesis, the reduction of initiation intervals (IIs) is very important. Existing loop pipelining techniques, however, pessimistically assumes that dependences whose occurrences can be determined only at runtime always occur, resulting in increased IIs. To address this issue, recent work achieves reduced II by a source code transformation which introduces runtime dependence analysis and performs pipeline stalls when the dependences actually occur. Unfortunately, the recent work suffers from the increased execution cycles by frequent pipeline stalls under the frequent occurrences of the dependences. In this paper, we propose a technique to reduce IIs in which data written to memories are also written to registers for such dependences of read-after-write (RAW) type. In our technique, registers which are faster than memories are accessed when the RAW dependences occur. Since the proposed technique achieved the reduction of the execution cycles by 34% with 15% gate count increase on average for three examples compared to the state-of-the-art technique, the proposed technique is effective for synthesizing high-speed circuits with loop pipelining.

View full abstract

Download PDF (523K)

Register with J-STAGE for free!