Improving Small-Delay Fault Coverage of On-Chip Delay Measurement by Segmented Scan and Test Point Insertion*

Wenpo ZHANG†a), Student Member, Kazuteru NAMBA†, Member, and Hideo ITO†, Fellow

SUMMARY With IC design entering the nanometer scale integration, the reliability of VLSI has declined due to small-delay defects, which are hard to detect by traditional delay fault testing. To detect small-delay defects, on-chip delay measurement, which measures the delay time of paths in the circuit under test (CUT), was proposed. However, our pre-simulation results show that when using on-chip delay measurement method to detect small-delay defects, test generation under the single-path sensitization is required. This constraint makes the fault coverage very low. To improve fault coverage, this paper introduces techniques which use segmented scan and test point insertion (TPI). Evaluation results indicate that we can get an acceptable fault coverage, by combining these techniques for launch off shift (LOS) testing under the single-path sensitization condition. Specifically, fault coverage is improved 27.02∼47.74% with 6.33∼12.35% of hardware overhead.

key words: small-delay defects, fault coverage, segmented scan, control point, observation point

1. Introduction

As technology scales to 45nm and below, semiconductor device scaling has significantly improved performance and circuit integration density. With the increasing speed of integrated circuits, violations of the performance specifications are becoming a major factor affecting the product-quality level [1]. Delay defects that degrade performance and cause timing related failures are emerging as a major problem in nanometer technologies. Several delay fault models and delay test methods have been proposed. Transition fault and path delay fault are two prevalent fault models [2]. With the growing complexity of designs, scan-based techniques of testing are becoming very popular. However, it is inherently limited by the accuracy of the provided test clock frequency with the external automatic test equipment (ATE), which can be affected by factors such as parasitic capacitance, resistance of probe and tester skew [3].

Small-delay defects are known to degrade in operation and cause early life failure. A small-delay defect has a defect size that is not large enough to cause a timing failure under the system clock cycle. Small-delay defects represent a significant reliability concern when resistive defects are present in a technology. For example, a resistive open caused by a minimally connecting via can become a complete open in operation due to metal migration. Recently, increasing random process variations contribute to significant timing variability, which is indistinguishable from the effects of small-delay defects. Such variations can be beyond the performance variations caused by resistive small-delay defects. Therefore, these process variations need to be detected to improve the reliability of chips [4]. Since they might escape detection during traditional Pass-Fail delay fault testing with functional clock, small-delay defects have become a significant problem and it is essential to detect such defects during manufacturing tests [5]. Such manufacturing flaws have traditionally been eliminated through burn-in stress testing. Burn-in fallout can be as high as 0.5-1% (5,000-10,000 defect parts per million (DPPM) for large complex die, making stress testing essential for high end parts such as microprocessors. Unfortunately, traditional burn-in is becoming extremely expensive for delicate nanometer technologies; it also appears to be losing effectiveness in accelerating certain types of early life failures [6]. As a result, industry is looking for alternate low-cost methods for eliminating latent defects [7], [8].

To screen small-delay fault, small-delay defect screening with criteria based on statistical analysis is proposed [8]. In this technique, small-delay defects are detected as outliers. Delay distributions for each path can be obtained by measuring many chips of the same design. If a path delay time is beyond a specified time such as the three-sigma limit (users can set the specified time freely by taking into consideration the trade-off between yield and dependability), even if it is not beyond the system clock cycle, we regard it as a faulty path. Some previous works presented on-chip path delay time measurements based on this strategy [9]–[14]. By measuring delay time of the path under measurement (PUM), not only the gross and small-delay faults can be detected but also the amount of timing violation in the failing paths can be obtained under certain environment conditions [15], [16].

This paper points out a considerable issue of testing using on-chip path delay measurement: in the measurement, PUMs are sensitized by delay fault test patterns. However, this paper reveals that the robust test patterns are not suitable for on-chip delay measurement. Specifically, they require test generation under the single-path sensitization condition, which causes its small-delay fault coverage to be very low. Thus, a method improving fault coverage is strongly required. This paper uses the transition fault coverage as the
small-delay fault coverage because the aim of this paper is to detect increases of gate and line delays caused by resistive faults and other reasons (for example process variation) to reduce early life-failure.

This paper gives evidence that, for improving small-delay fault coverage of on-chip delay measurement, the use of segmented scan and test point insertion (TPI) is efficient.

The rest of the paper is organized as follows. Section 2 introduces some terminology and related works, including on-chip delay measurement. Section 3 analyzes the constraint of single-path sensitization and the reasons for low small-delay fault coverage. Section 4 explains methods for improving fault coverage. Section 5 introduces the introduced methods. Finally, Sect. 6 concludes the paper.

2. Preliminaries

This section explains some terminology related to scan-based delay testing. We also introduce some related works for small-delay testing and the on-chip delay measurement method.

2.1 Transition Fault and Path Delay Fault

Transition fault and path delay fault are two prevalent fault models. The transition fault model targets each gate output in the design for a slow-to-rise and slow-to-fall delay fault while the path delay fault model targets the cumulative delay through the entire list of gates in a pre-defined path. Transition fault model is more widely used than path delay fault because it tests for at-speed failures at all nets in the design and the total fault list is equal to twice the number of nets. On the other hand, there are billions of paths in a modern design to be tested for path delay fault leading to high analysis effort [7]. This paper uses the transition fault coverage as the small-delay fault coverage because the aim of this paper is to detect increases of gate and line delays caused by resistive faults to reduce early life-failure. The transition fault is detected if a transition occurs at the fault site and if a sensitized path extends from the fault site to a primary output.

Transition faults are detected if a transition occurs at the fault site and if a sensitized path extends from the fault site to any primary output. From this reason, a path through the fault should be sensitized for testing the transition delay fault. In path delay fault testing, a signal is an on-input of path $p$ if it is on path $p$. If a gate $g$ is on path $p$ and an input line of the gate $g$ is not on $p$, the line is called an off-input of $p$ [1],[17]. A logic value is the controlling value to a gate if it determines the output value of the gate regardless of the values on the other inputs to the gate. If the value on some input is the complement of the controlling value, the input is said to have a non-controlling value. Typically, there are three classes of path delay faults according to the sensitization criteria: single-path sensitization, robust sensitization and non-robust sensitization. Consider an AND gate with two inputs, this gate is a part of a target path $p$ with one input as the on-input and the other input as the off-input for $p$. Table 1 shows the sensitization versus pairs of initial and final values in on and off-input of an AND gate. Symbol $S_0$ ($S_1$) represents a stable 0 (1) value on some signal under the initial value and final value. Symbol $T_0$ ($T_1$) represents a 1 to 0 (0 to 1) transition. $H_0$ ($H_1$) represents a 0 (1) value that might have hazard.

<table>
<thead>
<tr>
<th>Sensitization</th>
<th>On-input</th>
<th>Off-input</th>
</tr>
</thead>
<tbody>
<tr>
<td>Single path sensitization</td>
<td>$T_1$</td>
<td>$S_1$</td>
</tr>
<tr>
<td>Robust</td>
<td>$T_0$</td>
<td>$S_1$</td>
</tr>
<tr>
<td>Non-robust</td>
<td>$T_1$ or $H_1$</td>
<td>$S_1$, $T_1$ or $H_1$</td>
</tr>
<tr>
<td></td>
<td>$T_0$ or $H_0$</td>
<td>$S_1$, $T_1$ or $H_1$</td>
</tr>
</tbody>
</table>

2.2 Related Works

Recently, various methods for small-delay defect detection have been proposed. Methods using faster-than-at-speed have been proposed to detect delay faults [18],[19]. These methods use multiple clock frequencies that are higher than the system clock. These methods have a drawback that the test time is very long and it causes the test cost high. To detect small-delay faults, methods with delay fault testing using a ring oscillator have been proposed [20]–[22]. In these, the PUM is made a part of ring oscillator, delay of the target path can be translated into the oscillation period. However, the timing resolution is not very good. Some time-to-voltage converter (TVC) based schemes have been proposed [23]–[25]. The delay of the PUM is converted to a certain voltage, and by comparing the converted voltage with the reference voltage, the delay of the PUM can be obtained. These techniques give good timing resolution. However, the calibration is difficult.

Some on-chip path delay time measurement methods using embedded delay measurement were proposed [9]–[14]. In these, delay times of paths are measured. Datta et al. proposed a modified vernier delay line (VDL) technique for path delay measurement [9]. High-resolution delay measurement capability can be achieved by using this method. The paper [10] presented modified boundary scan cells in which a time-to-digital converter (TDC) is embedded. Tsai et al. proposed a built-in delay measurement (BIDM) circuit consisting of coarse and fine blocks, which is an extension of the modified VDL technique. A built-in-self delay testing methodology based on BIDM and self-calibration methods can be developed [11]. Pei et al. also proposed an area-efficient version of the modified VDL [12]. The feature of this method is delay range of each stage. The delay ranges increase by a factor of two gradually, which reduces the required stages. Thus, without decreasing delay measurement resolution, this method expands delay measurement range much more easily with significantly less hardware overhead. The authors’ group proposed a measurement system, which
is different from the VDL method, to improve the accuracy of the measured value [13]. This method measures delay times of two paths: a path which includes the PUM, and the extra path whose length is almost equal to the redundant line of the path, is measured previously. The difference between the delay times gives the delay of the PUM. This method is able to give a precise measurement. In addition, a method with smaller execution time and circuit area has been proposed [14].

2.3 On-Chip Delay Measurement Method

Figure 1 shows the architecture of the on-chip path delay measurement method of [14]. The on-chip delay measurement system measures the delay of each path including a PUM; the input (output) of the PUM is the output (input) of a flip-flop (FF) in the left (right) scan chain. The delay measurement system consists of delay value measurement circuit (DVMC), stop signal generator (SSG, which is an N-to-1 multiplexer), and circuit under test (CUT). The embedded delay measurement circuit DVMC has two input lines, start line and stop line. The DVMC, which is a class of TDC, consists of a delay chain and an n-bit up counter. DVMC measures the time difference between the transition signals sent to start line and stop line. The clock line of the CUT clk is directly connected to start of the DVMC; the transition of start triggers the measurement. The DVMC starts the measurement when a positive transition of clk is sent to start. The input line of the FF, in the right scan chain, ssgin, is connected to the SSG; the SSG detects the transition at ssgin, and sends the transition to stop of the DVMC, by setting the corresponding control data of SSG. The input line clk is the clock signal of CUT. The line clk, is the clock line of FF. The input of FF is connected to ssgout through ssgin, and the SSG. The system measures a path including one clock line clk, a PUM pi, and some redundant lines ssgin, and ssgout. For example, after the measurement of the path p = clk-pi-ssgin-ssgout, by comparing the measured delay time with the expected delay time, small-delay defects on clk, and pi, can be detected [14]. In this paper, we insert one DVMC circuit in one CUT. Thus, only one path is selected to be measured for each test.

Before the measurement, a test pattern (which sensitizes pi) is assigned to the primary inputs and FFs of CUT. SSG is controlled to send the transition propagating to the output of pi, to ssgout. Then, we start the measurement by launching a positive transition to start. The transition propagates to the start of DVMC after the rising edge of clk, and DVMC starts the measurement. At the moment the clock rising edge reaches FF, a transition is launched to pi. The transition reaches stop of DVMC through pi, ssgin, and ssgout. Then, DVMC stops the measurement. The measured delay time of p contains the delay of redundant lines ssgin, and ssgout. By using the criterion of [8], we can detect small-delay defects occurring on pi and the segments of clock trees. This method has a good measurement resolution enough to detect defects even if the path delay is short [13]. However, there is an important point for small-delay fault test. When the intended transition reaches the stop of DVMC through SSG, DVMC stops the measurement. If the transition on the off-input of a PUM affects measured delay time, an incorrect measurement result will be recorded. The incorrect measurement result leads to false error indications or test escapes.

3. Constraint of Single-Path Sensitization

In this section, we introduce the necessity of single-path sensitization in on-chip delay measurement using an example. We give some simulation results to prove that the robust test is not appropriate for on-chip delay measurement. We also analyze the reasons for low small-delay fault coverage.

3.1 Necessity of Single-Path Sensitization

Unlike the traditional delay fault test, for the on-chip delay measurement method we must consider the single-path sensitization condition. As already known, in traditional delay fault test, we have to test sensitizing PUM robustly and not non-robustly to detect delay faults on PUM regardless of the delay time to off-inputs. The same is true for the delay measurement method. However, this is not sufficient, which is explained below using the example of Fig. 2 and the simulation data in Figs. 3 and 4.

In Fig. 2, PUM is the path p1-p2-G-p4, denoted with a bold line. Assume that all sub-paths p1, p2 and p4 are sensitized, and the values of on-input Ion and the off-input Ioff of the AND gate G are T1. From Table 1, PUM is robustly tested, but is not single-path sensitized. Here, we
set $T_{ON} = t_1 + t_2 + t_{ON} + t_4$ is the delay time of the PUM, and $T_{OFF} = t_1 + t_3 + t_{OFF} + t_4$ is the delay time of the path $p_1-p_3-G-p_4$, where $t_i$ is the delay times of $p_i$ and, $t_{ON}$ and $t_{OFF}$ are the gate delay of $G$ from $T_{ON}$ and $T_{OFF}$. By measuring the delay time from the input to the output, we obtain the following time:

$$T = \max(T_{ON}, T_{OFF}).$$

If the expected delay time of $t_3$ is longer than $t_2$, we need to set the expected path delay to $T_{OFF} + 3\sigma$. In other words, we should compare $T$ with $T_{OFF} + 3\sigma$. We cannot detect a resistive fault on $p_2$ with statistical analysis of $T$, unless the PUM has a sufficiently delay fault such that $T_{ON} \geq T_{OFF} + 3\sigma$. It is likely to bring fault escapes in manufacturing testing, as a PUM typically has plenty of off-inputs. Else if the expected delay time of $t_2$ is longer than $t_3$, we need to set the expected path delay to $T_{ON} + 3\sigma$. We can detect small-delay defects on the PUM, unless the off-input has a delay fault that causes $T_{OFF} \geq T_{ON} + 3\sigma$. It is noted that in both of the two cases, we cannot detect small-delay defects on the PUM when $T_{OFF} > T_{ON}$. Even there has just one off-input of the PUM makes $t_3$ longer than $t_2$.

To prove that the robust test is not appropriate for on-chip delay measurement, we show the relationship of $T_{OFF}$ and $T_{ON}$ using some experimental data. We use the test set (robust but not single-path sensitization) of s5378 and s9234; Figures 3 and 4 show results of s5378 and s9234, respectively. In these figures the x axis shows the delay time difference of $T_{OFF}$ and $T_{ON}$ using number of inverter, and the y axis shows the numbers of faults. The results show that in most cases (over than 90%) $T_{OFF} > T_{ON}$. It causes incorrect measurement result, and defects can be masked. These results demonstrate that, in the actual testing, it is difficult to ensure that the transition of on-input earlier than the transitions of all off-inputs. In other words, it is difficult to detect small-delay defects on the PUM under robust but not single-path sensitization. To guarantee that faults on the PUM are detected, we must make the measured time include $t_2$ regardless of $t_3$. We can satisfy this by using the single-path sensitization condition, setting $S_1$ to $T_{OFF}$.

### 3.2 Small-Delay Fault Coverage

Because of the constraint of single-path sensitization, small-delay fault coverage of the test method using on-chip delay measurement is very low. Table 2 shows the small-delay fault coverage of ISCAS89 benchmark circuits of LOS test by using on-chip delay measurement. For example fault coverage of s38584 is less than 31%. From this, a method for improving small-delay fault coverage using on-chip delay measurement is strongly required.

#### 4. Techniques for Improving Fault Coverage

This section introduces some techniques for improving fault coverage. These techniques are available for improving small-delay fault coverage of on-chip delay measurement. For higher fault coverage with an acceptable area overhead, we propose a procedure for using these techniques at the same time. The proposed method is a class of DFTs (designs for test), which facilitate testing. Although DFT increase area resulting in increase in the probability of defects, detecting small-delay faults with the DFT accomplishes shipment of dependable chips, which are free from manufacturing defects bringing about early-die.

#### 4.1 Segmented Scan

Segmented scan is one of the techniques for improving the fault coverage [26]. Consider the part of a sequential circuit shown in Fig. 5, assume that the FFs are connected in the order of their numerical indices. Assume that a slow-to-fall small-delay fault occurs on line $a$. Under the single-path sensitization condition, the path (which includes the line $a$) cannot be sensitized since the initialization condition $FF_1 = FF_2 = 1$ implies $FF_3 = 1$ by 1 bit shift during the launch cycle. Thus, the off-input of the OR gate (line $c$) is set to controlling value, and the fault is blocked from being propagated to $FF_3$ during the capture cycle. Consider again, assume that the scan chain is partitioned into two segments.
and each segment is connected to independent scan enable signals SE1 and SE2 as shown in Fig. 6. The initialization pattern \((FF_1, FF_2, FF_3, FF_4) = (1, 1, 0, X)\) (X = do not care) is scanned in with both the scan enables SE1 and SE2 set 1. In the next cycle, we set SE1 = 1, SE2 = 0, then FF3 contains the value of “0” instead of set to “1”. Therefore, the fault effect is propagated to FF4 and single-path sensitization is achieved. The example demonstrates that the path cannot be sensitized under the single-path sensitization condition. However, it can be achieved by using two scan segments. Thus, segmented scan technique can be utilized to improve small-delay fault coverage of on-chip delay measurement method.

4.2 Test Point Insertion

Test point insertion (TPI) for improving fault coverage is popular on design. There are two types of TPI methods, namely observation point insertion and control point insertion [27]. As shown in Fig. 7, observation point insertion involves making a node observable by making it a primary output or connecting it to the SSG. Control point insertion involves a selector with an activation signal where the activation signal just like other scan FFs. In this section, we investigate a strategy for maximizing the fault coverage from a small number of observation points and control points, respectively.

(1) Observation Point Insertion

Consider the sequential circuit shown in Fig. 5, assume that a slow-to-fall small-delay fault occurs on line \(b\). Under the single-path sensitization condition, the path (which includes this fault) cannot be sensitized since the initialization condition FF1 = FF2 = 1 implies FF3 = 1 by 1 bit shift during the launch cycle. Thus, the off-input of the OR gate (line \(c\)) is set to controlling value, and the fault is blocked from being propagated to FF3 during the capture cycle. Here we insert an observation point after gate B (on line \(a\)) to observe the fault. We just need to ensure the values of FF1 and FF2 are (1, 1) and (0, 1) in the initialization pattern and launch pattern, respectively. By inserting the observation point, the transition on line \(b\) can reach stop of the DVMC, the sub-path (which includes the fault) is sensitized under the single-path sensitization condition. Thus, by comparing the measured delay time with the expected delay time, the transition fault on line \(b\) can be detected with criteria based on statistical analysis.

We place observation points in order to detect faults that are not detected by LOS test under the single-path sensitization condition. For keeping the discussion general, we assume a set of all faults in one circuit denoted \(F\), and the set of faults, which are detected by LOS under the single-path sensitization condition, denoted \(F_{\text{SINGLE}}\). We denote the set of target faults as \(U = F - F_{\text{SINGLE}}\).

We determine the placement of observation points for \(U\) as follows. For illustration, we show an example of part of a CUT in Fig. 8. Assume that there are a fault on line \(G_1\) and a fault on line \(G_2\). Target to the fault \(G_1\), because the off-input is controlling value, the fault is blocked at Gate 4 and Gate 5. Thus, the path including the fault \(G_1\) cannot

![Fig. 5 Example circuit.](image)

![Fig. 6 Segmented scan.](image)

![Fig. 7 Example of the test point insertion.](image)
be sensitized under the single-path sensitization condition. The fault effects to lines $G_2$, $G_3$, $G_4$ and $G_5$. An observation point on any one of these lines allows the fault to be detected under the single-path sensitization condition. However, observation points on some of these lines may allow other faults to be detected. For example, the fault on line $G_2$ can be detected by inserting an observation point on line $G_4$. Thus, we should find the line which allows the most faults to be detected.

If we insert a test point on a line $l$ that lies on a critical path, then inserting a test point on $l$ may degrade performance. In order to prevent any possible performance degradation due to this, we do not insert any test point into signal lines on a critical path. Before the test point insertion procedure, we identify all signal lines that lie on a critical path. We delete these signal lines from the potential test point set.

We select a minimal subset of observation points applied for $U$ by using a greedy covering procedure. The procedure for observation point insertion is given next as Procedure 1. To achieve higher fault coverage with an acceptable hardware overhead, the number of inserted observation point ($N_O$) is decided by results of hundreds of pre-simulation tests.

**Procedure 1: Observation Point Insertion**

1. Let $U$ be the set of target faults. Let $OB$ be an empty set, it is the set of lines used to insert observation points.
2. For every line $g_i$, let $OBS(g_i)$ be an empty fault set. Find the set of faults $OBS(g_i)$ such that can be detected with an observation point inserted on the line $g_i$.
3. Select a line $g_j$ such that $OBS(g_j)$ has the largest number of faults $tf_j \in U$.
4. Add the line $g_j$ to $OB$. Remove faults $tf_j \in OBS(g_j)$ from $U$.
5. If $U = \emptyset$, or the number of observation points $= \text{pre-set value } N_O$, stop; else go to Step 3.

(2) Control Point Insertion

Consider the circuit shown in Fig. 5, assume that a slow-to-fall small-delay fault occurs on line $a$. Under the single-path sensitization condition, the path (which includes this fault) cannot be sensitized since the initialization condition $FF_1 = FF_2 = 1$ implies $FF_3 = 1$ by 1 bit shift during the launch cycle. Thus, the off-input of the OR gate (line $c$) is set to controlling value, and the fault is blocked from being propagated to $FF_4$ during the capture cycle. Here we insert a control point after gate $A$ (on line $c$). Under the single-path sensitization condition, to detect the slow-to-fall delay fault of line $a$, we just need to set the value of the of-input of OR gate (line $c$) as non-controlling value ($S_0$) in both the initialization pattern and launch pattern.

We place control points in order to detect faults that are not detected by LOS test under the single-path sensitization condition. We select a minimal subset of control points by using a greedy covering procedure. As the same with observation point insertion, to achieve higher fault coverage with minimal control point, we should consider the number of faults that can be detected by one control point insertion. In other words, control point which detects the largest number of faults will be first inserted. In order to prevent any possible performance degradation, we do not insert any test point into signal lines that are on a critical path. The procedure for control point insertion is given next as Procedure 2. To achieve higher fault coverage with an acceptable hardware overhead, the number of inserted observation point ($N_C$) is decided by results of hundreds of pre-simulation tests.

**Procedure 2: Control Point Insertion**

1. Let $U$ be the set of target faults. Let $CO$ be an empty set, it is the set of lines used to insert control points.
2. For every line $g_i$, let $COS(g_i)$ be an empty fault set. Find the set of faults $COS(g_i)$ such that can be detected with a control point inserted on the line $g_i$.
3. Select a line $g_j$ such that $COS(g_j)$ has the largest number of faults $tf_j \in U$.
4. Add the line $g_j$ to $CO$. Remove faults $tf_j \in COS(g_j)$ from $U$.
5. If $U = \emptyset$, or the number of control points $= \text{pre-set value } N_C$, stop; else go to Step 3.

(3) Procedure for Segmented Scan and Test Point Insertion

Based on Procedure 1 and Procedure 2, test points are inserted according to the number of faults that can be detected. In other words, test point which detects the largest number of faults will be first inserted. Thus, after a number of test points inserted, the effect for the coverage improvement will be not very notable. Hence, by using only one of the introduced techniques, we may not be able to get an ideal fault coverage under the single-path sensitization condition. For higher fault coverage, we use the segmented scan and test point insertion at the same time. The procedure for using all the introduced techniques is given next as Procedure 3. The values of $N_S$, $N_C$ and $N_O$ are decided by results of hundreds of pre-simulation tests. We can use these values to get a higher coverage with an acceptable hardware overhead.

**Procedure 3: Segmented scan and test point insertion**

1. Let $L_0$ be the length of CUT’s scan chain, $U$ be the set
of undetectable faults under the single-path sensitization condition, and \( N_F \) be the number of scan segments. We divide the scan chain to \( N_S \) segments, the length \( L \) of these segments is calculated by \( L = L_0/N_S \). Each segment is controlled by a corresponding scan enable signal. After this step, let \( U = U - P_S \), where \( P_S \) is the set of new detectable faults.

2. Let \( N_C \) be the number of control point will be inserted. Insert these control points using procedure 2.

3. Let \( N_O \) be the number of observation points will be inserted. Insert these observation points using procedure 1.

This procedure inserts control points before observation point insertions. This is because that the effect of the control point is better than the observation point in term of area overhead; the results of the pre-simulation tests which confirm this fact will be shown in the next section (Fig. 12).

5. Evaluation

In this section, we study the effects of these introduced techniques on the set of faults undetectable by LOS test under the single-path sensitization condition. The corresponding hardware overhead also will be evaluated. In this evaluation, we use ISCAS89 benchmark circuits. The results of cell area are obtained by synthesis with Synopsys design compiler using Rohm 180 \( \mu m \) process [28]. We also reported the maximal core utilization after layout with Synopsys IC Compiler. The maximal core utilization means the maximal value of core utilization that the IC Compiler can make layout (placement and routing) without errors. The chip area depends on both the maximal core utilization and cell area. The smaller value of maximal core utilization means larger chip area resulting from more complex routing. Figures 9–11 show the relation between maximal core utilization (in s5378 and s9234) and the numbers of scan segments, inserted observation points and inserted control points, respectively. These figures are explained later. The test patterns are generated with in-house ATPG based on what is used in [14]. First, we evaluate the effect of segmented scan. Next, we evaluate the effect of test point insertion (observation point insertion and control point insertion, respectively). We also compared the effects of control point insertion and observation point insertion for the pre-simulation tests to explain why the proposed procedure inserts control points before observation point insertions (as noted in the last paragraph of the previous section). For higher fault coverage, we evaluate the effect of segmented scan and test point insertion. Tables 3–6 show the evaluation results. In these Tables, the column Circuit shows the circuit name. The columns [14] and Proposed show the evaluation results of the method of [14] and the methods using introduced fault coverage improving techniques. The column \( N_F \) gives the number of the test pattern pairs (As only one path is selected to be measured for each test pattern pair, \( N_F \) also gives the number of measurements). Columns \( C_0(\%) \) and \( C_1(\%) \) report the fault coverage of [14] and methods using introduced fault coverage improving techniques, respectively. Columns \( S_0(\text{mm}^2) \) and \( S_1(\text{mm}^2) \) report the cell area of [14] and methods using introduced fault coverage improving techniques, respectively. The column \( C_{\text{uni}} \) reports the maximal core utilization after layout. Column \( N_S \), \( N_C \) and \( N_O \) show the numbers of scan segments, inserted control points and inserted observation points, respectively. The column \( V \) shows the test data volume in \( 10^5 \) bit. The column \( T \) shows the test application time in \( 10^5 \) clocks. The column \( C_{\text{IMP}}(\%) \) reports the effect of fault coverage improvement by using fault coverage improving techniques. The column \( AO(\%) \) reports the area overhead, which is calculated by \( AO = (S_1 - S_0)/S_0 \times 100(\%) \).

In addition, to achieve a still higher fault coverage with the same overall hardware overhead, we implemented
the proposed procedure several times with the same overall hardware overhead (by changing the area ratio of control point insertion and observation point insertion in the same overall hardware overhead). Figures 13 and 14 show the results of s5378 and s9234.

5.1 Effect of Segmented Scan

We improved the fault coverage 12.67~22.31% by only using 8 scan segments. However, with the increasing of scan segment’s numbers, the effect for the coverage improvement is not very notable for some circuits. For example, as shown in Table 3 in the circuit s35932, we used 8 scan segments, the coverage improvement was 27.55%. However, when we used 64 scan segments, the fault coverage is just improved 0.64% compared to 8 scan segments. For higher fault coverage, we must consider to use other techniques. Figure 9 shows that the maximal core utilization became smaller with increasing the number of scan segments. It means that increasing the number of scan segments increases the number of redundant lines and makes routing more difficult. As a result, it causes the larger total area.

5.2 Effect of Test Point Insertion

We improved the fault coverage with 4.83~22.06% by inserting 100~400 observation points. The fault coverage can be improved with 8.21~38.29% by inserting 40~100 con-

<table>
<thead>
<tr>
<th>Table 3</th>
<th>Effect of segmented scan.</th>
</tr>
</thead>
<tbody>
<tr>
<td>Circuit</td>
<td>Proposed</td>
</tr>
<tr>
<td></td>
<td>N₇</td>
</tr>
<tr>
<td>s5378</td>
<td>160</td>
</tr>
<tr>
<td>s9234</td>
<td>199</td>
</tr>
<tr>
<td>s13207</td>
<td>330</td>
</tr>
<tr>
<td>s38584</td>
<td>3778</td>
</tr>
<tr>
<td>s35932</td>
<td>3213</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Table 4</th>
<th>Effect of observation point insertion.</th>
</tr>
</thead>
<tbody>
<tr>
<td>Circuit</td>
<td>Proposed</td>
</tr>
<tr>
<td></td>
<td>N₇</td>
</tr>
<tr>
<td>s5378</td>
<td>160</td>
</tr>
<tr>
<td>s9234</td>
<td>199</td>
</tr>
<tr>
<td>s13207</td>
<td>330</td>
</tr>
<tr>
<td>s38584</td>
<td>3778</td>
</tr>
<tr>
<td>s35932</td>
<td>3213</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Table 5</th>
<th>Effect of control point insertion.</th>
</tr>
</thead>
<tbody>
<tr>
<td>Circuit</td>
<td>Proposed</td>
</tr>
<tr>
<td></td>
<td>N₇</td>
</tr>
<tr>
<td>s5378</td>
<td>160</td>
</tr>
<tr>
<td>s5378</td>
<td>160</td>
</tr>
<tr>
<td>s9234</td>
<td>199</td>
</tr>
<tr>
<td>s13207</td>
<td>330</td>
</tr>
<tr>
<td>s38584</td>
<td>3778</td>
</tr>
<tr>
<td>s35932</td>
<td>3213</td>
</tr>
</tbody>
</table>

<table>
<thead>
<tr>
<th>Table 6</th>
<th>Effect of segmented scan and test point insertion.</th>
</tr>
</thead>
<tbody>
<tr>
<td>Circuit</td>
<td>Proposed</td>
</tr>
<tr>
<td></td>
<td>N₇</td>
</tr>
<tr>
<td>s5378</td>
<td>160</td>
</tr>
<tr>
<td>s9234</td>
<td>199</td>
</tr>
<tr>
<td>s13207</td>
<td>330</td>
</tr>
<tr>
<td>s38584</td>
<td>3778</td>
</tr>
<tr>
<td>s35932</td>
<td>3213</td>
</tr>
</tbody>
</table>
can get similar results in control point insertion in Table 5. We found that the decrease is smaller in large circuits when we insert the same (or more in large circuit) number of test points. This means that the increases of the redundant lines and difficulty in routing caused by our DFT design are small in large circuits. In other words, the effect on layout of our proposed method is less serious in large circuits.

To decide whether observation point insertion follows control point insertion, we also compared the effects of control point insertion and observation point insertion with the same hardware overhead. Figure 12 presents the results of s13207. In Fig. 12, the y axis and the x axis show the fault coverage(%) and the hardware overhead(%). Here, we set the hardware overhead of test point insertion from 1% to 10%. From the experiment result, we found that the effect of the control point insertion is better than the observation point. For example, the fault coverage is improved more than 36% by using control point insertion with only 3% hardware overhead, while the improvement of the fault coverage is less than 6% by using observation point insertion with the same hardware overhead. Therefore, control points are inserted before observation points in Procedure 3.

5.3 Effect of Segmented Scan and Test Point Insertion

To achieve higher fault coverage with an acceptable hardware overhead, we use segmented scan, observation point insertion and control point insertion at the same time by using procedure 3. The evaluation results are shown in Table 6. As shown in the results, we got an acceptable fault coverage by using these techniques at the same time. Fault coverage can be improved 27.02∼47.74% with 6.33∼12.35% of hardware overhead.

5.4 Effective Fault Coverage by the Same Overall Hardware Overhead

To achieve a still higher fault coverage with the same overall hardware overhead, we implemented the proposed procedure several times with the same overall hardware overhead (by changing the area ratio of control point insertion and observation point insertion in the same overall hardware overhead). Figures 13 and 14 show the results of s5378 and s9234.

The y axis in Figs. 13 and 14 shows the fault coverage, and the x axis shows the area ratio of the overhead for the observation point insertion to the whole overhead. For example, in the case of the overall hardware overhead is 10% and the area ratio for the observation point insertion is 40%, we insert control points and observation points with area ratio of 6:4. Here, we set the overall hardware overhead as 5%, 10%, 15% and 20%. Figure 13 shows the result of s5378 using 32 scan segments, and Fig. 14 shows the result of s9234 using 64 scan segments.

From the experiment result, we can achieve a still higher fault coverage with the same hardware overhead. For example, for s5378 using 32 scan segments, we can achieve...
the most effective fault coverage of 91.57% when the observation point insertion occupies 50% of the overall hardware overhead (when the overall hardware overhead is 10%). For s9234 using 64 scan segments, we can achieve the most effective fault coverage of 91.22% when the observation point insertion occupies 30% of the overall hardware overhead (when the overall hardware overhead is 20%). From the experiment result of Figs. 13 and 14, to achieve a still higher fault coverage with the same hardware overhead, we should set the area of observation point insertion occupies 30~50% of the overall hardware overhead.

6. Conclusion

Our pre-simulation results show that single-path sensitization is required when using on-chip delay measurement method. This constraint makes the fault coverage very low. To improve small-delay fault coverage under the single-path sensitization condition, this paper introduced techniques using segmented scan and test point insertion. For higher fault coverage, we propose a procedure for using these techniques at the same time. As the evaluation results, the proposed procedure improved the fault coverage 27.02~47.74% with 6.33~12.35% of hardware overhead. To achieve a still higher fault coverage with the same hardware overhead, we should set the area of observation point to occupy 30~50% of the overall hardware overhead.

In this paper, we focus only on the LOS test. However, the proposed procedure can be extended to improve fault coverage for other test designs. As our future work, for higher fault coverage and smaller hardware overhead, we will apply the proposed procedure on LOC test and LOS+LOC. Our future work also includes test data and test application time reduction.

Acknowledgments

This work was supported by VLSI Design and Education Center (VDEC), the University of Tokyo in collaboration with Synopsys, Inc., Cadence Design Systems, Inc. and Mentor Graphics, Inc.

References

Wenpo Zhang received B.E. degree from University of Electronic Science and Technology of China in 2004, and the M.E. degree from Chiba University in 2012. Currently, he is working toward the Ph.D. degree at Chiba University. His research interests include very large scale integration (VLSI) testing and design for testability. He is a student member of the IEEE.

Kazuteru Namba received B.E., M.E. and Ph.D. from Tokyo Institute of Technology in 1997, 1999 and 2002, respectively. He joined Chiba University in 2002. He is currently an Assistant Professor of Graduate School of Advanced Integration Science, Chiba University. His current research interests include dependable computing. He is a member of the IEEE and the IPSJ.

Hideo Ito was born in Chiba, Japan, on June 1, 1946. He received the B.E. degree from Chiba University in 1969 and the D.E. degree from Tokyo Institute of Technology in 1984. He joined Nippon Electric Co. Ltd. in 1969 and Kisarazu Technical College in 1971. He had been a member of Chiba University from 1973 until March 2012. He is currently a Professor Emeritus of Graduate School of Advanced Integration Science. He had been a member of the IEEE and the IPSJ.