Contents lists available at ScienceDirect INTEGRATION, the VLSI journal journal homepage: www.elsevier.com/locate/vlsi Refresh re-use based transparent test for detection of in-ﬁeld permanent faults in DRAMs Bibhas Ghoshal a, ⁎ , Chittaranjan Mandal b , Indranil Sengupta b a Department of Information Technology, Indian Institute of Information Technology, Allahabad 211012, India b Department of Computer Science and Engineering, IIT Kharagpur, Kharagpur, India ARTICLE INFO Keywords: DRAM Permanent faults Transparent test March test Refresh-reuse ABSTRACT In this paper, a transparent test technique for testing permanent faults developed during ﬁeld operation of DRAMs has been proposed. A three pronged approach has been taken in this work. First, a word oriented transparent March test generation algorithm has been proposed that avoids signature based prediction phase; next the proposed transparent March test is structured in a way that facilitates its implementation during refresh cycles of the DRAM; ﬁnally the on-chip refresh circuit is modiﬁed to allow its re-use during implementation of the proposed transparent March test on DRAM. Re-use of refresh cycles for test purpose ensures periodic testing of DRAM without interruption. Thus, faults are not allowed to accumulate. Moreover, wait for idle cycles of the processor to perform the test are avoided and test ﬁnishes within a deﬁnite time. Re- using the refresh circuit for test purpose overcomes requirement of additional Design-For-Testability hardware and brings down the area overhead. Both analytic predictions and simulation results for the method proposed here indicate real estate beneﬁts and test time savings in comparison to other reported techniques. The proposed refresh re-use based transparent test technique provides a cost eﬀective solution by providing facility for periodic tests of DRAM without requiring additional test hardware. 1. Introduction The run-time faults which occur in Dynamic Random Access Memories (DRAMs) are either transient or intermittent [1]. A lot of research has been devoted in devising eﬃcient run time fault detection techniques for DRAMs. The detection techniques of these run-time faults reported in literature have mainly focussed on detection of transient faults (soft errors). However, for deeply scaled CMOS based DRAMs, the occasional run-time faults in DRAMs which are a result of physical eﬀects such as environmental susceptibility, aging and low supply voltage, are intermittent faults [2]. These intermittent faults usually exhibit a relatively high occurrence rate and eventually tend to become permanent [2]. Moreover, wearout of DRAM can also cause intermittent faults to become frequent enough to be classiﬁed as permanent [3]. Studies on DRAM failures in ﬁeld ([4–6]) provided evidence that DRAMs experience both transient (soft) faults and permanent (hard) faults in ﬁeld. Thus, for a DRAM based system which develops both soft errors and hard faults during in-ﬁeld operation, using only software based detection mechanisms (memory diagnostic software programs used to check for memory failures on a computer) such as ECC [7], Chipkill [8] or memory scrubbers [9] may not be suﬃcient as suggested by the results in [10]. Sridharan et al. in [10] reported that a commonly used ECC technique such as SEC-DED ECC results in undetected errors (causing silent data corruption) at a rate of up to 20 FIT per DRAM device (unacceptably high rate for many enterprise data centers and high- performance computing systems), thus making it poorly suited to modern DRAM subsystems. Memory scrubbing with ECC may be an alternative. However, there are a few drawbacks in implementing scrubbers (in both hardware and software) for DRAM testing as mentioned in [11]. Implementing scrubber as state machine in memory controller increases the hardware complexity and causes performance penalty as memory becomes inaccessible during the scrubbing period. Software implementa- tion requires generation of interrupt to activate a ﬁrmware which executes on the processor to perform scrubbing. However, due to limited number of interrupt request signals or vectors in some systems, an interrupt for scrubbing often is not available. As result, the software based solution is often infeasable. Thus, for a system which develops both soft errors and hard faults during in-ﬁeld operation, the most preferred solution is to apply a http://dx.doi.org/10.1016/j.vlsi.2017.06.011 Received 9 February 2017; Received in revised form 21 April 2017; Accepted 24 June 2017 ⁎ Corresponding author. E-mail addresses: bibhas.ghoshal@iiita.ac.in (B. Ghoshal), chitta@iitkgp.ac.in (C. Mandal), isg@iitkgp.ac.in (I. Sengupta). INTEGRATION the VLSI journal xxx (xxxx) xxx–xxx 0167-9260/ © 2017 Elsevier B.V. All rights reserved. Please cite this article as: Ghoshal, B., INTEGRATION the VLSI journal (2017), http://dx.doi.org/10.1016/j.vlsi.2017.06.011