Contents lists available at ScienceDirect







journal homepage: www.elsevier.com/locate/mejo

# SLIM-ADC: Spin-based Logic-In-Memory Analog to Digital Converter leveraging SHE-enabled Domain Wall Motion devices

### Soheil Salehi\*, Ronald F. DeMara

Department of Electrical and Computer Engineering, University of Central Florida, USA

| ARTICLE INFO                                                                                                                           | A B S T R A C T                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
|----------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Keywords:<br>Signal conversion<br>Boolean logic<br>In-memory computation<br>Beyond-CMOS devices<br>Spin-Hall Effect Domain Wall Motion | This paper devises a novel Analog to Digital Converter (ADC) framework for energy-aware acquisition of analog signals with Logic-in-Memory capabilities. The beyond-CMOS hardware architecture has been designed to minimize the overall cost of signal acquisition. Spin-Hall Effect driven Domain Wall Motion (SHE-DWM) devices are utilized to realize the proposed framework called <i>Spin-based Logic-In-Memory ADC (SLIM-ADC)</i> . Our simulation results indicate that the proposed SLIM-ADC offers ~ 200 fJ energy consumption on average for each analog conversion or logic operation with up to 1 GHz speed. Furthermore, our results indicate that the proposed SLIM-ADC designs by offering ~ 5.45 mW improved power dissipation on average. Additionally, a Majority Gate (MG)-based Full-Adder (MG-FA) is implemented using the proposed SLIM-ADC. Our results show that the proposed MG-FA offers ~ 2.9-fold reduced power dissipation on average and ~ 1.7-fold reduced delay on average compared to the state of the art Full-Adder designs reported herein. |

#### 1. Introduction

Spin-based devices have been extensively researched as promising companions to CMOS devices. As CMOS scaling trends continue, the need to identify viable approaches for reducing leakage power increases [1]. With attributes of non-volatility, near-zero standby energy, and high density, the Magnetic Tunnel Junction (MTJ) has emerged as a promising alternative post-CMOS technology for embedded memory and logic applications [2-4]. Recent studies have shown that conventional Von-Neumann computing architectures, in which the storing elements are distinct from computing elements, incur challenges created by interconnection and busing demands [5]. These challenges include, but are not limited to, increased static energy consumption, large access latencies, and limited scalability. Recent studies have offered in-memory computing paradigms as a potential solution to these challenges. Use of non-volatile memory devices such as spin-based devices have enabled researchers to design non-Von-Neumann architectures, where processing and memory are integrated [5,6].

Furthermore, there is an increasing demand for energy and area efficient Analog to Digital Converters (ADCs) as the need for integrating the signal acquisition and processing as well as rapid parallel data conversion in sensor nodes has increased [7–9]. One of the

main challenges of designing such sensors in CMOS is integrating ADCs in each sensor due to the large area of analog circuits, especially in applications such as image processing where each pixel sensor requires a compact ADC [10,11]. Moreover, another main challenge is the increased static energy consumption due to transistor scaling. Additionally, decreased reliability caused by high process variation can become another major challenge in scaled technology nodes [12].

The above-mentioned challenges have motivated us to devise a framework for efficient acquisition of analog signals utilizing emerging spin-based devices. Herein, we propose a spin-based intermittent quantizer with logic computation capabilities. The proposed architecture, called Spin-based Logic-In-Memory ADC (SLIM-ADC), utilizes Spin-Hall Effect driven Domain Wall Motion (SHE-DWM) devices to provide fast quantization of analog signals in a novel energy-efficient fashion as well as realizing intrinsic logic operations. By leveraging non-volatility, SLIM-ADC can reduce energy consumption via instant off/on operation without the use of backup storage.

The remainder of this paper is organized as follows: A general background and related work are provided in Section 2 and a realization of the acquisition method utilizing the SHE-DWM devices is presented in Section 3. Section 4 provides the simulation results. Finally, Section 5 concludes the paper.

\* Corresponding author. E-mail addresses: soheil.salehi@knights.ucf.edu (S. Salehi), ronald.demara@ucf.edu (R.F. DeMara).

https://doi.org/10.1016/j.mejo.2018.09.012

Received 15 June 2018; Received in revised form 29 August 2018; Accepted 28 September 2018 Available online 1 October 2018 0026-2692/© 2018 Elsevier Ltd. All rights reserved.



Fig. 1. SHE-enabled Domain Wall Motion device structure.



Fig. 2. The proposed SLIM-ADC device in (a) 000, (b) 100, (c) 110, and (d) 111 modes.

#### 2. Background and related work

In recent studies, researchers have exploited the use of emerging devices for signal processing applications. In particular, they have explored designing ADCs using emerging devices such as SHE-MTJ [13], Domain Wall Motion (DWM) [14,15], and Racetrack Memory [16]. The basic concept of spin-based devices is to control the spin of electrons in a ferromagnetic solid-state nano-device. Fig. 1 shows a SHE-DWM device [17,18]. The non-volatile MTJ consists of a Ferromagnetic (FM) layer, which is called the fixed-layer, a FM nano-wire layer, which is called the free-layer, a tunneling oxide layer between the fixed-layer and the free-layer, and a heavy metal to realize Spin-Hall assisted switching. FM layers could be aligned in two different magnetization configurations according to the position of the Domain Wall (DW), Parallel (P) and Anti-Parallel (AP). Accordingly, the MTJ exhibits low resistance  $(R_p)$  or high resistance  $(R_{AP})$  states, respectively [17]. The read and write process for the memory cells are like SHE devices. Based on Spin-Transfer Torque (STT) switching principles, the P or AP state of the SHE-DWM device is configured by means of the bidirectional current that passes through the Spin-Hall heavy Metal (SHM) from terminal T1 to terminal T3,  $I_{SHE}$ . When the  $I_{SHE}$  is applied, a strong spin-orbit coupling is generated, which results in generation of a spin current, Ispin, along the z-axis of the Cartesian coordinate system and perpendicular to the  $I_{SHE}$  current [18]. The DW will move if  $I_{SHE}$  exceeds the critical current,  $I_C$ .

Additionally, in order to read the data stored in these devices, a Sense Amplifier (SA) [2] is used to sense the difference between the resistance of the SHE-DWM device that is used to store the data and a reference MTJ device with a known resistance. Terminals T2 and T3 are used during the read operation, meaning that the read and write paths are separate, which is one of the reasons for high reliability of these devices regarding read disturbance failures. The main reason for using SHE-DWM devices is due to slow and high energy switching of DWM devices alone. Utilizing SHE approach will help reduce the write energy consumption and increase the DW velocity. Conventionally the DWM was achieved using STT which could switch the domain wall due to the coupling between local magnetic moments of the DW and spin-polarized currents. However, it has been practically shown in recent studies that SHE-DWM devices offer significantly lower energy consumption and faster switching [17]. The spin current,  $I_{Spin}$ , generated due to the charge current applied through the SHM,  $I_{SHE}$ , can be described using the following equations:

$$I_{Spin} = \theta_{SHM} \times \frac{A_{Spin}}{A_{SHM}} \times I_{SHE} \times \sigma$$
(1)

$$A_{Spin} = L_{DW} \times W_{DW} \tag{2}$$

$$A_{SHM} = W_{SHM} \times t_{SHM} \tag{3}$$

$$\sigma = 1 - sech(\frac{t_{SHM}}{\lambda_{sf}}) \tag{4}$$

where,  $L_{DW}$  and  $W_{DW}$  are the length and width of the DW free-layer,  $W_{SHM}$  and  $t_{SHM}$  are the width and thickness of the SHM,  $\theta_{SHM}$  is the spin-Hall angle, and  $\lambda_{sf}$  is the spin flip length.



Fig. 3. The proposed write circuit for the SLIM-ADC device.

#### Table 1

The Signaling of the SLIM-ADC device for read and write operations.

| Operation | WSL                | RSL     | WE | RE | BL |
|-----------|--------------------|---------|----|----|----|
| Reset     | From write circuit | Hi-Z    | 1  | 0  | 1  |
| Write     | From write circuit | Hi-Z    | 1  | 0  | 0  |
| Read      | 0                  | From SA | 0  | 1  | 0  |

 Table 2

 The SLIM-ADC's bit encoding for ADC operation.

| MTJ0/L0 | MTJ1/L1 | MTJ2/L2 | Encoded Bits |
|---------|---------|---------|--------------|
| 0       | 0       | 0       | 0 0          |
| 1       | 0       | 0       | 0 1          |
| 1       | 1       | 0       | 1 0          |
| 1       | 1       | 1       | 1 1          |

#### Table 3

The truth table for the 3-input logic operations.

| Α | В | С | F0=OR(A,B,C) | F1 = MG(A,B,C) | F2 = AND(A,B,C) |
|---|---|---|--------------|----------------|-----------------|
| 0 | 0 | 0 | 0            | 0              | 0               |
| 0 | 0 | 1 | 1            | 0              | 0               |
| 0 | 1 | 0 | 1            | 0              | 0               |
| 0 | 1 | 1 | 1            | 1              | 0               |
| 1 | 0 | 0 | 1            | 0              | 0               |
| 1 | 0 | 1 | 1            | 1              | 0               |
| 1 | 1 | 0 | 1            | 1              | 0               |
| 1 | 1 | 1 | 1            | 1              | 1               |
|   |   |   |              |                |                 |

Table 4

The truth table for the 2-input logic operations.

| А | В | C/Bias | F0=OR(A,B) | F1 = AND(A,B) |
|---|---|--------|------------|---------------|
| 0 | 0 | 0      | 0          | 0             |
| 0 | 1 | 0      | 1          | 0             |
| 1 | 0 | 0      | 1          | 0             |
| 1 | 1 | 0      | 1          | 1             |

## 3. Spin-based Logic-In-Memory Analog to Digital Converter (SLIM-ADC)

Our proposed SLIM-ADC design leveraging SHE-DWM devices is shown in Fig. 2. Our proposed dual-mode device is capable of implementing ADC operations as well as logical operations. The MTJ0, MTJ1, and MTJ2 each can represent a quantization level referred to as L0, L1, and L2 as shown in Fig. 2, in order to quantize the analog input signal. Additionally, each of these MTJ devices can represent a different function such as 3-input OR gate, 3-input Majority Gate (MG), and 3input AND gate, shown in Fig. 2 as F0, F1, and F2, respectively. The proposed write circuit used for the SLIM-ADC device is illustrated in Fig. 3. Since there are notches [19] in the magnetic domain, in order to move the DW, the input requires an appropriate current magnitude and direction. Furthermore, the signals used to activate the read and write operations of the SLIM-ADC device are listed in Table 1. One of the main contributions of the proposed SLIM-ADC devices is their tolerance to intermittency which enables these devices to save energy by going to standby mode when there is no ADC or logic operation requested. Moreover, the instant-on feature of these devices allows them to resume normal operation without loss of data stored in the MTJs.

#### 3.1. ADC mode

As depicted in Fig. 3, when the Func signal for MUX0 is set to 1, the device will be in ADC mode. ADC mode has three simple steps: 1) reset, 2) conversion, and 3) read-out. During the reset state, the Rst signal of MUX1 will be set to 1 and the DW will be pushed all the way to the beginning of the magnetic domain (leftmost location) to reset and prepare the device for the conversion state. As shown in Fig. 3, during the conversion state, the Smpl signal will be enabled for a short period to sample the analog input signal and the Rst signal of MUX1 will be set to 0 to allow the sampled analog input signal to move the DW, depending on the input signal's magnitude. During the read-out state, using the read operation and SAs presented in Ref. [2], we can read the values stored in all 3 MTJs, and based on their resistance values, find the digital output encoded using 3 levels to realize a 2-bit ADC operation as shown in Fig. 2, where L0, L1, and L2 refer to Level 0, Level 1, and Level 2, respectively. During the read operation one of the 000, 100, 110, or 111 states will be achieved that can be encoded into two bits as shown in Table 2.

#### 3.2. Logic-In-Memory mode

Furthermore, when the Func signal for MUX0 is set to 0, the device will be in logic operation mode, as illustrated in Fig. 3. The logic operation has also three steps: 1) reset, 2) computation, and 3) read-out. During the reset state, which is the same as ADC mode, the Rst signal of MUX1 will be set to 1 and the DW will be pushed all the way to the beginning of the magnetic domain (leftmost location) to reset and prepare the device for the computation state. As depicted in Fig. 3, after the reset state, the input currents of A, B, and C will be applied during the computation state for logic operation. Based on the current magnitude applied through the inputs A, B, and C, the DW will move. Finally, in the read-out state, the output of each MTJ will provide a different function as shown in Fig. 2. The proposed device is designed so that F0 provides a 3-input OR gate, OR(A,B,C), F1 provide a 3-input Majority gate, MG(A,B,C), and F2 provides a 3-input AND gate, AND(A,B,C), as listed in Table 3. Additionally, one of the inputs can be used as a bias to achieve 2-input OR and AND gates. Herein, if we consider input C as the bias and connect it to logic 0, then MTJ0 will provide OR(A,B) as F0, and MTJ1 will provide AND(A,B) as F1, as listed in Table 4.



Fig. 4. The proposed SA circuit for the SLIM-ADC device  $(i = \{0, 1, 2\})$ .

#### Table 5

Circuit parameters and constants with their corresponding values for the SHE-DWM device model. The values are taken from Refs. [17,20,21].

| Parameter/Constant            | Description                      | Default Value                           |
|-------------------------------|----------------------------------|-----------------------------------------|
| M <sub>s</sub>                | Saturation Magnetization         | $6.8 \times 10^{5} \text{A/m}$          |
| K <sub>u</sub>                | Initial Interfacial PMA energy   | $3.5 \times 10^5 \text{J/m}^3$          |
| α                             | Gilbert Damping Factor           | 0.03                                    |
| t <sub>ox</sub>               | Oxide-layer Thickness            | 1 nm                                    |
| A <sub>ex</sub>               | Exchange Stiffness               | $1.1 \times 10^{-11} \text{J/m}$        |
| ρ                             | Resistivity of Magnet            | 170 Ωnm                                 |
| RA                            | MTJ Resistance Area Product      | $2.38 \Omega\mu m^2$                    |
| $(L \times W)_{MTJ}$          | MTJ Dimensions                   | $20 \times 20 \mathrm{nm^2}$            |
| $(L \times W \times t)_{DW}$  | DW Nano-wire Dimensions          | $100 \times 20 \times 2.8  \text{nm}^3$ |
| $(L \times W \times t)_{SHM}$ | SHM dimensions                   | $120 \times 20 \times 2.8  \text{nm}^3$ |
| MTJ <sub>i</sub>              | MTJ Resistance in [P, AP] States | [3.2, 6.4] KΩ                           |
| Ref <sub>i</sub>              | Reference Cell Resistance        | 4.8 ΚΩ                                  |
| $\rho_{SHM}$                  | Resistivity of SHM (W)           | $200 \mu\Omega cm^2$                    |
| $\theta_{SHM}$                | Initial Spin-Hall angle          | 0.3                                     |
| TMR <sub>AP</sub>             | Tunnel Magneto Resistance        | 100%                                    |
| Р                             | Spin Polarization                | 0.6                                     |
| $\lambda_{sf}$                | Spin Flip Length                 | 1.5 nm                                  |

#### 3.3. Sense Amplifier (SA) circuit for the read operation

The Sense Amplifier (SA) circuit shown in Fig. 4 is used to read the data of the three MTJ devices, namely MTJ0, MTJ1, and MTJ2, simultaneously. The read operation is comprised of two steps: pre-charge and sensing. During the pre-charge step, the RE signal is connected to the logic 0, which turns on MP0 and MP3 transistors and causes Transmission Gates (TGs), TG0, TG1, and TG2, to turn off and as a result the output nodes of each SA, referred to as  $OUT_i$  and  $\overline{OUT_i}$  in Fig. 4, are pre-charged to VDD. During the sensing step, the RE signal is connected to logic 1, which causes TG0, TG1, and TG2 to turn on and as a result the output nodes of each SA, referred to as  $OUT_i$  and  $\overline{OUT_i}$  in Fig. 4, start to discharge to the ground. According to the difference in the resistance states of the MTJ<sub>i</sub> and Ref<sub>i</sub> in each SA, one of the two output nodes,  $OUT_i$  and  $\overline{OUT_i}$ , discharges more rapidly, leading the other output to charge to VDD [2]. All of the reference cells, Ref<sub>0</sub>, Ref<sub>1</sub>, and  $Ref_2$ , share the same dimensions and resistance values. The dimensions are set so that the resistance values of the reference cells hold a value between the P and AP states of the MTJs used with the SHE-DW. All of the reference cells, Ref<sub>0</sub>, Ref<sub>1</sub>, and Ref<sub>2</sub>, share the same dimensions and resistance values. The dimensions are set so that the resistance values of the reference cells hold a value between the P and AP states of the MTJs used with the SHE-DW in order to achieve a sufficient sensing margin during the read operation.

#### 4. Simulation framework, results, and analysis

In order to accurately simulate the behavior of the proposed SLIM-ADC design, we have extracted the values used in Refs. [17,20,21]. The Domain Wall Simulator presented in Ref. [20] are used along with SPICE simulation with the 22 nm Predictive Technology Model (PTM) [22], in order to analyze the behavior of the SHE-DWM devices proposed herein utilizing the parameters listed in Table 5.

The modeling of the SHE-DWM can be realized through modifying the Landau-Lifshitz-Gilbert (LLG) equations as shown below [21]:

$$\frac{d\vec{m}}{dt} = -\gamma \vec{m} \times \vec{H}_{eff} + \alpha \vec{m} \times \frac{d\vec{m}}{dt} + \vec{\tau}_{stt} + \vec{\tau}_{sot},$$
(5)

where,  $\vec{m}$  is the magnetization vector of the DW's free-layer  $\{m_x, m_y, m_z\}$ ,  $\gamma$  is the gyromagnetic ratio,  $\alpha$  is the Gilbert damping factor,  $\vec{H}_{eff}$  is the effective magnetic field vector derived from the energy density of the system,  $\tau_{stt}$  is the Spin-Transfer Torque (STT) factor, and  $\tau_{sot}$  is the Spin-Orbit Torque (SOT) factor. Additionally,  $H_{eff}$  can

be described as [21]:



Fig. 5. Simulation waveforms for the proposed SLIM-ADC device.

Table 6

Comparison with prior low-resolution ADC designs. (N/A: Data Not Available in the referenced manuscript).

| Design   | Technology    | Resolution in Bits | Power     | Maximum Bandwidth Frequency | Energy per Sample |
|----------|---------------|--------------------|-----------|-----------------------------|-------------------|
| [23]     | CMOS          | 4-bit              | 30 mW     | 20 MHz                      | 5 pJ              |
| [24]     | CMOS          | 3-bit              | 3.1 mW    | 2 GHz                       | 0.27 pJ           |
| [25]     | SHE-MTJ       | 3-bit              | 1.9 mW    | 500 MHz                     | 0.48 pJ           |
| [14]     | DWM           | 5-bit              | 3.4 mW    | 500 MHz                     | 3.5 pJ            |
| [15]     | DWM           | 3-bit              | 0.22 mW   | 200 MHz                     | N/A               |
|          |               |                    | 1.44 mW   | 500 MHz                     | N/A               |
|          |               |                    | 6.56 mW   | 1 GHz                       | N/A               |
| [16]     | Racetrack DWM | 8-bit              | 96.5 μW   | 20 MHz                      | 21 fJ             |
| SLIM-ADC | SHE-DWM       | 2-bit              | 285.87 μW | 500 MHz                     | 79.71 fJ          |
|          |               |                    | 549.51 μW | 1 GHz                       | 79.52 fJ          |

$$\epsilon_{DM} = -D[m_z \nabla \cdot \vec{m} - (\vec{m} \cdot \nabla)m_z] \quad \text{if } t_{DW} \ll L_{DW} \& W_{DW}, \tag{7}$$

where,  $\mu_0$  is the vacuum permeability,  $M_s$  is the Saturation Magnetization,  $\epsilon_{DM}$  is the Dzyaloshinskii-Moriya Interaction (DMI) energy density, and *D* is the DMI intensity parameter. As a result, the effective DMI field can be described as below [21]:

$$\vec{H}_{DM} = \frac{-2D}{\mu_0 M_s} \times \left[\frac{\partial m_z}{\partial x} \vec{u}_x + \frac{\partial m_z}{\partial y} \vec{u}_y - \left(\frac{\partial m_x}{\partial x} \vec{u}_x + \frac{\partial m_y}{\partial y} \vec{u}_y\right) \vec{u}_z\right].$$
(8)

Furthermore, assuming that  $\hbar$  is the reduced Planck constant, P is the STT polarization factor,  $j_a$  is the driving current density, e is the elementary electron charge,  $\mu_B$  is the Bohr magneton,  $\theta$  is the initial spin-Hall angle,  $\eta$  is the non-adiabatic Rashba term,  $\xi$  is the dimensionless non-adiabatic parameter,  $\alpha_R$  is the Rashba parameter, and  $\overrightarrow{H}_R$  is the effective Rashba field, the STT and SOT factors can be described as below [21]:

$$\vec{\tau}_{stt} = (j_a \frac{\mu_B P}{eM_s}) \times (\vec{u}_x \cdot \nabla) \vec{m} - (j_a \frac{\mu_B P}{eM_s}) \times \xi \vec{m} \times (\vec{u}_x \cdot \nabla) \vec{m}, \tag{9}$$

$$\vec{\tau}_{sot} = -\gamma \vec{m} \times \vec{H}_R + \eta \gamma \xi \vec{m} \times (\vec{m} \times \vec{H}_R) - \gamma \vec{m} \times (\vec{m} \times H_{SH} \vec{u}_y), \tag{10}$$

$$\vec{H}_R = \frac{\alpha_R P}{\mu_0 \mu_B M_s} (\vec{u}_z \times \vec{j}_a) = \frac{\alpha_R P j_a}{\mu_0 \mu_B M_s} \vec{u}_y, \tag{11}$$

$$H_{SH} = \frac{\hbar \theta_{SH} j_a}{\mu_0 2eM_s t_{DW}} = \frac{\mu_B \theta_{SH} j_a}{\gamma 2eM_s t_{DW}},\tag{12}$$

Authors in Ref. [20] have utilized the modeling approach described in Ref. [21] to implement a standalone one-dimensional DWM simulator. This model takes STT, SOT, and DMI fields into account [20]. According to our results, if  $j_a \simeq 0.75 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move to the first notch within 1 ns, if  $j_a \simeq 1.44 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move to the second notch within 1 ns, and if  $j_a \simeq 2.08 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move all the way to the end of the magnetic domain within 1 ns. Fig. 5 depicts sample simulation waveforms for the proposed SLIM-ADC device. According to our results, the energy consumption of each ADC or logic operation on average is ~ 201.48 fJ, which on average includes ~ 117.94 fJ for the reset operation, ~ 79.70 fJ for the sampling/computing operation, and



Fig. 6. Proposed 1-bit MG-FA circuit implemented utilizing the SLIM-ADC devices.

 $\sim 3.84\,\rm fJ$  for the read operation. Considering 0.6ns for the reset operation, 1ns for the sample/compute operation, and 0.4ns for the read operation, the overall operation time is 2 ns, which means the proposed SLIM-ADC device can perform ADC or logic operations with 500 MHz frequency.

Faster ADC and logic operations can be achieved by increasing the input current corresponding to reset and sample/compute operations, however this will elevate the power dissipation. In order to increase the speed of the proposed SLIM-ADC device to be able to perform ADC or logic operations with 1 GHz frequency, the reset operation is required



**Fig. 7.** Simulation waveforms for the proposed 1-bit MG-FA circuit implemented utilizing the SLIM-ADC devices with inputs A = 1, B = 0, Cin = 1.

| Table /                                                                         |     |
|---------------------------------------------------------------------------------|-----|
| Comparison with prior Full-Adder designs. (*The values are taken from Ref. [26] | ]). |

| Design   | Technology    | Power     | Delay         | Energy per bit |
|----------|---------------|-----------|---------------|----------------|
| [27]*    | CMOS          | 2 mW      | 2.2ns         | 4 pJ           |
| [27]*    | STT-MTJ       | 2.1 mW    | 10.2ns        | 4.2 pJ         |
| [26]*    | SHE-MTJ       | 0.71 mW   | 7ns           | 4.3 pJ         |
| [28]     | DWM           | 1.364 mW  | 2.54ns        | 1.4 pJ         |
| [29]     | STT-MTJ       | 0.315 mW  | 2.1ns         | 6.3 pJ         |
| [30]     | Racetrack DWM | 0.432 mW  | 3.03ns        | 1.3 pJ         |
| SLIM-ADC | SHE-DWM       | 589.95 µW | 2ns (1 GHz)   | 0.6 pJ         |
|          |               | 302.22 µW | 4ns (500 MHz) | 0.6 pJ         |

to be done in 0.3 ns, sample/compute operation is required to be done in 0.5 ns, and read operation requires to be done in 0.2 ns. Furthermore, according to our results, if  $j_a \simeq 1.57 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move to the first notch within 0.5 ns, if  $j_a \simeq 2.85 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move to the second notch within 0.5 ns, and if  $j_a \simeq 4.1 \times 10^{12} \text{ A/m}^2$  is applied, the DW will move to the magnetic domain within 0.5 ns. In this case, the energy consumption of each ADC or logic operation is equal to ~ 196.65 fJ on average, which includes ~ 117.1 fJ for the reset operation, ~ 79.52 fJ for the sampling/computing operation, and ~ 0.03 fJ for the read operation.

Table 7

In Table 6, we compare the performance of the developed SLIM-ADC device with other low-resolution ADC architectures that utilize CMOS or emerging spin-based technologies. It can be observed that the proposed SLIM-ADC device provides fast and energy-efficient analog to digital conversion compared to state of the art ADC designs. In particular, the proposed SLIM-ADC operating in 1 GHz frequency, in most cases outperform other designs listed in Table 6 in terms of power dissipation by  $\sim 5.3$  mW on average. Moreover, the proposed SLIM-ADC design improves the power dissipation of the sampling in 500 MHz frequency by  $\sim 5.6$  mW on average compared to most of the designs provided in Table 6.

Moreover, a 1-bit MG-based Full-Adder (MG-FA) circuit is implemented utilizing the proposed SLIM-ADC devices using the circuit shown in Fig. 6. According to our results, the power dissipation of the proposed SLIM-ADC-based MG-FA in 1 GHz frequency is equal to  $589.95\,\mu\text{W}$  and the result of the addition will be ready within 2 ns. Furthermore, the power dissipation of the proposed SLIM-ADC-based MG-FA in 500 MHz frequency is equal to  $302.22 \,\mu\text{W}$  and the result of the addition will be ready within 4 ns. The output waveform of the 1bit MG-FA circuit using the proposed SLIM-ADC devices is shown in Fig. 7.Table 7 compares the performance of the developed SLIM-ADCbased MG-FA with other FA designs that utilize CMOS or emerging spin-based technologies. It can be observed that the proposed SLIM-ADC-based MG-FA operating in 1 GHz frequency outperforms the other FA designs listed in Table 7 in terms of power dissipation by 2.7-fold on average. Additionally, the proposed SLIM-ADC MG-FA offers faster FA operation by 3.2-fold on average compared to other FA designs listed in Table 7. Furthermore, the proposed SLIM-ADC MG-FA offers  $\sim$  2fold and ~ 3.8-fold reduced power dissipation on average in 1 GHz and 500 MHz operating speeds, respectively, and provides  $\sim$  2.3-fold and ~ 1.13-fold delay improvement on average in 1 GHz and 500 MHz operating speeds, respectively, compared to other emerging spin-based FA designs listed in Table 7.

Previous results indicate that for a conventional 2-bit CMOS ADC requires 49 transistors [31] and 1-bit CMOS FA requires 42 transistors [27], while our proposed SLIM-ADC can perform 2-bit ADC and 1-bit FA operation with 90 total transistors and 6 total MTJs. Thus, the device count for implementing a single ADC and FA circuit is comparable with the conventional CMOS-based approaches. However, compared to the conventional Logic In Memory (LIM) and ADC approaches, the area of the proposed SLIM-ADC is reduced, since an array of SAs is shared among the entire column of SLIM-ADC devices and an array of write circuits is shared among the entire row of SLIM-ADC devices. Hence, there

is no need for a distinct SA and write circuit per device. Furthermore, the proposed SLIM-ADC device is capable of both logic and ADC operations while conventional approaches are only capable of performing one operation, either logic or ADC. Additionally, according to our results, the proposed SLIM-ADC consumes  $\sim 0.2 \,\mu\text{W}$  leakage power which is negligible compared to CMOS designs which is around  $\sim 1 \,\text{nW}$  [27].

#### 5. Conclusion

To advance energy-sparing sampling methods, a novel framework for efficient and intelligent computing approach through the integration of resource allocation and spin-based devices is introduced. The utility of SHE-DWM devices within the proposed SLIM-ADC architecture is demonstrated to realize rapid and more energy-efficient sampling while achieving reduced area footprint compared to conventional CMOS designs. Moreover, SLIM-ADC takes a step towards the realization of non-Von-Neumann architectures via in-memory computation utilizing SHE-DWM devices. According to our simulation results, the proposed SLIM-ADC offers  $\sim 200 \, \text{fJ}$  energy consumption on average for each analog conversion or logic operation with up to 1 GHz speed. Furthermore, our results indicate that the proposed SLIM-ADC outperforms other state of the art spin-based ADC designs by offering  $\sim 5.5 \,\mathrm{mW}$  improved power dissipation on average. Additionally, a Majority Gate (MG)-based Full-Adder (MG-FA) is implemented using the proposed SLIM-ADC. Our results show that the proposed MG-FA offers  $\sim$  2-fold and  $\sim$  3.8-fold reduced power dissipation on average in 1 GHz and 500 MHz operating speeds, respectively, compared to the state of the art Full-Adder designs reported herein. Additionally, according to our results, the proposed MG-FA provides  $\sim$  2.3-fold and  $\sim$  1.13-fold reduced delay on average in 1 GHz and 500 MHz operating speeds, respectively, compared to the state of the art Full-Adder designs reported herein.

#### Acknowledgment

This work was supported in part by the National Science Foundation (NSF) through CCSS 1810256.

#### References

- [1] S. Salehi, R.F. DeMara, Energy and area analysis of a floating-point unit in 15nm CMOS process technology, in: SoutheastCon 2015, IEEE, 2015, pp. 1–5, https:// doi.org/10.1109/SECON.2015.7132972, http://ieeexplore.ieee.org/document/ 7132972/.
- [2] S. Salehi, R.F. DeMara, Process variation immune and energy aware sense amplifiers for resistive non-volatile memories, in: Proceedings of IEEE International Symposium on Circuits & Systems (ISCAS-2017), IEEE, Baltimore, MD, USA, 2017.
- [3] S. Salehi, D. Fan, R.F. Demara, Survey of STT-MRAM cell design strategies: taxonomy and sense amplifier tradeoffs for resiliency, ACM J. Emerg. Technol. Comput. Syst. 13 (3) (2017) 1–16, https://doi.org/10.1145/2997650, http://dl.acm.org/citation.cfm?doid=3051701.2997650.
- [4] R. Zand, A. Roohi, S. Salehi, R.F. DeMara, Scalable adaptive spintronic reconfigurable logic using area-matched MTJ design, IEEE Trans. Circuits Syst. II: Express Briefs 63 (7) (2016) 678–682, https://doi.org/10.1109/TCSII.2016. 2532099.

- [5] Y. Wang, H. Yu, L. Ni, G.-B. Huang, M. Yan, C. Weng, W. Yang, J. Zhao, An energy-efficient nonvolatile in-memory computing architecture for extreme learning machine by domain-wall nanowire devices, IEEE Trans. Nanotechnol. 14 (6) (2015) 998–1012, https://doi.org/10.1109/TNANO.2015.2447531, http:// ieeexplore.ieee.org/document/7128727/.
- [6] P. Chi, S. Li, C. Xu, T. Zhang, J. Zhao, Y. Liu, Y. Wang, Y. Xie, PRIME: a novel processing-in-memory architecture for neural network computation in ReRAM-based main memory, in: 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), IEEE, 2016, pp. 27–39, https://doi. org/10.1109/ISCA.2016.13, http://ieeexplore.ieee.org/document/7551380/.
- [7] S. Kleinfelder, SukHwan Lim, Xinqiao Liu, A. El Gamal, A 10000 frames/s CMOS digital pixel sensor, IEEE J. Solid State Circ. 36 (12) (2001) 2049–2059, https://doi.org/10.1109/4.972156, http://ieeexplore.ieee.org/document/972156/.
  [8] A. Zaeemzadeh, M. Joneidi, N. Rahnavard, Adaptive non-uniform compressive
- [8] A. Zaeemzadeh, M. Joneidi, N. Rahnavard, Adaptive non-uniform compressive sampling for time-varying signals, in: 51st Annual Conference on Information Sciences and Systems, CISS 2017, 2017, https://doi.org/10.1109/CISS.2017. 7926148, https://arxiv.org/abs/1703.03340.
- [9] M.B. Mashhadi, N. Salarieh, E.S. Farahani, F. Marvasti, Level crossing speech sampling and its sparsity promoting reconstruction using an iterative method with adaptive thresholding, IET Signal Process. 11 (6) (2017) 721–726, https://doi.org/ 10.1049/iet-spr.2016.0569, http://digital-library.theiet.org/content/journals/10. 1049/iet-spr.2016.0569.
- [10] X. Li, B. Taylor, Y. Chien, L.T. Pileggi, Adaptive post-silicon tuning for analog circuits: concept, analysis and optimization, in: Proceedings of the 2007 IEEE/ACM International Conference on Computer-aided Design, ICCAD '07, IEEE Press, Piscataway, NJ, USA, 2007, pp. 450–457. http://dl.acm.org/citation.cfm?id=1326073.1326167.
- [11] B. Shahrasbi, N. Rahnavard, Model-based nonuniform compressive sampling and recovery of natural images utilizing a wavelet-domain universal hidden Markov model, IEEE Trans. Signal Process. 65 (1) (2017) 95–104, https://doi.org/10. 1109/TSP.2016.2614654.
- [12] M. Sharad, Deliang Fan, K. Roy, Low power and compact mixed-mode signal processing hardware using spin-neurons, in: International Symposium on Quality Electronic Design (ISQED), IEEE, 2013, pp. 189–195, https://doi.org/10.1109/ ISQED.2013.6523609, http://ieeexplore.ieee.org/document/6523609/.
- [13] Y. Jiang, Y. Lv, M. Jamali, J.-P. Wang, Spin analog-to-digital convertor using magnetic tunnel junction and spin Hall Effect, IEEE Electron. Device Lett. 36 (5) (2015) 511–513, https://doi.org/10.1109/LED.2015.2416689, http://ieeexplore. ieee.org/document/7070706/.
- [14] K. Yogendra, M.-C. Chen, X. Fong, K. Roy, Domain wall motion-based low power hybrid spin-CMOS 5-bit Flash Analog Data Converter, in: Sixteenth International Symposium on Quality Electronic Design, IEEE, 2015, pp. 604–609, https://doi. org/10.1109/ISQED.2015.7085496, http://ieeexplore.ieee.org/document/ 7085496/.
- [15] Y.K. Upadhyaya, M.K. Gupta, M. Hasan, S. Maheshwari, High-density magnetic flash ADC using domain-wall motion and pre-charge sense amplifiers, IEEE Trans. Magn. 52 (6) (2016) 1–10, https://doi.org/10.1109/TMAG.2015.2505662, http:// ieeexplore.ieee.org/document/7347436/.
- [16] Q. Dong, K. Yang, L. Fick, D. Fick, D. Blaauw, D. Sylvester, Low-power and compact analog-to-digital converter using spintronic Racetrack memory devices, IEEE Trans. Very Large Scale Integr. (VLSI) Syst. 25 (3) (2017) 907–918, https:// doi.org/10.1109/TVLSI.2016.2622224, http://ieeexplore.ieee.org/document/ 7765085/.
- [17] S. Angizi, Z. He, F. Parveen, D. Fan, RIMPA: a new reconfigurable dual-mode in-memory processing architecture with spin Hall effect-driven domain wall motion device, in: 2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), IEEE, 2017, pp. 45–50, https://doi.org/10.1109/ISVLSI.2017.18, http:// ieeexplore.ieee.org/document/7987493/.

- [18] J. Torrejon, J. Kim, J. Sinha, S. Mitani, M. Hayashi, M. Yamanouchi, H. Ohno, Interface control of the magnetic chirality in CoFeB/MgO heterostructures with heavy-metal underlayers, Nat. Commun. 5 (2014) 4655, https://doi.org/10.1038/ ncomms5655, http://www.nature.com/doifinder/10.1038/ncomms5655.
- [19] S.J. Noh, Y. Miyamoto, M. Okuda, N. Hayashi, Y. Keun Kim, Effects of notch shape on the magnetic domain wall motion in nanowires with in-plane or perpendicular magnetic anisotropy, J. Appl. Phys. 111 (7) (2012), https://doi.org/10.1063/1. 3677340. 07D123 http://aip.scitation.org/doi/10.1063/1.3677340.
- [20] H. Kim, S.W. Heo, C.-Y. You, Implementation of one-dimensional domain wall dynamics simulator, AIP Adv. 7 (12) (2017) 125231, https://doi.org/10.1063/1. 4996029, http://aip.scitation.org/doi/10.1063/1.4996029.
- [21] E. Martinez, S. Emori, N. Perez, L. Torres, G.S.D. Beach, Current-driven dynamics of Dzyaloshinskii domain walls in the presence of in-plane fields: full micromagnetic and one-dimensional analysis, J. Appl. Phys. 115 (21) (2014) 213909, https://doi.org/10.1063/1.4881778, http://aip.scitation.org/doi/10. 1063/1.4881778.
- [22] Arizona State University (ASU). 22nm Predictive Technology Model (PTM). (Accessed 20 March 2018). Available at: http://ptm.asu.edu/.
- [23] T.-F. Wu, C.-R. Ho, M.S.-W. Chen, A flash-based non-uniform sampling ADC with hybrid quantization enabling digital anti-aliasing filter, IEEE J. Solid State Circ. 52 (9) (2017) 2335–2349, https://doi.org/10.1109/JSSC.2017.2718671, http:// ieeexplore.ieee.org/document/7994711/.
- [24] Chang-Joon Park, H.M. Geddada, A.I. Karsilayan, J. Silva-Martinez, M. Onabajo, A current-mode flash ADC for low-power continuous-time sigma delta modulators, in: 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013), IEEE, 2013, pp. 141–144, https://doi.org/10.1109/ISCAS.2013.6571802, http:// ieeexplore.ieee.org/document/5571802/.
- [25] Z. He, D. Fan, A low power current-mode flash ADC with spin Hall Effect based multi-threshold comparator, in: Proceedings of the 2016 International Symposium on Low Power Electronics and Design, ACM, 2016, pp. 314–319.
- [26] A. Roohi, R. Zand, D. Fan, R.F. DeMara, Voltage-based concatenatable full adder using spin Hall Effect switching, IEEE Trans. Comput. Aided Des. Integrated Circ. Syst. 36 (12) (2017) 2134–2138, https://doi.org/10.1109/TCAD.2017.2661800, http://ieeexplore.ieee.org/document/7837669/.
- [27] S. Matsunaga, J. Hayakawa, S. Ikeda, K. Miura, H. Hasegawa, T. Endoh, H. Ohno, T. Hanyu, Fabrication of a nonvolatile full adder based on logic-in-memory architecture using magnetic tunnel junctions, Appl. Phys. Express 1 (9) (2008) 91301.
- [28] A. Roohi, R. Zand, R.F. DeMara, A tunable majority gate-based full adder using current-induced domain wall nanomagnets, IEEE Trans. Magn. 52 (8) (2016) 1–7, https://doi.org/10.1109/TMAG.2016.2540600.
- [29] E. Deng, Y. Zhang, W. Kang, B. Dieny, J.-O. Klein, G. Prenat, W. Zhao, Synchronous 8-bit non-volatile full-adder based on spin transfer Torque magnetic tunnel junction, IEEE Trans. Circuits Syst. I: Reg. Pap. 62 (7) (2015) 1757–1765, https://doi.org/10.1109/TCSI.2015.2423751, http://ieeexplore.ieee.org/ document/7124537/.
- [30] D. Ravelsona, C. Chappert, J.-O. Klein, H.-P. Trinh, Y. Zhang, W. Zhao, Domain wall motion based magnetic adder, Electron. Lett. 48 (17) (2012) 1049–1051, https://doi.org/10.1049/el.2012.1577, http://digital-library.theiet.org/content/ journals/10.1049/el.2012.1577.
- [31] V. Bhatia, M. Goel, S. Gupta, P. Iswerya, N. Pandey, A. Bhattacharyya, Low power delay proficient current mode ADC design, in: 2012 2nd International Conference on Power, Control and Embedded Systems, IEEE, 2012, pp. 1–4, https://doi.org/ 10.1109/ICPCES.2012.6508082, http://ieeexplore.ieee.org/document/6508082/.