# CS3351-DIGITAL PRINCIPLES AND COMPUTER ORGANIZATION #### **UNITICOMBINATIONAL CIRCUITS:** - -Combinational Circuits - Karnaugh Map Analysis and Design Procedures - Binary Adder Subtractor - Decimal Adder - Magnitude Comparator - Decoder - Encoder - Multiplexers - Demultiplexers #### **INTRODUCTION:** The digital system consists of two types of circuits, namely - (i) Combinational circuits - (ii) Sequential circuits **Combinationalcircuit**consistsoflogicgates whose output at anytime is determined from the present combination of inputs. The logic gate is the most basic building block of combinational logic. The logical function performed by a combinational circuit is fully defined by a set of Boolean expressions. **Sequential logic circuit** comprises both logic gates and the state of storageelementssuchasflip- flops. As a consequence, the output of a sequential circuit depends not only on present value of inputs but also on the past state of inputs. Intheprevious chapter, we have discussed binary numbers, codes, Boolean alge braands implification of Boolean function and logic gates. In this chapter, formulation and analysis of various systematic designs of combinational circuits will be discussed. Acombinational circuit consists of input variables, logic gates, and output variables. The logic gates accept signals from inputs and output signals are generated according to the logic circuits employed in it. Binary information from the given data transforms to desired output data in this process. Both input and output are obviously the binary signals, *i.e.*, both the input and output signals are of two possible states, logic landlogic 0. Blockdiagramofacombinationallogiccircuit For n number of input variables to a combinational circuit, $2^n$ possible combinations of binary input states are possible. For each possible combination, there is one and only one possible output combination. A combinational logic circuit can be described by m Boolean functions and each output can be expressed in terms of n input variables. #### **DESIGNPROCEDURE:** Anycombinational circuit can be designed by the following steps of design procedure. - 1. The problem is stated. - 2. Identifytheinputandoutputvariables. - 3. Theinputandoutputvariablesareassignedlettersymbols. - 4. Construction of a truth table to meet in put-output requirements. - 5. WritingBooleanexpressionsforvariousoutputvariablesintermsofinp utvariables. - 6. The simplified Boolean expression is obtained by any method of minimization—algebraic method, Karnaughmap method, or tabulation method. - 7. Alogicdiagramisrealizedfromthesimplifiedbooleanexpressionusinglog icgates. The following guidelines should be followed while choosing the preferred form for hard ware implementation: - 1. Theimplementationshouldhavetheminimumnumberofgates, with the gate sused having the minimum number of inputs. - 2. Thereshouldbeaminimumnumberofinterconnections. - 3. Limitationonthedrivingcapabilityofthegatesshouldnotbeignored. #### ARITHMETICCIRCUITS-BASICBUILDINGBLOCKS: Inthissection, we will discuss those combinational logic building blocks that can be used to perform addition and subtraction operations on binary numbers. Additionand subtraction are the two most commonly used arithmetic operations, as the other two, namely multiplication and division, are respectively the processes of repeated additionand repeated subtraction. The basic building blocks that form the basis of all hardware used to perform the a rithmetic operations on binary numbers are half-adder, full adder, half-subtractor, full-subtractor. #### **Half-Adder:** A half-adder is a combinational circuit that can be used to add two binary bits. Ithas two inputs that represent the two bits to be added and two outputs, with oneproducingtheSUMoutputandtheotherproducingtheCARRY. #### **Blockschematicofhalf-adder** Outputs 0 The truth table of a half-adder, showing all possible in put combinations and the corresponding outputs are shown below. Inputs | | $\mathbf{A}$ | В | Carry(C | Sum(S) | | |----|--------------|-------|----------------------------------------------|--------|--| | | | | ) | | | | | 0 | 0 | 0 | 0 | | | | 0 | 1 | 0 | 1 | | | | For Carry | | . В | or Sum | | | ΑÌ | B<br>\ 0 | 1 | A C | 1 | | | 1 | 0 0 | 0 | 0 0 | 1 | | | 1 | 0 1 | | 1 [1 | 0 | | | | Cany | = A.B | $\mathbf{Sum} = \mathbf{AB'} + \mathbf{A'B}$ | | | | | | r | $= \mathbf{A} \oplus \mathbf{B}$ | | | | | 1 | 0 | 0 | 1 | | **Truthtableofhalf-adder** # **K-mapsimplificationforcarryandsum:** The Boolean expressions for the SUM and CARRY outputs are given by the equations, Sum, $$\hat{S} = A'B+AB'=A \oplus B$$ Carry,C=A.B The first one representing the SUM output is that of an EX- OR gate, the second one representing the CARRY output is that of an AND gate. The logicdiagram of the half adderis, Full- Adder: A full adder is a combinational circuit that forms the arithmetic sum of three input bits. It consists of 3 inputs and 2 outputs. Two of the input variables, represent the significant bits to be added. The thirdinputrepresents the carryfrom previous lowers ignificant position. The block diagram of full adderisgiven by, #### **Blockschematicoffull-adder** The full adder circuit overcomes the limitation of the half-adder, which can be used to add two bits only. As there are three input variables, eight different input combinations are possible. The truthtable is shown below, #### TruthTable: | | Input | | Output | | | |--------------|-------|----|--------|----------|--| | | S | | | S | | | $\mathbf{A}$ | В | Ci | Sum(S) | Carry(Co | | | | | n | | ut) | | | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | 1 | 1 | 0 | | | 0 | 1 | 0 | 1 | 0 | | | 0 | 1 | 1 | 0 | 1 | | | 1 | 0 | 0 | 1 | 0 | | | 1 | 0 | 1 | 0 | 1 | | | 1 | 1 | 0 | 0 | 1 | | | 1 | 1 | 1 | 1 | 1 | | To derive the simplified Boolean expression from the truth table, the Karnaugh map method is adopted as, | | For Sum | | | | | | | | |------|---------|----|----|----|--|--|--|--| | A BC | 00 | 01 | 11 | 10 | | | | | | 0 | 0 | 1 | 0 | 1 | | | | | | 1 | 1 | 0 | 1 | 0 | | | | | Carry, $C_{out} = AB + AC_{in} + BC_{in}$ Sum, $S = A'B'C_{in} + A'BC'_{in} + AB'C'_{in} + ABC_{in}$ The Boolean expressions for the SUM and CARRY outputs are given by the equations, Thelogic diagram for the above functions is shown as, #### $\underline{Implementation of full-adderin Sum of Products}$ Thelogicdiagram of the full adder can also be implemented with two half-adders and one OR gate. The Soutput from the second half adder is the exclusive OR of Cinand the output of the first half-adder, giving $$Sum = Cin \oplus (A \oplus B)$$ $$= Cin \oplus (A'B + AB')$$ $$= C'in(A'B + AB') + Cin(A'B + AB')'$$ $$= C'in(A'B + AB') + Cin(AB + A'B')$$ $$= C'in(A'B + AB') + Cin(AB + A'B')$$ andthecarryoutputis, #### Carry, Cout=AB+Cin(A'B+AB') =AB+A'BCin+AB'Cin =AB(Cin+1)+A'BCin+AB'Cin [Cin+1=1] =ABCin+AB+A'BCin+AB'Cin =AB+ACin(B+B')+A'BCin =AB+ACin+A'BCin =AB(Cin+1)+ACin+A'BCin [Cin+1=1] =ABCin+AB+ACin+A'BCin =AB+ACin+BCin(A+A') =AB+ACin+BCin. #### $\underline{Implementation of full adder with two half-adders and an OR gate}$ # Half-Subtractor:Blockschematicofhalf-subtractor # Ahalf- $subtractor is a combinational circuit that can be used to subtract one binary digit from an other top roduce a DIFFERENCE output and a BORROW output. The BORROW output here specifies whether a\_1 'has been borrowed toper form the subtraction. The trut htable of half-$ subtractor, showing all possible in put combinations and the corresponding outputs are shown below. | I i | nput | Output | | | | |-----|------|-------------------|--------------|--|--| | A | В | Difference(<br>D) | Borrow(Bout) | | | | 0 | 0 | 0 | 0 | | | | 0 | 1 | 1 | 1 | | | | 1 | 0 | 1 | 0 | | | | 1 | 1 | 0 | 0 | |---|---|---|---| #### <u>K-mapsimplificationforhalfsubtractor</u>: The Boolean expressions for the DIFFERENCE and BORROW outputs are given by the equations, The first one representing the DIFFERENCE (**D**)output is that of an exclusive-ORgate, the expression for the BORROW output (**Bout**) is that of an AND gate with input Acomplementedbeforeitisfedtothegate. The logicdiagramofthehalf adderis, #### <u>LogicImplementationofHalf-Subtractor</u> Comparing a half-subtractor with a half-adder, we find that the expressions forthe SUM and DIFFERENCE outputs are just the same. The expression for BORROW in the case of the half-subtractor is also similar to what we have for CARRY in the case of the half-adder. If the input A, ie., the minuend is complemented, an AND gate can beusedtoimplementtheBORROWoutput. #### FullSubtractor: Afullsubtractorperformssubtractionoperationontwobits,aminuendandasubtra hend, and alsotakes intoconsideration whethera <u>l</u>' has alreadybeen borrowedbythepreviousadjacentlowerminuendbitornot. As a result, there are three bits to be handled at the input of a full subtractor, namely the two bits to be subtracted and a borrow bit designated as Bin. There are twooutputs, namely the DIFFERENCE output Dandthe BORROW output Bo. The BORROW output bit tells whether the minuend bit needs to borrow a \_1' from the nextpossiblehigherminuendbit. #### **Block schematicof full-adder** Thetruthtableforfull-subtractoris, | | Input | | Output | | | |--------------|-------|----|--------------|-----------|--| | | S | | S | | | | $\mathbf{A}$ | В | Bi | Difference(D | Borrow(Bo | | | | | n | ) | ut) | | | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | 1 | 1 | 1 | | | 0 | 1 | 0 | 1 | 1 | | | 0 | 1 | 1 | 0 | 1 | | | 1 | 0 | 0 | 1 | 0 | | | 1 | 0 | 1 | 0 | 0 | | | 1 | 1 | 0 | 0 | 0 | | | 1 | 1 | 1 | 1 | 1 | | # **K-map simplification for full-subtractor:** A 00 01 11 10 0 0 0 1 0 Difference, $D = A'B'B_{in} + A'BB'_{in} + AB'B'_{in} + ABB_{in}$ Borrow, $B_{out} = A'B + A'B_{in} + BB_{in}$ The Boolean expressions for the DIFFER ENCE and BORROW outputs are given by the equations, #### Difference,D =A'B'Bin+A'BB'in+AB'B'in+ABBin Borrow, Bout = A'B+A'Cin+BBin. Thelogic diagram for the above functions is shown as, Implementationoffull-adderinSumofProducts The logic diagram of the full-subtractor can also be implemented with two half- subtractors and one OR gate. The difference, Doutput from the second half subtractor is the difference of differenheexclusive-ORofBinandtheoutputofthefirsthalf-subtractor, giving Difference,D=Bin $$\oplus$$ (A $\oplus$ B) [x $\oplus$ y=x'y+xy'] =Bin(A'B+AB') =B'in(A'B+AB')+Bin(A'B+AB')' [(x'y+xy')'=(xy+x'y')] =B'in(A'B+AB')+Bin(AB+A'B') = A'BB'in+AB'B'in+ABBin+A'B'Bin. and the borrow output is, =A'BBin+A'B+BBin+A'B'Bin Therefore, we can implement full-subtractor using two half-subtractors and OR gate as, # $\frac{Implementation of full-subtractor with two half-subtractors and an OR gate}{}$ #### Four -bit BinaryAdder(ParallelAdder): The 4-bit bit naryadder using full adder circuits is capable of adding two 4-bit numbers resulting in a 4-bit sum and a carry output as shown in figure below. 4-bitbinaryparallelAdder Since all the bits of augendand addendare fed into the adder circuits simultaneously and the additions in each positionare taking place at the same time, this circuit is known as parallel adder. Letthe4-bitwordstobeaddedberepresentedby, A3A2A1A0=1111andB3B 2B1B0=0011. ``` Significant place 4 3 2 1 Input carry 1 1 1 0 Augend word A: 1 1 1 1 Addend word B: 0 0 1 1 1 0 0 1 0 ← Sum Output Carry ``` Thebitsareaddedwithfulladders, starting from the least significant position, to form the sumitand carrybit. The input carry C0 in the least significant position must be 0. The carry output of the lower order stage is connected to the carry input of the next higher order stage. Hence this type of adderiscalled ripple-carryadder. In the least significant stage, A0, B0 and C0 (which is 0) are added resulting insumS0andcarryC1. This carryC1 becomes the carry input to the second stage. Similarly in the second stage, A1, B1 and C1 are added resulting in sum S1 and carry C2, in the third stage, A2, B2 and C2 are added resulting in sum S2 and carry C3, in the third stage, A3, B3 and C3 are added resulting in sum S3 and C4, which is the output carry. Thus the circuit results in a sum(S3S2S1S0) and acarry output(Cout). Though the parallel binary adder is said to generate its output immediately afterthe inputs are applied, its speed of operation is limited by the carry propagation delaythrough all stages. However, there are several methods to reduce this delay. One of the methods of speeding up this process is look-ahead carry additionwhicheliminatestheripple-carrydelay. #### **BinarySubtractor(ParallelSubtractor):** The subtraction of unsigned binary numbers can be done most conveniently bymeans of complements. The subtraction A-B can be done by taking the 2's complementofBandaddingittoA.The2'scomplementcanbeobtainedbytakingthe1 'scomplementandadding1totheleastsignificantpairofbits. The 1's complement can be implemented within verters and a 1 can be added to the sum through the input carry. ThecircuitforsubtractingA- Bconsistsofanadderwithinvertersplacedbetween each data input B and the corresponding input of the full adder. The inputcarryC0mustbeequalto1whenperformingsubtraction. Theoperation thus performed becomes A, plus the 1's complement of B, plus 1. This is equal to A plus the 2'scomplement of B. 4-bitParallelSubtractor #### 4-Bit ParallelAdder/Subtractor: The addition and subtraction operation can be combined into one circuit withoue common binary adder. This is done by including an exclusive-OR $gate\ with\ each full adder. A 4-bit adder Subtractor circuit is shown below.$ The mode input M controls the operation. When M=0, the circuit is an adder and when M=1, the circuit becomes a Subtractor. Each exclusive- ORgatereceivesinputMand one of the inputs of B. When M=0, we have B 0= B. The full adders receive thevalueofB,theinputcarryis0,andthecircuitperformsAplusB.WhenM=1,wehave B 1= B' and C0=1. The B inputs are all complemented and a 1 is added through theinput carry. The circuit performs the operation A plus the 2's complement of B. Theexclusive-ORwithoutputVisfordetectinganoverflow. # $\underline{DecimalAdder(BCDAdder):}$ The digital system handles the decimal number in the form of binary coded decimal numbers (BCD). ABCD adder is a circuit that adds two BCD bits and produces a sum digital so in BCD. Consider the arithmetic addition of two decimal digits in BCD, together with an input carry from a previous stage. Since each input digit does not exceed 9, the output sum cannot be greater than 9+9+1 = 19; the 1 is the sum being an input carry. The adder will form the sum in binary and produce a result that ranges from 0 through 19. These binary numbers are labeled by symbols K, Z8, Z4, Z2, Z1, K is the carry. The columns under the binary sum list the binary values that appear in the outputs of the 4-bit binary adder. The outputs umof the two decimal digits must be represented in BCD. | BinarySum | | | BCD Sum | | | | | | | | |-----------|---|------------|---------------------|------------|---|-----------|---|------|-----------|---------| | K | | <b>Z</b> 4 | <b>Z</b> 2 <b>Z</b> | <b>Z</b> 1 | C | <b>S8</b> | | S2 S | <b>S1</b> | Decimal | | | 8 | | | | | | 4 | | | | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 2 | | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 3 | | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 4 | | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 5 | | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 6 | | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 7 | | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 8 | | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 9 | | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 10 | | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 11 | | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 12 | | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | 13 | | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 0 | 14 | | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 15 | | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 16 | | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 17 | | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 18 | | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 19 | In examining the contents of the table, it is apparent that when the binary sum isequal to or less than 1001, the corresponding BCD number is identical, and therefore no conversion is needed. When the binarysum is greater than 9(1001), we obtain a non-valid BCD representation. The addition of binary 6(0110) to the binary sum converts it to the correct BCD representation and also produces an output carry as required. The logic circuit to detect sum greater than 9 can be determined by simplifying the boolean expression of the given truth table. | | Inj | Output | | | |-----------------------|-------|------------|-------|---| | <b>S</b> <sub>3</sub> | $S_2$ | <b>S</b> 1 | $S_0$ | Y | | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 0 | 1 | 0 | | 0 | 0 | 1 | 0 | 0 | | 0 | 0 | 1 | 1 | 0 | | 0 | 1 | 0 | 0 | 0 | | 0 | 1 | 0 | 1 | 0 | | 0 | 1 | 1 | 0 | 0 | | 0 | 1 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 1 | 0 | | 1 | 0 | 1 | 0 | 1 | | 1 | 0 | 1 | 1 | 1 | | 1 | 1 | 0 | 0 | 1 | | 1 | 1 | 0 | 1 | 1 | | 1 | 1 | 1 | 0 | 1 | | 1 | 1 | 1 | 1 | 1 | | S <sub>3</sub> S <sub>2</sub> S <sub>1</sub> | S <sub>0</sub> | 01 | 11 | 10 | |----------------------------------------------|----------------|----|----|----| | 00 | 0 | 0 | 0 | 0 | | 01 | 0 | 0 | 0 | 0 | | 11 | 1 | 1 | 1 | 1 | | 10 | 0 | 0 | 1 | 1 | $Y = S_3S_2 + S_3S_1$ # ToimplementBCDadder werequire: - 4-bitbinaryadderforinitialaddition - Logiccircuittodetectsumgreaterthan9and - Onemore 4-bitadder to add 01102 in the sum if the sum is greater than 9 or carry is 1. The twodecimal digits, togetherwith the inputcarry, are firstadded in the top4-bit binary adder to provide the binary sum. When the output carry is equal to zero,nothing is added to the binarysum. When it equal to one, binary0110 is added to the binary sum through the bottom 4-bit adder. The output carry generated from thebottom adder can be ignored, since it supplies information already available at theoutputcarryterminal. Theoutputcarryfromonestagemust beconnected to the input carryofthen exthigher-order stage. # **MAGNITUDECOMPARATOR:** A *magnitude comparator* is a combinational circuit that compares two givennumbers(AandB)anddetermineswhetheroneisequalto,lessthanorgreatertha ntheother. Theoutputisintheformofthreebinary variables #### **Blockdiagramofn-Bit magnitudecomparator** representing the conditions A=B, A>B and A<B, if A and B are the two numbers being compared. Forcomparison of two n-bit numbers, the classical method to achieve the Boolean expressions requires a truth table of $2^{2n}$ entries and becomes too lengthy and cumbersome. #### 1-Bit Magnitude Comparator: A comparator used to compare two bits is called a single-bit comparator. It consists of two inputs each for two single-bit numbers and three outputs to generate less than, equal to, and greater than between two binary numbers. The truth table for a 1-bit comparator is given below: | Α | В | A <b< th=""><th>A=B</th><th>A&gt;B</th></b<> | A=B | A>B | |---|---|----------------------------------------------|-----|-----| | 0 | 0 | 0 | 1 | 0 | | 0 | 1 | 1 | 0 | 0 | | 1 | 0 | 0 | 0 | 1 | | 1 | 1 | 0 | 1 | 0 | From the above truth table logical expressions for each output can be expressed as follows: A>B: AB' A < B: A'B A=B: A'B' + AB From the above expressions we can derive the following formula: ``` ( A<B)+(A>B) = A'B+AB' Taking complement both sides ( (A<B) + (A>B) )' = ( A'B + AB')' ( (A<B) + (A>B) )' = ( A'B)' ( AB')' ( (A<B) + (A>B) )' = ( A + B') (A' +B ) ( (A<B) + (A>B) )' = ( AA' + AB + A'B' +BB') " " = ( AB + A'B' ) Thus, ( (A<B) + (A > B) )' = (A = B) ``` By using these Boolean expressions, we can implement a logic circuit for this comparator as given below: #### **2-bitMagnitudeComparator:** Thetruthtableof2-bitcomparatorisgivenintablebelow— # Truthtable: | | | put | Outputs | | | | |--------|----|-----|------------|-----|-----|-------------------| | A<br>3 | A2 | A1 | <b>A</b> 0 | A>B | A=B | A <b< th=""></b<> | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 0 | 0 | 1 | 0 | 0 | 1 | | 0 | 0 | 1 | 0 | 0 | 0 | 1 | | 0 | 0 | 1 | 1 | 0 | 0 | 1 | | 0 | 1 | 0 | 0 | 1 | 0 | 0 | | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | 1 | | 0 | 1 | 1 | 1 | 0 | 0 | 1 | | 1 | 0 | 0 | 0 | 1 | 0 | 0 | | 1 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 0 | 1 | 0 | 0 | 1 | 0 | | 1 | 0 | 1 | 1 | 0 | 0 | 1 | | 1 | 1 | 0 | 0 | 1 | 0 | 0 | | 1 | 1 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 1 | 0 | 1 | 0 | 0 | | 1 | 1 | 1 | 1 | 0 | 1 | 0 | # K-mapSimplification: | $B_1$ | B <sub>0</sub> | For A>B | | | | |----------|----------------|---------|----|----|--| | $A_1A_0$ | 00 | 01 | 11 | 10 | | | 00 | 0 | 0 | 0 | 0 | | | 01 | 1 | 0 | 0 | 0 | | | 11 | 1 | 1 | 0 | 1 | | | 10 | 1 | 1 | 0 | 0 | | | $B_1$ | B <sub>0</sub> | For | <u>A=B</u> | | |----------|----------------|-----|------------|----| | $A_1A_0$ | 00 | 01 | 11 | 10 | | 00 | 1 | 0 | 0 | 0 | | 01 | 0 | 1 | 0 | 0 | | 11 | 0 | 0 | 1 | 0 | | 10 | 0 | 0 | 0 | 1 | $$A>B = A_0B_1'B_0' + A_1B_1' + A_1A_0B_0'$$ $$\begin{array}{c} A{=}B = A_1{'}A_0{'}B_1{'}B_0{'}{+} \ A_1{'}A_0B_1{'}B_0{+} \\ A_1A_0B_1B_0{+} \ A_1A_0{'}B_1B_0{'} \end{array}$$ $= A_1'B_1' \left( A_0'B_0' + A_0B_0 \right) + A_1B_1 \left( A_0B_0 + A_0'B_0' \right)$ $= (\mathbf{A}_0 \odot \mathbf{B}_0) (\mathbf{A}_1 \odot \mathbf{B}_1)$ For A<B $A < B = A_1'A_0'B_0 + A_0'B_1B_0 + A_1'B_1$ 2- <u>bitMagnitudeComparator</u> #### **ENCODERS:** An encoder is a digital circuit that performs the inverse operation of a decoder. Hence, the opposite of the decoding process is called encoding. An encoder is a combinational circuit that converts binary information from 2<sup>n</sup> input lines to a maxim umof\_n'unique output lines. The general structure of encoder circuit is- It has 2<sup>n</sup> input lines, only one which 1 is active at any time and \_n' output lines. Itencodes one of the active inputs to a coded binary output with \_n' bits. In an encoder,thenumberofoutputsislessthanthenumberofinputs. #### **Octal-to-BinaryEncoder:** It has eight inputs (one for each of the octal digits) and the three outputs that generate the corresponding binary number. It is assumed that only one input has avalueof1atanygiventime. | | | ( | Outpu | ts | | | | | | | |---|---|---|-------|----|---|---|---|---|---|---| | D | D | D | D | D | D | D | D | A | В | C | | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | | | | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | The encoder can be implemented with OR gates whose inputs are determined directly from the truth table. Output z is equal to 1, when the input octal digit is 1 or 3or5or7.Outputyis 1foroctaldigits2,3,6,or7andtheoutputis1fordigits4,5,6or 7. The seconditions can be expressed by the following output Boolean functions: The encoder can be implemented with three OR gates. The encoder defined in the below table, has the limitation that only one input can be active at any given time. If two inputs are active simultaneously, the output produces an undefined combination. For eg., if D3 and D6 are 1 simultaneously, the output of the encoder may be 111. This does not represent either D6 or D3. To resolve this problem, encoder circuits must establish an input priority to ensure that only one input is encoded. If we establish a higher priority for inputs with higher subscript numbers and if D3 and D6 are 1 at the same time, the output will be 110 because D6 has higher priority than D3. #### Octal-to-BinaryEncoder Another problem in the octal-to-binary encoder is that an output with all 0's isgenerated when all the inputs are 0; this output is same as when D0is equal to 1. The discrepancy can be resolved by providing one more output to indicate that at least one input is equal to 1. #### **PriorityEncoder:** A priority encoder is an encoder circuit that includes the priority function. Inpriorityencoder, if two or more inputs are equal to 1 at the same time, the input having the highest priority will take precedence. In addition to the two outputs x and y, the circuit has a third output, V (valid bitindicator). It is set to 1 whenoneormore inputs are equal to 1. If all inputs are 0, there is no valid input and Visequal to 0. The higher the subscript number, higher the priority of the input. Input D3, has the highest priority. So, regardless of the values of the other inputs, when D3 is 1, the output for xy is 11. D2 has the next priority level. The output is 10, if D2 = 1 provided D3 = 0. The output for P1 is generated only if higher priority in puts are 0, and so on down the priority levels. #### **Truthtable:** | | Inj | out<br>s | ( | Outpu | ts | | |---|-----|----------|---|-------|----|---| | D | D | D | D | X | y | V | | 0 | 1 | 2 | 3 | | | | | 0 | 0 | 0 | 0 | X | X | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | X | 1 | 0 | 0 | 0 | 1 | 1 | | X | X | 1 | 0 | 1 | 0 | 1 | | X | X | X | 1 | 1 | 1 | 1 | Although the above table has only five rows, when each don't care condition is replaced first by 0 and then by 1, we obtain all 16 possible input combinations. Forexample, the thirdrow in the table with X100 represents minterms 0100 and 1100. The don't care condition is replaced by 0 and 1 as shown in the table below. # **ModifiedTruthtable:** | | _ | put<br>s | ( | Outpu | ts | | |---|---|----------|---|-------|----|---| | D | D | D | D | X | y | V | | 0 | 1 | 2 | 3 | | | | | 0 | 0 | 0 | 0 | X | X | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | 0 | 1 | 0 | 0 | 0 | 1 | 1 | | 1 | 1 | 0 | 0 | 0 | 1 | 1 | | 0 | 0 | 1 | 0 | | | | | 0 | 1 | 1 | 0 | 1 | 0 | 1 | | 1 | 0 | 1 | 0 | 1 | 0 | 1 | | 1 | 1 | 1 | 0 | | | | | 0 | 0 | 0 | 1 | | | | | 0 | 0 | 1 | 1 | | | | | 0 | 1 | 0 | 1 | | | | | 0 | 1 | 1 | 1 | 1 | 1 | 1 | | 1 | 0 | 0 | 1 | | | | | 1 | 0 | 1 | 1 | | | | | 1 | 1 | 0 | 1 | | | | | 1 | 1 | 1 | 1 | | | | # K-mapSimplification: The priority encoder is implemented according to the above Boolean functions. # **InputPriorityEncoder** #### **DECODERS:** A decoder is a combinational circuit that converts binary information from $\_n$ 'input lines to a maximum of $\_2n$ 'unique output lines. The general structure of decoder circuit is— # $\underline{General structure of decoder}$ Theencodedinformationispresentedas\_n'inputsproducing\_ $2^n$ 'possibleout puts. The $2^n$ output values are from 0 through $2^n$ -1. A decoder is provided withenable inputs to activate decoded output based on data inputs. When any one enableinputisunasserted, alloutputs of decoder are disabled. #### **BinaryDecoder(2to4decoder):** A binary decoder has \_n' bit binary input and a one activated output out of $2^n$ outputs. A binary decoder is used when it is necessary to activate exactly one of $2^n$ outputsbasedonann-bitinputvalue. #### **2-to-4Linedecoder** Here the 2 inputs are decoded into 4 outputs, each output representing one of the minterms of the two input variables. | I | Output<br>s | | | | | | |-------|-------------|---|---|---|---|---| | Enabl | A | В | Y | Y | Y | Y | | e | | | 3 | 2 | 1 | 0 | | 0 | X | X | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | 1 | 0 | 1 | 0 | 0 | 1 | 0 | | 1 | 1 | 0 | 0 | 1 | 0 | 0 | | 1 | 1 | 1 | 1 | 0 | 0 | 0 | As shown in the truth table, if enable input is 1 (EN=1) only one of the outputs (Y0-Y3), is active for a given input. The output Y0 is active, ie., Y0 = 1 when inputs A=B=0,Y1isactivewheninputs, A=0andB=1, Y2isactive, when input A=1 and B=0, Y3isactive, wheninputsA=B=1. #### 3 to-8LineDecoder: A3-to-8linedecoderhasthreeinputs(A,B,C)andeightoutputs(Y0- Y7).Basedonthe3inputsoneoftheeightoutputsisselected. Thethreeinputsaredecodedintoeightoutputs, each output representing one of the minterms of the 3-input variables. This decoder is used for binary-to-octal conversion. The input variables may represent a binary number and the outputs will represent the eight digits in the octal number system. The output variables are mutually exclusive because only one output can be equal to 1 at anyone time. The output line whose value is equal to 1 represents the mintermequivalent of the binar ynumber presently available in the input lines. | | Input<br>s | | | | | Out | t <b>put</b> | | | | |---|------------|---|---|--------|--------|--------|--------------|--------|--------|-----| | A | В | C | Y | Y<br>1 | Y<br>2 | Y<br>3 | Y<br>4 | Y<br>5 | Y<br>6 | Y 7 | | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 3-to-8linedecoder # **BCDto7-SegmentDisplayDecoder:** A seven-segment display isnormally usedfordisplaying any one of the decimaldigits,0through9.ABCD-to-sevensegmentdecoderacceptsadecimaldigitinBCDandgeneratesthecorrespondingseven-segmentcode. $$\begin{array}{c|c} a \\ f & g \\ e & d & c \end{array}$$ Each segment is made up of a material that emits light when current is passed through it. The segment sactivated during each digit displayare tabulated as — | Dig<br>it | Displa<br>y | Segments<br>Activated | |-----------|-------------------------------------------------------------------------------------------------|-----------------------| | 0 | $ \begin{array}{c c} \underline{a} \\ f & b \\ e & c \end{array} $ | a,b,c, d,e,f | | 1 | | b,c | | 2 | | a, b, d, e, g | | 3 | <u>a</u> b <u>d</u> c | a, b,c,d,g | | 4 | f g b | b,c,f, g | | 5 | $ \begin{array}{c c} & \underline{a} \\ f & \underline{g} \\ & \underline{d} \\ \end{array} $ | a,c,d,f,g | | 6 | $ \begin{array}{c c} & \underline{a} \\ f & \underline{g} \\ e & \underline{d} \end{array} $ | a,c,d,e,f,g | |---|------------------------------------------------------------------------------------------------|-----------------| | 7 | b<br> c | a,b,c | | 8 | $ \begin{array}{c c} a \\ f & g \\ \hline e & c \end{array} $ | a, b,c,d,e,f, g | | 9 | $ \begin{array}{c c} & \underline{a} \\ & \underline{g} \\ & \underline{d} \\ \end{array} $ | a,b,c,d,f,g | # **Truthtable:** | | | BCDcode | | | 7- | | | | | | | | |------|---|---------|---|---|----|-------------|---|---|---|---|---|--| | | | | | | | Segmentcode | | | | | | | | Digi | A | В | C | D | a | b | c | d | e | f | g | | | t | | | | | | | | | | | | | | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | | | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | | | 2 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | | | 3 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | | | 4 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | | | 5 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | 1 | | | 6 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | | | 7 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | | | 8 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | | | 9 | 1 | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 1 | | ### **K-mapSimplification:** | \CI | | <u>For</u> | <u>(e)</u> | | |-----|-------|------------|------------|----| | AB | 00 | 01 | 11 | 10 | | 00_ | 1 | 0 | 0 | 1 | | 01 | 0 | 0 | 0 | 1 | | 11 | Х | X | X | X | | 10 | 1 | 0 | X | x | | , | e= B' | D'+ C | D' | ' | **BCDto7-segmentdisplaydecoder** # **Applications of decoders:** - 1. Decodersareusedincountersystem. - 2. Theyareusedinanalogtodigitalconverter. - ${\it 3. \ Decoder outputs} can be used to drive a display system.$ A *multiplexer* or MUX, is a combinational circuit with more than one input line,oneoutputlineandmorethanoneselectionline. Amultiplexerselects binary information present from one of many input lines, depending upon the logic status of the selection inputs, and routes it to the output line. Normally, there are $2^n$ input lines and n selection lines whose bit combinations determine which input is selected. The multiplexer is often labeled as MUX in block diagrams. A multiplexer is also called a **data selector**, since it selects one of many inputsandsteersthebinaryinformation to the output line. $\underline{Block diagram of Multiplexer}$ #### **2-to-1-lineMultiplexer**: $The circuit has two data in put lines, one output line and one selection line, S.\ When S=0, the upper AND gate is enabled and I0 has a path to the output.$ When S=1,thelowerAND gateisenabledandI1has apathtotheoutput. The multiple xeract slike an electronic switch that selects one of the two sources. #### **Truthtable:** | S | Y | |---|-------| | 0 | $I_0$ | | 1 | $I_1$ | #### **4-to-1-lineMultiplexer**: A 4-to-1-line multiplexer has four $(2^n)$ input lines, two (n) select lines and one output line. It is the multiplexer consisting of four input channels and information of one of the channels can be selected and transmitted to an output line according to theselect inputs combinations. Selection of one of the four input channel is possible by two selection inputs. EachofthefourinputsI0throughI3,isappliedtooneinputofANDgate.Selection lines S1and S0are decoded to select a particular AND gate. The outputs of theANDgateareappliedtoasingleORgatethatprovidesthe1-lineoutput. 4-to-1-LineMultiplexer #### **Functiontable:** | S <sub>1</sub> | S <sub>0</sub> | Y | |----------------|----------------|----| | 0 | 0 | I0 | | 0 | 1 | I1 | | 1 | 0 | I2 | | 1 | 1 | I3 | To demonstrate the circuit operation, consider the case when S1S0= 10. The ANDgateassociatedwithinputI2hastwoofitsinputsequalto1andthethirdinputconnecte dtoI2. Theotherthree ANDgateshaveatleastone input equalto0, which makes their output sequal to 0. The OR output is now equal to the value of I2, providing a path from the selected in put to the output. The data output is equal to I0 only if S1=0 and S0=0; Y= I0S1'S0'. The data output is equal to I1 only if S1=0 and S0=1; Y= I1S1'S0. The data output is equal to I2 only if S1=1 and S0=0; Y= I2S1S0'. The data output is equal to I3only if S1=1 and S0=1; Y= I3S1S0. When these terms are ORed, the total expression for the data output is, ## Y=I0S1'S0'+I1S1'S0+I2S1S0'+I3S1S0. As in decoder, multiplexers may have an enable input to control the operation of the unit. When the enable input is in the inactive state, the outputs are disabled, andwhenit is in the active state, the circuit functions as a normal multiplexer. ## **Quadruple 2-to-1LineMultiplexer:** This circuit has four multiplexers, each capable of selecting one of two inputlines. Output Y0 can be selected to come from either A0 or B0. Similarly, output Y1 mayhave the value of A1 or B1, and so on. Input selection line, S selects one of the lines ineachofthefourmultiplexers. The enable input Emust be active for normal operation. Although the circuit contains four 2-to-1-Line multiplexers, it is viewed as acircuit that selects one of two 4-bit sets of data lines. The unit is enabled when E= 0.ThenifS=0,thefourAinputshaveapathtothefouroutputs.Ontheotherhand,ifS=1, the four B inputs are applied to the outputs. The outputs have all 0's when E= 1,regardlessofthevalueofS. ## **Application**: Themultiplexerisaveryuseful MSI function and has various ranges of applications in data communication. Signal routing and data communication are the important applications of amultiplexer. It is useful for constructing a common bus system. One of the general properties of a multiplexer is that Boolean functions can be implemented by this device. ## ImplementationofBooleanFunctionusingMUX: AnyBooleanorlogicalexpressioncanbeeasilyimplementedusingamultiplex er. If a Boolean expression has (n+1) variables, then \_n' of these variables canbe connected to the select lines of the multiplexer. The remaining single variable alongwith constants 1 and 0 is used as the input of the multiplexer. For example, if C is the singlevariable, then the input softhemultiplexers are C, C', 1 and 0. By this method a nylogical expression can be implemented. In general, a Boolean expression of (n+1) variables can be implemented using amultiplexerwith2<sup>n</sup>inputs. # ${\bf 1.} \ \ {\bf Implement the following boolean}$ functionusing 4:1 multiplexer, $F(A,B,C) = \sum m(1,3,5,6)$ . ## **Solution:** ``` Variables,n=3(A,B,C)Se lectlines=n-1=2(S1,S0) 2n- 1toMUXi.e.,22to1=4to1MUXInputli nes=2<sup>n-1</sup>=2<sup>2</sup>=4(D0,D1,D2,D3) ``` ## **Implementationtable:** Applyvariables A and B to the select lines. The procedures for implementing the function are: - i. Listtheinputofthemultiplexer - ii. Listunderthemallthemintermsintworowsasshownbelow. The first half of the minterms is associated with A' and the second half with A. The given function is implemented by circling the minterms of the function and applying the following rules to find the values for the input softhemultiplexer. 1. Ifboththemintermsinthecolumnarenotcircled,apply0tothecorrespondingin put. - 2. Ifboththemintermsinthecolumnarecircled,apply1tothecorrespondinginput. - $3. \ If the bottom mintermiscircled and the top is not circled, apply Ctothe input.$ - 4. Ifthetopmintermiscircledandthebottomis notcircled,applyC'totheinput. ## **MultiplexerImplementation**: $$2.F(x,y,z)=\sum m(1,2,6,7)$$ # **Solution:** ## **Implementationtable:** | | $\mathbf{D}_0$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | <b>D</b> <sub>3</sub> | |---|----------------|----------------|------------------|-----------------------| | Z | 0 | 1 | 2 | 3 | | Z | 4 | 5 | 6 | 7 | | | 0 | Z | 1 | z | # **Multiplexer Implementation:** $$3.F(A,B,C) = \sum m(1,2,4,5)$$ # **Solution:** Variables,n=3(A,B,C)Selectli nes=n-1=2(**S1,S0**) 2n- $1_{toMUXi.e.,2} \\ 2_{to1} \\ = \\ 4_{to1} \\ MUXInputli$ $nes=2^{n-1}=2^2=4(\mathbf{D0},\mathbf{D1},\mathbf{D2},\mathbf{D3})$ # **Implementationtable:** | | $\mathbf{D}_0$ | D <sub>1</sub> | $\mathbf{D}_2$ | $D_3$ | |-------------------------|----------------|----------------|-------------------------|-------| | $\overline{\mathbf{c}}$ | 0 | 1 | 2 | 3 | | С | (4) | (b) | 6 | 7 | | | С | 1 | $\overline{\mathbf{C}}$ | 0 | ## $4.F(P,Q,R,S)=\sum m(0,1,3,4,8,9,15)$ ## **Solution:** Variables,n=4(P,Q,R,S)Sel ectlines=n-1=3(**S2,S1,S0**) 2<sup>n-1</sup>toMUXi.e.,2<sup>3</sup>to1=8to1MUX Inputlines=2<sup>n-1</sup>=2<sup>3</sup>=8(**D0,D1,D2,D3,D4,D5,D6,D7**) # **Implementationtable:** | | $\mathbf{D}_0$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | <b>D</b> <sub>3</sub> | $D_4$ | <b>D</b> 5 | <b>D</b> 6. | <b>D</b> <sub>7</sub> | |-------------------------|----------------|----------------|------------------|-------------------------|-------------------------|------------|-------------|-----------------------| | $\overline{\mathbf{S}}$ | 0 | 1 | 2 | (3) | 4 | 5 | 6 | 7 | | S | $\odot$ | (3) | 10 | 11 | 12 | 13 | 14 | 15 | | | 1 | 1 | 0 | $\overline{\mathbf{S}}$ | $\overline{\mathbf{s}}$ | 0 | 0 | s | ## **Multiplexer Implementation:** 5. ImplementtheBooleanfunctionusing8:1andalsousing4:1multiplexer $F(A,B,C,D)=\sum m(0,1,2,4,6,9,12,14)$ # **Solution:** Variables,n=4(A,B,C,D)Se lectlines=n-1=3(**S2,S1,S0**) $2^{n-1}$ toMUXi.e., $2^3$ to1=8to1MUX Inputlines= $2^{n-1}=2^3=8(\mathbf{D0},\mathbf{D1},\mathbf{D2},\mathbf{D3},\mathbf{D4},\mathbf{D5},\mathbf{D6},\mathbf{D7})$ | | | $\mathbf{D}_{0}$ | D <sub>1</sub> | <b>D</b> <sub>2</sub> | <b>D</b> <sub>3</sub> | D <sub>4</sub> | <b>D</b> 5 | <b>D</b> 6. | <b>D</b> <sub>7</sub> | |---|-------------------------|-------------------------|----------------|-------------------------|-----------------------|----------------|------------|-------------|-----------------------| | | $\overline{\mathbf{D}}$ | 0 | 1 | (2) | 3 | 4 | 5 | (9) | 7 | | | D | 8 | (3) | 10 | 11 | 12 | 13 | (14) | 15 | | : | | $\overline{\mathrm{D}}$ | 1 | $\overline{\mathrm{D}}$ | 0 | 1 | 0 | 1 | 0 | **Implementationtable:** MultiplexerImplementation(Using8:1MUX): Using4:1MUX: $6.F(A,B,C,D)=\sum m(1,3,4,11,12,13,14,15)$ ## **Solution:** Variables,n=4(A,B,C,D)Se lectlines=n-1=3(**S2,S1,S0**) $2^{n-1}$ toMUXi.e., $2^3$ to1=8to1MUX Inputlines= $2^{n-1}=2^3=8(\textbf{D0,D1,D2,D3,D4,D5,D6,D7})$ # <u>Implementationtable</u>: | | $\mathbf{D}_{0}$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | <b>D</b> <sub>3</sub> | D <sub>4</sub> | <b>D</b> <sub>5</sub> | <b>D</b> <sub>6.</sub> | <b>D</b> <sub>7</sub> | |-------------------------|------------------|-------------------------|------------------|-----------------------|----------------|-----------------------|------------------------|-----------------------| | $\overline{\mathrm{D}}$ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | | D | 8 | 9 | 10 | (11) | 12 | 13 | (14) | (15) | | | 0 | $\overline{\mathbf{D}}$ | 0 | 1 | 1 | D | D | D | # **Multiplexer Implementation:** $7. \ Implement the Boolean function using 8:1 multiple xer.\\$ $$F(A,B,C,D)=A'BD'+ACD+B'CD+A'C'D.$$ ## **Solution:** ConvertintostandardSOPform, ## **Implementationtable:** | | $\mathbf{D}_0$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | $\mathbf{D}_3$ | D <sub>4</sub> | <b>D</b> 5 | <b>D</b> <sub>6.</sub> | <b>D</b> <sub>7</sub> | |-------------------------|----------------|-------------------------|------------------|----------------|-------------------------|-------------------------|-------------------------|-----------------------| | $\overline{\mathbf{D}}$ | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | | D | 8 | 9 | 10 | (11) | 12 | 13 | 14 | (15) | | | 0 | $\overline{\mathrm{D}}$ | 0 | 1 | $\overline{\mathrm{D}}$ | $\overline{\mathrm{D}}$ | $\overline{\mathbf{D}}$ | D | ## **Multiplexer Implementation:** $8. \ Implement the Boolean function using 8:1 multiple xer.\\$ $$F(A,B,C,D)=AB'D+A'C'D+B'CD'+AC'D.$$ ## **Solution:** ConvertintostandardSOPform, - =AB'D(C'+C)+A'C'D(B'+B)+B'CD'(A'+A)+AC'D(B'+B) - $= \underline{AB'C'D} + AB'CD + A'B'C'D + A'BC'D + A'B'CD' + \underline{AB'C}$ $\underline{'D} + ABC'D$ - =AB'C'D+AB'CD+A'B'C'D+A'BC'D+A'B'CD'+AB'CD'+ABC'D - =m9+m11+m1+m5+m2+m10+m13 - $=\sum m (1,2,5,9,10,11,13).$ ## <u>ImplementationTable:</u> | | $\mathbf{D}_{0}$ | D <sub>1</sub> | <b>D</b> <sub>2</sub> | <b>D</b> <sub>3</sub> | $D_4$ | <b>D</b> 5 | <b>D</b> 6, | <b>D</b> <sub>7</sub> | |-------------------------|------------------|----------------|-----------------------|-----------------------|-------|------------|-------------|-----------------------| | $\overline{\mathrm{D}}$ | 0 | 1 | 2 | 3 | 4 | (5) | 6 | 7 | | D | 8 | (3) | 10 | (11) | 12 | 13 | 14 | 15 | | | 0 | 1 | 1 | D | 0 | 1 | 0 | 0 | ## **Multiplexer Implementation:** 9. ImplementtheBooleanfunctionusing8:1andalsousing4:1multiplexer $F(w,x,y,z)=\sum m(1,2,3,6,7,8,11,12,14)$ # **Solution:** Implementationtable: | × | ii ca o i | <u> </u> | | | | | | | | |---|-----------|----------------|----------------|------------------|----------------|-------|----------------|-------------|-----------------------| | | | $\mathbf{D}_0$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | $\mathbf{D}_3$ | $D_4$ | $\mathbf{D}_5$ | <b>D</b> 6, | <b>D</b> <sub>7</sub> | | | Z | 0 | 1 | 2 | 3 | 4 | 5 | (0) | 7 | | | Z | $\odot$ | 9 | 10 | (1) | 12 | 13 | 14) | 15 | | • | | Z | z | Z | 1 | Z | 0 | 1 | Z | # $\frac{MultiplexerImplementation(Using 8:1 MUX):}{\mid z\mid}$ # (Using4:1MUX): 10. ImplementtheBooleanfunctionusing8:1multiplexer $F(A,B,C,D)=\prod m(0,3,5,8,9,10,12,14)$ # **Solution:** Variables,n=4(A,B,C,D)Se lectlines=n-1=3(**S2,S1,S0**) $2^{n\text{-}1} to MUX i.e., 2^3 to 1 = 8 to 1 MUX$ Inputlines= $2^{n-1}=2^3=8($ **D0,D1,D2,D3,D4,D5,D6,D7**) ## **Implementationtable:** | | $\mathbf{D}_{0}$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | $\mathbf{D}_3$ | $D_4$ | <b>D</b> <sub>5</sub> | <b>D</b> 6, | $\mathbf{D}_7$ | |-------------------------|------------------|-------------------------|-------------------------|----------------|-------------------------|-----------------------|-------------------------|----------------| | $\overline{\mathrm{D}}$ | 0 | 1 | 2 | 3 | (4) | 5 | 6 | (7) | | D | 8 | 9 | 10 | (11) | 12 | 13 | 14 | 15 | | | 0 | $\overline{\mathbf{D}}$ | $\overline{\mathrm{D}}$ | D | $\overline{\mathbf{D}}$ | D | $\overline{\mathrm{D}}$ | 1 | ## **Multiplexer Implementation:** 11. ImplementtheBooleanfunctionusing8:1multiplexer $F(A,B,C,D)=\sum m(0,2,6,10,11,12,13)+d(3,8,14)$ ## **Solution:** Variables,n=4(A,B,C,D)Se lectlines=n-1=3(**S2,S1,S0**) $2^{n-1}$ toMUXi.e., $2^3$ to1=8to1MUX Inputlines= $2^{n-1}=2^3=8(\mathbf{D0},\mathbf{D1},\mathbf{D2},\mathbf{D3},\mathbf{D4},\mathbf{D5},\mathbf{D6},\mathbf{D7})$ ## **ImplementationTable:** | | $\mathbf{D}_{0}$ | D <sub>1</sub> | $\mathbf{D}_{2}$ | <b>D</b> <sub>3</sub> | D <sub>4</sub> | <b>D</b> <sub>5</sub> | <b>D</b> <sub>6.</sub> | <b>D</b> <sub>7</sub> | |----------------------|------------------|----------------|------------------|-----------------------|----------------|-----------------------|------------------------|-----------------------| | $\overline{ ext{D}}$ | $\odot$ | 1 | 2 | 3 | 4 | 5 | (O) | 7 | | D | ⊗ | 9 | (10) | (11) | 12 | 13 | 14 | 15 | | | 1 | 0 | 1 | 1 | D | D | 1 | 0 | # **Multiplexer Implementation:** 12. An8×1multiplexer hasinputsA,BandC connectedtotheselectioninputsS2,S1,andS0respectively.ThedatainputsI0t o I7areasfollows I1 = I2 = I7 = 0; I3 = I5 = 1; I0 = I4 = D and I6 = D'. Determine the Boolean function that the multiple xerimple ments. # **Multiplexer Implementation:** # **Implementationtable:** | | $I_{0}$ | Iı | ** | ~> | *4 | -0 | <b>~</b> u . | -, | |-------------------------|---------|----|----|------|----|----|-------------------------|----| | $\overline{\mathbf{D}}$ | 0 | 1 | 2 | 3 | 4 | 5 | (%) | 7 | | D | 8 | 9 | 10 | (11) | 12 | 13 | 14 | 15 | | | D | 0 | 0 | 1 | D | 1 | $\overline{\mathrm{D}}$ | 0 | $F(A,B,C,D)=\sum m(3,5,6,8,11,12,13).$ ## **DEMULTIPLEXER:** Demultiple x means one into many. Demultiple xing is the process of taking information from one input and transmitting the same over one of several outputs. $A demultiple xeris a combination allogic circuit that receives information on a single input and transmits the same information over one of several (2^n) output lines. \\$ **Blockdiagramofdemultiplexer** The block diagram of a demultiplexer which is opposite to a multiplexer in its operation is shown above. The circuit has one input signal, \_n' select signals and 2<sup>n</sup>output signals. The select inputs determine to which output the data input will beconnected. As the serial data is changed to parallel data, i.e., the input caused to appear on one of the noutput lines, the demultiplexer is also called a — data distributer or or $--serial\text{-}to\text{-}parallel converter}\|.$ ## **1-to-4Demultiplexer:** A1-to- 4demultiplexerhasasingleinput, Din, four outputs (Y0toY3) and two selectinputs (S1andS0). ## **LogicSymbol** The input variable Dinhas a path to all four outputs, but the input information is directed to only one of the output lines. The truth table of the 1-to-4 demultiplexer is shown below. | Enabl | S1 | S <sub>0</sub> | Din | Y0 | Y1 | Y2 | <b>Y3</b> | |-------|----|----------------|-----|----|----|----|-----------| | e | | | | | | | | | 0 | X | X | X | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | Truthtableof1-to-4demultiplexer Fromthetruthtable,itisclearthatthedatainput,Dinisconnectedtotheoutput Y0, when S1=0 and S0=0 and the data input is connected to output Y1 when S1=0 and S0=1. Similarly, the data input is connected to output Y2 and Y3 when S1=1 and S0=0 and when S1=1 and S0=1, respectively. Also, from the truth table, the expression foroutputscanbewrittens follows, Y0= **S1'S0'DinY1=** S1'S0DinY2=S1S0 'Din Y3=S1S0Din ## Logicdiagramof1-to-4demultiplexer Now, using the above expressions, a 1-to-4 demultiplexer can be implemented using four 3-input AND gates and two NOT gates. Here, the input data line Din, is connected to all the AND gates. The two select lines \$1,50 enable only one gate at a time. and the data that appears on the input line passes through the selected gate to the associated output line. ## **1-to-8Demultiplexer:** A1-to- 8demultiplexerhasasingleinput, $D_{in}$ , eightoutputs ( $Y_0toY_7$ ) and three selectinputs ( $S_2$ , $S_1$ and $S_0$ ). It distributes one input line to eightout put lines based on the selectin puts. The truth table of 1-to-8 demultiplexer is shown below. | Din | S | S <sub>1</sub> | S <sub>0</sub> | Y | Y | Y | Y | Y | Y | Y | Y | |-----|---|----------------|----------------|---|---|---|---|---|---|---|---| | | 2 | | | 7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 | | 0 | X | X | X | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | | 1 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ## **Truthtableof1-to-8demultiplexer** From the above truth table, it is clear that the data input is connected with one of the eight outputs based on these lectin puts. Now from this truth table, the expression for eight outputs can be written as follows: Y0=S2'S1'S0'Din Y1=S2'S1'S0Din Y2=S2'S1S0'Din Y3=S2'S1S0Din Y4= S2S1'S0'Din $Y_5 = S_2S_1$ 'SODin #### Y7=S2S1S0Din Now using the above expressions, the logic diagram of a 1-to-8 demultiplexer can bedrawn as shown below. Here, the single data line, Din is connected to all the eight ANDgates, but only one of the eight AND gates will be enabled by the select input lines. For example, if S2S1S0=000, then only AND gate-0 will be enabled and thereby the datainput, Dinwillappearat Y0. Similarly, the different combinations of the selectinp uts, the input Dinwillappearat the respective output. ## <u>Logicdiagramof1-to-8demultiplexer</u> 1. Design1:8demultiplexerusingtwo1:4DEMUX. # 2. Implementfullsubtractorusingdemultiplexer. | | Input<br>s | | Output<br>s | | | |---|------------|---|--------------|-----------|--| | A | B Bi | | Difference(D | Borrow(Bo | | | | | n | ) | ut) | | | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | 1 | 1 | 1 | | | 0 | 1 | 0 | 1 | 1 | | | 0 | 1 | 1 | 0 | 1 | | | 1 | 0 | 0 | 1 | 0 | | | 1 | 0 | 1 | 0 | 0 | | | 1 | 1 | 0 | 0 | 0 | | | 1 | 1 | 1 | 1 | 1 | | #### UNITII-SYNCHRONOUS SEQUENTIAL LOGIC #### Introduction $In {\it combinational logic circuits}, the output satany instant of time depend only on the input signal spresent at that time. For a change in input, the output occurs immediately.$ In *sequential logic circuits*, it consists of combinational circuits to whichstorage elements are connected to form a feedback path. The storage elements are devices capable of storing binary information either 1 or 0. The information stored in the memory elements at any given time defines the present state of these quential circuit. The present state and the external circuit determine the output and the next state of sequential circuits. #### SequentialCircuit-BlockDiagram Thusins equential circuits, the output variables depend not only on the past history of input variables. Therotarychannelselectedknobonanold-fashionedTVislikeacombinational. Its output selects a channel based only on its current input – theposition of the knob. The channel-up and channel-down push buttons on a TV is likeasequentialcircuit.Thechannelselectiondependsonthepastsequenceofup/downpus hes.Thecomparisonbetweencombinationalandsequentialcircuitsisgivenintablebelow. | S.No | Combinationallogic | Sequentiallogic | |------|-------------------------------------------------------------------------|--------------------------------------------------------------------------------------------| | 1 | Theoutputvariable,atalltimesd ependsonthecombination of inputvariables. | Theoutputvariabledependsnotonlyo nthepresentinputbutalsodepend uponthepasthistoryofinputs. | | 2 | Memoryunitisnotrequired | Memoryunitisrequiredtostorethe pasthistoryofinputvariables. | | 3 | Fasterinspeed | Slowerthancombinationalcircuits. | | 4 | Easy to design | Comparativelyhardertodesign. | | 5 | Eg.Paralleladder | Eg.Serialadder | These quential circuits can be classified depending on the timing of their signals: - Synchronoussequentialcircuits - Asynchronoussequential circuits. In synchronous sequential circuits, signals can affect the memory elementsonlyatdiscreteinstantsoftime. In a synchronous sequential circuits change in input signals can affect memory element at any instant of time. The memory elements used in both circuits are Flip-Flops, which are capable of storing 1-bitinformation. | S.No | Synchronoussequentialcircuits | Asynchronoussequentialcircuits | | | |------|--------------------------------------------------------------------------------|----------------------------------------------------------------------------|--|--| | 1 | Memoryelements are clocked Flip-Flops | Memoryelementsareeitherunclocked Flip-Flopsortimedelayelements. | | | | 2 | Thechangeininputsignalscan affect memory element uponactivationofclocksig nal. | Thechangeininputsignalscanaffectmem oryelementatanyinstantoftime. | | | | 3 | Themaximumoperatingspeedofc lockdependsontimedelays involved. | Becauseoftheabsenceofclock,itcanoper ate faster than synchronous circuits. | | | | 4 | Easiertodesign | Moredifficulttodesign | | | #### LATCHES: Latches and Flip-Flops are the basic building blocks of the most sequentialcircuits.Latchesareusedforasequentialdevicethatchecksallofitsinputscontinu ouslyandchangesitsoutputsaccordinglyatanytimeindependentofclockingsignal.Enables ignalisprovidedwiththelatch.Whenenablesignal isactive output changes occur as the input changes. But when enable signal is notactivatedinputchangesdonotaffecttheoutput. Flip-Flop is used for a sequential device that normally samples its inputs andchangesitsoutputsonlyattimesdeterminedbyclockingsignal. #### SRLatch: The simplest type of latch is the set-reset (SR) latch. It can be constructed fromeithertwoNORgatesortwoNANDgates. #### **SRlatchusingNORgates:** The two NOR gates are cross-coupled so that the output of NOR gate 1 isconnected to one of the inputs of NOR gate 2 and vice versa. The latch has twooutputsQandQ'andtwoinputs,setandreset. Before going to analyse the SR latch, we recall that a logic 1 at any input of aNORgateforcesitsoutputtoalogic0.Letusunderstandtheoperationofthiscircuitforvariou sinput/outputpossibilities. #### Case1:S=0andR=0 Initially,Q=1andQ'=0 Letus assumethatinitiallyQ=1 and Q'=0. WithQ'=0,both inputs to NORgate 1 are at logic 0. So, its output, Q is at logic 1. With Q=1, one input of NOR gate 2 isatlogic 1. Hence its output, $Q^\prime$ is at logic 0. This shows that when S and R both arelow,theoutputdoesnotchange. Initially,Q=0andQ'=1 WithQ'=1,oneinputofNORgate1isatlogic1,henceitsoutput,Qisatlogic 0.WithQ=0,bothinputstoNORgate2areatlogic 0. So, its output Q'is at logic 1. In this case also there is no change in the output state. #### Case2:S=0andR=1 Inthiscase, Rinputofthe NORgate 1 is at logic 1, hence its output, Qisatlogic 0. Both inputs to NO Rgate2arenowatlogic 0.Sothatitsoutput,Q'is atlogic 1. #### Case3:S=1andR=0 In thiscase, Sinputofthe NORgate 2isat logic1, henceitsoutput, Q isat logic0. Bothinputs toNORgate1arenow atlogic0.Sothatits output,Qisatlogic 1. #### Case4:S=1andR=1 When R and S both are at logic 1, they force the outputs of both NOR gates to the low state, i.e., (Q=0 and Q'=0). So, we call this an indeterminate or prohibited state, and represent this condition in the truth table as an asterisk (\*). This condition has a violates the basic definition of a latch that requires Q to be complement of Q'. Thus innormal operation this condition must be avoided by making sure that 1's are not applied to both the inputs simultaneously. We can summarize the operation of SR latch as follows: - When S=0 and R=0, the output, $Q_{n+1}$ remains in its present state, $Q_n$ . - WhenS=0andR=1,thelatchisresetto0. - WhenS=1andR=0,thelatchissetto1. - WhenS=1andR=1,theoutputofbothgateswillproduce0.i.e., $\mathbf{Q}_{n+1} = \mathbf{Q}_{n+1}' = \mathbf{0}$ . | S | R | Qn | Q <sub>n+1</sub> | State | |---|---|----|------------------|---------------| | 0 | 0 | 0 | 0 | No Change | | 0 | 0 | 1 | 1 | (NC) | | | | | | | | 0 | 1 | 0 | 0 | Reset | | 0 | 1 | 1 | 0 | | | | | | | | | 1 | 0 | 0 | 1 | Set | | 1 | 0 | 1 | 1 | | | | | | | | | 1 | 1 | 0 | X | Indeterminate | | 1 | 1 | 1 | X | * | | | | | | | #### **SRlatchusingNANDgates:** The SR latch can also be implemented using NAND gates. The inputs of this Latchare Sand R. Tounderstand how this circuit functions, recall that allow on any input to a NAND gate for cesits output high. WecansummarizetheoperationofSRlatchasfollows: - WhenS=0andR=0,theoutputofboth gateswillproduce0.i.e., $\mathbf{Q}_{n+1}=\mathbf{Q}_{n+1}'=\mathbf{1}$ . - WhenS=0andR=1,thelatchisresetto0. - WhenS=1andR=0,thelatchissetto1. - When S=1 and R=1, the output, $Q_{n+1}$ remains in its present state, $Q_n$ . | S | R | Qn | Q <sub>n+1</sub> | State | |---|---|----|------------------|---------------| | 0 | 0 | 0 | X | Indeterminate | | 0 | 0 | 1 | X | * | | | | | | | | 0 | 1 | 0 | 1 | Set | | 0 | 1 | 1 | 1 | | | | | | | | | 1 | 0 | 0 | 0 | Reset | | 1 | 0 | 1 | 0 | | | | | | | | | 1 | 1 | 0 | 0 | No Change | | 1 | 1 | 1 | 1 | (NC) | | | | | | | #### **GatedSRLatch**: In the SR latch, the output changes occur immediately after the input changes i.e., the latch is sensitive to its SandR input sall the time. A latch that is sensitive to the input sonly when an enable input is active. Such a latch with enable input is known as gated SR latch. ThecircuitbehaveslikeSRlatchwhenEN=1.ItretainsitspreviousstatewhenEN =0 #### **SRLatchwithenableinputusingNANDgates** ThetruthtableofgatedSRlatchisshowbelow. | O | | | | | | |----|---|---|----|------------------|---------------| | EN | S | R | Qn | Q <sub>n+1</sub> | State | | 1 | 0 | 0 | 0 | 0 | No Change(NC) | | 1 | 0 | 0 | 1 | 1 | | | | | | | | | | 1 | 0 | 1 | 0 | 0 | Reset | | 1 | 0 | 1 | 1 | 0 | | | | | | | | | | 1 | 1 | 0 | 0 | 1 | Set | | 1 | 1 | 0 | 1 | 1 | | | | | | | | | | 1 | 1 | 1 | 0 | X | Indeterminate | | 1 | 1 | 1 | 1 | X | * | | 1 | 1 | ı | 1 | ı | 1 | | 0 | X | x | 0 | 0 | No Change(NC) | |---|---|---|---|---|---------------| | 0 | X | x | 1 | 1 | | | | | | | | | When Sis HIGH and Ris LOW, a HIGH on the EN input sets the latch. When Sis LOW and Ris HIGH, a HIGH on the EN input resets the latch. #### **DLatch** In SR latch, when both inputs are same (00 or 11), the output either does notchange or it is invalid. In many practical applications, these input conditions are notrequired. These input conditions can be avoided by making them complement ofeachother. This modified SR latch is known as **Dlatch**. As shown in the figure, Dinput goes directly to the Sinput, and its complement is applied to the R input. Therefore, only two input conditions exists, either S=0 and R=1 or S=1 and R=0. The truth table for Dlatch is shown below. | EN | D | $\mathbf{Q}_{\mathbf{n}}$ | Q <sub>n+1</sub> | State | |----|---|---------------------------|---------------------------|---------------| | 1 | 0 | X | 0 | Reset | | 1 | 1 | X | 1 | Set | | 0 | X | X | $\mathbf{Q}_{\mathbf{n}}$ | No Change(NC) | | | | | | | | | | | | | As shown in the truth table, the Qoutput follows the Dinput. For this reason, Dlatch is called transparent latch. When D is HIGH and EN is HIGH. Q goes HIGH. When D is LOW and EN isHIGH,QgoesLOW.WhenENisLOW,thestateofthelatchisnotaffectedbytheDinput. #### **TRIGGERINGOFFLIP-FLOPS** ThestateofaFlip-Flopisswitchedbyamomentarychangeintheinputsignal. This momentary change is called a trigger and the transition it causes is said totrigger the Flip-Flop. Clocked Flip-Flops are triggered by pulses. A clock pulse startsfrom an initial value of 0, goes momentarily to 1 and after a short time, returns to itsinitial 0 value. Latches are controlled by enable signal, and they are level triggered, eitherpositive level triggered or negative level triggered. The output is free to changeaccording to the S and R input values, when active level is maintained at the enableinput.Flip-Flopsaredifferentfromlatches.Flip- Flopsarepulseorclockedgetriggeredinsteadofleveltriggered. #### **EDGETRIGGEREDFLIP-FLOPS** Flip-Flops are synchronous bistable devices (has two outputs Q and Q'). Inthiscase, the terms ynchronous means that the output changes state only at aspecified point on the triggering input called the clock (CLK), i.e., changes in the output occur in synchronization with the clock. An*edge-triggeredFlip-Flop* changesstateeitheratthepositiveedge(rising edge) or at the negative edge (falling edge) of the clock pulse and is sensitive to itsinputs only at this transition of the clock. The different types of edge-triggered Flip-Flopsare— - S-RFlip-Flop, - J-KFlip-Flop, - DFlip-Flop, - TFlip-Flop. Although the S-R Flip-Flop is not available in IC form, it is the basis for the DandJ-KFlip-Flops.Eachtypecanbeeitherpositiveedge-triggered(nobubbleatC input) or negative edge-triggered (bubble at C input). The key to identifying anedge-triggered Flip-Flop by its logic symbol is the small triangle inside the block attheclock(C)input.Thistriangleiscalledthedynamicinputindicator. #### S-RFlip-Flop The Sand Rinputs of the S-RFlip-Flopare called *synchronous* inputs becaused at a on these inputs are transferred to the Flip-Flop's output only on the triggering edge of the clock pulse. The circuit is similar to SR latch except enable signal is replaced by clock pulse (CLK). On the positive edge of the clock pulse, the circuit responds to the Sand Rinputs. WhenSisHIGHandRisLOW,theQoutputgoesHIGHonthetriggeringedgeoftheclockp ulse,andtheFlip-FlopisSET.WhenSisLOWandRisHIGH,theQ output goes LOW on the triggering edge of the clock pulse, and the Flip-Flop isRESET. When both S and R are LOW, the output does not change from its prior state.AninvalidconditionexistswhenbothSandRareHIGH. | CLK | S | R | Qn | Q <sub>n+1</sub> | State | |-----|---|---|----|------------------|---------------| | 1 | 0 | 0 | 0 | 0 | No Change(NC) | | 1 | 0 | 0 | 1 | 1 | | | | | | | | | | 1 | 0 | 1 | 0 | 0 | Reset | | 1 | 0 | 1 | 1 | 0 | | | | | | | | | | 1 | 1 | 0 | 0 | 1 | Set | | 1 | 1 | 0 | 1 | 1 | | | | | | | | | | 1 1 | 1 1 | 1<br>1 | 0<br>1 | X<br>X | Indeterminate<br>* | |-----|--------|--------|--------|--------|--------------------| | 0 | X<br>X | X<br>X | 0<br>1 | 0<br>1 | No Change(NC) | TruthtableforSRFlip-Flop J-KFlip-Flop: ## <u>JKFlipFlop</u> The data input J and the output Q' are applied o the first AND gate and $it soutput (JQ') is applied to the \underline{Sinput of SRF lip-Flop. Similarly, the data input Kand}$ the output Qare applied to the second AND gate and its output (KQ) is applied to the Rin put of SRF lip-Flop. #### J=K=0 (a) Using SR flipflop (b) Graphic symbol When J=K=0, both AND gates are disabled. Therefore clock pulse have no effect, hence the Flip-Flopout put is same as the previous output. #### J=0,K=1 When J=0 and K=1, AND gate 1 is disable di.e., S=0 and R=1. This condition will reset the eFlip-Flop to 0. #### J=1,K=0 When J=1 and K=0, AND gate 2 is disable di.e., S=1 and R=0. Therefore the Flip-Flopwill set on the application of a clock pulse. #### J=K=0 When J=K=1, it is possible to set or reset the Flip-Flop. If Qis High, AND gate 2 passes on a reset pulse to the next clock. When Q is low, AND gate 1 passes on a set pulse to the next clock. Eitherway, Q changes to the complement of the last state i.e., toggle. Toggle means to switch to the opposite state. The truth table of JKF lip-Flop is given below. | CLK | Inputs | | Output | State | |-----|--------|---|------------------|-----------| | | J | K | Q <sub>n+1</sub> | | | 1 | 0 | 0 | Qn | No Change | | 1 | 0 | 1 | 0 | Reset | | 1 | 1 | 0 | 1 | Set | | 1 | 1 | 1 | Q <sub>n</sub> ' | Toggle | InputandoutputwaveformsofJKFlip-Flop #### CharacteristictableandCharacteristicequation: The characteristic table for JK Flip-Flop is shown in the table below. From the table, K-map for the next state transition $(Q_{n+1})$ can be drawn and the simplified logicexpressionwhich represents the characteristic equation of JKFlip-Flop can be found. | Qn | J | K | Q <sub>n+1</sub> | |----|---|---|------------------| | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 1 | | 0 | 1 | 1 | 1 | | 1 | 0 | 0 | 1 | | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 1 | | 1 | 1 | 1 | 0 | | | I | | | **Characteristictable** #### **K-mapSimplification:** Characteristic equation: $Q_{n+1} = JQ' + K'Q$ . #### **DFlip-Flop:** LikeinDlatch,inDFlip-FlopthebasicSRFlip-Flopisusedwithcomplementedinputs.TheDFlip-FlopissimilartoD-latchexceptclockpulseisusedinsteadofenableinput. #### **DFlip-Flop** ToeliminatetheundesirableconditionoftheindeterminatestateintheRSFlip-FlopistoensurethatinputsSandRareneverequalto1atthesametime.Thisis donebyD Flip-Flop.TheD (*delay*) Flip-Flophas one inputcalled delayinputandclockpulseinput.TheDFlip-FlopusingSRFlip-Flopisshownbelow. (a) Using SR flipflop (b) Graphic symbol The truth table of D Flip-Flop isgiven below. | Clock | D | Q <sub>n+1</sub> | State | |-------|---|------------------|-----------| | 1 | 0 | 0 | Reset | | 1 | 1 | 1 | Set | | 0 | X | Qn | No Change | **TruthtableforDFlip-Flop** InputandoutputwaveformsofclockedDFlip-Flop Looking at the truth table for D Flip-Flop we can realize that $Q_{n+1}$ function follows the D input at the positive going edges of the clock pulses. ## CharacteristictableandCharacteristicequation: $The characteristic table for DF lip-Flopshows that the next state of the Flip-Flop is independent of the present states in ceQ_{n+1} is equal to D. This means that an input pulse will transfer the value of input Dinto the output of the Flip-Flop is a constant of the present states are the present states and the present states are the present states and the present states are presen$ Flopindependentofthevalueoftheoutputbeforethepulsewasapplied. The characteristic equation is derived from K-map. | Qn | D | Q <sub>n+1</sub> | |----|---|------------------| | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 0 | | 1 | 1 | 1 | | | | | | | | | **Characteristictable** K-map simplification #### TFlip-Flop The T (Toggle) Flip-Flop is a modification of the JK Flip-Flop. It is obtained from JKFlip- FlopbyconnectingbothinputsJandKtogether,i.e.,singleinput.Regardlessofthepresents tate,theFlip-FlopcomplementsitsoutputwhentheclockpulseoccurswhileinputT=1. #### TFlip-Flop When T= 0, $Q_{n+1}$ = $Q_n$ , ie., the next state is the sameas the present state and nochangeoccurs. When T=1, $Q_{n+1}=Q_n$ , i.e., the next state is the complement of the present state. ThetruthtableofTFlip-Flopisgivenbelow. | T | Q <sub>n+1</sub> | State | |-----|------------------------------------|------------------------| | 0 1 | Q <sub>n</sub><br>Q <sub>n</sub> ' | No<br>ChangeT<br>oggle | **TruthtableforTFlip-Flop** ## CharacteristictableandCharacteristicequation: ThecharacteristictableforTFlip- FlopisshownbelowandcharacteristicequationisderivedusingK-map. | Qn | T | Q <sub>n+1</sub> | |----|---|------------------| | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | | | | | | | | | ## **K-mapSimplification:** Characteristic equation: $Q_{n+1} = TQ_n' + T'Q_n$ . ## Master-SlaveJKFlip-Flop Amaster-slaveFlip-FlopisconstructedusingtwoseparateJKFlip-Flops.The firstFlip-Flop is called the master. Itis driven by the positive edge of the clockpulse.ThesecondFlip-Flopiscalledtheslave.Itisdrivenby the negative edge oftheclockpulse.Thelogicdiagramofamaster-slaveJKFlip-Flopisshownbelow. **Logicdiagram** When the clock pulse has a positive edge, the master acts according to its J-K inputs, but the slave does not respond, since it requires a negative edge at the clockinput. When the clock input has a negative edge, the slave Flip-Flop copies themaster outputs. But the master does not respond since it requires a positive edge atitsclockinput. The clocked master-slave J-KFlip-Flopusing NAND gates is shown below. Master-SlaveJKFlip-Flop ## **APPLICATION TABLE (OR) EXCITATION TABLE:** $The \emph{characteristic table} is useful for \textbf{analysis} and for defining the operation of the Flip-Flop. Its pecifies the next state (Q_{n+1}) when the inputs and present state are known.$ $The \emph{excitation or application table} is useful for \emph{design} process. It is used to find the Flip-Flopin put conditions that will cause the required transition, when the present state (Q_n) and the next state (Q_{n+1}) are known.$ SR flipflop | Present<br>State | Inputs | | Next<br>State | |------------------|--------|---|------------------| | Qn | S | R | Q <sub>n+1</sub> | | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 1 | | 0 | 1 | 1 | X | | 1 | 0 | 0 | 1 | | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 1 | | 1 | 1 | 1 | х | | Present<br>State | Next<br>State | Inputs | | Inp | outs | |------------------|------------------|--------|---|-----|------| | Qn | Q <sub>n+1</sub> | S | R | S | R | | 0 | 0 | 0 | 0 | 0 | Х | | 0 | 0 | 0 | 1 | U | A | | 0 | 1 | 1 | 0 | 1 | 0 | | 1 | 0 | 0 | 1 | 0 | 1 | | 1 | 1 | 0 | 0 | Х | 0 | | 1 | 1 | 1 | 0 | 21 | 3 | #### $\underline{CharacteristicTable}$ | Present<br>State | Next<br>State | Inputs | | |------------------|------------------|--------|---| | Qn | Q <sub>n+1</sub> | S | R | | 0 | 0 | 0 | X | | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 1 | | 1 | 1 | X | 0 | #### **ModifiedTable** xcitation Table The above table presents the excitation table for SR Flip-Flop. It consists of present state (Qn), next state (Qn+1) and a column for each input to show how the required transition is achieved. There are 4 possible transitions from present state to next state. The required Input conditions for each of the four transitions are derived from the information available in the characteristic table. The symbol 'x' denotes the don't care condition, it does not matter whether the input is 0 or 1. # JKFlip-Flop: | Present<br>State | Inputs | | Next<br>State | |------------------|--------|---|------------------| | Qn | J | K | Q <sub>n+1</sub> | | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 1 | | 0 | 1 | 1 | 1 | | 1 | 0 | 0 | 1 | | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 1 | | 1 | 1 | 1 | 0 | | Present<br>State | Next<br>State | Inputs | | Inp | outs | |------------------|------------------|--------|---|-----|------| | Qn | Q <sub>n+1</sub> | J | K | J | K | | 0 | 0 | 0 | 0 | 0 | X | | 0 | 0 | 0 | 1 | | | | 0 | 1 | 1 | 0 | 1 | Х | | 0 | 1 | 1 | 1 | 1 | | | 1 | 0 | 0 | 1 | Х | 1 | | 1 | 0 | 1 | 1 | Х | 1 | | 1 | 1 | 0 | 0 | Х | 0 | | 1 | 1 | 1 | 0 | Λ | U | $\underline{CharacteristicTable}$ **ModifiedTable** | Present<br>State | Next<br>State | Inputs | | |------------------|------------------|--------|---| | Qn | Q <sub>n+1</sub> | J | K | | 0 | 0 | 0 | Х | | 0 | 1 | 1 | X | | 1 | 0 | X | 1 | | 1 | 1 | Х | 0 | **Excitation Table** # DFlip-Flop | Present | Input | Next | |---------|-------|-------| | State | Input | State | | Qn | D | Qn+1 | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 0 | | 1 | 1 | 1 | ## $\underline{CharacteristicTable}$ # TFlip-Flop | Present<br>State | Input | Next<br>State | |------------------|-------|---------------| | Qn | T | Qn+1 | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | | | | | | | | | **CharacteristicTable** | Present<br>State | Next<br>State | Input | |------------------|------------------|-------| | Qn | Q <sub>n+1</sub> | D | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 0 | | 1 | 1 | 1 | ### $\underline{ExcitationTable}$ | Present<br>State | Next<br>State | Input | |------------------|------------------|-------| | Qn | Q <sub>n+1</sub> | T | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | **ModifiedTable** ## REALIZATIONOFONEFLIP-FLOPUSINGOTHERFLIP-FLOPS It is possible to convert one Flip-Flop into another Flip-Flop with someadditional gates or simply doing some extra connection. The realization of one Flip-Flop using other Flip-Flops is implemented by the use of characteristic tables and excitation tables. Let us see few conversions among Flip-Flops. - \* SRFlip-FloptoDFlip- - FlopSRFlip-FloptoJKFlip- - \* FlopSRFlip-FloptoTFlip- - \* FlopJKFlip-FloptoTFlip- - \* FlopJKFlip-FloptoDFlip- - FlopDFlip-FloptoTFlip-Flop - \* TFlip-FloptoDFlip-Flop ## SRFlip-FloptoDFlip-Flop: - WritethecharacteristictableforrequiredFlip-Flop(DFlip-Flop). - WritetheexcitationtableforgivenFlip-Flop(SRFlip-Flop). - Determine the expression for the given Flip-Flop inputs (SandR) by using K-map. - DrawtheFlip-FlopconversionlogicdiagramtoobtaintherequiredFlip-Flop(D Flip-Flop)byusingtheaboveobtainedexpression. The excitation table for the above conversion is | RequiredFlip-Flop(D) | | GivenFli<br>(SI | | | |----------------------|--------------|------------------|-----------|--------| | Input | Presentstate | Nextstate | Flip-Flop | Inputs | | D | Qn | Q <sub>n+1</sub> | S | R | | 0 | 0 | 0 | 0 | Х | | 0 | 1 | 0 | 0 | 1 | | 1 | 0 | 1 | 1 | 0 | | 1 | 1 | 1 | X | 0 | ## K-map simplification ## Logic diagram # SRFlip-FloptoJKFlip-Flop The excitation table for the above conversion is, | Inp | outs | Presentstate | Nextstate | | Flop<br>put | |-----|------|--------------|------------------|---|-------------| | J | K | Qn | Q <sub>n+1</sub> | S | R | | 0 | 0 | 0 | 0 | 0 | Х | | 0 | 0 | 1 | 1 | X | 0 | | 0 | 1 | 0 | 0 | 0 | X | | 0 | 1 | 1 | 0 | 0 | 1 | | 1 | 0 | 0 | 1 | 1 | 0 | | 1 | 0 | 1 | 1 | X | 0 | | 1 | 1 | 0 | 1 | 1 | 0 | | 1 | 1 | 1 | 0 | 0 | 1 | ### K-map simplification ## 2.7.3 SRFlip-FloptoTFlip-Flop Theexcitationtablefortheaboveconversionis | Input | Presentstate | Nextstate | Flip-Flop<br>Inputs | | |-------|--------------|-----------|---------------------|---| | T | Qn | Qn+1 | S | R | | 0 | 0 | 0 | 0 | Х | | 0 | 1 | 1 | X | 0 | | 1 | 0 | 1 | 1 | 0 | | 1 | 1 | 0 | 0 | 1 | K-map simplification Logic diagram # 3.7.4JKFlip-FloptoTFlip-Flop Theexcitationtablefortheaboveconversionis | Input | Presentstate | Nextstate | Flip<br>Inj | -Flop<br>outs | |-------|--------------|-----------|-------------|---------------| | T | Qn | Qn+1 | J | K | | 0 | 0 | 0 | 0 | X | | 0 | 1 | 1 | X | 0 | | 1 | 0 | 1 | 1 | X | | 1 | 1 | 0 | X | 1 | K-map simplification Logic diagram # JKFlip-FloptoDFlip-Flop The excitation table for the above conversion is | Input | Presentstate | Nextstate | Flip-Flop<br>Inputs | | |-------|--------------|------------------|---------------------|---| | D | Qn | Q <sub>n+1</sub> | J | K | | 0 | 0 | 0 | 0 | X | | 0 | 1 | 0 | X | 1 | | 1 | 0 | 1 | 1 | X | | 1 | 1 | 1 | X | 0 | Logic diagram # DFlip-FloptoTFlip-Flop The excitation table for the above conversion is | Input | Presentstate | Nextstate | Flip-Flop<br>Input | | | |-------|--------------|------------------|--------------------|--|--| | T | Qn | Q <sub>n+1</sub> | D | | | | 0 | 0 | 0 | 0 | | | | 0 | 1 | 1 | 1 | | | | 1 | 0 | 1 | 1 | | | | 1 | 1 | 0 | 0 | | | ## TFlip-FloptoDFlip-Flop The excitation table for the above conversion is | Input | Presentstate | Nextstate | Flip-Flop<br>Input | | | |-------|--------------|------------------|--------------------|--|--| | D | Qn | Q <sub>n+1</sub> | Т | | | | 0 | 0 | 0 | 0 | | | | 0 | 1 | 0 | 1 | | | | 1 | 0 | 1 | 1 | | | | 1 | 1 | 1 | 0 | | | $\mathbf{T} = \mathbf{D}\overline{\mathbf{Q}}_n + \overline{\mathbf{D}}\mathbf{Q}_n$ $= \mathbf{D} \oplus \mathbf{Q}_n$ # CLASSIFICATIONOFSYNCHRONOUSSEQUENTIALCIRCUIT: In synchronous or clocked sequential circuits, clocked Flip-Flops are used asmemory elements, which change their individual states in synchronism with theperiodic clock signal. Therefore, the change in states of Flip-Flop and change in state of the entire circuits occurat the transition of the clock signal. The synchronous or clocked sequential networks are represented by twomodels. - **Mooremodel:**TheoutputdependsonlyonthepresentstateoftheFlip-Flops. - Mealymodel: The output depends on both the present state of the Flip-Flopsandontheinputs. #### Mooremodel: IntheMooremodel,theoutputsareafunctionofthepresentstateoftheFlip-Flops only. The output depends only on present state of Flip-Flops, it appears onlyafter theclock pulse isapplied, i.e.,it variesinsynchronism with the clockinput. ## Mealymodel: $In the {\tt Mealy model}, the outputs are functions of both the present state of the {\tt Flip-Flops} and inputs.$ #### **Mealymodel** # DifferencebetweenMooreandMealymodel | Sl.No | Mooremodel | Mealymodel | |-------|-----------------------------------------------------------|-----------------------------------------------------------| | 1 | Itsoutputisafunctionofpresent stateonly. | Itsoutputisafunctionofpresentstate aswellaspresentinput. | | 2 | Inputchangesdoesnotaffectthe output. | Inputchangesmayaffecttheoutputof thecircuit. | | 3 | Itrequiresmorenumberofstates forimplementingsamefunction. | Itrequireslessnumberofstatesfor implementingsamefunction. | ## ANALYSISOFSYNCHRONOUSSEQUENTIALCIRCUIT: The behavior of a sequential circuit is determined from the inputs, outputsand the state of its Flip-Flops. The outputs and the next state are both a function of the inputs and the present state. The analysis of a sequential circuit consists o fobtaining at able or diagram from the time sequence of inputs, outputs and internal states. Before going to see the analysis and design examples, we first understand the state diagram, state table. ### StateDiagram Statediagramisapictorialrepresentation of abehavior of asequential circuit. - Inthestatediagram, a state is represented by a circle and the transition between states is indicated by directed lines connecting the circles. - A directed line connecting a circle with circle with itself indicates that next state is same as present state. - Thebinarynumberinsideeach circleidentifies thestate representedbythecircle. - The directed lines are labeled with two binary numbers separated by a symbol '/'. The input value that causes the state transition is labeled first and the output value during the present state is labeled after the symbol '/'. Incase of Moore circuit, the directed lines are labeled with only one binary number represe nting the state of the input that causes the state transition. The output state is indicated within the circle, below the present state because output state depends only on present state and not on the input. StateTable #### Statetablerepresents relationshipbetweeninput, output and Flip-Flop states. - Itconsistsofthreesectionslabeledpresentstate,nextstateandoutput. - ThepresentstatedesignatesthestateofFlip-Flopsbeforetheoccurrence of a clock pulse, and the output section gives the values oftheoutputvariablesduringthepresentstate. - o Both the next state and output sections have two columns representing two possible input conditions: X=0 and X=1. | Presentstate | Nexts | state | Output | | | |--------------|-------|-------|--------|-----|--| | | X=0 | X=1 | X=0 | X=1 | | | AB | AB | AB | Y | Y | | | a | a | С | 0 | 0 | | | b | b | a | 0 | 0 | | | С | d | С | 0 | 1 | | | d | b | d | 0 | 0 | | Incase of Moore circuit, the output section has only one columns ince output does not depend on input. | Presentstate | Nexts | Output | | |--------------|-------|--------|---| | | X=0 | X=1 | Y | | AB | AB | AB | | | a | a | С | 0 | | b | b | a | 0 | | С | d | С | 1 | | d | b | d | 0 | #### 2.9.3 StateEquation ItisanalgebraicexpressionthatspecifiestheconditionforaFlip-Flopstatetransition. The Flip-Flopsmay beof any typeandthe logicdiagrammay ormaynotincludecombinationalcircuitgates. #### **ANALYSIS PROCEDURE** The synchronous sequential circuit analysis is summarizes as given below: - 1. AssignastatevariabletoeachFlip-Flopinthesynchronoussequentialcircuit. - 2. WritetheexcitationinputfunctionsforeachFlip-FlopandalsowritetheMoore/Mealyoutputequations. - $3. \ \ Substitute the excitation input functions into the bistable equations for the Flip-Flops to obtain the next state output equations.$ - 4. Obtainthestatetableandreducedformofthestatetable. - 5. Drawthestatediagrambyusingthesecondformofthestatetable. #### **Analysis of Mealy Model** 1. A sequential circuit has two JKFlip-Flops A and B, one input(x) and one output(y). the Flip-Flop input functions are, $$J_A=B+x$$ $J_B=A'+x'$ $K_A=1$ $K_B=1$ and the circuit output function, Y=xA'B. - a) DrawthelogicdiagramoftheMealycircuit, - b) Tabulatethestatetable, - c) Drawthestatediagram. ### Soln: ### Statetable: To obtain the next-state values of a sequential circuit with JKF lip-Flops, use the JKF lip-Flop characteristic stable. | Prese | ntstate | Input | Flip-FlopInputs Nextstate | | | Output | | | | |-------|---------|-------|---------------------------|-------------------|-------------|-------------------|--------|--------|--------| | A | В | X | J <sub>A</sub> =B+x | K <sub>A</sub> =1 | $J_B=A'+x'$ | K <sub>B</sub> =1 | A(t+1) | B(t+1) | Y=xA'B | | | | | | | | | | | | | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 1 | | 1 | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | | 1 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | | Presentstate | | | Nex | Output | | | | |--------------|---|-----|-----|--------|---|-----|-----| | | | x=0 | | x=1 | | x=0 | x=1 | | A | В | A | В | A | В | у | у | | 0 | 0 | 0 | 1 | 1 | 1 | 0 | 0 | | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | | 1 | 1 | 0 0 | | 0 0 | | 0 | 0 | | | | | | | | | | Second form ofstatetable ## **State Diagram:** # 2. Asequentialcircuitwithtwo'D'Flip- FlopsAandB,oneinput(x)andoneoutput(y).theFlip-Flopinputfunctionsare: $D_A=Ax+Bx$ $D_B=A'x$ and the circuit output function is, Y=(A+B)x'. - (a) Drawthe logicdiagram of the circuit, - (b) Tabulatethestatetable, - (c) Drawthestatediagram. #### Soln: # **State Table:** | Prese | ntstate | Input | Flip-FlopInputs | | Next | state | Output | |-------|---------|-------|-----------------------------|---|--------|--------|-----------| | A | В | X | $D_A = D_B = A'x$ $Ax + Bx$ | | A(t+1) | B(t+1) | Y=(A+B)x' | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | | <b>.</b> | | | Nex | | Output | | | |--------------|---|-----|-----|-----|--------|-----|-----| | Presentstate | | x=0 | | x=1 | | x=0 | x=1 | | A | В | A | В | A | В | Y | Y | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 0 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 0 | 1 | 0 | 1 | 0 | Secondform of statetable # StateDiagram: $3. \ \ Analyze the synchronous Mealy machine and obtain its state diagram.$ ### Soln: The given synchronous Mealy machine consists of two D Flip-Flops, one inputs and one output. TheFlip-Flopinputfunctionsare, $$D_A = Y_1'Y_2X'D$$ $$B = X + Y_1'Y_2$$ The circuit output function is, $Z = Y_1Y_2X$ ## **State Table:** | Presei | ntstate | Input | Flip-Flo | pInputs | Next | state | Output | |-----------------------|-----------------------|-------|-----------------|---------------------|----------------------|----------------------|-------------| | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | X | $D_A=Y_1'Y_2X'$ | $D_B = X + Y_1'Y_2$ | Y <sub>1</sub> (t+1) | Y <sub>2</sub> (t+1) | $Z=Y_1Y_2X$ | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | | Presentstate | | | Nex | Output | | | | |----------------|-----------------------|-----------------------|-----------------------|-----------------------|-----------------------|-----|-----| | | | X=0 | | X=1 | | X=0 | X=1 | | Y <sub>1</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | Z | Z | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | | | | | | | | | | ### Secondform ofstatetable # StateDiagram: 4. AsequentialcircuithastwoJKFlop- FlopsAandB,twoinputsxandyandoneoutput z.The Flip-Flop inputequation and circuit output equations are $J_A = Bx + B'y'$ $J_B = A'x$ Z = Ax'y' + Bx'y' $K_A = B'xy'$ $K_B = A + xy'$ - (a) Drawthelogicdiagramofthecircuit - (b) Tabulatethestatetable. - (c) Derivethestate equation. ## Statediagram: #### **Statetable:** To obtain the next-state values of a sequential circuit with JKF lip-Flop, use the JKF lip-Flop characteristic table, | | esent | Input | | Flip-FlopInputs | | | | Next | state | Output | |---|-------|-------|---|-----------------------------|---------------------------|-------------------------|---------------------------|--------|--------|--------| | A | В | х | y | J <sub>A</sub> =<br>Bx+B'y' | K <sub>A</sub> =<br>B'xy' | J <sub>B</sub> =<br>A'x | K <sub>B</sub> =<br>A+xy' | A(t+1) | B(t+1) | Z | | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | 0 | | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 1 | | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | 0 | ## **State Equation:** | | | For $B(t+1)$ | | | | | | | | |-------|----|--------------|----|----|--|--|--|--|--| | AB\xy | 00 | 01 | 11 | 10 | | | | | | | 00 | 0 | 0 | 1 | 1 | | | | | | | 01 | 0 | 0 | 1 | 1 | | | | | | | 11 | 0 | 0 | 0 | 0 | | | | | | | 10 | 0 | 0 | 0 | 0 | | | | | | $\mathbf{A} \ (\mathbf{t+1}) = \mathbf{A}\mathbf{x'} + \mathbf{A}\mathbf{y} + \mathbf{B}\mathbf{x} + \mathbf{A'}\mathbf{B'}\mathbf{y'}$ $\mathbf{B}(\mathbf{t+1}) = \mathbf{A}'\mathbf{x}$ 5. As equential circuit has two JKF lip-Flop A and B. the Flip-Flop in put functions are: $J_A = B$ $J_B = x'$ $K_A=Bx'$ $K_B=A\oplus x$ . - (a) Drawthe logicdiagram of the circuit, - (b) Tabulatethestatetable, - (c) Drawthestatediagram. # Logic diagram: Theoutputfunctionisnotgivenintheproblem. Theoutputof the Flip-Flopsmay beconsidered as the output of the circuit. ### Statetable: To obtain the next-state values of a sequential circuit with JKF lip-Flop, use the JKF lip-Flop characteristic table. | Presei | ntstate | Input | | Flip-FlopInputs | | | Nextstate | | |--------|---------|-------|-------------------|---------------------|--------------------|---------------------|-----------|--------| | A | В | Х | J <sub>A</sub> =B | K <sub>A</sub> =Bx' | J <sub>B</sub> =x' | K <sub>B</sub> =A x | A(t+1) | B(t+1) | | 0 | 0 | 0 | 0 | 0 | 1 | 0 _ | 0 | 1 | | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 1 | 1 | | 0 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 1 | 1 | | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 1 | | | | | | Nex | tstate | | |----|------|---------|----|-----|--------|----| | Pr | esei | ntstate | X= | =0 | X | =1 | | P | A | В | A | В | A | В | | ( | ) | 0 | 0 | 1 | 0 | 0 | | ( | ) | 1 | 1 | 1 | 1 | 0 | | 1 | l | 0 | 1 | 1 | 1 | 0 | | 1 | l | 1 | 0 | 0 | 1 | 1 | | | | | | | | | ### **Secondformofstatetable** ## StateDiagram: assigned variable $Y_1 and \ Y_2 for the two JKF lip-Flops, we can$ write the four excitation input equations and the Moore output equat $J_A=Y_2X$ ; $K_A=Y_2'$ ionasfollows $J_B=X$ ; $K_B=X'$ and outputfunction, $Z=Y_1Y_2'$ Analys isofMo oreMo del 6. Analyzethe synchronou sMoorecirc uitandobtai nitsstatedia gram. ### Soln: Using the ## **Secondformofstatetable** Statetable: | Presei | ntstate | Input | | Flip-FlopInputs | | | Nextstate | | Output | |-----------------------|-----------------------|-------|------------|-----------------|-------------------|--------------------|----------------------|----------------------|-------------| | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | X | $J_A=Y_2X$ | $K_A=Y_2'$ | J <sub>B</sub> =X | K <sub>B</sub> =X' | Y <sub>1</sub> (t+1) | Y <sub>2</sub> (t+1) | $Z=Y_1Y_2'$ | | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | | 0 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 1 | 1 | | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | | _ | | | Output | | | | |-----------------------|-----------------------|-----------------------|-----------------------|-------------------------------|---|---| | Presei | ntstate | X= | =0 | X: | v | | | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | <b>Y</b> <sub>1</sub> | <b>Y</b> <sub>2</sub> | Y <sub>1</sub> Y <sub>2</sub> | | Y | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 1 | 0 | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 0 | 0 | 1 | 1 | | 1 | 1 | 1 | 0 | 1 | 1 | 0 | ## **StateDiagram:** Here the output depends on the present state only and is independent of theinput. The two values in side each circle separated by as lashare for the present state and output. $7. \ \ A sequential circuit has two TF lip-Flop A and B. The Flip-Flop input functions are:$ $$T_A=Bx$$ $T_B=x$ $y=AB$ - (a) Drawthe logicdiagram of the circuit, - (b) Tabulatethestatetable, - (c) Drawthestatediagram. ## Soln: ## Statetable | Presei | ntstate | Input | Flip-Flo | pInputs | Next | state | Output | |--------|---------|-------|--------------------|-------------------|--------|--------|--------| | A | В | X | T <sub>A</sub> =Bx | T <sub>B</sub> =x | A(t+1) | B(t+1) | y=AB | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 1 | 1 | 1 | 1 | 1 | 0 | 0 | | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | | 1 | 1 | 1 | 1 | 1 | 0 | 0 | 1 | | _ | | | Nex | tstate | | Output | | | |-------|---------|----|-----|--------|---|--------|-----|--| | Prese | ntstate | X= | =0 | x=1 | | x=0 | x=1 | | | A | В | A | В | A | В | у | y | | | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | | | 0 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | | | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 1 | | **Secondformofstatetable** #### StateDiagram: # STATEREDUCTION/MINIMIZATION The state reduction is used to avoid the redundant states in the sequential circuits. The reduction in redundant states reduces the number of required Flip-Flopsandlogic gates, reducing the cost of the final circuit. The two states are said to be redundant or equivalent, if every possible set ofinputs generateexactlysameoutputand samenextstate. When two states are equivalent, one of them can be removed without altering the input-output relationship. Since'n'Flip- $Flops produced 2^n state, are duction in the number of states may result in a reduction in the number of Flip-Flops.\\$ Theneedforstatereductionorstateminimizationisexplainedwithoneexample. ${\color{red} \underline{State diagram}} \\ Step 1: Determine the state table for given state diagram$ | Presentstate | Nexts | tate | Output | | | |--------------|-------|------|--------|-----|--| | | X=0 | X=1 | X=0 | X=1 | | | a | b | С | 0 | 0 | | | b | d | е | 1 | 0 | | | С | С | d | 0 | 1 | | | d | a | d | 0 | 0 | | | e | С | d | 0 | 1 | | Statetable ## **Step2:Findequivalentstates** From the above state table c and e generate exactly same next state and same output for every possible set of inputs. The state c and e got on ext states c and d and have outputs 0 and 1 for x=0 and x=1 respectively. Therefore state e can be removed and replaced by c. The final reduced state table is shown below | .Presentstate | Nextstate | | Output | | | |---------------|-----------|-----|--------|-----|--| | | X=0 | X=1 | X=0 | X=1 | | | a | b | С | 0 | 0 | | | b | d | С | 1 | 0 | | | С | С | d | 0 | 1 | | | d | a | d | 0 | 0 | | $\label{lem:consists} The state diagram for the reduced table consists of only four states and is shown below. \\ \underline{\textbf{Reduced state diagram}}$ $1. \ \ Reduce the number of states in the following state table and tabulate the reduced state table \\ e.$ | Presentstate | Nextstate | | Output | | | |--------------|-----------|-----|--------|-----|--| | | X=0 | X=1 | X=0 | X=1 | | | a | a | b | 0 | 0 | | | b | С | d | 0 | 0 | | | С | a | d | 0 | 0 | | | d | е | f | 0 | 1 | | | e | a | f | 0 | 1 | | | f | g | f | 0 | 1 | | | g | a | f | 0 | 1 | | #### Soln: From the above state table e and g generate exactly same next state and same output for every possible set of inputs. The state e and g got on ext states a and f and have out puts 0 and 1 for x=0 and x=1 respectively. Therefore state g can be removed and replaced by e. Thereduced state table-1 is shown below. | Presentstate | Nextstate | | Output | | | |--------------|-----------|---|--------|-----|--| | | X=0 X=1 | | X=0 | X=1 | | | a | a | b | 0 | 0 | | | b | С | d | 0 | 0 | | | С | a | d | 0 | 0 | | | d | е | f | 0 | 1 | | | e | a | f | 0 | 1 | |---|---|---|---|---| | f | e | f | 0 | 1 | #### Reducedstatetable-1 Nowstates dand fare equivalent. Both states got othes amenext state (e,f) and have esame output (0,1). Therefore one state can be removed; fis replaced by **d**. The final reduced state table - 2 is shown below. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | a | a | b | 0 | 0 | | b | С | d | 0 | 0 | | С | a | d | 0 | 0 | | d | e | d | 0 | 1 | | e | a | d | 0 | 1 | Reducedstatetable-2 Thus7statesarereducedinto5states. ### 2. Determineaminimalstatetableequivalentfurnishedbelow | Presentstate | Nextstate | | | |--------------|-----------|------|--| | | X=0 | X=1 | | | 1 | 1, 0 | 1, 0 | | | 2 | 1, 1 | 6, 1 | | | 3 | 4, 0 | 5, 0 | | | 4 | 1, 1 | 7, 0 | | | 5 | 2, 0 | 3, 0 | | | 6 | 4, 0 | 5, 0 | | | 7 | 2, 0 | 3, 0 | | #### Soln: | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | 1 | 1 | 1 | 0 | 0 | | 2 | 1 | 6 | 1 | 1 | | 3 | 4 | 5 | 0 | 0 | | 4 | 1 | 7 | 1 | 0 | | 5 | 2 | 3 | 0 | 0 | | 6 | 4 | 5 | 0 | 0 | | 7 | 2 | 3 | 0 | 0 | From the above state table, 5 and 7 generate exactly same next state and same output for every possible set of inputs. The state 5 and 7 go to next states 2 and 3 and have outputs 0 and table to the state 5 and 7 go to next states 2 and 3 and have outputs 0 and table to the state 5 and 7 go to next states 2 and 3 and have outputs 0 and table to the state 5 and 7 go to next states 2 and 3 and have outputs 0 and table to the state 5 and 7 go to next states 2 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table to the state 3 and 3 and have outputs 0 and table table to the state 3 and 3 and have outputs 0 and table tabl 0forx=0andx=1respectively.Thereforestate7canberemovedandreplacedby5. Similarly,3and6generateexactlysamenextstateandsameoutputforeverypossi blesetofinputs Thestate3and6gotonextstates4and5andhayeoutputs0and 0 for x=0 blesetofinputs. The state $\bf 3$ and $\bf 6$ goton ext states $\bf 4$ and $\bf 5$ and have outputs $\bf 0$ and $\bf 0$ for $\bf x=0$ and $\bf x=1$ respectively. Therefore state $\bf 6$ can be removed and replaced by $\bf 3$ . The final reduced state table is shown below. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | 1 | 1 | 1 | 0 | 0 | | 2 | 1 | 3 | 1 | 1 | | 3 | 4 | 5 | 0 | 0 | | 4 | 1 | 5 | 1 | 0 | | 5 | 2 | 3 | 0 | 0 | Reducedstatetable Thus7statesarereducedinto5states. ### 3. Minimizethefollowing statetable. | Presentstate | Nextstate | | | |--------------|-----------|-----|--| | | X=0 | X=1 | | | A | D,0 | C,1 | | | В | E,1 | A,1 | | | С | Н, 1 | D,1 | | | D | D,0 | C,1 | | | Е | В,0 | G,1 | | | F | Н, 1 | D,1 | | | G | A,0 | F,1 | | | Н | C,0 | A,1 | | | I | G,1 | Н,1 | | #### Soln: | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | A | D | С | 0 | 1 | | В | Е | Α | 1 | 1 | | С | Н | D | 1 | 1 | | D | D | С | 0 | 1 | | Е | В | G | 0 | 1 | | F | Н | D | 1 | 1 | | G | Α | F | 0 | 1 | | Н | С | Α | 0 | 1 | | I | G | Н | 1 | 1 | From the above state table, $\mathbf{A}$ and $\mathbf{D}$ generate exactly same next state and same output for every possible set of inputs. The state $\mathbf{A}$ and $\mathbf{D}$ go to next states $\mathbf{D}$ and $\mathbf{C}$ and have outputs 0 and 1 for x=0 and x=1 respectively. Therefore state $\mathbf{D}$ can ermoved and replaced by $\mathbf{A}$ . Similarly, $\mathbf{C}$ and $\mathbf{F}$ generate exactly same next state and same output for every possible set of inputs. The state $\mathbf{C}$ and $\mathbf{F}$ go to next states $\mathbf{H}$ and $\mathbf{D}$ and have outputs 1 and 1 for x=0 and x=1 respectively. Therefore state $\mathbf{F}$ can be removed and replaced by $\mathbf{C}$ . Thereduced state table-1 is shown below. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | A | Α | С | 0 | 1 | | В | Е | Α | 1 | 1 | | С | Н | Α | 1 | 1 | | Е | В | G | 0 | 1 | | G | A | С | 0 | 1 | | Н | С | Α | 0 | 1 | | I | G | Н | 1 | 1 | Reducedstatetable-1 From the above reduced state table-1, $\bf A$ and $\bf G$ generate exactly same nextstate and same output for every possible set of inputs. The state $\bf A$ and $\bf G$ go to nextstates $\bf A$ and $\bf C$ and have outputs 0 and 1 for x=0 and x=1 respectively. Thereforestate $\bf G$ can be removed and replaced by $\bf A$ . The final reduced state table-2 is shownbelow. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | A | A | С | 0 | 1 | | В | Е | Α | 1 | 1 | | С | Н | Α | 1 | 1 | | Е | В | A | 0 | 1 | | Н | С | A | 0 | 1 | | I | A | Н | 1 | 1 | Reducedstatetable-2 Thus9statesarereducedinto6states. #### 4. Reducethefollowingstatediagram. #### Soln: | Presentstate Nextstate Output | Presentstate | Nextstate | Output | |-------------------------------|--------------|-----------|--------| |-------------------------------|--------------|-----------|--------| | | X=0 | X=1 | X=0 | X=1 | |---|-----|-----|-----|-----| | a | a | b | 0 | 0 | | b | С | d | 0 | 0 | | С | a | d | 0 | 0 | | d | e | f | 0 | 1 | | e | a | f | 0 | 1 | | f | g | f | 0 | 1 | | g | a | f | 0 | 1 | #### **Statetable** From the above state table e and g generate exactly same next state and same output for every possible set of inputs. The state e and g got on ext state s a and f and have outputs 0 a nd 1 for x=0 and x=1 respectively. Therefore state g can be removed and replaced by e. The reduced state table -1 is shown below. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | а | a | b | 0 | 0 | | b | С | d | 0 | 0 | | С | a | d | 0 | 0 | | d | е | f | 0 | 1 | | e | a | f | 0 | 1 | | f | е | f | 0 | 1 | #### Reducedstatetable-1 Nowstates dand fare equivalent. Both states got othes amenext state (e,f) and have esame output (0,1). Therefore one state can be removed; fis replaced by $\mathbf{d}$ . The final reduced state table - 2 is shown below | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | а | a | b | 0 | 0 | | b | С | d | 0 | 0 | | С | a | d | 0 | 0 | | d | e | d | 0 | 1 | | e | a | d | 0 | 1 | Reducedstatetable-2 Thus 7 states are reduced into 5 states. The statediagram for the reduced state table - 2 is, Reducedstatediagram ## **DESIGNOF SYNCHRONOUS SEQUENTIAL CIRCUITS:** A synchronous sequential circuit is made up of number ofFlip-Flops and combinational gates. The design of circuit consists of choosing the Flip-Flops and then finding a combinational gates tructure to gether with the Flip-Flops. The number of Flip-Flops is determined from the number of states needed in the circuit. The combinational circuit is derived from the state table. ### **Designprocedure:** - 1. The given problem is determined with a statedia gram. - 2. Fromthestatediagram, obtain the state table. - 3. Thenumberofstatesmaybereducedbystatereductionmethods(ifappl icable). - 4. Assignbinaryvaluestoeachstate(BinaryAssignment)ifthestatetablecont ainslettersymbols. - 5. Determine the number of Flip-Flops and assignal ettersymbol (A,B,C,...) to each. - 6. ChoosethetypeofFlip-Flop(SR,JK,D,T)tobeused. - 7. Fromthestatetable, circuit excitation and output tables. - 8. UsingK-maporanyothersimplificationmethod,derivethecircuitoutputfunctionsan dtheFlip-Flopinputfunctions. - 9. Drawthelogicdiagram. The type of Flip-Flop to be used may be included in the design specificationsormaydependwhatisavailabletothedesigner.Manydigitalsystemsareco nstructed with JK Flip-Flops because they are the most versatile available. Theselection of inputsis given as follows. | Application | |---------------------------------| | GeneralApplicationsApplications | | requiringtransferofdata | | (Ex: Shift | | Registers)Applicationinvolving | | complementation | | (Ex:BinaryCounters) | | | | | ### **Excitation Tables:** Beforegoingtothedesignexamplesfortheclockedsynchronoussequentialcircuitswere viseFlip-Flopexcitationtables. | Present | Next | Inputs | | |---------|------------------|--------|---| | State | State | - | | | Qn | Q <sub>n+1</sub> | S R | | | 0 | 0 | 0 x | | | 0 | 1 | 1 | 0 | | 1 | 0 | 0 | 1 | | 1 | 1 | X | 0 | ExcitationtableforSRFlip-Flop | Present<br>State | Next<br>State | Inputs | | |------------------|------------------|--------|---| | Qn | Q <sub>n+1</sub> | J K | | | 0 | 0 | 0 | X | | 0 | 1 | 1 | X | | 1 | 0 | X | 1 | | 1 | 1 | X | 0 | ExcitationtableforJKFlip-Flop | Present<br>State | Next<br>State | Input | |------------------|---------------|-------| | Qn | $Q_{n+1}$ | T | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | Excitationtable forTFlip-Flop | Present<br>State | Next<br>State | Input | |------------------|---------------|-------| | Qn | $Q_{n+1}$ | D | | 0 | 0 | 0 | |---|---|---| | 0 | 1 | 1 | | 1 | 0 | 0 | | 1 | 1 | 1 | ExcitationtableforDFlip-Flop #### **Problems** 1. A sequential circuit has one input and one output. The state diagram is shownbelow. Design the sequential circuit with a) D-Flip-Flops, b) T Flip-Flops, c) RSFlip-Flopsandd)JKFlip-Flops. ## **Solution**: #### **State Table:** The state table for the state diagram is, | Presentstate | | Nextstate | | Output | | |--------------|---|-----------|-----|--------|-----| | | | X=0 | X=1 | X=0 | X=1 | | A | В | AB | AB | Y | Y | | 0 | 0 | 00 | 10 | 0 | 1 | | 0 | 1 | 11 | 00 | 0 | 0 | | 1 | 0 | 10 | 01 | 1 | 0 | | 1 | 1 | 00 | 10 | 1 | 0 | #### **Statereduction:** As seen from the state table there is no equivalent states. Therefore, no reduction in the state diagram. The state table shows that circuit goes through four states, therefore werequire 2 Flip-Flops (number of states $= 2^m$ , where m = number of Flip-Flops). Since two Flip-Flops are required first is denoted as A and second is denoted as B. #### i) <u>DesignusingDFlip-Flops</u>: #### **Excitationtable:** Using the excitation table for TF lip-Flop, we can determine the excitation table for the given circuitas, | PresentState | NextState | Input | |--------------|------------------|-------| | Qn | Q <sub>n+1</sub> | D | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 0 | | 1 | 1 | 1 | ExcitationtableforDFlip-Flop | Presentstate | | Input | Nextstate | | | Flop<br>outs | Output | |--------------|---|-------|-----------|---|----------------|----------------|--------| | A | В | X | A | В | D <sub>A</sub> | D <sub>B</sub> | Y | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | | 0 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 0 | | | | | | | | | 1 | Circuitexcitationtable # K-mapSimplification: WiththeseFlip- Flopinputfunctions and circuit output function we can draw the logic diagram as follows. # ii) DesignusingTFlip-Flops: Using the excitation table for TFlip- Flop, we can determine the excitation table for the given circuitas, | PresentState | NextState | Input | |--------------|------------------|-------| | Qn | Q <sub>n+1</sub> | T | | 0 | 0 | 0 | | 0 | 1 | 1 | | 1 | 0 | 1 | | 1 | 1 | 0 | **Excitation tablefor TFlip-Flop** | Presentstate | | Input | Nextstate | | state Flip-Flop<br>Inputs | | Output | |--------------|---|-------|-----------|---|---------------------------|----------------|--------| | A | В | X | A | В | $T_A$ | T <sub>B</sub> | Y | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | | 0 | 1 | 0 | 1 | 1 | 1 | 0 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | | Sī | nchron | ousSea | mentia <sup>*</sup> | lCir | cuits | |----|--------|--------|---------------------|------|-------| | J | | UUSSEY | lutiilia. | ш | cuits | | 3 | | 4 | Я | |---|---|---|---| | | - | - | | | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | | |---|---|---|---|---|---|---|---|--| | 1 | U | U | 1 | U | U | U | 1 | | | 1 | 0 | 1 | 0 | 1 | 1 | 1 | 0 | | | 1 | 1 | 0 | 0 | 0 | 1 | 1 | 1 | | | 1 | 1 | 1 | 1 | 0 | 0 | 1 | 0 | | | | | | | | | | | | **Circuitexcitationtable** ## K-mapSimplification: Therefore, input functions for, $$T_A = B \oplus x$$ and $T_B = AB + AX + B$ $X$ Circuitoutputfunction,Y=XA'B'+X'A ### WiththeseFlip- Flop input functions and circuit output function we can draw the logic diagram as follows. $\underline{Logic diagram of given sequential circuit using TF lip-Flop}$ ### iii) DesignusingSRFlip-Flops: UsingtheexcitationtableforRSFlip- Flop, we can determine the excitation table for the given circuitas, | PresentState | NextState | Inputs | | | |--------------|------------------|--------|---|--| | Qn | Q <sub>n+1</sub> | S | R | | | 0 | 0 | 0 | Х | | | 0 | 1 | 1 | 0 | | | 1 | 0 | 0 | 1 | | | 1 | 1 | X | 0 | | **ExcitationtableforSRFlip-Flop** | Pres | sent | Input | Nexts | tate | Flip-FlopInputs | | | Output | | |------|------|-------|-------|------|-----------------|---------------------------|---------|---------------------------|---| | sta | ite | | | | | | | | | | A | В | X | A | В | $S_A$ | $\mathbf{R}_{\mathbf{A}}$ | $S_{B}$ | $\mathbf{R}_{\mathbf{B}}$ | Y | | 0 | 0 | 0 | 0 | 0 | 0 | X | 0 | X | 0 | | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 0 | X | 1 | | 0 | 1 | 0 | 1 | 1 | 1 | 0 | X | 0 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | X | 0 | 1 | 0 | | 1 | 0 | 0 | 1 | 0 | X | 0 | 0 | X | 1 | | 1 | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | 0 | | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 1 | | 1 | 1 | 1 | 1 | 0 | X | 0 | 0 | 1 | 0 | Circuitexcitationtable ## K-mapSimplification: #### For Flip-flop A #### For Flip-flop B WiththeseFlip- Flopinputfunctionsandcircuitoutputfunctionwecandrawthelogicdiagramasfollows. # iii) DesignusingJKFlip-Flops: $Using the excitation table for JKF lip-\\Flop, we can determine the excitation table for the given circuit as,$ | PresentState | NextState | Inp | uts | |--------------|------------------|-----|-----| | Qn | Q <sub>n+1</sub> | J | К | | 0 | 0 | 0 | Х | | 0 | 1 | 1 | Х | | 1 | 0 | X | 1 | | 1 | 1 | Х | 0 | ExcitationtableforJKFlip-Flop | Pre<br>t<br>sta | | Inpu<br>t | Next | state | Flip-FlopInputs | | | | Outpu<br>t | |-----------------|---|-----------|------|-------|-----------------|--------|----|--------|------------| | A | В | X | A | В | JA | K<br>A | Jв | K<br>B | Y | | 0 | 0 | 0 | 0 | 0 | 0 | Х | 0 | Х | 0 | | 0 | 0 | 1 | 1 | 0 | 1 | X | 0 | X | 1 | | 0 | 1 | 0 | 1 | 1 | 1 | X | X | 0 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | X | X | 1 | 0 | | 1 | 0 | 0 | 1 | 0 | X | 0 | 0 | X | 1 | | 1 | 0 | 1 | 0 | 1 | X | 1 | 1 | X | 0 | | 1 | 1 | 0 | 0 | 0 | X | 1 | X | 1 | 1 | | 1 | 1 | 1 | 1 | 0 | X | 0 | X | 1 | 0 | Circuitexcitationtable # K-mapSimplification: # For Flip-flop A Theinputfunctionsfor, $$J_A=BX'+B'X J_B=AX$$ $$=B\oplus X$$ $$K_A=BX'+B'X$$ $=B\oplus X$ $K_B=A+X$ Circuitoutputfunction,Y=AX'+A'B'X WiththeseFlip- $Flop input functions and circuit output function we can draw the logic diagram as follows. \\ {\color{blue} {\bf Logic diagram of given sequential circuit using JKF lip-Flop}}$ 2. DesignaclockedsequentialmachineusingJKFlip-Flopsforthestatediagramshowninthefigure. Usestate reduction if possible. Make properstateassignment. Soln: **State Table:** | Presentstate | Nextstate | | tate Nextstate O | | | put | |--------------|-----------|-----|------------------|-----|--|-----| | | X=0 | X=1 | X=0 | X=1 | | | | a | a | b | 0 | 0 | | | | b | С | b | 0 | 0 | | | | С | a | b | 0 | 1 | | | | d | a | b | 0 | 0 | | | From the above state table $\mathbf{a}$ and $\mathbf{d}$ generate exactly same next state and same output for every possible set of inputs. The state $\mathbf{a}$ and $\mathbf{d}$ go to next states $\mathbf{a}$ and $\mathbf{b}$ and have outputs 0 and 0 for $\mathbf{x}$ =0 and $\mathbf{x}$ =1 respectively. Therefore state $\mathbf{d}$ can be removed and replaced by $\mathbf{a}$ . The final reduced state table is shown below. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | a | a | b | 0 | 0 | | b | С | b | 0 | 0 | | С | a | b | 0 | 1 | ReducedStatetable ### **BinaryAssignment:** Now each state is assigned with binary values. Since there are three states, number of Flip- $Flops required is two and 2 binary numbers are assigned to the states. a = 00; b = 0; \\ and c = 10$ Thereducedstatediagramisdrawnas ReducedStateDiagram 11 x х 10 0 1) # K-mapSimplification: 0 # For Flip-flop B # WiththeseFlip- Flop input functions and circuit output function we can draw the logic diagram as follows. 3. DesignaclockedsequentialmachineusingTFlip-Flopsforthefollowingstatediagram. Use statereduction ifpossible.Alsouse straightbinary stateassignment. ## Soln: # **State Table:** Statetableforthegivenstatediagramis, | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | a | a | b | 0 | 0 | | b | d | С | 0 | 0 | | С | a | b | 1 | 0 | | d | b | a | 1 | 1 | Even though a and care having same next states for input X=0 and X=1, as the output same not same state reduction is not possible. # **StateAssignment:** Use straight binary as signments as a = 00, b = 01, c = 10 and d = 11, the transition table eis, | Input | Presen | tstate | Nexts | Nextstate Flip-Flop<br>Inputs | | Nextstate | | Output | |-------|--------|--------|-------|-------------------------------|----------------|----------------|---|--------| | X | A | В | A | В | T <sub>A</sub> | T <sub>B</sub> | Y | | | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | 0 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | | | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 | | | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 1 | | | 1 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | | | 1 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | | | 1 | 1 | 0 | 0 | 1 | 1 | 1 | 0 | | | 1 | 1 | 1 | 0 | 0 | 1 | 1 | 1 | | # K-map simplification: # LogicDiagram: ## **STATEASSIGNMENT:** Insequential circuits, the behavior of the circuit is defined in terms of its inputs, presents tates, next states and outputs. To generate desired next state at particular present state and inputs, it is necessary to have specific Flip-Flop inputs. These Flip-Flop inputs are described by a set of Boolean functions called Flip-Flop input functions. To determine the Flip-Flop functions, it is necessary to represent states in the state diagram using binaryvalues instead of alphabets. This procedure is known as state assignment. Reducedstatediagramwithbinarystates # Rulesforstateassignments Therearetwobasicrulesformakingstateassignments. ### Rule1: States havingthesameNEXTSTATES for a given in put conditions hould have as sign ments which can be grouped into logically adjacent cells in a K-map. ### Rule2: States that are the NEXTSTATE Sofasing lest at eshould have as sign ment which can be grouped into logically adjacent cells in a K-map. | Presentstate | Nextstate | | Out | put | |--------------|-----------|-----|-----|-----| | | X=0 | X=1 | X=0 | X=1 | | 00 | 01 | 10 | 0 | 0 | | 01 | 11 | 10 | 1 | 0 | | 10 | 10 | 11 | 0 | 1 | | 11 | 00 | 11 | 0 | 0 | **Statetablewithassignmentstates** # **StateAssignmentProblem:** 1. Design a sequential circuit for a state diagram shown below. Use stateassignmentrulesforassigningstatesandcomparetherequiredcombination alcircuitwithrandomstateassignment. Usingrandomstateassignmentweassign,a=0 00,b=001,c=010,d=011ande=100. The excitation table with these assignments is given as, | P | resentstat | e | Input | | Nextstate | | Output | |----|----------------|----|-------|------------------|------------------|------------------|--------| | An | B <sub>n</sub> | Cn | X | A <sub>n+1</sub> | B <sub>n+1</sub> | C <sub>n+1</sub> | Z | | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | | 0 | 0 | 1 | 0 | 0 | 1 | 1 | 0 | | 0 | 0 | 1 | 1 | 1 | 0 | 0 | 0 | | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 0 | | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | | 0 | 1 | 1 | 1 | 0 | 0 | 0 | 1 | # Synchronous Sequential Circuits 3.59 | | • | | | | | | | | |---|---|---|---|---|---|---|---|--| | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | | | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | | | 1 | 0 | 1 | 0 | X | X | X | X | | | 1 | 0 | 1 | 1 | X | X | X | X | | | 1 | 1 | 0 | 0 | X | X | X | X | | | 1 | 1 | 0 | 1 | X | X | X | X | | | 1 | 1 | 1 | 0 | X | X | X | X | | | 1 | 1 | 1 | 1 | X | X | X | X | | # $\underline{K\text{-}mapSimplification:}$ | $\setminus C_n$ | X | | | | | | |-------------------------------------------------------------------------------------|----|----|----|----|--|--| | $A_nB_n$ | 00 | 01 | 11 | 10 | | | | 00 | 0 | 1 | 0 | 1 | | | | 01 | 0 | 1 | 0 | 0 | | | | 11 | Χ | Χ | Χ | Χ | | | | 10 | 0 | 0 | χ | X | | | | $D_{p} = \overline{A}_{p} \overline{C}_{p} X + \overline{B}_{p} C_{p} \overline{X}$ | | | | | | | | Cn | X | | | | | | | |-------------------------------------------------------------------------------------|----|----|----|----|--|--|--| | $A_nB_n$ | 00 | 01 | 11 | 10 | | | | | 00 | 1 | 0 | 0 | 1 | | | | | 01 | 0 | 1 | 0 | 0 | | | | | 11 | Χ | X | Χ | Х | | | | | 10 | 0 | 0 | Χ | Χ | | | | | $D_{C} = \overline{A}_{n} \overline{B}_{n} \overline{X} + B_{n} \overline{C}_{n} X$ | | | | | | | | Therandomassignmentsrequire: 7threeinputANDfunctions1t woinputANDfunction 4twoinputORfunctions 12gateswith31inputs Now, we will apply the state assignment rules and compare the results. StatediagramafterapplyingRules1and2 Rule1saysthat: eanddmustbeadjacent,and bandcmustbeadjacent. Rule2saysthat: eanddmustbeadjacent,and bandcmustbeadjacent. Applying Rule 1, Rule 2 to the state diagram we get the state assignment as, | P | resentstat | e | Input | | Nextstate | | | |----|------------|----|-------|------------------|------------------|------------------|---| | An | Bn | Cn | X | A <sub>n+1</sub> | B <sub>n+1</sub> | C <sub>n+1</sub> | Z | | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | | 0 | 0 | 0 | 1 | 0 | 1 | 1 | 0 | | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | | 0 | 0 | 1 | 1 | 1 | 1 | 1 | 0 | | 0 | 1 | 0 | 0 | X | X | X | X | | 0 | 1 | 0 | 1 | X | X | X | X | | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 0 | | 0 | 1 | 1 | 1 | 1 | 0 | 1 | 0 | | 1 | 0 | 0 | 0 | X | X | X | X | | 1 | 0 | 0 | 1 | X | X | X | X | | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | | 1 | 1 | 0 | 0 | X | X | X | X | | 1 | 1 | 0 | 1 | X | X | X | X | | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 1 | | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | | | | | | | | | | ### **K-mapSimplification:** | \C <sub>n</sub> } | X. | | | | | |-------------------------------|----------------------|----------------------|-------------------------------------------------------|----------------------------------------------------------------------------------|---| | A <sub>n</sub> B <sub>n</sub> | 00 | 01 | 11 | 10 | | | 00 | 0 | 1 | 1 | 0 | | | 01 | X | X | 0 | 1 | | | 11 | X | X | 0 | 0 | | | 10 | X | X | 0 | 0 | | | | B <sub>n+1</sub> = ] | $D_B = \overline{A}$ | $_{n}\overline{\overline{\mathbf{B}}}_{n}\mathbf{X}+$ | $\overline{\mathbf{A}}_{\mathbf{n}}\mathbf{B}_{\mathbf{n}}\overline{\mathbf{y}}$ | C | | $A_nB_n^{C_n}$ | X<br>00 | 01 | 11 | 10 | |----------------|---------|----|----|----| | 00 | 1 | 1 | 1 | 1 | | 01 | x | х | 1 | 1 | | 11 | х | Х | 0 | _ | | | | | U | 0 | | 10 | X | X | 0 | 0 | ThestateassignmentsusingRule1and2require:4thre einputANDfunctions 1twoinputANDfunction2t woinputORfunctions ----- 7gateswith18inputs Thus by simply applying Rules 1 and 2 good results have been achieved. ### **SYNCHRONOUSCOUNTERS** Flip-Flops can be connected together to perform counting operations. Such agroup of Flip- Flops is a **counter**. The number of Flip-Flops used and the way inwhich they are connected determine the number of states (called the modulus) and also the specific sequence of states that the countergoes through during each complete cycle. Countersareclassified into two broadcategories according to the way they are clocked: **4** Asynchronous counters,Synchronousc ounters. In asynchronous (ripple) counters, the first Flip-Flop is clocked by the external clockpulse and then each successive Flip- FlopisclockedbytheoutputoftheprecedingFlip-Flop. In synchronous counters, the clock input is connected to all of the Flip-Flops sothat they are clocked simultaneously. Within each of these two categories, countersare classified primarily by the type of sequence, the number of states, or the number of Flip-Flops in the counter. Theterm'synchronous'referstoeventsthathaveafixedtimerelationshipwith each other. In synchronous counter, the clock pulses are applied to all Flip-Flopssimultaneously.Hencethereisminimumpropagationdelay. | S.No | Asynchronous(ripple)counter | Synchronouscounter | |------|--------------------------------------------------------------------------------------------|-----------------------------------------------| | 1 | AlltheFlip-Flopsare not clockedsimultaneously. | All the Flip-Flopsare clocked simultaneously. | | 2 | The delay times of all Flip-Flopsareadded.Thereforet hereisconsiderable propagation delay. | Thereisminimumpropagationdelay. | | 3 | Speedofoperationislow | Speedofoperation ishigh. | | 4 | Logiccircuitisverysimple | Design involvescomplexlogiccircuit | | | evenformorenumberofstates. | asnumberofstateincreases. | |---|-----------------------------------------|----------------------------------------------------------| | 5 | Minimumnumbersoflogic devicesareneeded. | Thenumber of logic devices is more than ripple counters. | | 6 | Cheaperthansynchronous counters. | Costlierthanripplecounters. | # 2-BitSynchronousBinaryCounter Inthiscountertheclocksignalisconnected in parallel to clock inputs of both the Flip-Flops (FF<sub>0</sub> and FF<sub>1</sub>). The output of FF<sub>0</sub> is connected to $J_1$ and $K_1$ inputs of the second Flip-Flop (FF<sub>1</sub>). ### 2-BitSynchronousBinaryCounter Assume that the counter is initially in the binary 0 state: i.e., both Flip-Flopsare RESET. When the positive edge of the first clock pulse is applied, $FF_0$ will togglebecause $J_0=k_0=1$ , whereas $FF_1$ output will remain 0 because $J_1=k_1=0$ . After the firstclockpulseQ<sub>0</sub>=1andQ<sub>1</sub>=0. When the leading edge of CLK2 occurs, FF<sub>0</sub>will toggle and Q<sub>0</sub>will go LOW.Since $FF_1$ has a HIGH ( $Q_0$ = 1) on its $J_1$ and $K_1$ inputs at the triggering edge of this clockpulse, the Flip-Floptog gles and $Q_1$ goes HIGH. Thus, after CLK2, $Q_0$ =0 and $Q_1$ =1. When the leading edge of CLK3 occurs, FF<sub>0</sub> again to gglest othe SET state ( $Q_0$ = 1), and FF<sub>1</sub> remains SET ( $Q_1$ = 1) because its J<sub>1</sub> and K<sub>1</sub> inputs are both LOW ( $Q_0$ = 0). After this triggering edge, $Q_0$ = 1 and $Q_1$ = 1. Finally, at the leading edge of CLK4, $Q_0$ and $Q_1$ go LOW because they bothhave a toggle condition on their $J_1$ and $K_1$ inputs. The counter has now recycled to itsoriginal state, $Q_0 = Q_1 = 0$ . # 3-BitSynchronousBinaryCounter A 3 bit synchronous binary counter is constructed with three JK Flip-Flopsand an AND gate. The output of $FF_0$ ( $Q_0$ ) changes on each clock pulse as the counterprogresses from its original state to its final state and then back to its original state. To produce this operation, $FF_0$ must be held in the toggle mode by constant HIGH, on $IF_0$ and $IF_0$ in puts. ### 3-BitSynchronousBinaryCounter The output of $FF_1(Q_1)$ goes to the opposite state following each time $Q_0$ = 1.This change occurs at CLK2, CLK4, CLK6, and CLK8. The CLK8 pulse causes the counter to recycle. To produce this operation, $Q_0$ is connected to the $J_1$ and $K_1$ inputsof $FF_1$ . When $Q_0$ = 1 and a clock pulse occurs, $FF_1$ is in the toggle mode and therefore changes state. When $Q_0$ = 0, $FF_1$ is in the no-change mode and remains in its presentstate. The output of $FF_2$ ( $Q_2$ ) changes state both times; it is preceded by the uniquecondition in which both $Q_0$ and $Q_1$ are HIGH. This condition is detected by the ANDgateandappliedtotheJ2andK2inputsofFF3.Wheneverbothoutputs $Q_0=Q_1=1$ , the output of the AND gate makes the $J_2$ = $K_2$ = 1 and $FF_2$ toggles on the following clock pulse. Otherwise, the $J_2$ and $K_2$ inputs of $FF_2$ are held LOW by the AND gateoutput, $FF_2$ does not change state. | CLOCKPulse | $\mathbf{Q}_2$ | $\mathbf{Q}_{1}$ | $\mathbf{Q}_0$ | |------------|----------------|------------------|----------------| | | | | | | Initially | Λ | Λ | Λ | |-------------|---|---|---| | initially | 0 | 0 | 0 | | 1 | Ü | 0 | 1 | | 2 | 0 | 1 | 0 | | 3 | 0 | 1 | 1 | | 4 | 1 | 0 | 0 | | 5 | 1 | 0 | 1 | | 6 | 1 | 1 | 0 | | 7 | 1 | 1 | 1 | | 8(recycles) | 0 | 0 | 0 | | | | | ĺ | # 4-BitSynchronousBinaryCounter This particular counter is implemented with negative edge-triggered Flip-Flops. The reasoning behind the J and K input control for the first three Flip-Flops is the same as previously discussed for the 3-bit counter. For the fourth stage, the Flip-Flop has to change the state when $Q_0 = Q_1 = Q_2 = 1$ . This condition is decoded by $ANDgateG_3$ . Therefore, when $Q_0$ = $Q_1$ = $Q_2$ = 1, Flip-Flop FF $_3$ toggles and for all other times itisinano- change condition. Points where the AND gate outputs are HIGH are indicated by the shaded areas. # **4-BitSynchronousDecadeCounter:**(BCDCounter): BCDdecadecounterhasasequencefrom0000to1001(9).After1001stateitmust recycle back to 0000 state. This counter requires four Flip-Flops and AND/ORlogicasshownbelow. 4-Bit Synchronous DecadeCounter | CLOCKPulse | $\mathbf{Q}_3$ | $\mathbf{Q}_2$ | $\mathbf{Q}_1$ | $\mathbf{Q}_{0}$ | |--------------|----------------|----------------|----------------|------------------| | Initially | 0 | 0 | 0 | 0 | | 1 | 0 | 0 | 0 | 1 | | 2 | 0 | 0 | 1 | 0 | | 3 | 0 | 0 | 1 | 1 | | 4 | 0 | 1 | 0 | 0 | | 5 | 0 | 1 | 0 | 1 | | 6 | 0 | 1 | 1 | 0 | | 7 | 0 | 1 | 1 | 1 | | 8 | 1 | 0 | 0 | 0 | | 9 | 1 | 0 | 0 | 1 | | 10(recycles) | 0 | 0 | 0 | 0 | $\bullet \quad First, notice that FF_0(Q_0) toggles on each clock pulse, so the logic equation for its J_0 and K_0 i \\ nputs is$ ### $I_0 = K_0 = 1$ $This equation is implemented by connecting J_0 and K_0 to a constant HIGH level.\\$ • Next,noticefromtable,that $FF_1(Q_1)$ changes on the next clock pulse each time Q0=1 and Q3=0, so the logic equation for the $J_1$ and $J_2$ in puts is $$J_1=K_1=Q_0Q_3'$$ This equation is implemented by AND in gQ0 and Q3 and connecting the gate output to the J1 and K1 inputs of FF1. • Flip-Flop2( $Q_2$ )changesonthenextclockpulseeachtimeboth $Q_0$ = $Q_1$ =1.This requiresaninputlogicequationasfollows: $$J_2=K_2=Q_0Q_1$$ $This equation is implemented by ANDing Q_0 and Q_1 and connecting the gate output to the J_2 and K_2 inputs of FF_3\\$ • Finally, $FF_3(Q_3)$ changes to the opposite state on the next clock pulse each time $Q_0 = 1$ , $Q_1 = 1$ , and $Q_2 = 1$ (state 7), or when $Q_0 = 1$ and $Q_1 = 1$ (state 9). The equation for this is a solution of the content $$J_3 = K_3 = Q_0Q_1Q_2 + Q_0Q_3$$ $This function is implemented with the AND/OR logic connected to the J_3 and K_3 inputs of FF_3. \\$ **Timingdiagram** # SynchronousUP/DOWNCounter An up/down counter is a bidirection alcounter, capable of progressing in either direction through a certain sequence. A 3- bit bin ary counter that advances upward through its sequence (0,1,2,3,4,5,6,7) and then can be reversed so that it goesthroughthesequenceintheoppositedirection(7,6,5,4,3,2,1,0)isanillustrationofup/downsequentialoperation. The complete up/down sequence for a 3-bit binary counter is shown in tablebelow. The arrows in dicate the state-to- statemovementofthecounterforbothits UP and its DOWN modes of operation. An examination of $Q_0$ for both the up and downsequences shows that $FF_0$ toggles on each clock pulse. Thus, the $J_0$ and $K_0$ inputs of $FF_0$ are, $J_0 = K_0 = 1$ | CLOCK PULSE | UP | $\mathbf{Q}_2$ | Q <sub>1</sub> | $\mathbf{Q}_0$ | DOWN | |-------------|-----|----------------|----------------|----------------|-------| | 0 | 70 | 0 | 0 | 0 | 1) | | 1 | > | 0 | 0 | 1 | ₹\ | | 2 | { > | 0 | 1 | 0 | - ₹ } | | 3 | > | 0 | 1 | 1 | ₹ | | 4 | > | 1 | 0 | 0 | - } | | 5 | | 1 | 0 | 1 | 7 | | 6 | \ > | 1 | 1 | 0 | ₹ / | | 7 | 10 | 1 | 1 | 1 | 2) | Toform a synchronous UP/DOWN counter, the control input (UP/DOWN) is used to allow either the normal output or the inverted output of one Flip-Flop totheJandKinputsofthenextFlip- Flop.WhenUP/DOWN=1,theMOD8counterwillcountfrom 000to111andUP/DOWN=0,it willcount from 111 to 000. WhenUP/DOWN=1,itwillenableANDgates1and 3 and disable ANDgates 2 and 4. This allows the Q0 and Q1 outputs through the AND gates to the J andKinputsofthefollowingFlip-Flops,sothecounter countsupaspulsesareapplied. WhenUP/DOWN=0,thereverseactiontakesplace. $$J_1=K_1=(Q_0.UP)+(Q_0'.DOWN)$$ $$J_2=K_2=(Q_0.Q_1.UP)+(Q_0'.Q_1'.DOWN)$$ 3-bitUP/DOWNSynchronousCounter ## **MODULUS-N-COUNTERS** Thecounterwith'n'Flip- FlopshasmaximumMODnumber2<sup>n</sup>.Findthenumber of Flip-Flops (n) required for the desired MOD number (N) using theequation, #### $2^{n} \ge N$ (i) Forexample,a3bitbinarycounterisaMOD8counter.Thebasiccountercanbe modified to produce MOD numbers less than 2<sup>n</sup> by allowing the counter toskinthosearenormallypart of counting sequence. n=3 N=8 2<sup>n</sup>=2<sup>3</sup>=8=N ### (ii) MOD5Counter: 2<sup>n</sup>=N 2<sup>n</sup>=5 2<sup>2</sup>=4lessthanN. 2<sup>3</sup>=8>N(5) Therefore, 3 Flip-Flopsare required. # (iii) MOD10Counter: 2<sup>n</sup>=N=10 2<sup>3</sup>=8lessthanN. 2<sup>4</sup>=16>N(10). To construct any MOD-N counter, the following methods can be used. 1. FindthenumberofFlip- Flops (n) required for the desired MOD number (N) using the equation, 2<sup>n</sup>≥N. - 2. ConnectalltheFlip-Flopsasarequiredcounter. - 3. FindthebinarynumberforN. - 4. ConnectallFlip-FlopoutputsforwhichQ=1whenthecountisN,asinputs toNANDgate. - 5. ConnecttheNANDgateoutputtotheCLRinputofeachFlip-Flop. When the counter reaches N<sup>th</sup>state, the output of the NAND gate goes LOW,resettingallFlip-Flopsto0.Thereforethecountercountsfrom0throughN-1. Forexample, MOD- $10 counterreaches state 10 (1010). i.e., Q_3Q_2Q_1Q_0 = 1010. The outputs Q_3 and Q_1 are connected to the NAND gate and the output of the NAND gategoes LOW and resetting all Flip-Flops to zero. Therefore MOD-10 counter counts from 0000 to 1001. And then recycles to the zero value. \\$ The MOD-10 counter circuitis shown below. ## **SHIFTREGISTERS:** A register is simply a group of Flip-Flops that can be used to store a binarynumber. Theremust be one Flip- Flopforeachbitinthebinarynumber.Forinstance,aregister usedtostorean8-bitbinary number musthave8Flip-Flops. TheFlip- Flopsmustbeconnected such that the binary number can be entered (shifted) into the register and possibly shifted out. A group of Flip-Flops connected to provide either or both of these functions is called a *shift register*. Thebitsinabinarynumber(data)canberemovedfromoneplacetoanother in either of two ways. The first method involves shifting the data one bit at a time ina serial fashion, beginning with either the most significant bit (MSB) or the leastsignificant bit (LSB). This technique isreferred to as *serialshifting*. The second method involves shifting all the databits simultaneously and is referred to a sparallel shifting. There are two ways to shift into a register (serial or parallel) and similarly twoways to shift the data out of the register. This leads to the construction of four basicregistertypes— - i. Serialin-serialout, - ii. Serialin-parallelout, - iii. Parallelin-serialout, - iv. Parallelin-parallelout. # (iii)<u>Serialin-parallelout</u> Serial-InSerial-OutShiftRegister: ## (iv)Parallelin-parallelout Theserialin/serialoutshiftregisteracceptsdataserially,i.e.,onebitatatimeonasi ngleline.Itproducesthestoredinformationonitsoutputalsoinserialform. The entry of the four bits 1010 into the register is illustrated below, beginningwith the right-most bit. The register is initially clear. The 0 is put onto the datainputline, making D=0 for FF $_0$ . When the first clock pulse is applied, FF $_0$ is reset, thus storing the 0. $Next the second bit, which is a 1, is applied to the data input, making D=1 for FF_0 and D=0 for FF_1 because the Dinput of FF_1 is connected to the Q_0 output. When the property of pr$ the second clock pulse occurs, the 1 on the data input is shifted into FF0, causing FF0 to set; and the 0 that was in FF0 is shifted into FF1. The third bit, a 0, is now put onto the data-input line, and a clock pulse isapplied. The 0 is entered into $FF_0$ , the 1 stored in $FF_0$ is shifted into $FF_1$ , and the 0storedin $FF_1$ is shifted into $FF_2$ . The last bit, a 1,is nowapplied to the data input, and a clock pulseis applied. This timethe 1 isentered into FF0, the 0 stored in FF0 is shifted into FF1, the 1 stored in FF1 is shifted into FF2, and the 0 stored in FF2 is shifted into FF3. This completes the serial entry of the four bits into the shift register, where they can be stored for any length of time as long as the Flip-Flops have do power. $\underline{Fourbits (1010) being entered serially into the register}$ To get the data out of the register, the bits must be shifted out serially andtaken offtheQ3 output. AfterCLK4, the right-most bit, 0, appears on theQ3 output. When clock pulse CLK5 is applied, the second bit appears on the Q3 output. Clock pulse CLK6 shifts the third bit to the output, and CLK7 shifts the fourth bit to the output. While the original four bits are being shifted out, more bits can be shifted in. All zeros are shown being shifted out, more bits can be shifted in. Serial-InParallel-OutShiftRegister: In this shift register, data bits are entered into the register in the same asserial-inserial- outshiftregister.Buttheoutputistakeninparallel.Oncethedataarestored,eachbitappea rsonitsrespectiveoutputlineandallbitsareavailablesimultaneouslyinsteadofonabitby-bit. # Serial-Inparallel-OutShiftRegister Fourbits(1111)beingseriallyenteredintotheregister ## Parallel-InSerial-OutShiftRegister: In this type, the bits are entered in parallel i.e., simultaneously into their respective stages on parallellines. A 4-bit parallel-in serial-out shift register is illustrated below. There are fourdata input lines, $X_0$ , $X_1$ , $X_2$ and $X_3$ for entering data in parallel into the register. SHIFT/LOAD input is the control input, which allows four bits of data to **load** inparallelintotheregister. When SHIFT/LOADis LOW, gates $G_1$ , $G_2$ , $G_3$ and $G_4$ are enabled, allowing each data bit to be applied to the D input of its respective Flip-Flop. When a clockpulse is applied, the Flip-Flops with D = 1 will **set** and those with D = 0 will **reset**, thereby storing all four bits simultaneously. When SHIFT/LOAD is HIGH, gates $G_1$ , $G_2$ , $G_3$ and $G_4$ are enabled, allowing the data bits to shift right from one stage to the next. The OR gates allow either the normal shifting operation or the parallel data-entry operation, depending on which AND gates are enabled by the level on the SHIFT/LOAD input. # Parallel-InParallel-OutShiftRegister: In this type, there is simultaneous entry of all data bits and the bits appear onparalleloutputssimultaneously. ### **UNIVERSALSHIFTREGISTERS** If the register has shift and parallel load capabilities, then it is called a shiftregister with parallel load or *universal shift register*. Shift register can be used forconverting serial data to parallel data, and vice-versa. If a parallel load capability isaddedtoashiftregister, the data entered in parallel can be taken out in serial fashion by shifting the data stored in the register. Thefunctionsofuniversalshiftregisterare: - Aclearcontroltocleartheregisterto0. - Aclockinputtosynchronizetheoperations. - Ashiftrightcontroltoenabletheshiftrightoperationandtheserialinputandoutputlines associatedwiththeshiftright. - Ashiftleftcontroltoenabletheshiftleftoperationandtheserialinputandoutputlinesassociat edwiththeshiftleft. - Aparallelloadcontroltoenableaparalleltransferandtheninputlinesassociatedwiththepar alleltransfer. - 'n'paralleloutputlines. - Acontrollinethatleavestheinformationintheregisterunchangedeventhoughthe clockpulsesrecontinuouslyapplied. It consists of four D-Flip-Flops and four 4 input multiplexers (MUX). $S_0$ and $S_1$ are the two selection in puts are used to select one of the four inputs of each multiplexer. The input 0 in each MUX is selected when $S_1S_0$ = 00 and input 1 is selectedwhen $S_1S_0$ = 01. Similarly inputs 2 and 3 are selected when $S_1S_0$ = 10 and $S_1S_0$ = 11 respectively. The inputs S1 and S0 control the mode of the operation of the register. 4-BitUniversalShiftRegister When $S_1S_0$ = 00, the present value of the register is applied to the D-inputs of the Flip-Flops. This is done by connecting the output of each Flip-Flop to the 0 input of the respective multiplexer. The next clock pulse transfers into each Flip-Flop, thebinary value is held previously, and hence no change of state occurs. When $S_1S_0$ = 01, terminal 1 of the multiplexer inputs has a path to the D inputs of the Flip-Flops. This causes a shift- rightoperationwiththelefterserialinputtransferredintoFlip-FlopFF<sub>3</sub>. When $S_1S_0$ = 10, a shift-left operation results with the right serial input going intoFlip-FlopFF<sub>1</sub>. Finallywhen $S_1S_0=11$ , the binary information on the parallel input lines ( $I_1$ , $I_2$ , $I_3$ and $I_4$ ) are transferred into the register simultaneously during the next clock pulse. The function table of bidirectionals hiftregister with parallel inputs and parallel outputs is shown below. | Mode | Control | Operation | |------------------|------------------|---------------------------------------------------------------| | S <sub>1</sub> | S <sub>0</sub> | | | 0<br>0<br>1<br>1 | 0<br>1<br>0<br>1 | No<br>changeShi<br>ft-<br>rightShift-<br>left<br>Parallelload | ### **BI-DIRECTION SHIFT REGISTERS:** A bidirectional shift register is one in which the data can be shifted either leftor right. It can be implemented by using gating logic that enables the transfer of adata bit from one stage to the next stage to the right or to the left depending on thelevelofacontrolline. A4-bitbidirectionalshiftregisterisshownbelow.AHIGHontheRIGHT/LEFT control input allows data bits inside the register to be shifted to theright, and aLOW enables data bits inside the register to be shifted to the left. When the RIGHT/LEFT control input is **HIGH**, gates $G_1$ , $G_2$ , $G_3$ and $G_4$ are enabled, and the state of the Q output of each Flip-Flop is passed through to the Dinput of the following Flip-Flop. When a clock pulse occurs, the data bits are shifted one place to the right. When the RIGHT/LEFT control input is LOW, gates $G_5$ , $G_6$ , $G_7$ and $G_8$ are enabled, and the Q output of each Flip-Flop is passed through to the D input of the preceding Flip-Flop. When a clock pulse occurs, the data bits are then shifted one place to the left. 4-bitbi-directionalshiftregister #### **UNIT III** #### COMPUTER FUNDAMENTALS ### FUNCTIONAL UNITS OF A DIGITAL COMPUTER: **Computer:** A computer is a combination of **hardware and software** resources which integrate together and provides various functionalities to the user. Hardware are the physical components of a computer like the processor, memory devices, monitor, keyboard etc. while software is the set of programs or instructions that are required by the hardware resources to function properly. There are a few basic components that aids the working-cycle of a computer i.e. the Input- Process- Output Cycle and these are called as the functional components of a computer. It needs certain input, processes that input and produces the desired output. The input unit takes the input, the central processing unit does the processing of data and the output unit produces the output. The memory unit holds the data and instructions during the processing. **Digital Computer:** A digital computer can be defined as a programmable machine which reads the binary data passed as instructions, processes this binary data, and displays a calculated digital output. Therefore, Digital computers are those that work on the digital data. ### **Details of Functional Components of a Digital Computer** - **Input Unit**: The input unit consists of input devices that are attached to the computer. These devices take input and convert it into binary language that the computer understands. Some of the common input devices are keyboard, mouse, joystick, scanner etc. - Central Processing Unit (CPU): Once the information is entered into the computer by the input device, the processor processes it. The CPU is called the brain of the computer because it is the control center of the computer. It first fetches instructions from memory and then interprets them so as to know what is to be done. If required, data is fetched from memory or input device. Thereafter CPU executes or performs the required computation and then either stores the output or displays on the output device. The CPU has three main components which are responsible for different functions Arithmetic Logic Unit (ALU), Control Unit (CU) and Memory registers - Arithmetic and Logic Unit (ALU): The ALU, as its name suggests performs mathematical calculations and takes logical decisions. Arithmetic calculations include addition, subtraction, multiplication and division. Logical decisions involve comparison of two data items to see which one is larger or smaller or equal. - Control Unit: The Control unit coordinates and controls the data flow in and out of CPU and also controls all the operations of ALU, memory registers and also input/output units. It is also responsible for carrying out all the instructions stored in the program. It decodes the fetched instruction, interprets it and sends control signals to input/output devices until the required operation is done properly by ALU and memory. - **Memory Registers**: A register is a temporary unit of memory in the CPU. These are used to store the data which is directly used by the processor. Registers can be of different sizes(16 bit, 32 bit, 64 bit and so on) and each register inside the CPU has - a specific function like storing data, storing an instruction, storing address of a location in memory etc. The user registers can be used by an assembly language programmer for storing operands, intermediate results etc. Accumulator (ACC) is the main register in the ALU and contains one of the operands of an operation to be performed in the ALU. - Memory: Memory attached to the CPU is used for storage of data and instructions and is called internal memory. The internal memory is divided into many storage locations, each of which can store data or instructions. Each memory location is of the same size and has an address. With the help of the address, the computer can read any memory location easily without having to search the entire memory, when a program is executed, it's data is copied to the internal memory and is stored in the memory till the end of the execution. The internal memory is also called the Primary memory or Main memory. This memory is also called as RAM, i.e. Random Access Memory. The time of access of data is independent of its location in memory, therefore this memory is also called Random Access memory (RAM). Read this for different types of RAMs - **Output Unit:** The output unit consists of output devices that are attached with the computer. It converts the binary data coming from CPU to human understandable form. The common output devices are monitor, printer, plotter etc. #### **Interconnection between Functional Components** A computer consists of input unit that takes input, a CPU that processes the input and an output unit that produces output. All these devices communicate with each other through a common bus. A bus is a transmission path, made of a set of conducting wires over which data or information in the form of electric signals, is passed from one component to another in a computer. The bus can be of three types – Address bus, Data bus and Control Bus. Following figure shows the connection of various functional components: The address bus carries the address location of the data or instruction. The data bus carries data from one component to another and the control bus carries the control signals. The system bus is the common communication path that carries signals to/from CPU, main memory and input/output devices. The input/output devices communicate with the system bus through the controller circuit which helps in managing various input/output devices attached to the computer. #### **Von-Neumann Architecture:** Historically there have been 2 types of Computers: - 1. **Fixed Program Computers** Their function is very specific and they couldn't be re-programmed, e.g. Calculators. - 2. **Stored Program Computers** These can be programmed to carry out many different tasks, applications are stored on them, hence the name. The modern computers are based on a stored-program concept introduced by John Von Neumann. In this stored-program concept, programs and data are stored in a separate storage unit called memories and are treated the same. This novel idea meant that a computer built with this architecture would be much easier to reprogram. The basic structure is like this, It is also known as **ISA** (Instruction set architecture) computer and is having three basic units: - 1. The Central Processing Unit (CPU) - 2. The Main Memory Unit - 3. The Input/Output Device Let's consider them in details. #### • Control Unit - A control unit (CU) handles all processor control signals. It directs all input and output flow, fetches code for instructions, and controls how data moves around the system. ### • Arithmetic and Logic Unit (ALU) - The arithmetic logic unit is that part of the CPU that handles all the calculations the CPU may need, e.g. Addition, Subtraction, Comparisons. It performs Logical Operations, Bit Shifting Operations, and Arithmetic operations. Figure – Basic CPU structure, illustrating ALU - Main Memory Unit (Registers) - 1. **Accumulator:** Stores the results of calculations made by ALU. - 2. **Program Counter (PC):** Keeps track of the memory location of the next instructions to be dealt with. The PC then passes this next address to Memory Address Register (MAR). - 3. **Memory Address Register (MAR):** It stores the memory locations of instructions that need to be fetched from memory or stored into memory. - 4. **Memory Data Register (MDR):** It stores instructions fetched from memory or any data that is to be transferred to, and stored in, memory. - 5. **Current Instruction Register (CIR):** It stores the most recently fetched instructions while it is waiting to be coded and executed. - 6. **Instruction Buffer Register (IBR):** The instruction that is not to be executed immediately is placed in the instruction buffer register IBR. - **Input/Output Devices** Program or data is read into main memory from the *input device* or secondary storage under the control of CPU input instruction. *Output devices* are used to output the information from a computer. If some results are evaluated by computer and it is stored in the computer, then with the help of output devices, we can present them to the user. - **Buses** Data is transmitted from one part of a computer to another, connecting all major internal components to the CPU and memory, by the means of Buses. Types: - 1. **Data Bus:** It carries data among the memory unit, the I/O devices, and the processor. - 2. Address Bus: It carries the address of data (not the actual data) between memory and processor. - 3. **Control Bus:** It carries control commands from the CPU (and status signals from other devices) in order to control and coordinate all the activities within the computer. #### Von Neumann bottleneck - Whatever we do to enhance performance, we cannot get away from the fact that instructions can only be done one at a time and can only be carried out sequentially. Both of these factors hold back the competence of the CPU. This is commonly referred to as the 'Von Neumann bottleneck'. We can provide a Von Neumann processor with more cache, more RAM, or faster components but if original gains are to be made in CPU performance then an influential inspection needs to take place of CPU configuration. This architecture is very important and is used in our PCs and even in Super Computers. #### **Basic Computer Instructions** The basic computer has 16-bit instruction register (IR) which can denote either memory reference or register reference or inputoutput instruction. 1. **Memory Reference** – These instructions refer to memory address as an operand. The other operand is always accumulator. Specifies 12-bit address, 3-bit opcode (other than 111) and 1-bit addressing mode for direct and indirect addressing. | | | Will out to | | | |---|-----|-------------|----------------------------------------|--| | I | OPC | ODE | MEMORY ADDRESS | | | - | | | III. III. III. III. III. III. III. III | | #### Example - IR register contains = 0001XXXXXXXXXXXXXXX, i.e. ADD after fetching and decoding of instruction we find out that it is a memory reference instruction for ADD operation. Hence, $DR \leftarrow M[AR]$ $$AC \leftarrow AC + DR, SC \leftarrow 0$$ 2. **Register Reference** – These instructions perform operations on registers rather than memory addresses. The IR(14 – 12) is 111 (differentiates it from memory reference) and IR(15) is 0 (differentiates it from input/output instructions). The rest 12 bits specify register operation. #### Example - IR register contains = 0111001000000000, i.e. CMA after fetch and decode cycle we find out that it is a register reference instruction for complement accumulator. Hence, $AC \leftarrow \sim AC$ 3. **Input/Output** – These instructions are for communication between computer and outside environment. The IR(14 – 12) is 111 (differentiates it from memory reference) and IR(15) is 1 (differentiates it from register reference instructions). The rest 12 bits specify I/O operation. | 15 | 14 | | 12 | 11 | | |----|----|---|----|----|------------------------| | 1 | 1 | 1 | 1 | | INPUT/OUTPUT OPERATION | ### Example - IR register contains = 1111100000000000, i.e. INP after fetch and decode cycle we find out that it is an input/output instruction for inputing character. Hence, INPUT character from peripheral device. The set of instructions incorporated in 16 bit IR register are: - 1. Arithmetic, logical and shift instructions (and, add, complement, circulate left, right, etc) - 2. To move information to and from memory (store the accumulator, load the accumulator) - 3. Program control instructions with status conditions (branch, skip) - 4. Input output instructions (input character, output character) | Symbol Symbol | Hexadeci | mal Code | Description | |---------------|----------|----------|-----------------------------------| | AND | 0xxx | 8xxx | And memory word to AC | | ADD | 1xxx | 9xxx | Add memory word to AC | | LDA | 2xxx | Axxx | Load memory word to AC | | STA | 3xxx | Bxxx | Store AC content in memory | | BUN | 4xxx | Cxxx | Branch Unconditionally | | BSA | 5xxx | Dxxx | Branch and Save Return Address | | ISZ | 6xxx | Exxx | Increment and skip if 0 | | CLA | 78 | 00 | Clear AC | | CLE | 74 | 00 | Clear E(overflow bit) | | CMA | 72 | 00 | Complement AC | | CME | 71 | 00 | Complement E | | CIR | 70 | 80 | Circulate right AC and E | | CIL | 70 | 40 | Circulate left AC and E | | INC | 70 | 20 | Increment AC | | SPA | 70 | 10 | Skip next instruction if AC > 0 | | SNA | 70 | 08 | Skip next instruction if AC < 0 | | SZA | 70 | 04 | Skip next instruction if $AC = 0$ | | Symbol | Hexadecimal Code | Description | | |--------|------------------|----------------------------------|--| | SZE | 7002 | Skip next instruction if $E = 0$ | | | HLT | 7001 | Halt computer | | | INP | F800 | Input character to AC | | | OUT | F400 | Output character from AC | | | SKI | F200 | Skip on input flag | | | SKO | F100 | Skip on output flag | | | ION | F080 | Interrupt On | | | IOF | F040 | Interrupt Off | | ### **Instruction Formats (Zero, One, Two and Three Address Instruction):** A computer performs a task based on the instruction provided. Instruction in computers comprises groups called fields. These fields contain different information as for computers everything is in 0 and 1 so each field has different significance based on which a CPU decides what to perform. The most common fields are: - Operation field specifies the operation to be performed like addition. - Address field which contains the location of the operand, i.e., register or memory location. - Mode field which specifies how operand is to be founded. Instruction is of variable length depending upon the number of addresses it contains. Generally, CPU organization is of three types based on the number of address fields: - 1. Single Accumulator organization - 2. General register organization - 3. Stack organization In the first organization, the operation is done involving a special register called the accumulator. In second on multiple registers are used for the computation purpose. In the third organization the work on stack basis operation due to which it does not contain any address field. Only a single organization doesn't need to be applied, a blend of various organizations is mostly what we see generally. Based on the number of address, instructions are classified as: Note that we will use X = (A+B)\*(C+D) expression to showcase the procedure. #### 1. Zero Address Instructions - A stack-based computer does not use the address field in the instruction. To evaluate an expression first it is converted to reverse Polish Notation i.e. Postfix Notation. Expression: X = (A+B)\*(C+D) Postfixed : X = AB + CD + \* TOP means top of stack M[X] is any memory location PUSH A TOP = A PUSH B TOP = B ADD TOP = A+B PUSH C TOP = C PUSH D TOP = D ADD TOP = C+D MUL TOP = (C+D)\*(A+B) POP X M[X] = TOP ### 2.One Address Instructions - This uses an implied ACCUMULATOR register for data manipulation. One operand is in the accumulator and the other is in the register or memory location. Implied means that the CPU already knows that one operand is in the accumulator so there is no need to specify it. | opcode | operand/address of operand | mode | |--------|----------------------------|------| |--------|----------------------------|------| Expression: X = (A+B)\*(C+D) AC is accumulator M[] is any memory location M[T] is temporary location LOAD $A ext{ } AC = M[A]$ ADD B AC = AC + M[B] STORE T M[T] = AC LOAD $C \quad AC = M[C]$ ADD $$D ext{ AC} = AC + M[D]$$ $$MUL$$ $T$ $AC = AC * M[T]$ STORE $$X M[X] = AC$$ #### 3.Two Address Instructions - This is common in commercial computers. Here two addresses can be specified in the instruction. Unlike earlier in one address instruction, the result was stored in the accumulator, here the result can be stored at different locations rather than just accumulators, but require more number of bit to represent address. | opcode | Destination address | Source address | mode | | |--------|---------------------|----------------|------|--| | | | | | | Here destination address can also contain operand. Expression: X = (A+B)\*(C+D) R1, R2 are registers M[] is any memory location MOV R1, A $$R1 = M[A]$$ ADD R1, B $$R1 = R1 + M[B]$$ MOV R2, C $$R2 = C$$ ADD R2, D $$R2 = R2 + D$$ MUL R1, R2 $$R1 = R1 * R2$$ $$MOV \quad X, R1 \quad M[X] = R1$$ #### 4. Three Address Instructions - This has three address field to specify a register or a memory location. Program created are much short in size but number of bits per instruction increase. These instructions make creation of program much easier but it does not mean that program will run much faster because now instruction only contain more information but each micro operation (changing content of register, loading address in address bus etc.) will be performed in one cycle only. | opcode Destination address Source address Source address m | mode | ource address | Source | Source address | Destination address | opcode | | |------------------------------------------------------------|------|---------------|--------|----------------|---------------------|--------|--| |------------------------------------------------------------|------|---------------|--------|----------------|---------------------|--------|--| Expression: X = (A+B)\*(C+D) R1, R2 are registers M[] is any memory location ADD R1, A, B $$R1 = M[A] + M[B]$$ ADD R2, C, D $$R2 = M[C] + M[D]$$ MUL $$X, R1, R2$$ $M[X] = R1 * R2$ #### INSTRUCTION SET ARCHITECTURE In this article, we look at what an *Instruction Set Architecture (ISA)* is and what is the difference between an 'ISA' and *Microarchitecture*. An ISA is defined as the design of a computer from the *Programmer's Perspective*. This basically means that an ISA describes the **design of a Computer** in terms of the **basic operations** it must support. The ISA is not concerned with the implementation-specific details of a computer. It is only concerned with the set or collection of basic operations the computer must support. For example, the AMD Athlon and the Core 2 Duo processors have entirely different implementations but they support more or less the same set of basic operations as defined in the x86 Instruction Set. Let us try to understand the Objectives of an ISA by taking the example of the MIPS ISA. MIPS is one of the most widely used ISAs in education due to its simplicity. 1. The ISA defines the **types of instructions** to be supported by the processor. Based on the type of operations they perform MIPS Instructions are classified into 3 types: • Arithmetic/Logic Instructions: These Instructions perform various Arithmetic & Logical operations on one or more operands. • Data Transfer Instructions: These instructions are responsible for the transfer of instructions from memory to the processor registers and vice versa. • Branch and Jump Instructions: These instructions are responsible for breaking the sequential flow of instructions and jumping to instructions at various other locations, this is necessary for the implementation of *functions* and *conditional statements*. 2. The ISA defines the **maximum length** of each type of instruction. Since the MIPS is a 32 bit ISA, each instruction must be accommodated within 32 bits. 3. The ISA defines the **Instruction Format** of each type of instruction. The **Instruction Format** determines how the entire instruction is encoded within 32 bits There are 3 types of Instruction Formats in the MIPS ISA: - R-Instruction Format - I-Instruction Format - J-Instruction Format If we look at the Abstraction Hierarchy: Figure – The Abstraction Hierarchy We note that the **Microarchitectural** level lies just below the **ISA** level and hence is concerned with the implementation of the basic operations to be supported by the Computer as defined by the **ISA**. Therefore we can say that the AMD Athlon and Core 2 Duo processors are based on the same ISA but have different microarchitectures with different performance and efficiencies. Now one may ask the need to distinguish between **Microarchitecture** and **ISA**? The answer to this lies in the need to standardize and maintain the compatibility of programs across different hardware implementations based on the same **ISA**. Making different machines compatible with the same set of basic instructions (The ISA) allows the same program to run smoothly on many different machines thereby making it easier for the programmers to document and maintain code for many different machines simultaneously and efficiently. This Flexibility is the reason we first define an ISA and then design different microarchitectures complying with this ISA for implementing the machine. The design of a lower-level ISA is one of the major tasks in the study of Computer Architecture. Instruction Set Architecture The ISA is responsible for defining the set of instructions to be supported by the processor. For example, some of the instructions defined by the ARMv7 ISA are given below. The Branch of **Computer Architecture** is more inclined towards the Analysis and Design of Instruction Set Architecture. For Example, Intel developed the *x86* architecture, ARM developed the *ARM* architecture, & AMD developed the *amd64* architecture. The RISC-V ISA developed by UC Berkeley is an example of an Open Source ISA. #### Microarchitecture The Microarchitecture is more concerned with the lower level implementation of how the instructions are going to be executed and deals with concepts like Instruction Pipelining, Branch Prediction, Out of Order Execution. On the other hand, the Branch of **Computer Organization** is concerned with the implementation of a particular ISA deals with various hardware implementation techniques, i.e. is the Microarchitecture level. For Example, ARM licenses other companies like Qualcomm, Apple for using ARM ISA, but each of these companies have their own implementations of this ISA thereby making them different in performance and power efficiency. The *Krait* cores developed by Qualcomm have a different microarchitecture and the Apple A-series processors have a different microarchitecture. The *x86* was developed by Intel, but we see that almost every year Intel comes up with a new generation of i-series processors. The *x86* architecture on which most of the Intel Processors are based essentially remains the same across all these generations but, where they differ is in the underlying Microarchitecture. They differ in their implementation and hence are claimed to have improved Performance. These various Microarchitectures developed by Intel are codenamed as 'Nehalem', 'Sandybridge', 'Ivybridge', and so on. Therefore, in conclusion, we can say that different machines may be based on the same ISA but have different Microarchitectures. #### **INSTRUCTIONS & INSTRUCTION SEQUENCING** The tasks carried out by a computer program consist of a sequence of small steps, such as adding two numbers, testing for a particular condition, reading a character from the keyboard, or sending a character to be displayed on a display screen. #### A computer must have instructions capable of performing 4 types of operations: - 1) Data transfers between the memory and the registers (MOV, PUSH, POP, XCHG). - 2) Arithmetic and logic operations on data (ADD, SUB, MUL, DIV, AND, OR, NOT). - 3) Program sequencing and control(CALL.RET, LOOP, INT). - 4) I/0 transfers (IN, OUT). #### **REGISTER TRANSFER NOTATION (RTN)** Here we describe the transfer of information from one location in a computer to another. Possible locations that may be involved in such transfers are memory locations, processor registers, or registers in the I/O subsystem. Most of the time, we identify such locations symbolically with convenient names. - The possible locations in which transfer of information occurs are: - 1) Memory-location - 2) Processor register & - 3) Registers in I/O device. | Location | Hardware Binary Address | Example | Description | | | | |---------------|-------------------------|------------------|----------------------------------------------------------------------|--|--|--| | Memory | LOC, PLACE, NUM | R1 ← [LOC] | Contents of memory-location LOC<br>are transferred into register R1. | | | | | Processor | R0, R1 ,R2 | [R3] ← [R1]+[R2] | Add the contents of register R1 &R2<br>and places their sum into R3. | | | | | I/O Registers | DATAIN, DATAOUT | R1 ← DATAIN | Contents of I/O register DATAIN are<br>transferred into register R1. | | | | #### ASSEMBLY LANGUAGE NOTATION • To represent machine instructions and programs, assembly language format is used. | Assembly Language Format | Description | |--------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Move LOC, R1 | Transfer data from memory-location LOC to register R1. The contents of LOC are unchanged by the execution of this instruction, but the old contents of register R1 are overwritten. | | Add R1, R2, R3 | Add the contents of registers R1 and R2, and places their sum into register R3. | #### **BASIC INSTRUCTION TYPES** | Instruction Syntax Sypte | | Example | Description | Instructions<br>for<br>Operation<br>C<-[A]+[B] | | |--------------------------|--------------------------------------|-----------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------|--| | Three<br>Address | Opcode Source1, Source2, Destination | Add A,B,C | Add the contents of<br>memory-locations A & B.<br>Then, place the result into<br>location C. | (62) 622 | | | Two Address | Opcode Source, Destination | Add A,B | Add the contents of memory-locations A & B. Then, place the result into location B, replacing the original contents of this location. Operand B is both a source and a destination. | Move B, C<br>Add A, C | | | One Address | Opcode Source/Destination | Load A | Copy contents of memory-<br>location A into accumulator. | Load A<br>Add B | | | | | Add B | Add contents of memory-<br>location B to contents of<br>accumulator register &<br>place sum back into<br>accumulator. | Store C | | | 24040000 | | Store C | Copy the contents of the<br>accumulator into location C. | 05542554071311540 | | | Zero<br>Address | Opcode [no Source/Destination] | Push | Locations of all operands<br>are defined implicitly.<br>The operands are stored in<br>a pushdown stack. | Not possible | | ## INSTRUCTION EXECUTION & STRAIGHT LINE SEQUENCING - The program is executed as follows: - 1) Initially, the address of the first instruction is loaded into PC (Figure 2.8). - 2) Then, the processor control circuits use the information in the PC to fetch and execute instructions, one at a time, in the order of increasing addresses. This is called Straight-Line sequencing. - 3) During the execution of each instruction, PC is incremented by 4 to point to next instruction. - There are 2 phases for Instruction Execution: - 1) Fetch Phase: The instruction is fetched from the memory-location and placed in the IR. - 2) Execute Phase: The contents of IR is examined to determine which operation is to be performed. The specified-operation is then performed by the processor. ## **Program Explanation** - Consider the program for adding a list of n numbers (Figure 2.9). - The Address of the memory-locations containing the n numbers are symbolically given as NUM1, NUM2.....NUMn. - Separate Add instruction is used to add each number to the contents of register R0. - After all the numbers have been added, the result is placed in memory-location SUM. #### **ADDRESSING MODES:** The different ways in which the location of an operand is specified in an instruction are referred to as #### 1)Immediate Mode - The operand is given explicitly in the instruction. - For example, the instruction Move #200, R0 ;Place the value 200 in register R0. - Clearly, the immediate mode is only used to specify the value of a source-operand. #### 2)Register Mode - The operand is the contents of a register. - The name (or address) of the register is given in the instruction. - Registers are used as temporary storage locations where the data in a register are accessed. - For example, the instruction Move R1, R2 ;Copy content of register R1 into register R2. ## 3)Absolute (Direct) Mode - The operand is in a memory-location. - The address of memory-location is given explicitly in the instruction. - The absolute mode can represent global variables in the program. - For example, the instruction Move LOC, R2 ;Copy content of memory-location LOC into register R2. #### 4)INDIRECTION AND POINTERS - Instruction does not give the operand or its address explicitly. - Instead, the instruction provides information from which the new address of the operand can be determined. - This address is called Effective Address (EA) of the operand. #### **Indirect Mode** - The EA of the operand is the contents of a register(or memory-location). - The register (or memory-location) that contains the address of an operand is called a Pointer. - We denote the indirection by $\rightarrow$ name of the register or $\rightarrow$ new address given in the instruction. E.g. Add (R1), R0; The operand is in memory. Register R1 gives the effective-address (B) of the operand. The data is read from location B and added to contents of register R0. • To execute the Add instruction in fig 2.11 (a), the processor uses the value which is in register R1, as the EA of the operand. - It requests a read operation from the memory to read the contents of location B. The value read is the desired operand, which the processor adds to the contents of register R0. - Indirect addressing through a memory-location is also possible as shown in fig 2.11(b). In this case, the processor first reads the contents of memory-location A, then requests a second read operation using the value B as an address to obtain the operand. ## **Program Explanation** - In above program, Register R2 is used as a pointer to the numbers in the list, and the operands are accessed indirectly through R2. - The initialization-section of the program loads the counter-value n from memory-location N into R1 and uses the immediate addressing-mode to place the address value NUM1, which is the address of the first number in the list, into R2. Then it clears R0 to 0. - The first two instructions in the loop implement the unspecified instruction block starting at LOOP. - The first time through the loop, the instruction Add (R2), R0 fetches the operand at location NUM1 and adds it to R0. - The second Add instruction adds 4 to the contents of the pointer R2, so that it will contain the address value NUM2 when the above instruction is executed in the second pass through the loop. #### 5)INDEXING AND ARRAYS - · A different kind of flexibility for accessing operands is useful in dealing with lists and arrays. Index mode - The operation is indicated as X(Ri) where X=the constant value which defines an offset(also called a displacement). Ri=the name of the index register which contains address of a new location. - The effective-address of the operand is given by EA=X+[Ri] - The contents of the index-register are not changed in the process of generating the effective address. - The constant X may be given either $\rightarrow$ as an explicit number or $\rightarrow$ as a symbolic-name representing a numerical value. #### 6)Base with Index Mode - Another version of the Index mode uses 2 registers which can be denoted as(Ri, Rj) - Here, a second register may be used to contain the offset X. - The second register is usually called the base register. - ullet The effective-address of the operand is given by EA=[Ri]+[Rj] - This form of indexed addressing provides more flexibility in accessing operands because both components of the effective-address can be changed. #### 7)Base with Index & Offset Mode - Another version of the Index mode uses 2 registers plus a constant, which can be denoted as X(Ri, Rj) - The effective-address of the operand is given by EA=X+[Ri]+[Ri] - •This added flexibility is useful in accessing multiple components inside each item in a record, where the beginning of an item is specified by the (Ri, Rj) part of the addressing-mode. In other words, this mode implements a 3-dimensional array. #### 8) RELATIVE MODE - This is similar to index-mode with one difference: The effective-address is determined using the PC in place of the general purpose register Ri. - The operation is indicated as X(PC). - X(PC) denotes an effective-address of the operand which is X locations above or below the current contents of PC. - Since the addressed-location is identified "relative" to the PC, the name Relative mode is associated with this type of addressing. - This mode is used commonly in conditional branch instructions. - An instruction such as Branch > 0 LOOP ;Causes program execution to go to the branch target locationidentified by name LOOP if branch condition is satisfied. #### 9) Auto Increment Mode Effective-address of operand is contents of a register specified in the instruction (Fig: 2.16). After accessing the operand, the contents of this register are automatically incremented to point to the next item in a list. Implicitly, the increment amount is 1. This mode is denoted as (Ri)+; where Ri=pointer-register. 10) Auto Decrement Mode The contents of a register specified in the instruction are first automatically decremented and are then used as the effective-address of the operand. This mode is denoted as -(Ri) ; where Ri=pointer-register. These 2 modes can be used together to implement an important data structure called a stack. ## **ENCODING OF MACHINE INSTRUCTIONS** - To be executed in a processor, an instruction must be encoded in a binary-pattern. Such encoded instructions are referred to as Machine Instructions. - The instructions that use symbolic-names and acronyms are called assembly language instructions. - We have seen instructions that perform operations such as add, subtract, move, shift, rotate, and branch. These instructions may use operands of different sizes, such as 32-bit and 8-bit numbers. - Let us examine some typical cases. The instruction Add R1, R2; Has to specify the registers R1 and R2, in addition to the OP code. If the processor has 16 registers, then four bits are needed to identify each register. Additional bits are needed to indicate that the Register addressing-mode is used for each operand. The instruction Move 24(R0), R5; Requires 16 bits to denote the OP code and the two registers, and some bits to express that the source operand uses the Index addressing mode and that the index value is 24. - In all these examples, the instructions can be encoded in a 32-bit word (Fig 2.39). - The OP code for given instruction refers to type of operation that is to be performed. - Source and destination field refers to source and destination operand respectively. - The "Other info" field allows us to specify the additional information that may be needed such as an index value or an immediate operand. - Using multiple words, we can implement complex instructions, closely resembling operations in high level programming languages. The term complex instruction set computers (CISC) refers to processors that use - CISC approach results in instructions of variable length, dependent on the number of operands and the type of addressing modes used. - In RISC (reduced instruction set computers), any instruction occupies only one word. - The RISC approach introduced other restrictions such as that all manipulation of data must be done on operands that are already in registers. Ex: Add R1,R2,R3 - In RISC type machine, the memory references are limited to only Load/Store operations.Ro Figure 2.39 Encoding instructions into 32-bit words. ## **Rotate operations:** - In shift operations, the bits shifted out of the operand are lost, except for the last bit shifted out which is retained in the Carry-flag C. - To preserve all bits, a set of rotate instructions can be used. - They move the bits that are shifted out of one end of the operand back into the other end. - Two versions of both the left and right rotate instructions are usually provided. In one version, the bits of the operand is simply rotated. In the other version, the rotation includes the C flag. #### SHIFT AND ROTATE INSTRUCTIONS - There are many applications that require the bits of an operand to be shifted right or left some specified number of bit positions. - The details of how the shifts are performed depend on whether the operand is a signed number or some more general binary-coded information. - For general operands, we use a logical shift. For a number, we use an arithmetic shift, which preserves the sign of the number. ## LOGICAL SHIFTS - Two logical shift instructions are - 1) Shifting left (LShiftL) & - 2) Shifting right (LShiftR). # Interaction between Assembly language and high levellanguage High level languages such as Python, C, C++, Java, C# allows the programmer towrite the alpha numeric codes. The compiler converts the alpha numeric codes into assembly code. Assembly language codes example: ADD R1, R2, R3 ADDi R1, R2, 20 LOAD R1, 20(R2) STORE R1, 20(R2) The assembler intern converts the assembly language code into binary code(machine code). # Difference between assembly language and high level language | Parameters | Assembly Language | High-Level Language | |------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------| | Conversion | The assembly language requires an assembler for the process of conversion. | A high-level language requires an interpreter/ compiler for the process of conversion. | | Process of Conversion | We perform the conversion of an assembly language into a machine language. | We perform the conversion of a high-level language into an assembly language and then into a machine-level language for the computer. | | Machine<br>Dependency | The assembly language is a machine-dependent type of language. | A high-level language is a machine-<br>independent type of language. | | Codes | It makes use of the mnemonic codes for operation. | It makes use of the English statements for operation. | | Operation of Lower Level | It provides support for various low-level operations. | It does not provide any support for low-level languages. | | Access to Hardware Component | Accessing the hardware component is very easy in this case. | Accessing the hardware component is very difficult in this case. | | Compactness in Code | The code is more compact in this case. | No code compactness is present in this case. | | Type of Processor | The program that we write for one processor in an assembly language will not run on any other processor type. It means that it is processor-dependent. | This language is processor-independent. It means that the programs that we write using high-level languages can easily run on any processor independent of its type. | | Accuracy | It has better accuracy. | Accuracy is much lesser in this case. | | Performance | An assembly language performs better than any high-level language, in general. | The performance is comparatively not so good. | | Length of | It is shorter in assembly language. | It is larger in a high-level language. | |------------------------------|------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------| | Executable Code | | | | Time Taken in Code Execution | Execution of code takes less time in this case because the code is not very large. | It takes up more time for execution because it needs to execute a large code. | | Efficiency | It is way more efficient because of the shorter executable codes. | It is comparatively less efficient because<br>the executable codes are comparatively<br>longer in length. | | Reading of<br>Pointers | We can do that directly at a physical address in the case of an assembly language. | It is not possible to do so in the case of a high-level language. | | Extra Instructions | We don't need that in the case of an assembly language. | This language must give some extra instructions for running any code on the computer. | | Ease of<br>Understanding | It is very difficult to debug and understand the code of an assembly language. | It is very easy to debug and understand the code of an assembly language. | ## UNIT -IV PROCESSOR ## **INSTRUCTION EXECUTION** Steps in Instruction Execution by CPU: Six steps are involved in execution of an instruction by CPU. However, not all of them are required for all instructions. - 1. Fetch instruction - 2. Decode information - 3. Perform ALU operation - 4. Access memory - 5. Update register file - 6. Update the Program Counter (PC) ## **Step 1: Fetch instruction** Execution cycle starts with fetching instruction from main memory. The instruction at the current <u>program counter (PC)</u> will be fetched and will be stored in <u>instruction</u> register (IR). # **Step 2: Decode instruction** During this cycle the encoded instruction present in the IR (instruction register) is interpreted by the decoder. # **Step 3: Perform ALU operation** ALU (<u>Arithmetic Logic Unit</u>) is where two operands in the instruction will be operated on given operator in the instructions. Such as, if the instruction was to add two numbers, then here the addition will happen. ALU take two values and output one, the result of the operation. **Step 4: Access memory** There are only two kind of instructions that access memory: LOAD and STORE. LOAD copies a value from memory to a register and STORE copies a register value to memory. Any other instruction skips this step. # **Step 5: Update Register File** In this step, the output/result of the ALU is written back to the register file to update the register file. The result could also be due to a LOAD from memory. Some instructions don't have results to store. For example, BRANCH and JUMP instructions do not have any results to store. # **Step 6: Update the PC (Program Counter)** Ultimately, at the end of the execution of the current instruction, we need to update the program counter (PC) to the address of the next instruction, so that we can go back to step 1 where the CPU will fetch instruction. However, the program counter might need to be set to other memory address than the next one if the instruction was BRANCH or JUMP # **BUILDING A DATAPATH** # Datapath: It is a processor components that are used to perform arithmetic operations and elements that holds the data. Main elements of data path are Instruction memory, Program Counter (PC), ALU adder. ## Building a data path MIPS processor data path can be built incrementally by only considering the subsets of instruction. # Types of Elements in the Datapath State element: - · A memory element, i.e., it contains a state - E.g., program counter, instruction memory Combinational element: - · Elements that operate on values - · Eg adder ALU E.g. adder, ALU Elements required by the different classes of instructions - · Arithmetic and logical instructions - · Data transfer instructions - · Branch instructions ## R-Format ALU Instructions - · E.g., add \$t1, \$t2, \$t3 - · Perform arithmetic/logical operation - · Read two register operands and write register result # Register file: - · A collection of the registers - · Any register can be read or written by specifying the number of the register - · Contains the register state of the computer ## **ALU** - · Takes two 32 bit input and produces a 32 bit output - · Also, sets one-bit signal if the results is 0 - The operation done by ALU is controlled by a 4 bit control signal input. This is set according to the instruction . Portion of data path used for fetching instructions and incrementing the program counter # Register type instructions ## ADD \$t1, \$t2, \$t3 a. Registers b. ALU ## Use of sign extension unit It is used to convert the 16 bit constant input into 32 bit constant output. Addi \$R1, \$R2, 20 Here 20 is the 16 bit constant. Sign extension unit is used to convert this 16 bit constant into 32 bit output and given to the ALU unit as a input ## Use of shift left 2 Shift left 2 operation is used in conditional branch instruction Beq \$t0, \$t1, 250 It is used to find the exact byte address from the branch address specified in word (here 250 is word no.) Control branch : PC = PC + 250 If we perform the shift left 2 operation on 250 we can obtain 1000. The control branch address is, PC = PC + 1000 Creating a single data path Here all the operations are included in single data path. Operations such as R-Type, I-Type, J-type are included in this datapath. ----- ## **DESIGNING OF CONTROL UNIT** Control Unit is the part of the computer's central processing unit (CPU), which directs the operation of the processor. It was included as part of the Von Neumann Architecture by John von Neumann. It is the responsibility of the Control Unit to tell the computer's memory, arithmetic/logic unit and input and output devices how to respond to the instructions that have been sent to the processor. It fetches internal instructions of the programs from the main memory to the processor instruction register, and based on this register contents, the control unit generates a control signal that supervises the execution of these instructions. A control unit works by receiving input information to which it converts into control signals, which are then sent to the central processor. The computer's processor then tells the attached hardware what operations to perform. The functions that a control unit performs are dependent on the type of CPU because the architecture of CPU varies from manufacturer to manufacturer. Examples of devices that require a CU are: - Control Processing Units(CPUs) - Graphics Processing Units(GPUs) ## **Functions of the Control Unit –** - 1. It coordinates the sequence of data movements into, out of, and between a processor's many sub-units. - 2. It interprets instructions. - 3. It controls data flow inside the processor. - 4. It receives external instructions or commands to which it converts to sequence of control signals. - 5. It controls many execution units(i.e. ALU, data buffers and registers) contained within a CPU. - 6. It also handles multiple tasks, such as fetching, decoding, execution handling and storing results. # Simple datapath with the control unit Truth table for ouptut control lines and ALUOp | * | | | | | | 19 | | - | | 7 | _ | THE PERSON NAMED IN | | | | |-------------|---|----------------|---|---|---|----|------------|-----|--------------|----------------|----------|---------------------|------------|------------|---| | Instruction | | Input Op [5:0] | | | | | Reg<br>Dst | ALU | Mem<br>toReg | Reg Mem<br>W R | Mem<br>W | Branch | ALU<br>Op1 | ALU<br>Op2 | | | | 5 | 4 | 3 | 2 | 1 | 0 | | | | | | | | | | | R-format | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | | lw | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | | 5W | 1 | 0 | 1 | 0 | 1 | 1 | × | 1 | × | 0 | 0 | 1 | 0 | 0 | 0 | | beq | 0 | 0 | 0 | 1 | 0 | 0 | × | 0 | × | 0 | 0 | 0 | 1 | 0 | 1 | ## **Types of Control Unit –** There are two types of control units: Hardwired control unit and Microprogrammable control unit. ## HARDWIRED CONTROL METHOD It is a hardware based method to design the control unit. - A Hard-wired Control consists of two decoders, a sequence counter, and a number of logic gates. - An instruction fetched from the memory unit is placed in the instruction register (IR). - The component of an instruction register includes; I bit, the operation code, and bits 0 through 11. - o The operation code in bits 12 through 14 are coded with a 3 x 8 decoder. - o The outputs of the decoder are designated by the symbols D0 through D7. - The operation code at bit 15 is transferred to a flip-flop designated by the symbol I. - The operation codes from Bits 0 through 11 are applied to the control logic gates. - o The Sequence counter (SC) can count in binary from 0 through 15 Control Unit of a Basic Computer In the Hardwired control unit, the control signals that are important for instruction execution control are generated by specially designed hardware logical circuits, in which we can not modify the signal generation method without physical change of the circuit structure. The operation code of an instruction contains the basic data for control signal generation. In the instruction decoder, the operation code is decoded. The instruction decoder constitutes a set of many decoders that decode different fields of the instruction opcode. As a result, few output lines going out from the instruction decoder obtains active signal values. These output lines are connected to the inputs of the matrix that generates control signals for execution units of the computer. This matrix implements logical combinations of the decoded signals from the instruction opcode with the outputs from the matrix that generates signals representing consecutive control unit states and with signals coming from the outside of the processor, e.g. interrupt signals. The matrices are built in a similar way as a programmable logic arrays. Block diagram of a hardwired control unit of a computer Control signals for an instruction execution have to be generated not in a single time point but during the entire time interval that corresponds to the instruction execution cycle. Following the structure of this cycle, the suitable sequence of internal states is organized in the control unit. A number of signals generated by the control signal generator matrix are sent back to inputs of the next control state generator matrix. This matrix combines these signals with the timing signals, which are generated by the timing unit based on the rectangular patterns usually supplied by the quartz generator. When a new instruction arrives at the control unit, the control units is in the initial state of new instruction fetching. Instruction decoding allows the control unit enters the first state relating execution of the new instruction, which lasts as long as the timing signals and other input signals as flags and state information of the computer remain unaltered. A change of any of the earlier mentioned signals stimulates the change of the control unit state. This causes that a new respective input is generated for the control signal generator matrix. When an external signal appears, (e.g. an interrupt) the control unit takes entry into a next control state that is the state concerned with the reaction to this external signal (e.g. interrupt processing). The values of flags and state variables of the computer are used to select suitable states for the instruction execution cycle. The last states in the cycle are control states that commence fetching the next instruction of the program: sending the program counter content to the main memory address buffer register and next, reading the instruction word to the instruction register of computer. When the ongoing instruction is the stop instruction that ends program execution, the control unit enters an operating system state, in which it waits for a next user directive. ## Pros(advantage) of Hardwired Control Unit - Hardwired Control Unit is quick due to the usage of combinational circuits to generate signals. - The amount of delay that can occur in the creation of control signals is dependent on the number of gates. - It can be tweaked to get the fastest mode of operation. - Quicker than a micro-programmed control unit. # Cons(disadvantage) of Hardwired Control Unit - As it require additional control signals to be created, the design becomes morecomplex (need for more encoders or decoders). - Changes to control signals are challenging since they necessitate rearrangingwires in the hardware circuit. - It's difficult and time-consuming to add a new feature. - It's difficult to evaluate and fix flaws in the initial design. # MICRO PROGRAMMED CONTROL METHOD It is programming method to design control unit. It consists of micro program stored in control memory called ROM. Micro program consists of large number of micro instructions. When executing a micro instruction it will generate a appropriate control signal. Microprogrammed control unit of a Basic Computer Next address generator examine the instruction within Instruction Register(IR) and find the address of micro instruction within micro program to create the corresponding control signal. Send the micro instruction address to control address register. Now control address register contain the address of microinstruction. Then the appropriate microinstruction is pick from the control memory (ROM). (Control memory contains the micro program) The micro instruction is then shifted to control data register. The data processor executes the microinstruction to produce the control signal. While executing the micro instruction, next address generator generate the address of the micro instruction needed to execute next. The address of micro instruction stored control address register(micro instruction address register) is decoded then it given as input to control memory(control store) Micro instructions operation part is decoded and produce the control signals. Micro instructions control part and address part is useful to generate the next micro instruction address and provide this information to next address generator. The fundamental difference between these unit structures and the structure of the hardwired control unit is the existence of the control store that is used for storing words containing encoded control signals mandatory for instruction execution. In microprogrammed control units, subsequent instruction words are fetched into the instruction register in a normal way. However, the operation code of each instruction is not directly decoded to enable immediate control signal generation but it comprises the initial address of a microprogram contained in the control store. ## • With a single-level control store: In this, the instruction opcode from the instruction register is sent to the control store address register. Based on this address, the first microinstruction of a microprogram that interprets execution of this instruction is read to the microinstruction register. This microinstruction contains in its operation part encoded control signals, normally as few bit fields. In a set microinstruction field decoders, the fields are decoded. The microinstruction also contains the address of the next microinstruction of the given instruction microprogram and a control field used to control activities of the microinstruction address generator. The last mentioned field decides the addressing mode (addressing operation) to be applied to the address embedded in the ongoing microinstruction. In microinstructions along with conditional addressing mode, this address is refined by using the processor condition flags that represent the status of computations in the current program. The last microinstruction in the instruction of the given microprogram is the microinstruction that fetches the next instruction from the main memory to the instruction register. ## With a two-level control store: In this, in a control unit with a two-level control store, besides the control memory for microinstructions, a nano-instruction memory is included. In such a control unit, microinstructions do not contain encoded control signals. The operation part of microinstructions contains the address of the word in the nano-instruction memory, which contains encoded control signals. The nano-instruction memory contains all combinations of control signals that appear in microprograms that interpret the complete instruction set of a given computer, written once in the form of nano-instructions. In this way, unnecessary storing of the same operation parts of microinstructions is avoided. In this case, microinstruction word can be much shorter than with the single level control store. It gives a much smaller size in bits of the microinstruction memory and, as a result, a much smaller size of the entire control memory. The microinstruction memory contains the control for selection of consecutive microinstructions, while those control signals are generated at the basis of nano-instructions. In nano-instructions, control signals are frequently encoded using 1 bit/ 1 signal method that eliminates decoding. # Micro programmed Control Unit Pros(advantage) - It allows for a more methodical control unit design. - It's easier to troubleshoot and modify. - It can keep the control function's fundamental structure. - It can make the control unit's design easier. As a result, it is less expensive and less prone to errors or glitches. - It has the ability to design in a methodical and ordered manner. - It is used to control software-based functions rather than hardware-based functions. - It's more adaptable. - It is used to do complex functions with ease. # Microprogrammed Control Unit Cons(disadvantage) - Adaptability comes at a higher price. - It is comparatively slower than a control unit that is hardwired. # Differences between hardwired control and micro programmed control | Hardwired control | Micro programmed control | |------------------------------------------|--------------------------------------| | i.It is a hardware based method | It is programming method to design | | to design control unit | control unit | | ii. Here control logic circuits generate | Here micro instructions generate the | | the control signals | control signals | | iii. It is difficult one to modify the | It is easier one to modify | | method of generating control signals | | | iv. Higher speed than micro programmed | It is slower than hardwired control | | control method | | | v. Costlier than micro program method | cheaper than hardwired method | ## **PIPELINING** Pipelining defines the temporal overlapping of processing. Pipelines are emptiness greater than assembly lines in computing that can be used either for instruction processing or, in a more general method, for executing any complex operations. It can be used efficiently only for a sequence of the same task, much similar to assembly lines. A basic pipeline processes a sequence of tasks, including instructions, as per the following principle of operation – Each task is subdivided into multiple successive subtasks as shown in the figure. For instance, the execution of register-register instructions can be broken down into instruction fetch, decode, execute, and writeback. A pipeline phase related to each subtask executes the needed operations. A similar amount of time is accessible in each stage for implementing the needed subtask. All pipeline stages work just as an assembly line that is, receiving their input generally from the previous stage and transferring their output to the next stage. Finally, it can consider the basic pipeline operates clocked, in other words synchronously. This defines that each stage gets a new input at the beginning of the clock cycle, each stage has a single clock cycle available for implementing the needed operations, and each stage produces the result to the next stage by the starting of the subsequent clock cycle. Five phases of instruction execution - 1. Instruction Fetch - 2. Instruction decode - 3. Instruction execution - 4. Memory access - 5. Write back Pipelining take this instruction execution phases into pipelining stages. ## Pipelining stages - i. Instruction Fetch(IF) - ii. Instruction Decode (ID) - iii. Instruction Execution (ALU) - iv. Memory Access (MA) - v. Write Back(WB) Pipelining is used to improve the program execution speed. It never leave the processor components in idle position. Each stage in pipeline is executed in one clock cycle of CPU. **Pipelined Execution** $$\label{eq:time_poly} \mbox{Time between instructions}_{\mbox{pipelined}} = \frac{\mbox{Time between instructions}_{\mbox{nonpipelined}}}{\mbox{Number of pipe stages}}$$ Exact time required to finish each stage of pipeline in MIPS processor is given as follows | Instruction class | Instruction<br>fetch | Register read | ALU operation | Data<br>access | Register<br>write | Total<br>time | |-----------------------------------|----------------------|---------------|---------------|----------------|-------------------|---------------| | Load word (1w) | 200 ps | 100 ps | 200 ps | 200 ps | 100 ps | 800 ps | | Store word (SW) | 200 ps | 100 ps | 200 ps | 200 ps | | 700 ps | | R-format (add, sub, AND, OR, slt) | 200 ps | 100 ps | 200 ps | | 100 ps | 600 ps | | Branch (beq) | 200 ps | 100 ps | 200 ps | | | 500 ps | Following diagram shows time between instruction in non pipelined execution and pipelined execution. Non-Pipelined Execution ## Types of pipelines There are two types of pipelines in computer processing. ## Instruction pipeline The instruction pipeline represents the stages in which an instruction is moved through the various segments of the processor, starting from fetching and then buffering, decoding and executing. One segment reads instructions from the memory, while, simultaneously, previous instructions are executed in other segments. Since these processes happen in an overlapping manner, the <a href="throughput">throughput</a> of the entire system increases. The pipeline's efficiency can be further increased by dividing the instruction cycle into equal-duration segments. Pipeline processing can occur not only in the data stream but in the instruction stream as well. Most of the digital computers with complex instructions require instruction pipeline to carry out operations like fetch, decode and execute instructions. In general, the computer needs to process each instruction with the following sequence of steps. - 1. Fetch instruction from memory. - 2. Decode the instruction. - 3. Calculate the effective address. - 4. Fetch the operands from memory. - 5. Execute the instruction. - 6. Store the result in the proper place. Each step is executed in a particular segment, and there are times when different segments may take different times to operate on the incoming information. Moreover, there are times when two or more segments may require memory access at the same time, causing one segment to wait until another is finished with the memory. The organization of an instruction pipeline will be more efficient if the instruction cycle is divided into segments of equal duration. One of the most common examples of this type of organization is a **Four-segment instruction pipeline**. A **four-segment instruction** pipeline combines two or more different segments and makes it as a single one. For instance, the decoding of the instruction can be combined with the calculation of the effective address into one segment. The following block diagram shows a typical example of a four-segment instruction pipeline. The instruction cycle is completed in four segments. Segment 1: The instruction fetch segment can be implemented using first in, first out (FIFO) buffer. ## Segment 2: The instruction fetched from memory is decoded in the second segment, and eventually, the effective address is calculated in a separate arithmetic circuit. ## Segment 3: An operand from memory is fetched in the third segment. ## Segment 4: The instructions are finally executed in the last segment of the pipeline organization. ## Arithmetic pipeline Arithmetic Pipelines are mostly used in high-speed computers. They are used to implement floating-point operations, multiplication of fixed-point numbers, and similar computations encountered in scientific problems. To understand the concepts of arithmetic pipeline in a more convenient way, let us consider an example of a pipeline unit for floating-point addition and subtraction. The inputs to the floating-point adder pipeline are two normalized floating-point binary numbers defined as: $$X = A * 2^a = 0.9504 * 10^3$$ $Y = B * 2^b = 0.8200 * 10^2$ Where **A** and **B** are two fractions that represent the mantissa and **a** and **b** are the exponents. The combined operation of floating-point addition and subtraction is divided into four segments. Each segment contains the corresponding suboperation to be performed in the given pipeline. The suboperations that are shown in the four segments are: - 1. Compare the exponents by subtraction. - 2. Align the mantissas. - 3. Add or subtract the mantissas. - 4. Normalize the result. The following block diagram represents the suboperations performed in each segment of the pipeline. #### Pipeline organization for floating point addition and subtraction: Note: Registers are placed after each suboperation to store the intermediate results. # 1. Compare exponents by subtraction: The exponents are compared by subtracting them to determine their difference. The larger exponent is chosen as the exponent of the result. The difference of the exponents, i.e., 3 - 2 = 1 determines how many times the mantissa associated with the smaller exponent must be shifted to the right. # 2. Align the mantissas: The mantissa associated with the smaller exponent is shifted according to the difference of exponents determined in segment one. $$X = 0.9504 * 10^3$$ $$Y = 0.08200 * 10^3$$ ## 3. Add mantissas: The two mantissas are added in segment three. $$Z = X + Y = 1.0324 * 10^3$$ ## 4. Normalize the result: After normalization, the result is written as: $$Z = 0.1324 * 10^4$$ ## **Advantages of Pipelining** - The cycle time of the processor is decreased. It can improve the instruction throughput. Pipelining doesn't lower the time it takes to do an instruction. Rather than, it can raise the multiple instructions that can be processed together ("at once") and lower the delay between completed instructions (known as 'throughput'). - If pipelining is used, the CPU Arithmetic logic unit can be designed quicker, but more complex. - Pipelining increases execution over an un-pipelined core by an element of the multiple stages (considering the clock frequency also increases by a similar factor) and the code is optimal for pipeline execution. - Pipelined CPUs frequently work at a higher clock frequency than the RAM clock frequency, (as of 2008 technologies, RAMs operate at a low frequency correlated to CPUs frequencies) increasing the computer's global implementation. # Disadvantages of Pipelining Designing of the pipelined processor is complex. Instruction latency increases in pipelined processors. The throughput of a pipelined processor is difficult to predict. The longer the pipeline, worse the problem of hazard for branch instructions. ## PIPELINE HAZARD There are situations in pipelining when the next instruction cannot execute in the following clock cycle. These events are called *hazards*, and there are three different types. Dependencies between instructions in pipeline is called hazards .Three types - i. Structural hazard - ii. Data hazard - iii. Control hazard ## Structural hazard Hardware dependencies between instructions in pipeline are called as structural hazard. Here hardware means memory, ALU, a register. ## Example: Here instruction 1 and instruction 4 are use the memory at the same. So memory conflict will occur. At a time memory process only one work. Not able to do more than one work at a time (mutual exclusion property). ## Pipeline stall Delay in execution of instruction in pipeline is called pipeline stall. It is also called as bubble –nickname. ## Data hazard Data hazard occur when the pipeline must be stalled because one step must Wait for another to complete. Also called as pipeline data hazard. When a planned instruction can not execute in proper clock cycle because data that is need to execute the instruction is not yet available Consider following two instructions ``` add $s0, $t0, $t1 sub $t2, $s0, $t3 ``` Subtract operation needs the value of S0. But S0 value available write at WB stage. ## **Forwarding** The primary solution is based on the observation that we don't need to wait for the instruction to complete before trying to resolve the data hazard. For the code sequence above, as soon as the ALU creates the sum for the add, we can supply it as an input for the subtract. Adding extra hardware to retrieve the missing item early from the internal resources is called **forwarding** or **bypassing**. Forwarding technique is useful to forward the results from internal hardware units. Forward another example: #### **Control hazard** Control hazard arising from the need to make a decision based on the results of one instruction while others are executing. In the above example , if 1 = 2 then go to the branch address PC=PC + 40x4 But here next load word instruction, 1w \$3, 300(\$0) is fetched from memory. Suppose branch is taken then the pipeline wrongly fetch and execute the load word instruction #### HANDLING DATA HAZARDS & CONTROL HAZARDS Hazards: Prevent the next instruction in the instruction stream from executing during its designated clock cycle. - \* Hazards reduce the performance from the ideal speedup gained by pipelining. 3 classes of hazards: - Ø Structural hazards: arise from resource conflicts when the hardware cannot support all possible combinations of instructions simultaneously in overlapped execution. - Ø Data hazards: arise when an instruction depends on the results of a previous instruction in a way that is exposed by the overlapping of instructions in the pipeline. - Ø Control hazards: arise from the pipelining of branches and other instructions that change the PC. # **Performance of Pipelines with Stalls** \* A stall causes the pipeline performance to degrade from the ideal performance. Speedup from pipelining = [1/(1+pipeline stall cycles per instruction)] \* Pipeline #### **Structural Hazards** - \* When a processor is pipelined, the overlapped execution of instructions requires pipelining of functional units and duplication of resources to allow all possible combinations of instructions in the pipeline. - \* If some combination of instructions cannot be accommodated because of resource conflicts, the processor is said to have a structural hazard. - \* Instances: - § When functional unit is not fully pipelined, Then a sequence of instructions using that unpipelined unit cannot proceed at the rate of one per clock cycle. - § when some resource has not been duplicated enough to allow all combinations of instructions in the pipeline to execute. - \* To Resolve this hazard, - § Stall the the pipeline for 1 clock cycle when the data memory access occurs. A stall is commonly called a pipeline bubble or just bubble, since it floats through the pipeline taking space but carrying no useful work. #### **Data Hazards** - \* A major effect of pipelining is to change the relative timing of instructions by overlapping their execution. This overlap introduces data and control hazards. - \* Data hazards occur when the pipeline changes the order of read/write accesses to operands so that the order differs from the order seen by sequentially executing instructions on an unpipelined processor. # **Minimizing Data Hazard Stalls by Forwarding** - \* The problem solved with a simple hardware technique called forwarding (also called bypassing and sometimes short-circuiting). Forwards works as: - § The ALU result from both the EX/MEM and MEM/WB pipeline registers is always fed back to the ALU inputs. - § If the forwarding hardware detects that the previous ALU operation has written the register corresponding to a source for the current ALU operation, control logic selects the forwarded result as the ALU input rather than the value read from the register file. # **Data Hazards Requiring Stalls** - \* The load instruction has a delay or latency that cannot be eliminated by forwarding alone. Instead, we need to add hardware, called a pipeline interlock, to preserve the correct execution pattern. - \* A pipeline interlock detects a hazard and stalls the pipeline until the hazard is cleared. - \* This pipeline interlock introduces a stall or bubble. The CPI for the stalled instruction increases by the length of the stall. #### **Branch Hazards** - \* Control hazards can cause a greater performance loss for our MIPS pipeline . When a branch is executed, it may or may not change the PC to something other than its current value plus 4. - \* If a branch changes the PC to its target address, it is a taken branch; if it falls through, it is not taken, or untaken. # **Reducing Pipeline Branch Penalties** - \* Simplest scheme to handle branches is to freeze or flush the pipeline, holding or deleting any instructions after the branch until the branch destination is known. - \* A higher-performance, and only slightly more complex, scheme is to treat every branch as not taken, simply allowing the hardware to continue as if the branch were not executed. The complexity of this scheme arises from having to know when the state might be changed by an instruction and how to "back out" such a change. - \* In simple five-stage pipeline, this predicted-not-taken or predicted untaken scheme is implemented by continuing to fetch instructions as if the branch were a normal instruction. - § The pipeline looks as if nothing out of the ordinary is happening. - § If the branch is taken, however, we need to turn the fetched instruction into a no-op and restart the fetch at the target address. - \* An alternative scheme is to treat every branch as taken. As soon as the branch is decoded and the target address is computed, we assume the branch to be taken and begin fetching and executing at the target**Performance of Branch Schemes** Pipeline speedup = Pipeline depth / [1+ Branch frequency × Branch penalty] Pipeline speedup = Pipeline depth 1+ Branch frequency × Branch penalty The branch frequency and branch penalty can have a component from both unconditional and conditional branches. #### **UNIT V** #### MEMORY AND I/O #### MEMORY HIERARCHY IN COMPUTER ARCHITECTURE In the design of the computer system, **a processor**, as well as a large amount of memory devices, has been used. However, the main problem is, these parts are expensive. So the **memory organization** of the system can be done by memory hierarchy. It has several levels of memory with different performance rates. But all these can supply an exact purpose, such that the access time can be reduced. The memory hierarchy was developed depending upon the behavior of the program. This article discusses an overview of the memory hierarchy in computer architecture. #### What is Memory Hierarchy? The memory in a computer can be divided into five hierarchies based on the speed as well as use. The processor can move from one level to another based on its requirements. The five hierarchies in the memory are registers, cache, main memory, magnetic discs, and magnetic tapes. The first three hierarchies are volatile memories which mean when there is no power, and then automatically they lose their stored data. Whereas the last two hierarchies are not volatile which means they store the data permanently. A memory element is the set of <u>storage devices</u> which stores the binary data in the type of bits. In general, <u>the storage of memory</u> can be classified into two categories such as volatile as well as non-volatile. #### **Memory Hierarchy in Computer Architecture** The **memory hierarchy design** in a computer system mainly includes different storage devices. Most of the computers were inbuilt with extra storage to run more powerfully beyond the main memory capacity. The following **memory hierarchy diagram** is a hierarchical pyramid for computer memory. The designing of the memory hierarchy is divided into two types such as primary (Internal) memory and secondary (External) memory. Memory Hierarchy #### **Primary Memory** The primary memory is also known as internal memory, and this is accessible by the processor straightly. This memory includes main, cache, as well as CPU registers. #### **Secondary Memory** The secondary memory is also known as external memory, and this is accessible by the processor through an input/output module. This memory includes an optical disk, magnetic disk, and magnetic tape. #### **Characteristics of Memory Hierarchy** The memory hierarchy characteristics mainly include the following. #### Performance Previously, the designing of a computer system was done without memory hierarchy, and the speed gap among the main memory as well as the CPU registers enhances because of the huge disparity in access time, which will cause the lower performance of the system. So, the enhancement was mandatory. The enhancement of this was designed in the memory hierarchy model due to the system's performance increase. #### **Ability** The ability of the memory hierarchy is the total amount of data the memory can store. Because whenever we shift from top to bottom inside the memory hierarchy, then the capacity will increase. #### **Access Time** The access time in the memory hierarchy is the interval of the time among the data availability as well as request to read or write. Because whenever we shift from top to bottom inside the memory hierarchy, then the access time will increase #### Cost per bit When we shift from bottom to top inside the memory hierarchy, then the cost for each bit will increase which means an internal Memory is expensive compared with external memory. #### **Memory Hierarchy Design** The memory hierarchy in computers mainly includes the following. #### **Registers** Usually, the register is a static RAM or SRAM in the processor of the computer which is used for holding the data word which is typically 64 or 128 bits. The program counter <u>register is the most important</u> as well as found in all the processors. Most of the processors use a status word register as well as an accumulator. A status word register is used for decision making, and the accumulator is used to store the data like mathematical operation. Usually, computers like **complex instruction set computers** have so many registers for accepting main memory, and **RISC- reduced instruction set** computers have more registers. #### **Cache Memory** Cache memory can also be found in the processor, however rarely it may be another **IC** (**integrated circuit**) which is separated into levels. The cache holds the chunk of data which are frequently used from main memory. When the processor has a single core then it will have two (or) more cache levels rarely. Present multi-core processors will be having three, 2-levels for each one core, and one level is shared. #### **Main Memory** The main memory in the computer is nothing but, the memory unit in the CPU that communicates directly. It is the main storage unit of the computer. This memory is fast as well as large memory used for storing the data throughout the operations of the computer. This memory is made up of RAM as well as ROM. #### **Magnetic Disks** The magnetic disks in the computer are circular plates fabricated of plastic otherwise metal by magnetized material. Frequently, two faces of the disk are utilized as well as many disks may be stacked on one spindle by read or write heads obtainable on every plane. All the disks in computer turn jointly at high speed. The tracks in the computer are nothing but bits which are stored within the magnetized plane in spots next to concentric circles. These are usually separated into sections which are named as sectors. #### **Magnetic Tape** This tape is a normal magnetic recording which is designed with a slender magnetizable covering on an extended, plastic film of the thin strip. This is mainly used to back up huge data. Whenever the computer requires to access a strip, first it will mount to access the data. Once the data is allowed, then it will be un mounted. The access time of memory will be slower within magnetic strip as well as it will take a few minutes for accessing a strip. # **Advantages of Memory Hierarchy** The need for a memory hierarchy includes the following. - Memory distributing is simple and economical - Removes external destruction - Data can be spread all over - Permits demand paging & pre-paging - Swapping will be more proficient Thus, this is all about **memory hierarchy**. From the above information, finally, we can conclude that it is mainly used to decrease the bit cost, access frequency, and to increase the capacity, access time. #### **MEMORY MANAGEMENT** #### What do you mean by memory management? Memory is the important part of the computer that is used to store the data. Its management is critical to the computer system because the amount of main memory available in a computer system is very limited. At any time, many processes are competing for it. Moreover, to increase performance, several processes are executed simultaneously. For this, we must keep several processes in the main memory, so it is even more important to manage them effectively. Memory management plays several roles in a computer system. - Memory manager is used to keep track of the status of memory locations, whether it is free or allocated. It addresses primary memory by providing abstractions so that software perceives a large memory is allocated to it. - Memory manager permits computers with a small amount of main memory to execute programs larger than the size or amount of available memory. It does this by moving information back and forth between primary memory and secondary memory by using the concept of swapping. - The memory manager is responsible for protecting the memory allocated to each process from being corrupted by another process. If this is not ensured, then the system may exhibit unpredictable behavior. - Memory managers should enable sharing of memory space between processes. Thus, two programs can reside at the same memory location although at different times. #### **Memory management Techniques:** The Memory management Techniques can be classified into following main categories: - Contiguous memory management schemes - Non-Contiguous memory management schemes Classification of memory management schemes Contiguous memory management schemes: In a Contiguous memory management scheme, each program occupies a single contiguous block of storage locations, i.e., a set of memory locations with consecutive addresses. Single contiguous memory management schemes: The Single contiguous memory management scheme is the simplest memory management scheme used in the earliest generation of computer systems. In this scheme, the main memory is divided into two contiguous areas or partitions. The operating systems reside permanently in one partition, generally at the lower memory, and the user process is loaded into the other partition. #### Advantages of Single contiguous memory management schemes: - o Simple to implement. - Easy to manage and design. - o In a Single contiguous memory management scheme, once a process is loaded, it is given full processor's time, and no other processor will interrupt it. #### Disadvantages of Single contiguous memory management schemes: - Wastage of memory space due to unused memory as the process is unlikely to use all the available memory space. - The CPU remains idle, waiting for the disk to load the binary image into the main memory. - o It can not be executed if the program is too large to fit the entire available main memory space. - o It does not support multiprogramming, i.e., it cannot handle multiple programs simultaneously. #### Multiple Partitioning: The single Contiguous memory management scheme is inefficient as it limits computers to execute only one program at a time resulting in wastage in memory space and CPU time. The problem of inefficient CPU use can be overcome using multiprogramming that allows more than one program to run concurrently. To switch between two processes, the operating systems need to load both processes into the main memory. The operating system needs to divide the available main memory into multiple parts to load multiple processes into the main memory. Thus multiple processes can reside in the main memory simultaneously. #### The multiple partitioning schemes can be of two types: - Fixed Partitioning - o Dynamic Partitioning #### **Fixed Partitioning** The main memory is divided into several fixed-sized partitions in a fixed partition memory management scheme or static partitioning. These partitions can be of the same size or different sizes. Each partition can hold a single process. The number of partitions determines the degree of multiprogramming, i.e., the maximum number of processes in memory. These partitions are made at the time of system generation and remain fixed after that. #### **Advantages of Fixed Partitioning memory management schemes:** - Simple to implement. - o Easy to manage and design. #### Disadvantages of Fixed Partitioning memory management schemes: - o This scheme suffers from internal fragmentation. - o The number of partitions is specified at the time of system generation. ### **Dynamic Partitioning** The dynamic partitioning was designed to overcome the problems of a fixed partitioning scheme. In a dynamic partitioning scheme, each process occupies only as much memory as they require when loaded for processing. Requested processes are allocated memory until the entire physical memory is exhausted or the remaining space is insufficient to hold the requesting process. In this scheme the partitions used are of variable size, and the number of partitions is not defined at the system generation time. #### Advantages of Dynamic Partitioning memory management schemes: - Simple to implement. - Easy to manage and design. #### Disadvantages of Dynamic Partitioning memory management schemes: - o This scheme also suffers from internal fragmentation. - o The number of partitions is specified at the time of system segmentation. #### Non-Contiguous memory management schemes: In a Non-Contiguous memory management scheme, the program is divided into different blocks and loaded at different portions of the memory that need not necessarily be adjacent to one another. This scheme can be classified depending upon the size of blocks and whether the blocks reside in the main memory or not. #### What is paging? Paging is a technique that eliminates the requirements of contiguous allocation of main memory. In this, the main memory is divided into fixed-size blocks of physical memory called frames. The size of a frame should be kept the same as that of a page to maximize the main memory and avoid external fragmentation. #### **Advantages of paging:** - Pages reduce external fragmentation. - o Simple to implement. - Memory efficient. - Due to the equal size of frames, swapping becomes very easy. - o It is used for faster access of data. #### What is Segmentation? Segmentation is a technique that eliminates the requirements of contiguous allocation of main memory. In this, the main memory is divided into variable-size blocks of physical memory called segments. It is based on the way the programmer follows to structure their programs. With segmented memory allocation, each job is divided into several segments of different sizes, one for each module. Functions, subroutines, stack, array, etc., are examples of such modules. #### **CACHE MEMORY** Cache Memory is a special very high-speed memory. It is used to speed up and synchronize with high-speed CPU. Cache memory is costlier than main memory or disk memory but more economical than CPU registers. Cache memory is an extremely fast memory type that acts as a buffer between RAM and the CPU. It holds frequently requested data and instructions so that they are immediately available to the CPU when needed. Cache memory is used to reduce the average time to access data from the Main memory. The cache is a smaller and faster memory that stores copies of the data from frequently used main memory locations. There are various different independent caches in a CPU, which store instructions and data. #### Levels of memory: - Level 1 or Register It is a type of memory in which data is stored and accepted that are immediately stored in CPU. Most commonly used register is accumulator, Program counter, address register etc. - Level 2 or Cache memory It is the fastest memory which has faster access time where data is temporarily stored for faster access. - Level 3 or Main Memory It is memory on which computer works currently. It is small in size and once power is off data no longer stays in this memory. - Level 4 or Secondary Memory It is external memory which is not as fast as main memory but data stays permanently in this memory. **Cache Performance:** When the processor needs to read or write a location in main memory, it first checks for a corresponding entry in the cache. • If the processor finds that the memory location is in the cache, a **cache hit** has occurred and data is read from the cache. • If the processor **does not** find the memory location in the cache, a **cache miss** has occurred. For a cache miss, the cache allocates a new entry and copies in data from main memory, then the request is fulfilled from the contents of the cache. The performance of cache memory is frequently measured in terms of a quantity called **Hit ratio.** Hit ratio = hit / (hit + miss) = no. of hits/total accesses We can improve Cache performance using higher cache block size, and higher associativity, reduce miss rate, reduce miss penalty, and reduce the time to hit in the cache. **Cache Mapping:** There are three different types of mapping used for the purpose of cache memory which is as follows: Direct mapping, Associative mapping, and Set-Associative mapping. These are explained below. #### A. Direct Mapping The simplest technique, known as direct mapping, maps each block of main memory into only one possible cache line. or In Direct mapping, assign each memory block to a specific line in the cache. If a line is previously taken up by a memory block when a new block needs to be loaded, the old block is trashed. An address space is split into two parts index field and a tag field. The cache is used to store the tag field whereas the rest is stored in the main memory. Direct mapping's performance is directly proportional to the Hit ratio. i = j modulo m where i=cache line number j= main memory block number m=number of lines in the cache 1. For purposes of cache access, each main memory address can be viewed as consisting of three fields. The least significant w bits identify a unique word or byte within a block of main memory. In most contemporary machines, the address is at the byte level. The remaining s bits specify one of the 2<sup>s</sup> blocks of main memory. The cache logic interprets these s bits as a tag of s-r bits (most significant portion) and a line field of r bits. This latter field identifies one of the m=2<sup>r</sup> lines of the cache. Line offset is index bits in the #### **B.** Associative Mapping In this type of mapping, the associative memory is used to store content and addresses of the memory word. Any block can go into any line of the cache. This means that the word id bits are used to identify which word in the block is needed, but the tag becomes all of the remaining bits. This enables the placement of any word at any place in the cache memory. It is considered to be the fastest and the most flexible mapping form. In associative mapping the index bits are zero. #### C. Set-associative Mapping This form of mapping is an enhanced form of direct mapping where the drawbacks of direct mapping are removed. Set associative addresses the problem of possible thrashing in the direct mapping method. It does this by saying that instead of having exactly one line that a block can map to in the cache, we will group a few lines together creating a *set*. Then a block in memory can map to any one of the lines of a specific set. Set-associative mapping allows that each word that is present in the cache can have two or more words in the main memory for the same index address. Set associative cache mapping combines the best of direct and associative cache mapping techniques. In set associative mapping the index bits are given by the set offset bits. In this case, the cache consists of a number of sets, each of which consists of a number of lines. The relationships are m = v \* k $i = j \mod v$ where i=cache set number j=main memory block number v=number of sets m=number of lines in the cache number of sets k=number of lines in each set #### **Application of Cache Memory:** - 1. Usually, the cache memory can store a reasonable number of blocks at any given time, but this number is small compared to the total number of blocks in the main memory. - 2. The correspondence between the main memory blocks and those in the cache is specified by a mapping function. - 3. **Primary Cache** A primary cache is always located on the processor chip. This cache is small and its access time is comparable to that of processor registers. - 4. **Secondary Cache** Secondary cache is placed between the primary cache and the rest of the memory. It is referred to as the level 2 (L2) cache. Often, the Level 2 cache is also housed on the processor chip. - 5. **Spatial Locality of reference** This says that there is a chance that the element will be present in close proximity to the reference point and next time if again searched then more close proximity to the point of reference. - 6. **Temporal Locality of reference** In this Least recently used algorithm will be used. Whenever there is page fault occurs within a word will not only load word in main memory but complete page fault will be loaded because the spatial locality of reference rule says that if you are referring to any word next word will be referred to in its register that's why we load complete page table so the complete block will be loaded. #### **Cache Organization** Cache is close to CPU and faster than main memory. But at the same time is smaller than main memory. The cache organization is about mapping data in memory to a location in cache. A Simple Solution: One way to go about this mapping is to consider last few bits of long memory address to find small cache address, and place them at the found address. Problems With Simple Solution: The problem with this approach is, we lose the information about high order bits and have no way to find out the lower order bits belong to which higher order bits. **Solution is Tag:** To handle above problem, more information is stored in cache to tell which block of memory is stored in cache. We store additional information as **Tag** What is a Cache Block? Since programs have Spatial Locality (Once a location is retrieved, it is highly probable that the nearby locations would be retrieved in near future). So a cache is organized in the form of blocks. Typical cache block sizes are 32 bytes or 64 bytes. # The above arrangement is Direct Mapped Cache and it has following problem We have discussed above that last few bits of memory addresses are being used to address in cache and remaining bits are stored as tag. Now imagine that cache is very small and addresses of 2 bits. Suppose we use the last two bits of main memory address to decide the cache (as shown in below diagram). So if a program accesses 2, 6, 2, 6, 2, ..., every access would cause a hit as 2 and 6 have to be stored in same location in cache. **Solution to above problem – Associativity** What if we could store data at any place in cache, the above problem won't be there? That would slow down cache, so we do something in between. #### **VIRTUAL MEMORY** #### Virtual Memory Virtual Memory is a storage scheme that provides user an illusion of having a very big main memory. This is done by treating a part of secondary memory as the main memory. In this scheme, User can load the bigger size processes than the available main memory by having the illusion that the memory is available to load the process. Instead of loading one big process in the main memory, the Operating System loads the different parts of more than one process in the main memory. By doing this, the degree of multiprogramming will be increased and therefore, the CPU utilization will also be increased. #### How Virtual Memory Works? In modern word, virtual memory has become quite common these days. In this scheme, whenever some pages needs to be loaded in the main memory for the execution and the memory is not available for those many pages, then in that case, instead of stopping the pages from entering in the main memory, the OS search for the RAM area that are least used in the recent times or that are not referenced and copy that into the secondary memory to make the space for the new pages in the main memory. Since all this procedure happens automatically, therefore it makes the computer feel like it is having the unlimited RAM. #### **Demand Paging** Demand Paging is a popular method of virtual memory management. In demand paging, the pages of a process which are least used, get stored in the secondary memory. A page is copied to the main memory when its demand is made or page fault occurs. There are various page replacement algorithms which are used to determine the pages which will be replaced. We will discuss each one of them later in detail. #### Snapshot of a virtual memory management system Let us assume 2 processes, P1 and P2, contains 4 pages each. Each page size is 1 KB. The main memory contains 8 frame of 1 KB each. The OS resides in the first two partitions. In the third partition, 1<sup>st</sup> page of P1 is stored and the other frames are also shown as filled with the different pages of processes in the main memory. The page tables of both the pages are 1 KB size each and therefore they can be fit in one frame each. The page tables of both the processes contain various information that is also shown in the image. The CPU contains a register which contains the base address of page table that is 5 in the case of P1 and 7 in the case of P2. This page table base address will be added to the page number of the Logical address when it comes to accessing the actual corresponding entry. #### Advantages of Virtual Memory - 1. The degree of Multiprogramming will be increased. - 2. User can run large application with less real RAM. - 3. There is no need to buy more memory RAMs. #### Disadvantages of Virtual Memory - 1. The system becomes slower since swapping takes time. - 2. It takes more time in switching between applications. - 3. The user will have the lesser hard disk space for its use. #### **DIRECT MEMORY ACCESS** For the execution of a computer program, it requires the synchronous working of more than one component of a computer. For example, <u>Processors</u> – providing necessary control information, addresses...etc, buses – to transfer information and data to and from memory to I/O devices...etc. The interesting factor of the system would be the way it handles the transfer of information among processor, memory and I/O devices. Usually, processors control all the process of transferring data, right from initiating the transfer to the storage of data at the destination. This adds load on the processor and most of the time it stays in the ideal state, thus decreasing the efficiency of the system. To speed up the transfer of data between I/O devices and memory, DMA controller acts as station master. DMA controller transfers data with minimal intervention of the processor. #### What is a DMA Controller? The term DMA stands for direct memory access. The hardware device used for direct memory access is called the DMA controller. DMA controller is a control unit, part of I/O device's interface circuit, which can transfer blocks of data between I/O devices and main memory with minimal intervention from the processor. #### **DMA Controller Diagram in Computer Architecture** DMA controller provides an interface between the bus and the input-output devices. Although it transfers data without intervention of processor, it is controlled by the processor. The processor initiates the DMA controller by sending the starting address, Number of words in the data block and direction of transfer of data .i.e. from I/O devices to the memory or from main memory to I/O devices. More than one external device can be connected to the DMA controller. DMA controller contains an address unit, for generating addresses and selecting I/O device for transfer. It also contains the control unit and data count for keeping counts of the number of blocks transferred and indicating the direction of transfer of data. When the transfer is completed, DMA informs the processor by raising an interrupt. The typical block diagram of the DMA controller is shown in the figure below. # **Working of DMA Controller** DMA controller has to share the bus with the processor to make the data transfer. The device that holds the bus at a given time is called bus master. When a transfer from I/O device to the memory or vice verse has to be made, the processor stops the execution of the current program, increments the program counter, moves data over stack then sends a DMA select signal to DMA controller over the address bus. If the DMA controller is free, it requests the control of bus from the processor by raising the bus request signal. Processor grants the bus to the controller by raising the bus grant signal, now DMA controller is the bus master. The processor initiates the DMA controller by sending the memory addresses, number of blocks of data to be transferred and direction of data transfer. After assigning the data transfer task to the DMA controller, instead of waiting ideally till completion of data transfer, the processor resumes the execution of the program after retrieving instructions from the stack. MA controller now has the full control of buses and can interact directly with memory and I/O devices independent of CPU. It makes the data transfer according to the control instructions received by the processor. After completion of data transfer, it disables the bus request signal and CPU disables the bus grant signal thereby moving control of buses to the CPU. When an I/O device wants to initiate the transfer then it sends a DMA request signal to the DMA controller, for which the controller acknowledges if it is free. Then the controller requests the processor for the bus, raising the bus request signal. After receiving the bus grant signal it transfers the data from the device. For n channeled DMA controller n number of external devices can be connected. The DMA transfers the data in three modes which include the following. - a) **Burst Mode**: In this mode DMA handover the buses to CPU only after completion of whole data transfer. Meanwhile, if the CPU requires the bus it has to stay ideal and wait for data transfer. - b) **Cycle Stealing Mode**: In this mode, DMA gives control of buses to CPU after transfer of every byte. It continuously issues a request for bus control, makes the transfer of one byte and returns the bus. By this CPU doesn't have to wait for a long time if it needs a bus for higher priority task. - c) **Transparent Mode:** Here, DMA transfers data only when CPU is executing the instruction which does not require the use of buses. #### **Advantages and Disadvantages of DMA Controller** The advantages and disadvantages of DMA controller include the following. #### **Advantages** - DMA speedups the memory operations by bypassing the involvement of the CPU. - The work overload on the CPU decreases. - For each transfer, only a few numbers of clock cycles are required #### **Disadvantages** - Cache coherence problem can be seen when DMA is used for data transfer. - Increases the price of the system. DMA (<u>Direct Memory Access</u>) controller is being used in graphics cards, network cards, sound cards etc... DMA is also used for intra-chip transfer in multi-core processors. Operating in one of its three modes, DMA can considerably reduce the load of the processor. #### INTRODUCTION TO INPUT- OUTPUT INTERFACE: Input -Output Interface is used as an method which helps in transferring of information between the internal storage devices i.e. memory and the external peripheral device. A peripheral device is that which provide input and output for the computer, it is also called Input-Output devices. For Example: A keyboard and mouse provide Input to the computer are called input devices while a monitor and printer that provide output to the computer are called output devices. Just like the external hard-drives, there is also availability of some peripheral devices which are able to provide both input and output. Input-Output Interface In micro-computer base system, the only purpose of peripheral devices is just to provide **special communication links** for the interfacing them with the CPU. To resolve the differences between peripheral devices and CPU, there is a special need for communication links. The major differences are as follows: - 1. The nature of peripheral devices is electromagnetic and electro-mechanical. The nature of the CPU is electronic. There is a lot of difference in the mode of operation of both peripheral devices and CPU. - 2. There is also a synchronization mechanism because the data transfer rate of peripheral devices are slow than CPU. - 3. In peripheral devices, data code and formats are differ from the format in the CPU and memory. - 4. The operating mode of peripheral devices are different and each may be controlled so as not to disturb the operation of other peripheral devices connected to CPU. There is a special need of the additional hardware to resolve the differences between CPU and peripheral devices to supervise and synchronize all input and output devices. #### **Functions of Input-Output Interface:** - 1. It is used to synchronize the operating speed of CPU with respect to input-output devices. - 2. It selects the input-output device which is appropriate for the interpretation of the input-output device. - 3. It is capable of providing signals like control and timing signals. - 4. In this data buffering can be possible through data bus. - 5. There are various error detectors. - 6. It converts serial data into parallel data and vice-versa. - 7. It also convert digital data into analog signal and vice-versa. #### I/O Interface (Interrupt and DMA Mode) The method that is used to transfer information between internal storage and external I/O devices is known as I/O interface. The CPU is interfaced using special communication links by the peripherals connected to any computer system. These communication links are used to resolve the differences between CPU and peripheral. There exists special hardware components between CPU and peripherals to supervise and synchronize all the input and output transfers that are called interface units. #### **Mode of Transfer:** The binary information that is received from an external device is usually stored in the memory unit. The information that is transferred from the CPU to the external device is originated from the memory unit. CPU merely processes the information but the source and target is always the memory unit. Data transfer between CPU and the I/O devices may be done in different modes. Data transfer to and from the peripherals may be done in any of the three possible ways - 1. Programmed I/O. - 2. Interrupt- initiated I/O. - 3. Direct memory access( DMA). Now let's discuss each mode one by one. - 1. **Programmed I/O:** It is due to the result of the I/O instructions that are written in the computer program. Each data item transfer is initiated by an instruction in the program. Usually the transfer is from a CPU register and memory. In this case it requires constant monitoring by the CPU of the peripheral devices. - **Example of Programmed I/O:** In this case, the I/O device does not have direct access to the memory unit. A transfer from I/O device to memory requires the execution of several instructions by the CPU, including an input instruction to transfer the data from device to the CPU and store instruction to transfer the data from CPU to memory. In programmed I/O, the CPU stays in the program loop until the I/O unit indicates that it is ready for data transfer. This is a time consuming process since it needlessly keeps the CPU busy. This situation can be avoided by using an interrupt facility. This is discussed below. - 2. **Interrupt- initiated I/O:** Since in the above case we saw the CPU is kept busy unnecessarily. This situation can very well be avoided by using an interrupt driven method for data transfer. By using interrupt facility and special commands to inform the interface to issue an interrupt request signal whenever data is available from any device. In the meantime the CPU can proceed for any other program execution. The interface meanwhile keeps monitoring the device. Whenever it is determined that the device is ready for data transfer it initiates an interrupt request signal to the computer. Upon detection of an external interrupt signal the CPU stops momentarily the task that it was already performing, branches to the service program to process the I/O transfer, and then return to the task it was originally performing. **Note:** Both the methods programmed I/O and Interrupt-driven I/O require the active intervention of the processor to transfer data between memory and the I/O module, and any data transfer must transverse - a path through the processor. Thus both these forms of I/O suffer from two inherent drawbacks. - The I/O transfer rate is limited by the speed with which the processor can test and service a device. - The processor is tied up in managing an I/O transfer; a number of instructions must be executed for each I/O transfer. 3. **Direct Memory Access**: The data transfer between a fast storage media such as magnetic disk and memory unit is limited by the speed of the CPU. Thus we can allow the peripherals directly communicate with each other using the memory buses, removing the intervention of the CPU. This type of data transfer technique is known as DMA or direct memory access. During DMA the CPU is idle and it has no control over the memory buses. The DMA controller takes over the buses to manage the transfer directly between the I/O devices and the memory unit. High Impedance (disable) when BG is enable Figure - CPU Bus Signals for DMA Transfer **Bus Request :** It is used by the DMA controller to request the CPU to relinquish the control of the buses. **Bus Grant :** It is activated by the CPU to Inform the external DMA controller that the buses are in high impedance state and the requesting DMA can take control of the buses. Once the DMA has taken the control of the buses it transfers the data. This transfer can take place in many ways. # Types of DMA transfer using DMA controller: BurstTransfer DMA returns the bus after complete data transfer. A register is used as a byte count, being decremented for each byte transfer, and upon the byte count reaching zero, the DMAC will release the bus. When the DMAC operates in burst mode, the CPU is halted for the duration of the data transfer. Steps involved are: - 1. Bus grant request time. - 2. Transfer the entire block of data at transfer rate of device because the device is usually slow than the speed at which the data can be transferred to CPU. - 3. Release the control of the bus back to CPU So, total time taken to transfer the N bytes = Bus grant request time + (N) \* (memory transfer rate) + Bus release control time. Where, X μsec =data transfer time or preparation time (words/block) Y µsec =memory cycle time or cycle time or transfer time (words/block) % CPU idle (Blocked)=(Y/X+Y)\*100 % CPU Busy=(X/X+Y)\*100 #### **CyclicStealing:** An alternative method in which DMA controller transfers one word at a time after which it must return the control of the buses to the CPU. The CPU delays its operation only for one memory cycle to allow the direct memory I/O transfer to "steal" one memory cycle. Steps Involved are: - 4. Buffer the byte into the buffer - 5. Inform the CPU that the device has 1 byte to transfer (i.e. bus grant request) - 6. Transfer the byte (at system bus speed) - 7. Release the control of the bus back to CPU. Before moving on transfer next byte of data, device performs step 1 again so that bus isn't tied up and the transfer won't depend upon the transfer rate of device. So, for 1 byte of transfer of data, time taken by using cycle stealing mode (T). = time required for bus grant + 1 bus cycle to transfer data + time required to release the bus, it will be NxT In cycle stealing mode we always follow pipelining concept that when one byte is getting transferred then Device is parallel preparing the next byte. "The fraction of CPU time to the data transfer time" if asked then cycle stealing mode is used. Where. X μsec =data transfer time or preparation time (words/block) Y μsec =memory cycle time or cycle time or transfer time (words/block) % CPU idle (Blocked) = (Y/X)\*100 % CPU busy=(X/Y)\*100 **Interleaved mode:** In this technique, the DMA controller takes over the system bus when the microprocessor is not using it.An alternate half cycle i.e. half cycle DMA + half cycle processor. #### **ACCESSING I/O DEVICE** We can access the i/o device in two interfaces. - i. Serial interface - ii. Parallel interface The main difference between the serial and parallel interfaces is how they transmit data. In serial interface the data is sent or received one bit at a time over a series of clock pulses. In parallel mode the interface sends and receives 4 bits, 8 bits, or 16 bits of dataat a time over multiple transmission lines. These two interface modes will be explained in further detail below. #### Serial interface Serial interface send a byte by bit by bit in serial manner with defined clockpulse width (with parity). Serial interfaces consist of 3 types each with their own pins: - 1. I2C (Inter-Integrated Circuit): Serial Data In and Serial Clock - 2. 3/4-wire SPI (Serial Peripheral Interface): Consists of Serial Data Out, SerialData In, Serial Clock and an additional Chip Select pin for the 4-wire SPI - 3. Serial synchronous control and data lines: Serial Data In, Register Select,Reset, and Serial Clock Below is the serial interface connection example. | No. | Symbol | Description | Notes | |-----|---------|---------------------------------------------------|--------------------| | 1 | SCL | Serial Clock | Output from master | | 2 | CS | Chip Select, Low is active | Control Line | | 3 | SDI/SDA | Serial Data In | Data lines | | 4 | SDO | Serial Data Out | | | 5 | A0 | Register Select. 0: instruction, 1: data register | | | 6 | RES | External Reset, Low is active | | Serial Interface Pros: - -Less data pins - -Cheaper - -Easy setup #### **Parallel Interface** The parallel interface transmits 8-bits, or one byte, of data over multiple data bus lines over one clock pulse. This makes parallel transmission faster than serial but istypically more expensive and requires more data pins to be connected. The parallelinterface consists of 8 data pins and 3 control pins. The control pins are typically labeled: Register Select (RS), Enable (E), and Read/Write (R/W). Additional common parallel interface pins may include: Contrast adjust (V0), Chip Select (CS) and Parallel interface consists of 2 standard types: - 1. 8080 type: parallel 4-bit/8-bit data input with a write and a read line - 2. 6800 type: parallel 4/8-bit data input with write, read and enable lines #### Parallel Interface Pros - -Faster data transmission - -High performance #### INTERCONNECTION STANDARDS There are two interconnection standards - i. USB - ii. SATA (Serial ATA) #### **Universal Serial Bus (USB)** The **universal serial bus (USB)** is a standard interface for connecting a wide range of devices to the computer such as keyboard, mouse, smartphones, speakers, cameras etc. The USB was introduced for commercial use in the year 1995 at that time it has a data transfer speed of 12 megabits/s. With some improvement, a modified USB 2 was introduced which is also called a *highspeed USB* that transfers data at 480 megabits/s. With the evolution of I/O devices that require highspeed data transfer also leads to the development of USB 3 which is also referred to as *Superspeed USB* which transfers data at 5 gigabits/s. The recent version of USB can transfer data up to 20 gigabits/s. Key Objectives of Universal Serial Bus - The developed USB must be *simple* and a *low-cost* interconnection system that should be *easy* to use. - The developed USB must be compatible with all new I/O devices, their bit rates, internet connections and audio, video application. - The USB must support a *plug-and-play* mode of operation. - The USB must support *low power* implementation. - The USB must also provide support for *legacy hardware* and *software*. #### **USB** Architecture When multiple I/O devices are connected to the computer through USB they all are organized in a tree structure. Each I/O device makes a *point-to-point* connection and transfers data using the *serial transmission format* we have discussed serial transmission in our previous content 'interface circuit'. A tree structure has a **root, nodes** and **leaves.** The tree structure connecting I/O devices to the computer using USB has nodes which are also referred to as a **hub**. Hub is the intermediatory connecting point between the I/O devices and the computer. Every tree has a root here, it is referred to as the **root hub** which connects the entire tree to the hosting computer. The leaves of the tree here are nothing but the I/O devices such as a mouse, keyboard, camera, speaker. The USB works on the principle of polling. In **polling**, the processor keeps on checking whether the I/O device is ready for data transfer or not. So, the devices do not have to inform the processor about any of their statuses. It is the processor's responsibility to keep a check. This makes the USB simple and low cost. Whenever a new device is connected to the hub it is addressed as 0. Now at a regular interval the host computer polls all the hubs to get their status which lets the host know of I/O devices that are either detached from the system or are attached to the system. When the host becomes aware of the new device it gets to know about the capabilities of the device by reading the information present in the special memory of the device's USB interface. So that the host can use the appropriate device driver to communicate with the device. The host then assigns an address to this new device, this address is written to the register of the device interface register. With this mechanism, USB serves plug-and-play capability. The **plug and play** feature let the host recognize the existence of the new I/O device automatically when the device is plugged in. The host software determines the capabilities of the I/O devices and if it has any special requirement. The USB is **hot-pluggable** which means the I/O device can be attached or removed from the host system without performing any restart or shutdown. That means your system can keep running while the I/O device is plugged or removed. # **Types of USB Connectors** The USB has different types of ports and connectors. Usually, the upstream port and connector are always the USB type A the downstream port and connector differ depending on the type of device connected. We will discuss all types of the USB connector. **USB Type A:** This is the standard connector that can be found at one end of the USB cable and is also known as upstream. It has a flat structure and has four connecting lines as you can see in the image below. USB Type A **USB Type B:** This is an older standard cable and was used to connect the peripheral devices also referred to as downstream. It is approximately a square as you can see in the image below. This is now been replaced by the newer versions. USB Type B **Mini USB:** This type of USB is compatible with mobile devices. This type of USB is now superseded your micro-USB still you will get it on some devices. **Micro USB:** This type of USB is found on newer mobile devices. It has a compact 5 pin design. **USB Type C:** This type of USB is used for transferring both data and power to the attached peripheral or I/O device. The USB C does not have a fixed orientation as it is reversible i.e. you can plug it upside down or in reverse. **USB 3.0 Micro B:** This USB is a superspeed USB. This USB is used for a device that requires high-speed data transfer. You can find this kind of USB on portable hard drives. #### **Electrical Characteristics of USB** The standard USB has four lines of connection among which two carry power (one carry +5 V and one is for Ground). The other two lines of connection are for data transfer. USB also supply power to connected I/O device that requires very low power. Transferring of data over USB can be divided into two categories i.e., transferring data at low speed and transferring data at high speed. The low-speed transmission uses **single-ended signalling** where varying high voltage is transmitted over one of the two data lines to represent the signal bit 0 or 1. The other data line is connected to the reference voltage i.e., ground. The single-ended signalling is prone to noise. The high-speed data transmission uses the approach **differential signalling**. Here, the signal is transmitted over the two data lines that are twisted together. Here both the data lines are involved in carrying the signal no ground wire is required. The differential signalling is not prone to noise and uses low voltages as compared to single-ended transmission. #### **SATA** SATA is an interface that connects various storage devices such as hard disks, optical drives, SSD's, etc to the motherboard. SATA was introduced in the year 2000 to replace the long-standing PATA (Parallel ATA) interface. We all know, in serial mode, data is transferred bit by bit and in parallel, there are several streams that carry the data. Despite knowing this fact, there is a drawback in PATA. PATA is highly susceptible to outside interferences and hence allows SATA to operate at high speeds than PATA. SATA cables are thinner, more flexible and compact as compared to the PATA cables. There were several industry groups that began their development in SATA late in the 2000s. It was only in the year 2003 that SATA-IO (SATA International Organization) was formed and it laid out the first SATA specifications. # A SATA controller is a device that is used to connect the computer's motherboard to the storage drives. SATA operates on two modes – - 1. **IDE mode:** IDE stands for Integrated Drive Electronics. This is a mode which is used to provide backward compatibility with older hardware, which runs on PATA, at the cost of low performance. - 2. **AHCI mode:** AHCI is abbreviation for Advanced Host Controller Interface. AHCI is a high-performance mode that also provides support for hot-swapping. #### Characteristics of SATA - Low Voltage Requirement: SATA operates on 500mV (0.5V) peak-to-peak signaling. This help in promoting a much low interference and crosstalk between conductors. - **Simplified construction:** PATA cables had 40-pin/80-wire ribbon cable. This was complex in construction. In comparison, SATA had a single 7 pin data cable and a 15 pin power cable. This cable resulted in a higher signaling rate, which translates to faster throughput of data. - **Differential Signaling:** SATA uses differential signaling. Differential signaling is a technology which uses two adjacent wires to simultaneously the in-phase and out-of-phase signals. Thus, it is possible to transfer high-speed data with low operating voltage and low power consumption by detecting the phase difference between the two signals at the receiver's end. - **High data transfer rate:** SATA has a high data transfer rate of 150/300/600 MBs/second. This capability of SATA allows for faster program loading, better picture loading and fast document loading. #### Advantages of SATA - Faster data transfer rate as compared to PATA. - SATA cable can be of length upto 1 meter, whereas PATA cable can only have length of maximum 18 inches. - SATA cables are smaller in size. - Since, they are smaller in size, they take up less space inside the computer and increase the internal air flow. Increased air flow can decrease heat build-up and therefore increases the overall life of computer. - Most modern computer motherboards today have SATA ports more than PATA ports. - Low power consumption (0.5V). #### Disadvantages of SATA - Special device drivers are required sometimes to recognize and use the drive. However, a SATA hard drive can behave as a PATA drive. This eliminates the need for a specific driver to be installed. - SATA cable supports only one hard drive to connect at a time, whereas PATA cable allows up to two PATA drives per cable. - SATA is costlier as compared to PATA. #### SATA standards and revisions The nonprofit SATA-IO industry consortium authors the technical specifications governing Serial ATA device interfaces. The consortium revises SATA standards to reflect increased <u>data transfer rates</u>. These revisions include the following changes: - **SATA Revision 1.** These devices were widely used in personal desktop and office computers, configured from PATA drives daisy chained together in a primary/secondary configuration. SATA Revision 1 devices reached a top transfer rate of 1.5 Gbps. - **SATA Revision 2.** These devices doubled the transfer speed to 3.2 Gbps with the inclusion of port multipliers, port selectors and improved <u>queue depth</u>. - **SATA Revision 3.** These interfaces supported drive transfer rates up to 6 Gbps. Revision 3 drives are <u>backward-compatible</u> with SATA Revision 1 and Revision 2 devices, though with lower transfer speeds. - **SATA Revision 3.1.** This intermediate revision added final design requirements for SATA Universal Storage Module for consumer-based portable storage applications. - **SATA Revision 3.2.** This update added the SATA Express specification. It supports the simultaneous use of SATA ports and PCI Express (<u>PCIe</u>) lanes. - SATA Revision 3.3. This revision addressed the use of shingled magnetic recording - **SATA Revision 3.5.** This change promoted greater integration and interoperability with PCIe flash and other I/O protocols. #### COMPARISON: | | USB 2.0 | 1394 | Serial | |--------------------------------------|----------------|----------------|-------------------------| | Raw Interface<br>speed | 480 Mbps | 400 Mbps | 1.5 Gbps<br>(1500 Mbps) | | Benchmark<br>comparison<br>64K read | 31.6<br>MB/sec | 34.8<br>MB/sec | 56.4<br>MB/sec | | Benchmark<br>comparison<br>64K write | 26.5<br>MB/sec | 26.7<br>MB/sec | 54.2<br>MB/sec | | Burst Transfer<br>Rate | 33.5<br>MB/sec | 36.2<br>MB/sec | 111.3<br>MB/sec |