## INFORMATION TO USERS

This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer.

The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely affect reproduction.

In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion.

Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand comer and continuing from left to right in equal sections with small overlaps.

## University of Alberta

## A Combined Error Control and Constrained Sequence Code through Multimode Coding <br> Aaron Hughes <br> ©

A thesis submitted to the Faculty of Graduate Studies and Research in partial fulfillment of the requirements for the degree of Master of Science

Department of Electrical and Computer Engineering

Edmonton, Alberta
Fall 2005

Library and
Archives Canada
Published Heritage Branch

395 Wellington Street
Ottawa ONK1A ON4 Canada

Bibliothèque et Archives Canada

Direction du
Patrimoine de l'édition
395, rue Wellington
Ottawa ON K1A ON4
Canada

Your file Votre référence ISBN:
Our file Notre retérence ISBN:

## NOTICE:

The author has granted a nonexclusive license allowing Library and Archives Canada to reproduce, publish, archive, preserve, conserve, communicate to the public by telecommunication or on the Internet, loan, distribute and sell theses worldwide, for commercial or noncommercial purposes, in microform, paper, electronic and/or any other formats.

The author retains copyright ownership and moral rights in this thesis. Neither the thesis nor substantial extracts from it may be printed or otherwise reproduced without the author's permission.


#### Abstract

AVIS: L'auteur a accordé une licence non exclusive permettant à la Bibliothèque et Archives Canada de reproduire, publier, archiver, sauvegarder; conserver, transmettre au public par télécommunication ou par I'Internet, prêter, distribuer et vendre des thèses partout dans le monde, à des fins commerciales ou autres, sur support microforme, papier, électronique et/ou autres formats.


L'auteur conserve la propriété du droit d'auteur et des droits moraux qui protège cette thèse. Ni la thèse ni des extraits substantiels de celle-ci ne doivent être imprimés ou autrement reproduits sans son autorisation.

In compliance with the Canadian Privacy Act some supporting forms may have been removed from this thesis.

While these forms may be included in the document page count, their removal does not represent any loss of content from the thesis.

Conformément à la loi canadienne sur la protection de la vie privée, quelques formulaires secondaires ont été enlevés de cette thèse.

Bien que ces formulaires aient inclus dans la pagination, il n'y aura aucun contenu manquant.

## Canadä

To my wife Valerie, we did it baby!

Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.


#### Abstract

This thesis introduces a novel approach of integrating error control codes and constrained sequence codes into a single monolithic code. This is done in order to overcome drawbacks with the conventional manner in which these coding procedures are concatenated. The technique is based on the principle of multimode coding, where each source word is represented by a set of complementary error control code words. From this set the encoder selects the error control codeword that best meets the constrained sequence goals of the system. The decoding structure avoids the problem of error propagation during constrained sequence decoding by performing error correction before removing the effects of the constrained sequence code. Power spectra of encoded sequences generated by a hardware implementation have a null at 0 Hz , confirming that these coded sequences are dc-free. Bit error rate simulations demonstrate the superior performance of this combined error control and constrained sequence code on a dcconstrained noisy channel.


## Acknowledgements

I would like to thank the University of Alberta and TRLabs for giving me an exciting place to study that I always found challenging.

Thanks to Pranavi Anand, Tony Rapley and Shreeram Sigdel for being there through all of our classes.

Thanks to Yan Xin and Fengqin Zhai for always being able to help me solve my toughest problems, and locate hard to find papers.

Thanks to Steve Drake and Rick McGregor for letting me use the VLSI labs, oscilloscopes, power supplies, logic analyzers and many other little parts to make my project work.

Thanks to Marco Castellon for your VHDL expertise. Without you I might still be in the lab to this day.

Thanks to Dr. Witold Krzymien. You became a good friend and mentor and it was nice to know you were in my corner.

Thanks to Dr. Wayne Grover. You may not know this but I learned a surprising amount of communication theory from you.

Thanks to my good friend Shreeram Sigdel. Thanks for all the Tim Horton's coffees, thanks for studying with me, and mostly thanks for preparing me for my Qualcomm interview.

Thanks to my brother Duane for flying into Edmonton on numerous occasions to catch Oiler's hockey games. We smashed many joysticks and didn't make it home until 3am, and I wouldn't have had it any other way. I will never forget those times. Thanks!

Thanks to my Mother and Father for staying supportive all these years I wasn't working! This is what I was doing with my time.

Thanks to my best friend and wife Valerie. We spent almost four years walking over the windy high level bridge (even in $-40^{\circ} \mathrm{C}$ ), and we continued to support each other every day. When I had some low times and lost my confidence, you were there to remind me that I am worth something. Thank you for all of your support.

My biggest and most sincere thanks goes out to my supervisor Dr. Ivan Fair. You took a chance on me as a graduate student when I am certain no one else would, and you never gave up on me. You gave me skills and tools that I use everyday, taught me new ways to envision problems, and had the patience to weather my sometimes relentless questions.

Everyone needs a supervisor like you!

## Table of Contents

1. Introduction ..... 1
1.1. Thesis overview ..... 2
2. Error Control Coding and Constrained Sequence Coding Background ..... 4
2.1. Digital Communication System Basics ..... 4
2.2. Error Control Coding ..... 4
2.3. Goals of Error Control Coding ..... 6
2.4. Block Codes ..... 7
2.5. Parity Check Codes ..... 7
2.6. Generator Matrix ..... 9
2.7. Parity Check Matrix ..... 10
2.8. Syndrome Decoding ..... 11
2.9. Hamming bound ..... 12
2.10. Hamming Distance. ..... 12
2.11. Linear Code Properties ..... 13
2.12. Hamming Codes ..... 13
2.13. Cyclic Codes ..... 14
2.13.1. Polynomial representation of codes ..... 14
2.13.2. Addition and Subtraction ..... 15
2.13.3. Multiplication ..... 15
2.13.4. Division ..... 15
2.14. Encoding of Cyclic Codes ..... 15
2.15. Decoding of Cyclic Codes ..... 17
2.16. Advantages of Error Control Coding ..... 20
2.17. Drawbacks of Error Control Coding ..... 20
2.18. Constrained Sequence Coding ..... 21
2.19. Line Coding Basics ..... 21
2.20. Line Coding Goals ..... 25
2.21. Line Coding approaches ..... 26
2.22. Advantages of Constrained Sequence Coding ..... 29
2.23. Disadvantages of Constrained Sequence Coding ..... 29
2.24. Combining Error Control Coding and Constrained Sequence Coding ..... 29
3. Integrating EC coding and CS coding ..... 32
3.1. A Simple Scheme ..... 33
3.2. Improved scheme ..... 36
3.3. Tradeoffs ..... 42
3.4. Goals of the combined EC and CS system ..... 43
3.5. Generic Systems ..... 44
3.6. General notes on AddCWs ..... 50
3.6.1. AddSWs and hexadecimal notation ..... 50
3.6.2. Number of AddCWs can be any power of two ..... 50
3.6.3. MSbs and identification bits ..... 51
3.6.4. Good AddCW set ..... 51
3.6.5. Number of AddCW sets ..... 51
3.7. Combined EC and CS code summary ..... 53
4. Code Word Search and Analytical Results ..... 54
4.1. Evaluating CW Statistics ..... 54
4.2. Choosing the Best Code Word. ..... 57
4.3. Selection process used in this thesis ..... 59
4.4. Example of the CW selection process based on minimizing the RDS ..... 60
4.5. Comparing unconstrained and constrained transmission ..... 63
4.6. Effect of Different AddCWs ..... 64
4.7. Finite State Machines ..... 66
4.8. Markov Processes and Markov Chains ..... 69
4.8.1. Markov Chains ..... 69
4.8.2. Markov Chain Properties ..... 70
4.8.3. Long Term Behavior of Markov Chains ..... 72
4.8.4. Non-Regular Markov Chains ..... 74
4.8.5. State Diagrams ..... 75
4.9. Analyzing the Encoder as a Markov Chain ..... 76
4.9.1. Finding the invariant probability vector of the encoder. ..... 79
4.10. Finding Runlength Probabilities ..... 82
4.11. Analysis of runlength probabilities ..... 83
4.12. Code Word Search ..... 89
4.13. Analysis and CW Search on larger codes ..... 91
4.14. CW Search Results ..... 93
4.15. AddCW recommendations ..... 94
5. Power Spectral Density Results ..... 98
5.1. Relationship between the time and frequency domain ..... 98
5.2. Evaluating the Power Spectral Density ..... 100
5.3. Evaluating the Power Spectral Density of a Coded System ..... 101
5.4. Evaluating the Power Spectral Density using Matlab ..... 110
5.5. Understanding the Discrete-Like Nature of the PSD ..... 114
5.6. Spectra of various codes ..... 116
5.7. Spectra of larger codes ..... 122
5.8. PSD Summary ..... 126
6. Implementing the Multimode Encoder and Decoder in Hardware ..... 127
6.1. Pinouts ..... 130
6.2. SDATA and SCLK signals ..... 130
6.3. Controlling the FPGAs over the PC parallel port ..... 131
6.4. Transmitter FPGA State Machine ..... 133
6.5. PC Software: Controlling the TX FPGA ..... 145
6.6. Transmitting a File ..... 146
6.7. Debug mode of the FPGA ..... 150
6.8. Receiver State Machine ..... 152
6.9. PC Software: Controlling the RX FPGA ..... 156
6.10. Receiving a File ..... 157
6.11. Verify that the system could correct single bit errors ..... 159
6.12. The effect of more than single bit errors ..... 160
6.13. Equipment used to measure the PSD ..... 161
6.14. Testing discrete spectra $-50 \%$ duty cycle square wave ..... 161
6.15. Fourier Series Representation ..... 162
6.16. Accuracy of discrete PSD measurements ..... 167
6.17. Accuracy of continuous PSD measurements ..... 168
6.18. How the ResBW affects the PSD ..... 169
6.19. Comparing spectra of various codes ..... 171
6.20. $(15,8)$ Best AddCW sets ..... 172
6.21. $(15,8)$ Worst AddCW sets ..... 174
6.22. Spectra of larger codes ..... 177
6.23. Summary ..... 177
7. Bit Error Rate Performance ..... 178
7.1. Channel model ..... 178
7.2. Simulation ..... 182
7.3. BPSK Simulation results ..... 183
7.4. Effect of error control codes ..... 184
7.5. Simple multimode code ..... 185
7.6. Effect of 8 AddCWs with the $(15,8)$ code ..... 187
7.7. Performance of larger codes ..... 189
7.8. Effect of 8 AddCWs with the $(31,23)$ code ..... 191
7.9. Conclusions ..... 192
8. Conclusion ..... 194
8.1. Thesis summary ..... 194
8.2. Future work ..... 195
References ..... 196
Appendix A - Calculating Power Spectral Density ..... 198
A.1. Determining the Power Spectral Density ..... 198
A.2. How to calculate RMS for non-sinusoidal signals ..... 199
A.3. Comparing Fourier calculations to spectrum analyzer output ..... 200
A.4. Trigonometric Fourier Series ..... 201
A.5. Compact Fourier Series ..... 201
A.6. Exponential Fourier Series ..... 201
A.7. Comparing to a Spectrum Analyzer and Digital Oscilloscope ..... 204
A.8. Random Binary Signals ..... 209
A.9. Finding the PSD of a Coded Binary Signal ..... 217
A.10. Finite State Machines. ..... 217
A.11. Markov Chains ..... 220
A.12. Markov Chain Properties ..... 221
A.13. Partial Summary ..... 224
A.14. PSD of Coded Systems - Simplest Example ..... 225
A.15. PSD of Coded Systems - Real World Example ..... 235
Appendix B - AddSWs, AddCWs and Codes ..... 240
B.1. $(7, x)$ codes ..... 240
B.2. $(15, x)$ codes ..... 241
B.3. $(31, x)$ codes ..... 242

## List of Figures

Figure 2-1: Simple digital communication system ..... 4
Figure 2-2: Digital communication system with EC CODing ..... 5
Figure 2-3: Systematic CW ..... 7
Figure 2-4: Mapping of the 1:8 CODe ..... 7
Figure 2-5: Cyclic code encoder ..... 17
Figure 2-6: Meggitt decoder and operation correcting the $5^{\text {tit }}$ MSb ..... 20
Figure 2-7: On-Off Signaling Formats ..... 21
Figure 2-8: Polar Signaling Formats ..... 22
Figure 2-9: Simple high-impedance preamplifier design using a bipolar transistor ..... 23
Figure 2-10: 50\% duty cycle square wave through a dc blocking capacitor ..... 23
Figure 2-11: AC Coupled signal and baseline wander ..... 23
Figure 2-12: AC Coupling of an unbalanced sequence. ..... 24
Figure 2-13: Clock recovery circuit losing synchronization ..... 25
Figure 2-14: Digital communication system with CS Coding ..... 26
Figure 2-15: Typical configuration that incorporates EC coding and CS coding ..... 30
Figure 2-16: Desired configuration that incorporates EC coding and CS coding ..... 31
Figure 2-17: Combined approach that incorporates EC Coding and CS coding ..... 31
Figure 3-1: A simple EC + CS encoder using bimode Coding ..... 34
Figure 3-2: The effect of the bimode coding: balanced output ..... 34
Figure 3-3: A simple EC + CS decoder using two AddCWs ..... 35
Figure 3-4: A simple EC + CS Encoder using four AddCWs ..... 37
Figure 3-5: The effect of the (7,2) scheme is balanced output and numerous transitions ..... 37
Figure 3-6: A simple EC + CS decoder using four AddCWs ..... 39
Figure 3-7: Concatenation of CWs shows runs of four like valued bits ..... 39
Figure 3-8: A simple EC + CS encoder using eight AddCWs ..... 40
Figure 3-9: SW to CW mapping of this (7,1) Setup ..... 41
Figure 3-10: A simple EC + CS decoder using eight AddCWs ..... 42
Figure 3-11: Generic bimode complementary encoder using two AddCWs ..... 44
Figure 3-12: Generic bimode complementary decoder using two AddCWs ..... 44
Figure 3-13: AddCW format for a system using two AddCWs ..... 45
Figure 3-14: Format of the two AddCWs for proper decoder operation ..... 45
Figure 3-15: Generic multimode complementary encoder using four AddCWs ..... 47
Figure 3-16: Generic multimode complementary decoder using four AddCWs ..... 47
Figure 3-17: Choosing an AddCW set with four AddCWs ..... 48
Figure 3-18: Strict format of the four AddCWs ..... 48
FIGURE 3-19: GENERIC MULTIMODE COMPLEMENTARY ENCODER USING EIGHT ADDCWS ..... 49
FIGURE 3-20: GENERIC MULTIMODE COMPLEMENTARY DECODER USING EIGHT ADDCWS ..... 49
Figure 3-21: ChOOSING AN AdDCW SET WITH EIGHT AdDCWs ..... 49
Figure 3-22: Strict format of the eight AddCWs ..... 50
Figure 4-1: Counting Transitions ..... 54
Figure 4-2: Running Digital Sum of the code word 3Ah ..... 55
FIgURE 4-3: Minimum SQuared weight can take the internal RDS into account. ..... 56
Figure 4-4: Minimum squared weight criteria chooses the second CW ..... 57
Figure 4-5: SEQUENCE HAS MANY TRANSITIONS BUT IT IS UNBALANCED ..... 57
FIgure 4-6: SEQUENCE IS BALANCED BUT IT CONTAINS FEW TRANSITIONS ..... 58
Figure 4-7: $(7,2)$ encoder with AddCWs \{00H, 3AH, 45H, 7FH \} ..... 60
Figure 4-8: SimFPGA - Program for verifying the CW selection process ..... 60
FIGURE 4-9: UNCODED SEQUENCE IS UNBALANCED AND HAS A RUN OF 9. ..... 63
FIGURE 4-10: CODED SEQUENCE IS BALANCED BUT STILL HAS A RUN OF 5 LIKE VALUED bits ..... 64
FIGURE 4-11: CODED SEQUENCE IS BALANCED AND NOW HAS A RUN OF ONLY THREE LIKE VALUED BITS ..... 66
FIGURE 4-12: Partial state diagram for the (7,2) Encoder transmitting four SWs 0, 2, 3 and 3. This Causes transitions from state $(0,0)$, $\operatorname{TO} \operatorname{STATE}(-1,0)$, $\operatorname{TO} \operatorname{STATE}(0,1)$, TO STATE $(1,0)$, AND BACK TO STATE $(0,1)$ ..... 67
Figure 4-13: State Diagram For $(7,2)$ CODE With AddSWs $=\{0 \mathrm{H}, 4 \mathrm{H}, \mathrm{Bh}, \mathrm{FH}\}$ ..... 68
Figure 4-14: Markov Chain State Diagram ..... 76
Figure 4-15: Markov Chain State Diagram with State Probabilities ..... 81
Figure 4-16: Decomposed State diagram illustrating the probability of a SW/CW entering EACH STATE, AND PROBABILITY OF EACH CW LEAVING EACH STATE, ALLOWING STRAIGHTFORWARD CALCULATION OF RUNLENGTHS ..... 82
Figure 4-17: CONCATENATION OF Two CWS and the Portion of Interest ..... 83
Figure 4-18: Build_EuCu.exe Program that builds $\Theta_{l}, E_{l}$ and $C_{l /}$ Matrices ..... 84
Figure 4-19: Output states from Build_EuCu.exe ..... 85
Figure 4-20: Output $E_{U}$ and $C_{l /}$ MATRICES FROM Build_EuCu.EXE. ..... 85
Figure 4-21: CalcPIMandProbs.exe ..... 86
Figure 4-22: Output from CalcPIMandProbs.exe ..... 86
FIGURE 4-23: RUNLENGTH PROBABILITIES WITH SW 0 ..... 87
Figure 4-24: RunLengTh probabilities with SW 1 ..... 87
Figure 4-25: Runlength probabilities with SW 2 ..... 88
Figure 4-26: Runlength probabilities with SW 3 ..... 88
Figure 4-27: CW search program ..... 89
FIGURE 4-28: CW SEARCH RESULTS FOR THE $(7,2)$ ..... 90
Figure 4-29: SimFPGA keeps track of percentage of time each AddCW is used ..... 96
Figure 5-1: Demonstrating nulls at 0 Hz on both linear and logarithmic scales ..... 99
Figure 5-2: Low Frequency content of a signal ..... 99
Figure 5-3: High frequency content of a signal ..... 99
FIGURE 5-4: PSD of a SQUARE PULSE SHAPE PLOTTED ON LINEAR AND LOGARITHMIC SCALES ..... 109
FIGURE 5-5: InPut file for PSD CALCULATION ..... 110
Figure 5-6: Information required for PSD calculation ..... 111
Figure 5-7: Linear PSD of the multimode system (no pulse shape included) ..... 112
Figure 5-8: Showing that the PSD is periodic and an even function ..... 112
FIGURE 5-9: EFFECT OF THE SQUARE PULSE SHAPE PLOTTED ON A LINEAR AND LOGARITHMIC SCALE ..... 113
Figure 5-10: COMPARING THE SPECTRUM ANALYZER TO MATLAB SPECTRUM ..... 113
Figure 5-11: Simplest FSM model For the (7,1) CODE ..... 114
Figure 5-12: PSD of the multimode code shown in Figure 6-50 ..... 115
Figure 5-13: PSD of the periodic SEQuences \{6916H\} AND \{692CH\} ..... 116
Figure 5-14: PSD OF THE PERIODIC SEQUENCES \{5316H\} AND \{532CH\} ..... 116
FIGURE 5-15: AVERAGE OF THE SEVEN FREQUENCY COMPONENTS ..... 116
Figure 5-16: 90\% ONES - All four Best AddCW SETS ..... 117
Figure 5-17: 10\% ONES - All FOUR BEST AddCW SETS ..... 118
FIGure 5-18: PSD of $(15,8)$ CODES WITH A BALANCED SOURCE ..... 119
Figure 5-19: 90\% ONES - All four Worst AddCW SETS ..... 120
Figure 5-20: $10 \%$ ONES - All four Worst AddCW SETS ..... 121
FIGURE 5-21: 90\% ONES - BEST ADDCW SET FOLLOWED BY A WORST ADDCW SET ..... 121
FIGURE 5-22: $90 \%$ ONES - BEST AND WORST ADDCW SET ..... 122
Figure 5-23: Simulation for finding the PSD ..... 123
FIGURE 5-24: 90\% ONES - BEST ADDCW SET FOLLOWED bY A WORST ADDCW SET ..... 124
FIGURE 5-25: $10 \%$ ONES - TWO $(31,23)$ CODES. ..... 124
FIgure 5-26: Statistics of The $(31,23)$ CODE WITH THE $1^{\text {sT }}$ AdDCW SET ..... 125
FIGURE 5-27: Statistics of The $(31,23)$ CODE WITH THE $2^{\text {ND }}$ ADDCW SET. ..... 125
FIGURE 6-1: BļCK DIAGRAM OF THE FPGA COMMUNICATION SETUP ..... 127
FIGURE 6-2: FPGA TRANSMITTER AND RECEIVER COMMUNICATING. ..... 128
Figure 6-3: Digilent Digilab FPGA board ..... 129
FIGURE 6-4: DIO2 PERIPHERAL BOARD FOR INPUT AND OUTPUT ..... 129
Figure 6-5: The pinouts used on the FPGA boards ..... 130
FIGURE 6-6: SDATA SEQUENCE (TOP) AND SCLK SEQUENCE (BOTTOM) ..... 131
Figure 6-7: Parallel Port connector on the back of a PC ..... 131
Figure 6-8: The ‘TRANSIENT' NATURE OF THE PARALLEL PORT WHEN SWITCHING VALUES. ..... 132
Figure 6-9: FSM MODEL of THE PARALLEL PORT DEBOUNCE CIRCUIT ..... 132
Figure 6-10: Main Transmitter FPGA FSM ..... 133
Figure 6-11: LCD screen and corresponding DDRAM locations ..... 136
Figure 6-12: FSM For displaying strings and numbers on the LCD screen ..... 137
Figure 6-13: High-level block diagram of the running mode state FSM ..... 138
Figure 6-14: Testbench for SW packing ..... 139
Figure 6-15: 63h and 8Ah form the 9-bit SW C7h OR 011000111 B ..... 140
Figure 6-16: Testbench for the Hamming encoder ..... 140
Figure 6-17: Testbench for CW addition ..... 141
Figure 6-18: Testbench for the StatsGen module ..... 141
Figure 6-19: Testbench for the SelectBest module ..... 143
Figure 6-20: Testbench for the Parallel2Serial module ..... 145
Figure 6-21: Software that controls the TX FPGA ..... 145
Figure 6-22: Setting up the number of AddCWs ..... 146
Figure 6-23: Eight AddCWs have been selected for use with the (31,x) Code ..... 146
Figure 6-24: Reading the AddSWs from file to use with the $(31,23)$ CODE ..... 147
Figure 6-25: Encoder writes back the 8 AddCWs it is going to use ..... 147
Figure 6-26: Two ways to transmit the file into FPGA RAM ..... 148
Figure 6-27: Reading back the file Hello World, and what a possible error might look like. 149 Figure 6-28: Choosing the serial transmission speed ..... 149
Figure 6-29: Setting the amount of RAM that the FPGA could use ..... 150
Figure 6-30: Encoder in debug mode showing the CW selection process ..... 151
Figure 6-31: Winning location is CW5 or 5D5DDA81h ..... 152
Figure 6-32: Main receiver FPGA FSM ..... 153
Figure 6-33: High-Level block diagram of the running mode state FSM ..... 153
Figure 6-34: Testbench for the Serial2Parallel module ..... 154
Figure 6-35: Testbench for the Meggitt Decoder module ..... 155
Figure 6-36: Testbench for the CW addition module. ..... 155
Figure 6-37: Testbench for the SW Unpacking module ..... 156
Figure 6-38: Software that controllers the RX FPGA ..... 157
Figure 6-39: Dummy file read back from the RX FPGA RAM ..... 158
Figure 6-40: RAM CONTENTS READ back from the RX FPGA ..... 158
FIGURE 6-41: HOW THE SWITCHES ON THE DIO2 bOARD CONTROLLED THE TX FPGA ..... 159
Figure 6-42: TOP LINe is CWs with Single bit errors, bottom line is error free CWs ..... 160
Figure 6-43: "Hello World" file received with 2-bit errors per CW ..... 160
Figure 6-44: Digital oscilloscope and spectrum analyzer used ..... 161
FIgURE 6-45: 50\% DUTY CYCLE UNIPOLAR SQUARE WAVE ..... 161
FIGURE 6-46: PSD MEASURED ON THE DO aND SA ..... 166
Figure 6-47: Shows how the SA is not accurate below 9 KHz . ..... 167
FIGURE 6-48: PSD OF A RANDOM BINARY SQUARE WAVE, MEASURED VERSUS CALCULATED ..... 168
FIGURE 6-49: SAME PSD MEASURED WITH TWO DIFFERENT RESBW'S ..... 169
Figure 6-50: THE Effect of decreasing the Resbw ..... 170
Figure 6-51: BuILDRandomBitFile program For generating random data files ..... 171
FIGURE 6-52: ‘RANDOM_ONES_10_SIZEISIO24.BIN' SHOWS THE FILE IS MOSTLY LOGIC OS ..... 171
Figure 6-53: 90\% ones - All four Best AddCW sets. Compare with Figure 5-16 ..... 172
Figure 6-54: Comparing the 90\% ones best AddCW set laA on the 5dB/div scale ..... 173
Figure 6-55: 10\% ones - All four Best AddCW sets. Compare with Figure 5-17 ..... 173
FIGURE 6-56: 50\% ONES SHOWS LITTLE DIFFERENCE BETWEEN CODES - COMPARE WITH FIGURE 5-18 ..... 174
Figure 6-57: $90 \%$ ONES - Two Of the Worst AddCW sets. Compare with Figure 5-19 ..... 174
Figure 6-58: $10 \%$ ONES - TWO OF THE WORSt AddCW sets. COMPARE WITH Figure 5-20 ..... 175
Figure 6-59: Comparing the 90\% ones worst AddCW set 108 ON THE 5dB/Div sCale. ..... 175
Figure 6-60: $90 \%$ Ones - Best AddCW and Worst AddCW set - Compare with Figure 5-21 ..... 176
Figure 6-61: 90\% ones - Best AddCW and Worst AddCW set - Compare with Figure 5-22 ..... 176
Figure 6-62: $10 \%$ ONES - Two $(31,23)$ COdes. Compare with Figure 5-25 ..... 177
FIGURE 7-1: DC-CONSTRAINED CHANNEL - HPF CONCATENATED WITH AWGN ..... 179
FIGURE 7-2: (A) Binary SEQUENCE $[1,0,0,1,1,1,0]$ TRANSMITTED USING BIPOLAR CODING.
(B) Midpoint locations ..... 180
FIgure 7-3: Showing the effect of the HPF Channel with $R C$ time constants 10 and 2, and how THE SNR IS NOT CONSTANT OVER A BIT PERIOD ..... 181
FIGURE 7-4: HPF CHANNELS EFFECT OF A LONG RUN OF LOGIC OS WITH AN $R C$ CONSTANT OF 10 and 2. It CAN BE SEEN THAT THE VOLTAGE IS DECAYING TO 0 VOLTS ..... 181
Figure 7-5: HPF CHANNEL MODEL SIMULATION ..... 182
FIGURE 7-6: LOOKING FOR 1000 ERRORS AT EACH SNR FROM 5.5DB TO 10DB WITH T $=30$. ..... 183
FIGURE 7-7: BPSK ON AN AWGN CHANNEL VERSUS THE HPF Channel wITH AWGN ..... 183
Figure 7-8: Range of BER values that occurs with changing source statistics ..... 184
Figure 7-9: Effect of EC codes on this channel is minimal ..... 185
FIGURE 7-10: COMPARISON OF PERFORMANCE OF $(15,11)$ AND $(15,10)$ CODES WITH BALANCED SOURCE.. ..... 185
FIGURE 7-11: SIMFPGA RESULTS FOR THE $(15,10)$ CODE SHOWING RUNLENGTH PROBABILITIES ..... 186
FIGURE 7-12: Range of BER Results FOR THE $(15,10)$ CODE ..... 187
FIGURE 7-13: BER PERFORMANCE OF THE $(15,8)$ CODE ..... 188
Figure 7-14: The $(15,8)$ code on The HPF channel compared to BPSK on an AWGN channel. ..... 188
Figure 7-15: Performance of the ( 15,8 ) code does not change with Varying source statistics 189
Figure 7-16: BER PERFORMANCE OF THE $(31,25)$ CODE ..... 190
Figure 7-17: SimFPGA Results for the $(31,25)$ CODE SHOWING RUNLENGTH PROBABILITIES ..... 190
FIGURE 7-18: Range of BER Values for the $(31,25)$ COde versus the $(15,10)$ COde ..... 191
FIGURE 7-19: BER PERFORMANCE OF THE $(31,23)$ CODE IS BETTER THAN THAT OF THE $(15,8)$ CODE ..... 191
FIGURE 7-20: BER PERFORMANCE OF THE $(31,23)$ CODE ON THE DC-CONSTRAINED CHANNEL IS BETTER THAN THE BER PERFORMANCE OF BPSK ON AN AWGN CHANNEL ..... 192
Figure A-1: PSD displays the contribution of power at each frequency ..... 198
FIgure A-2: Once coding is introduced, calculating the PSD Can be challenging ..... 199
Figure A-3: Calculating the power and RMS of a Sine wave ..... 199
Figure A-4: Digital oscilloscope measurements ..... 200
FIGURE A-5: 50\% DUTY CYCLE SQUARE WAVE ON SCOPE ..... 202
Figure A-6: Calculating the average and RMS ..... 202
Flgure A-7: Calculating $A_{0}$ AND $A_{N}$ ..... 203
Figure A-8: Calculating $B_{N}$ ..... 203
 ..... 203
Figure A-10: $\mathrm{C}_{\mathrm{N}}$ Values in a List with frequency values and $N$ index below ..... 203
FIGURE A-11: $\Theta_{\mathrm{N}}$ VALUES SHOWN IN LIST WITH FREQUENCY VALUES AND $N$ INDEX BELOW ..... 204
FIGURE A-12: DO SHOWING THE FREQUENCY DOMAIN PSD OF THIS SQUARE WAVE ..... 205
Figure A-13: Calculated dB $\mathrm{Rm}_{\mathrm{ms}}$ Values of the square wave ..... 205
Figure A-14: SA showing the frequency domain PSD of this square wave ..... 206
Figure A-15: Calculated dBm values of the square wave ..... 206
Figure A-16: SQUARE WAVE WITH A 6.25\% DUTY CYCLE. ..... 207
Figure A-17: Calculating Average and RMS values for the 6.25\% duty cycle square wave. ..... 207
FIGURE A-18: $6.25 \%$ DUTY CYCLE SQUARE WAVE IN THE FREQUENCY DOMAIN ON THE DO AND SA ..... 208
Figure A-19: PDF for the discrete time random process ..... 209
Figure A-20: PDF of the random process for Example 4 ..... 211
Figure A-21: DIScRete time random process mapped to continuous time with delta functions ..... 214
Figure A-22: Waveform is a convolution of the pulse shape and train of delta functions ..... 215
Figure A-23: SQUARE PULSE SHAPE BEING CONVOLVED WITH ITSELF ..... 215
Figure A-24: Graphical depiction of EQuation A. 19 ..... 216
Figure A-25: Fourier transform of the autocorrelation function is the PSD ..... 216
Figure A-26: Matlab SCRIPT Showing PSD that matches with Example 3 ..... 216
Figure A-27: Mealy Finite State Machine (FSM) ..... 218
Figure A-28: Rate $1 / 2$ CONVOLUTIONAL ENCODER ..... 218
Figure A-29: FSM of the convolutional encoder of Example 6 ..... 219
Figure A-30: Transition matrix for Example 7 ..... 222
Figure A-31: State diagram For Example 7 ..... 222
Figure A-32: Matrix powers Can find the two step probability ..... 223
FIGURE A-33: 4-STEP, 5-STEP AND 100-STEP PROBABILITIES FOR EXAMPLE 7 ..... 224
Figure A-34: Original state diagram of the example system ..... 225
Figure A-35: Expanded state diagram for the example system ..... 226
Figure A-36: SQUARE PULSE SHAPE USED IN EXAMPLE 8 ..... 232
FIGURE A-37: STORES THE PULSE SHAPE $P(F)$ AND WEIGHT, FINDS THE DBV $V_{\text {RMS }}$ VALUE AT 100 KHz ..... 234
Figure A-38: Find dBV rms Values at 200 KHz and 300 KHz ..... 234
Figure A-39: Find dBV RMS Values at 500 kHz and 700 KHz ..... 234
Figure A-40: Finding the dBm value at 100 KHz ..... 234
Figure A-41: Finding the dBm Values at 300 KHz and 500 KHz ..... 234
Figure A-42: State diagram of the random binary signal ..... 235
Figure A-43: Ti-89 and Matlab plotting the spectrum ..... 239

## List of Tables

Table 2-1: Systematic CWs formed by Equations 2.3 To 2.5 ..... 10
Table 3-1: SW to CW mapping of this bimode (7,3) Setup. Compare with Table 2-1 ..... 35
TABLE 3-2: SW TO CW MAPPING OF THIS MULTIMODE $(7,2)$ SETUP. ..... 38
TABLE 3-3: SW TO CW MAPPING OF THIS MULTIMODE (7,1) SETUP ..... 41
TABLE 3-4: $(7, X)$ CODE PERFORMANCE WITH EQUIPROBABLE SOURCE STATISTICS ..... 42
TABLE 3-5: Using ADDCWS $C_{6}(X)=0110001$ AND $C_{9}(X)=1001110$ ..... 46
TABLE 3-6: USING ADDCWS $C_{6}(X)=0110001$ AND $C_{8}(X)=1000101$ ..... 46
Table 4-1: WITH THE MSW CRITERIA THE ALL-ZERO CW WOULD BE TRANSMITTED ..... 58
TAbLE 4-2: DECISION PROCESS FROM SIMFPGA FOR THE FIRST SW ..... 61
TABLE 4-3: DECISION PROCESS FROM SimFPGA FOR THE SECOND SW ..... 62
TABLE 4-4: DECISION PROCESS FROM SimFPGA FOR THE THIRD AND FOURTH SW ..... 63
Table 4-5: DECISION PROCESS FROM SImFPGA FOR THE $1^{\text {ST }}$ SW ..... 65
TABLE 4-6: DECISION PROCESS FROM SIMFPGA FOR THE SECOND, THIRD AND FOURTH SW. ..... 65
TAble 4-7: Best AddSW SETS FROM THE $(15,8)$ SEARCH ..... 93
Table 4-8: RUNLENGTH PROBABILITIES FOR THE BEST ADDCWS ..... 93
TABLE 4-9: WORST ADDSW SETS FROM THE $(15,8)$ SEARCH ..... 94
Table 4-10: RUNLENGTH PROBabILITIES FOR THE WORST CWS ..... 94
TABLE 4-11: ADDSW SET $\{0 \mathrm{H}, 108 \mathrm{H}, 200 \mathrm{H}, 300 \mathrm{H}, 4 \mathrm{FFH}, 5 \mathrm{FFH}, 6 \mathrm{~F} 7 \mathrm{H}, 7 \mathrm{FFH}\}$ ..... 95
TABLE 4-12: ADDSW SET \{0H,10CH,200H,300h,4FFH,5FFh,6F3H,7FFh\} ..... 95
TABLE 4-13: ADDSW SET $\{0 \mathrm{H}, 120 \mathrm{H}, 200 \mathrm{H}, 300 \mathrm{H}, 4 \mathrm{FFH}, 5 \mathrm{FFH}, 6 \mathrm{DFH}, 7 \mathrm{FFH}\}$ ..... 95
TABLE 4-14: ADDSW SET $\{0 \mathrm{H}, 130 \mathrm{H}, 200 \mathrm{H}, 300 \mathrm{H}, 4 \mathrm{FFH}, 5 \mathrm{FFH}, 6 \mathrm{CFH}, 7 \mathrm{FFH}\}$ ..... 95
Table 4-15: ADDSW SET $\{0 \mathrm{H}, 107 \mathrm{H}, 2 \mathrm{C} 8 \mathrm{H}, 323 \mathrm{H}, 4 \mathrm{DCH}, 537 \mathrm{H}, 6 \mathrm{~F} 8 \mathrm{H}, 7 \mathrm{FFH}\}$ ..... 95
TABLE 4-16: ADDSW SET $\{0 \mathrm{H}, 14 \mathrm{DH}, 29 \mathrm{BH}, 370 \mathrm{H}, 48 \mathrm{FH}, 564 \mathrm{H}, 6 \mathrm{~B} 2 \mathrm{H}, 7 \mathrm{FFH}\}$ ..... 95
TABLE 4-17: ADDSW SET \{0H,1AAH,29Bh,323H,4DCH,564H,655H,7FFH\} ..... 96
TABLE 4-18: ADDSW SET $\{0 \mathrm{H}, 1 \mathrm{E} 7 \mathrm{H}, 2 \mathrm{C} 8 \mathrm{H}, 323 \mathrm{H}, 4 \mathrm{DCH}, 537 \mathrm{H}, 618 \mathrm{H}, 7 \mathrm{FFH}\}$ ..... 96
TABLE 6-1: COMMANDS FOR BOTH THE TRANSMITTER AND RECEIVER FPGAS ..... 134
TABLE 6-2: STRING TABLE HELD IN THE FIRST 400H BYTES OF RAM ..... 136
TABLE 6-3: SUMMARY OF THE STATSGEN MODULE GATHERING STATISTICS ..... 142
TABLE 6-4: SUMMARY OF THE SELECTBEST MODULE ..... 144
TABLE 6-5: OUTPUT FROM SIMFPGA ..... 151
Table 6-6: Calculated AVG and RMS versus measured ..... 162
Table 6-7: First nine $C_{N}$ TERMS FROM COMPACT Fourier Series (LINEAR SCaLE) ..... 164
Table 6-8: First nine $C_{N}$ TERMS FROM COMPACT Fourier Series in dB Rms ..... 165
Table 6-9: First nine $C_{N}$ TERMS from Compact Fourier Series in dBm ..... 166
Table A-1: SHOWING HOW THE MEASURED AND CALCULATED VALUES MATCH UP ..... 205
TABLE A-2: SHOWING HOW THE MEASURED AND CALCULATED VALUES MATCH UP. ..... 206
TABLE A-3: MEASURED VS CALCULATED FOR DBM AND DBV $V_{\text {RMS }} 6.25 \%$ DUTY CYCLE SQUARE WAVE ..... 208
TABLE B-1: (7,3) ADDSWS $-\left(2^{3}\right)^{0}=1$ SET. ..... 240
TABLE B-2: $(7,2)$ ADDSWS $-\left(2^{2}\right)^{1}=4$ SETS ..... 240
TABLE B-3: $(7,1)$ ADDSWS $-\left(2^{1}\right)^{3}=8$ SETS ..... 240
TABLE B-4: $(15,10)$ ADDSWS $-\left(2^{10}\right)^{0}=1$ SET ..... 241
TABLE B-5: $(15,9)$ ADDSWS $-\left(2^{9}\right)^{1}=512$ SETS ..... 241
TABLE B-6: $(15,8)$ ADDSWS $-\left(2^{8}\right)^{3}=16777216$ SETS .....  241
TABLE B-7: $(31,25)$ ADDSWS $-\left(2^{25}\right)^{0}=1$ SET ..... 242
TABLE B-8: ( 31,24 ) ADDSWS $-\left(2^{24}\right)^{1}=16777216$ SETS ..... 242
TABLE B-9: $(31,23)$ ADDSWS $-\left(2^{23}\right)^{3}=590295810358705651712$ SETS ..... 242

## List of Abbreviations

| AddSW | Add Source Word |
| :--- | :--- |
| AddCW | Add Code Word |
| CS | Constrained Sequence |
| CW | Code Word |
| DO | Digital Oscilloscope |
| EC | Error Correction / Error Correcting |
| ESD | Energy Spectral Density |
| LB | Last Bit |
| LC | Line Coding |
| LSb | Least Significant bit |
| MSb | Most Significant bit |
| MSW | Minimum Squared Weight |
| PSD | Power Spectral Density |
| RDS | Running Digital Sum |
| SW | Source Word |
| SA | Spectrum Analyzer |
| WRDS | Word-end Running Digital Sum |

## 1. Introduction

The objective of a communication system is to transfer information from a source to a destination with as much accuracy as possible. While errors will inevitably occur, modern digital communication systems typically package their data in an intelligent manner to reduce these errors. As such, both error control coding and constrained sequence coding are a form of intelligent packaging used to deliver information from a source to a destination with a minimum number of errors $[1,2,3]$.

For numerous reasons errors can occur during transmission. Error control coding allows for the detection and often correction of these errors by encoding source data in a redundant manner. Constrained sequence coding, also known as line coding [4,5], is an alternate encoding method that pre-conditions source data before transmission in an attempt to prevent errors from happening in the first place.

Both of these coding techniques aim to achieve the same goal of error free transmission. As a result, a logical advancement is to try and combine the two approaches to extract the benefits from both coding techniques [6-11]. Simple configurations involve using both codes in a concatenated fashion where the original source data is encoded with an error control code first, called the outer code, followed by a constrained sequence code, called the inner code. This simple arrangement aspires to avoid errors as a result of the inner constrained sequence code, with the expectation that the outer error control code can correct any errors that do occur.

A problem known as error propagation occurs with this setup when the constrained sequence decoder encounters errors that inevitably occur during transmission. That is, random bit errors from the channel are increased during constrained sequence decoding since the constrained sequence decoder has no means of handling these errors. As a result, the outer error control code must be powerful enough to correct the original errors as well as the additional errors introduced by the constrained sequence decoder. If the number of errors exceed the error correcting ability of the error control code, then the error control decoder may also introduce additional errors into the decoded sequence.

Since this problem results from the fact that the constrained sequence decoder cannot adequately handle errors, common sense would mandate that any errors be corrected before removing the constrained sequence code. However this is not easily accomplished since the order of decoding operations would no longer be the inverse of the encoding operations. As a result this problem has lead many channel coding researchers to investigate ways to combine both coding techniques into one monolithic code. With a combined code data could be simultaneously pre-conditioned for the channel to try and prevent errors from occurring, and as well, when errors do occur on the channel, they could be error corrected first before constrained sequence decoding, thus avoiding error propagation. This thesis introduces a new combined error control and constrained sequence code based on linear block codes. The new code introduced is analyzed, simulated, and implemented in FPGA circuits for proof of concept. The coding technique presented in this thesis can be used in fiber optic back planes which traditionally rely on CS coding alone, or even in CD and DVD formats such as the new Blu-ray or HD-DVD standards, which employ both forward EC codes and dc-free CS codes when formatting data for storage on the disk.

### 1.1. Thesis overview

Chapter 2 presents concepts of error control coding and constrained sequence coding that are used in this thesis. Chapter 3 follows by discussing general approaches that can be used to create a combined error control and constrained sequence code. Following this is a gradually evolving example that clearly details the new approach introduced in this thesis. Chapter 4 outlines the mathematics required for analyzing code performance in the time domain. The results of a computer search are also presented and recommendations are made detailing how this coding technique can be applied to other linear block codes. Chapter 5 investigates the coding scheme in the frequency domain and presents the results and analysis of the power spectral density of various configurations of this coding technique. Chapter 6 presents the FPGA hardware implementation and compares the measured time domain and frequency domain results with analytical results, while Chapter 7 considers the bit error rate performance of this
combined code and compares various configurations. Finally, Chapter 8 summarizes the concept, configurations and performance of the combined coding technique as well as presents suggestions for future work. The appendix serves as a reference for calculating power spectral density in general as well as for block codes.

## 2. Error Control Coding and Constrained Sequence Coding Background

### 2.1. Digital Communication System Basics

A digital communication systems requires a source, which for the purpose of this thesis is considered to only output the binary symbols 0 and 1 . Binary symbols will also be referred to as bits or digits depending on the context. This binary information is transmitted over a channel and is received by a receiver. The channel introduces a number of effects such as attenuation, distortion, interference and noise, which inevitably results in the receiver making errors [4,5]. The channel however is most easily modeled as simply introducing errors in the form of flipping bits from either a 0 to a 1 , or vice versa. The receiver then receives the incoming data, and makes bit by bit decisions on whether or not it received a 0 or a 1. This is shown in Figure 2-1.


Figure 2-1: Simple digital communication system

### 2.2. Error Control Coding

Transmitting raw binary information over a channel provides no means of detecting errors which inevitably occur. The reason for this is because any combination of 0 s and 1 s on the channel is considered valid. Due to the addition of noise (that is modeled as simply flipping bits) the receiver can never really be certain if the bits it received were the ones that were sent.

A typical example would be copying a file from one computer to another. If the data was transmitted byte by byte, where one byte is 8 -bits, all $2^{8}=256$ possible bit
combinations of 0 and 1 would be considered valid. Thus if the source computer transmits 00000000 and the destination computer receives 00000001 (as a result of channel noise), the destination computer would have to assume that this was the data transmitted and the file would now contain errors.

The solution then is to reduce the number bytes considered valid. For instance, if out of the 256 possible bit combinations only 00000000 and 11111111 were considered valid, the receiver would now have a way of detecting errors on the channel. The destination computer would now know that byte 00000001 is invalid (error detection), and furthermore could infer that the intended byte was 00000000 (error correction). In fact, in this specific case, the channel can add as many as three bit errors per byte, such as 00000111 , and the destination computer could still infer that this byte was 00000000. (Note that four bit errors such as 00001111 would result in the receiver being unable to determine the intended byte, and five or more bit errors such as 00011111 would result in the receiver mistakenly inferring that 11111111 was the intended byte). This ability to detect and correct errors comes at the expense of increased redundancy. With only two valid bytes available for transmission only $\log _{2}(2)=1$ bit of information is transmitted with each byte. Therefore in order to transmit the data byte 10101010, the source computer would need to transmit the eight bytes 11111111, 00000000, 11111111, $00000000,11111111,00000000,11111111,00000000$. Thus the source computer must transmit eight times as much information, which results in a redundancy of $800 \%$ or an efficiency of $12.5 \%$, to send one byte of data $[1,2,3]$. As a result this system can be considered as emitting l-bit source words (SW) that are mapped to 8 -bit code words (CW). This is an example of error control (EC) coding and is shown in Figure 2-2.


Figure 2-2: Digital communication system with EC coding

Redundancy is an essential part of EC coding since the more redundancy added the more errors that can be corrected. For example, if instead of a $1: 8$ mapping a 1:16 or 1:32 mapping was used, the destination computer could correct as many as seven or fifteen bit errors per CW respectively. Conversely, the more redundancy added the more inefficient the system becomes. This is known as the coding tradeoff.

There is also no restriction that requires SWs to be 1-bit. As will be seen in this thesis, mappings such as 4 -bit SWs to 7 -bit CWs, 11 -bit SWs to 15 -bit CWs and higher can be used. This mapping can be increased to any level desired, and as stated in Shannon's work, we can drive our error rates as low as we choose [12]. In either case, the fundamental concept of EC coding is that more bits are being sent on the channel then is needed to convey the original message, but this is being done in order to detect errors, and in many cases correct errors.

The previous example demonstrates the fundamental concepts and motivations of EC coding. The following sections introduce terms and equations required for analyzing and comparing various forms of EC coding.

### 2.3. Goals of Error Control Coding

Goals of EC coding include:

1. Detect and/or correct as many bit errors as possible
2. High efficiency
3. Simple scheme
4. Reduce error extension (to be defined)

Some of these goals are contradictory, since to improve efficiency usually means reducing the error controlling ability and vice versa. On the other hand, attempting to improve both would be at the cost of increased encoding and decoding complexity.

### 2.4. Block Codes

The example presented in Section 2.2 broadly illustrates a simple block code. In block codes the source data is grouped in sequential blocks of $k$ bits defined as a source word (SW). Each $k$ bit block is then encoded into an $n$ bit block defined as a code word (CW), where $n$ is larger than $k[1,2,3]$. This is often referred to as SW to CW mapping. The resultant code is called an ( $n, k$ ) block code and thus in the example above, the code would be called an $(8,1)$ block code.

The extra ( $n-k$ ) bits added are called parity check bits. This is the redundant information added to the source data that allows for error detection and/or correction. If the CW is constructed in such a manner that the SW appears at the beginning of the CW , it is said to be a systematic code. Figure 2-3 shows a systematic CW.


Figure 2-3: Systematic CW

As shown in Figure 2-4, the example in Section 2.2 used systematic CWs since the original SW appears at the beginning of the CW.

> | 0 | 00000000 |
| :--- | :--- |
| 1 | 11111111 |

Figure 2.4: Mapping of the $1: 8$ code

### 2.5. Parity Check Codes

The redundant information that is added to the source data is usually done in a logical fashion. The goal is to have each CW unique, and different from all other CWs by the greatest amount possible. One way to do this is with parity check codes.

It is convenient to represent an $(n, k)$ block code in matrix form as a row vector whose elements are the code symbols (bits). The original SW in vector form can be represented as $d$, and the CW in vector form as $c$ as shown in Equation 2.1.

$$
\begin{align*}
& d=\left[d_{1}, d_{2}, \ldots d_{k}\right] \\
& c=\left[c_{1}, c_{2}, \ldots c_{k}, \ldots c_{n}\right] \tag{2.1}
\end{align*}
$$

As described in Section 2.4, in a systematic code the first $k$ bits of $c$ would equal the $k$ bits of $d$. The remaining $n-k$ parity check bits are created through addition over a finite field of length 2 , known as $\mathrm{GF}(2)$, which is also defined as modulo-2 addition, or XORing in programming notation. Let $m=n-k$ and let the systematic $C W$ vector $c$ be created as follows:

$$
\begin{align*}
& c 1=d 1 \\
& c 2=d 2 \\
& \vdots \\
& c k=d k \\
& c_{k+1}=p_{11} d_{1}+p_{12} d_{2}+\ldots+p_{1 k} d_{k}  \tag{2.2}\\
& c_{k+2}=p_{21} d_{1}+p_{22} d_{2}+\ldots+p_{2 k} d_{k} \\
& \vdots \\
& c_{k+m}=p_{m 1} d_{1}+p_{m 2} d_{2}+\ldots+p_{m k} d_{k}
\end{align*}
$$

The coefficients $p_{i j}$ can be collected into a binary parity matrix $P$ that describes which SW bits are used to form the parity check bits. This logical mapping allows for $2^{k}$ unique SWs to be mapped to $2^{k}$ unique CWs out of a possible $2^{n}$ words. The SW to CW mapping can also be pre-computed and stored in a large look-up table (LUT) in computer memory, in which case each SW is treated as an index into the LUT, and the output is the CW .

### 2.6. Generator Matrix

Using a LUT to assign SWs to CWs becomes unmanageable as the CW size grows and the number of possible SWs increases. This is because memory size and access time start to become a factor. An alternative is to represent the above operation in matrix form as shown in Equation 2.3 and 2.4.

$$
\begin{gather*}
c=d G=\left[d_{1}, d_{2}, \ldots, d_{k}\right]\left[\begin{array}{cccccccc}
1 & 0 & \ldots & 0 & p_{11} & p_{21} & \ldots & p_{m 1} \\
0 & 1 & \ldots & 0 & p_{12} & p_{22} & \ldots & p_{m 2} \\
\ldots & \ldots & \ldots & \ldots & \ldots & \ldots & \ldots & \ldots \\
0 & 0 & \ldots & 1 & p_{1 k} & p_{2 k} & \ldots & p_{m k}
\end{array}\right]  \tag{2.3}\\
c=d G \text { where }  \tag{2.4}\\
G=\left[\begin{array}{lll}
I_{k} & P^{T}
\end{array}\right]
\end{gather*}
$$

Here $G$ is known as the generator matrix and the CWs can now be generated on the fly using Equation 2.4. This provides a memory saving since only the generator matrix needs to be stored instead of all the CWs. This comes at the cost of added complexity in the form of matrix multiplications. For instance, the following could be a generator matrix with $k=4$ :

$$
G=\left[\begin{array}{lllllll}
1 & 0 & 0 & 0 & 1 & 0 & 1  \tag{2.5}\\
0 & 1 & 0 & 0 & 1 & 1 & 1 \\
0 & 0 & 1 & 0 & 1 & 1 & 0 \\
0 & 0 & 0 & 1 & 0 & 1 & 1
\end{array}\right] \text { where } I_{k}=\left[\begin{array}{llll}
1 & 0 & 0 & 0 \\
0 & 1 & 0 & 0 \\
0 & 0 & 1 & 0 \\
0 & 0 & 0 & 1
\end{array}\right] \text { and } P=\left[\begin{array}{llll}
1 & 1 & 1 & 0 \\
0 & 1 & 1 & 1 \\
1 & 1 & 0 & 1
\end{array}\right]
$$

Using Equation 2.3 with generator matrix $G$ from Equation 2.5, Table 2.1 shows all possible systematic CWs. Notice that the SW is left intact (bolded) inside the CW .

Table 2-1: Systematic CWs formed by Equations 2.3 to 2.5

| Index | SW | $C W$ | Index | SW | $C W$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 0000 | 0000000 | 8 | 1000 | 1000101 |
| 1 | 0001 | 0001011 | 9 | 1001 | 1001110 |
| 2 | 0010 | 0010110 | 10 | 1010 | 1010011 |
| 3 | 0011 | 0011101 | 11 | 1011 | 1011000 |
| 4 | 0100 | 0100111 | 12 | 1100 | 1100010 |
| 5 | 0101 | 0101100 | 13 | 1101 | 1101001 |
| 6 | 0110 | 0110001 | 14 | 1110 | 1110100 |
| 7 | 0111 | 0111010 | 15 | 1111 | 1111111 |

### 2.7. Parity Check Matrix

Decoding is done by comparing the incoming CW (which possibly has errors) with all possible CWs. The CW that is the closest match is assumed to be the one that was transmitted. It is then simply a matter of removing the $(n-k)$ parity bits to extract the SW. This is known as maximum likelihood decoding [2]. This was seen in the example of Section 2.2, where receiving 00000111 would result in CW 00000000 being inferred and SW 0 being decoded. Conversely receiving 00011111 would result in CW 11111111 being inferred and SW 1 being decoded.

As the number of CWs increase however, this comparison operation can become too time consuming. A more elegant approach is to use the Parity Check Matrix $H$. Since $G$ can be considered to create a $k$-dimensional sub-space of CW vectors, its dual space is of dimension $(n-k)$ and can be generated by $(n-k)$ linearly independent vectors that are all orthogonal to the subspace generated by the generator matrix. These vectors can be combined into an $(n-k)$ by $n$ matrix $H$, such that

$$
\begin{equation*}
G H^{T}=0 \tag{2.6}
\end{equation*}
$$

The parity check matrix $H$ can be formed in a manner similar to how $G$ is formed using the parity matrix $P$.

$$
\begin{align*}
& G=\left[I_{k} P^{T}\right]  \tag{2.7}\\
& H=\left[\begin{array}{ll}
P & I_{n-k}
\end{array}\right] \tag{2.8}
\end{align*}
$$

such that

$$
G H^{T}=\left[\begin{array}{ll}
I_{k} & P^{T}
\end{array}\right]\left[\begin{array}{c}
P^{T}  \tag{2.9}\\
I_{n-k}
\end{array}\right]=P^{T} \oplus P^{T}=0
$$

### 2.8. Syndrome Decoding

Since every CW generated by $G$ is orthogonal to all vectors in $H$, then $c H^{T}=0$, i.e. the result is the all zero vector. Furthermore a received $\mathrm{CW} r$ that has been corrupted with errors such that it is no longer a valid CW will no longer be orthogonal and $r H^{T} \neq 0$, i.e. the result will not be the all zero vector.

Denoting $r$ as the 1 by $n$ received vector that results from sending the code vector $c$ over a noisy channel, and considering the case of a single error in the $i^{\text {th }}$ position, then

$$
r=c+e \quad \text { where } e=\left[\begin{array}{llllll}
0 & 0 & 0 & 1 & 0 & 0 \tag{2.10}
\end{array}\right)
$$

Since the channel is modeled as simply flipping a bit, this can be represented in matrix form as shown in Equation 2.10, where an error vector $e$ is added to the original CW over GF(2). Next, evaluating $r H^{T}$ yields

$$
\begin{equation*}
r H^{T}=(c+e) H^{T}=c H^{T}+e H^{T}=0+e H^{T}=e H^{T}=s \tag{2.11}
\end{equation*}
$$

The 1 by $m$ vector $s$ is called the syndrome of the received vector $r$. A syndrome that is all zero indicates that $r$ is a code vector and is presumably correct. A non-zero syndrome indicates that an error occurred.

Using the syndrome $s$ and noting that $e H^{T}$ is the $i^{\text {th }}$ row of $H^{T}$ [2], the error position can be identified by comparing $s$ to the rows of $H^{T}$. Error correction can now be done simply by flipping the $i^{\text {th }}$ bit in the received CW. Decoding by this simple comparison is called syndrome decoding.

### 2.9. Hamming bound

With syndrome decoding an $(n, k)$ linear block code can correct up to $t$ errors per CW if $n$ and $k$ satisfy the following Hamming bound.

$$
\begin{equation*}
2^{n-k} \geq \sum_{i=0}^{t}\binom{n}{i} \tag{2.12}
\end{equation*}
$$

where

$$
\begin{equation*}
\binom{n}{i}=\frac{n!}{i!(n-i)!} \tag{2.13}
\end{equation*}
$$

A block code for which the equality holds is known as a perfect code. Note that satisfying the Hamming bound is necessary but not sufficient for the construction of a $t$ error correcting parity check code.

### 2.10. Hamming Distance

The Hamming distance $d\left(c_{i}, c_{j}\right)$ between two CWs $c_{i}$ and $c_{j}$ from the same code is defined as the number of positions in which their elements differ. The Hamming weight $w\left(c_{i}\right)$ of a CW $c_{i}$ is defined as the number of 1 s in $c_{i}$. Thus the Hamming weight of $c_{i}$ is the Hamming distance between $c_{i}$ and 0 (the all-zero CW ), that is

$$
\begin{equation*}
w\left(c_{i}\right)=d\left(c_{i}, 0\right) \tag{2.14}
\end{equation*}
$$

Similarly the Hamming distance can be rewritten in terms of Hamming weight as

$$
\begin{equation*}
d\left(c_{i}, c_{j}\right)=w\left(c_{i}+c_{j}\right) \tag{2.15}
\end{equation*}
$$

The minimum distance $d_{\text {min }}$ of a linear block code is defined as the smallest Hamming distance between any pair of CWs in the code. An ( $n, k$ ) linear block code can correct up to $t$ errors if and only if

$$
\begin{equation*}
d_{\min } \geq 2 t+1 \tag{2.16}
\end{equation*}
$$

### 2.11. Linear Code Properties

Almost all useful block codes possess the property of linearity. It is defined as

1. Multiplying a CW by a valid scalar produces another CW
2. Adding two CWs together also produces another CW.

When dealing with binary symbols a valid scalar is 0 or 1 , and hence multiplication by 0 produces the all-zero CW , and multiplication by 1 leaves the CW unchanged. Addition with binary symbols over $\mathrm{GF}(2)$ means that adding a CW with itself will result in the all zero CW , and the addition of two different CW s will produce another valid CW . For instance if $c_{i}$ and $c_{j}$ are two CWs, then the addition $c_{i}+c_{j}$ must also be a CW. Since $c H^{T}=0$ then $\left(c_{i}+c_{j}\right) H^{T}=0$. As well since $c_{i}+c_{j} \neq 0, c_{i} H^{T}=0$, and $c_{j} H^{T}=0$, then $c_{i}+c_{j}$ must also be a valid CW .

### 2.12. Hamming Codes

Hamming codes were the first linear block codes published on error correction. They have a minimum distance $d_{\text {min }}=3$, and thus they have the ability to correct single
bit errors per CW. They are also perfect codes by definition of the Hamming bound and they belong to a subset of linear block codes known as cyclic codes. This subset of codes is popular since encoding and decoding can be done in a simple fashion.

### 2.13. Cyclic Codes

Cyclic codes are a subset of linear block codes. In addition to the properties of linearity, they contain an additional property that if they are shifted cyclically (circularly shifted or rotated in programming notation), the result is also a CW. For instance, Table 2-1 above is showing a $(7,4)$ Hamming code, where it can be seen that CW1 is $[0,0,0,1,0,1,1]$, which when shifted left forms CW2, CW5 and then CW11. Further cyclic shifting, which involves wrapping the most significant bits back to the least significant position, gives CW6, CW12 and then CW8 before forming CW1 again. This cyclic nature of the CWs can be exploited to form simpler encoding and decoding configurations versus the matrix approaches used to this point. This effect is also easier to demonstrate when error control codes are represented using polynomials. The following subsections demonstrate how binary digits can be represented as polynomials, and how simple algebraic operations such as addition, subtraction, multiplication and division can be performed. Note that since the binary alphabet only has two symbols, all algebraic operations are performed in $\mathrm{GF}(2)$. As a result addition is the same as subtraction, and this operation can be implemented with XOR gates in hardware. Similarly, multiplication and division can be accomplished through shifting, which is done using feed-back and feed-forward shift registers respectively.

### 2.13.1. Polynomial representation of codes

$$
\begin{equation*}
101101=1 x^{5}+0 x^{4}+1 x^{3}+1 x^{2}+0 x^{1}+1 x^{0}=x^{5}+x^{3}+x^{2}+1=d(x) \tag{2.17}
\end{equation*}
$$

### 2.13.2. Addition and Subtraction

$$
\begin{align*}
& r(x)=c(x)+c(x) \\
& r(x)=\bmod \left(\left(x^{5}+x^{3}+x^{2}+1\right)+\left(x^{5}+x^{3}+x^{2}+1\right), 2\right)  \tag{2.18}\\
& r(x)=\bmod \left(\left(2 x^{5}+2 x^{3}+2 x^{2}+2\right), 2\right)=0
\end{align*}
$$

### 2.13.3. Multiplication

$$
\begin{align*}
& c(x)=g(x) d(x) \\
& c(x)=x^{2}\left(x^{5}+x^{3}+x^{2}+1\right)=x^{7}+x^{5}+x^{4}+x^{2} \tag{2.19}
\end{align*}
$$

### 2.13.4. Division

$$
\begin{aligned}
& d(x) / g(x)=q(x) \text { and } r(x) \\
& \begin{array}{cc}
x^{3}+x & \leftarrow(x) \\
x^{3}+x+1 \overline{x^{6}+x^{3}} & 1011 \frac{1010}{\frac{x^{6}+x^{4}+x^{3}}{x^{4}}} 4 q(x) \\
\frac{x^{4}+x^{2}+x}{x^{2}+x} \leftarrow r(x) & \frac{1011}{1000} \\
& \frac{1011}{110} \leftarrow r(x)
\end{array}
\end{aligned}
$$

With division there is a quotient $q(x)$ and a remainder $r(x)$.

### 2.14. Encoding of Cyclic Codes

To encode cyclic codes first choose a generator polynomial ${ }_{[2]} g(x)$ such that

1. It is of degree $(n-k)$, in order to have $n-k$ redundant symbols.
2. It is a factor of $x^{n}+1$, to ensure that the code is cyclic.

The simplest way to create CWs is to multiply the SW $i(x)$ of length $k$ by the generator polynomial $g(x)$, i.e. $c(x)=i(x) g(x)$. To match the systematic form however the data bits must remain intact, and the parity bits must be simply concatenated. Thus the first step is to shift the data bits $i(x)$ up by $(n-k)$, i.e. $i(x) x^{n-k}$. Then since it still needs to be a multiple of $g(x), i(x) x^{n-k}$ can be divided by $g(x)$ to see if the remainder $r(x)$ is zero. If it is not zero then $i(x) x^{n-k}$ must be modified so that it is a multiple of $g(x)$. The simplest way to do this is to subtract the non-zero remainder from the original dividend [2]. This operation is shown in Equation 2.21 noting that addition and subtraction are the same operation in GF(2).

This procedure works since this is an application of Euclidean division which states that the dividend $=(q u o t i e n t)($ divisor $)+$ remainder. Since $i(x)$ was premultiplied by $x^{n-k}$, the degree of the remainder will always be less than the degree of the divisor and the dividend terms will all be greater than or equal to $(n-k)$. Consequently adding the divisor to the quotient will not modify any terms in $i(x)$, maintaining systematic form. As well, choosing the divisor to be the generator polynomial $g(x)$ ensures that the result is always a CW . In other words, the polynomial representation of a CW can be represented by

$$
\begin{equation*}
c(x)=i(x) x^{n-k}+R_{g(x)}\left[i(x) x^{n-k}\right] \tag{2.21}
\end{equation*}
$$

where $R_{g(x)}$ [] represents taking the remainder after division through $g(x)$. All of this work is neatly represented by the circuit in Figure 2-5 which consists only of shift registers and XOR gates.


Figure 2-5: Cyclic code encoder

This circuit represents a simultaneous multiplication by $x^{3}$, and a division by $x^{3}+x+1$. In order to build a systematic CW the registers are initially cleared, then the $i(x)$ bits are shifted through one at a time. After all $k$ bits have been shifted in, the registers will contain the remainder. Thus using the circuit shown in Figure 2-5 to encode $i(x)=x^{3}+1$, or in binary 1001, with the generator polynomial $g(x)=x^{3}+x+1$, is identical to the process shown above in Equation 2.20 under polynomial division. For example if $k=4$ and $n=7, i(x)$ is first multiplied by $x^{7-4}$ to yield $x^{6}+x^{3}$, and then this dividend is divided by $g(x)$. The final step is to append the remainder $r(x)=x^{2}+x$ to form the CW $c(x)=x^{6}+x^{3}+x^{2}+x$, or in binary 1001110.

Inspection of the $\mathrm{CW} c(x)$ created above with the generator polynomial $g(x)=x^{3}+x+1$ shows that it is the same CW created with the generator matrix $G$ in Equation 2.5, i.e. CW9 in Table 2-1. Furthermore this generator polynomial can be used to construct all the CWs shown in Table 2-1. Thus while the explanation of cyclic codes is more difficult, the implementation is simpler and these encoders can be easily implemented with digital circuits using very basic logic components.

### 2.15. Decoding of Cyclic Codes

The received signal from the channel is a linear combination of the $\mathrm{CW} c(x)$ plus an error vector $e(x)$, denoted $y(x)=c(x)+e(x)$. Since all CWs are multiples of $g(x)$, testing if the received word is valid is as simple as dividing it by $g(x)$ and checking if the remainder $r(x)$ is zero. If it is, then it can be assumed that there are no errors. If $r(x)$ is not zero then there was at least one or more errors present. This remainder is once
again called the syndrome. Equation 2.22 demonstrates how the received vector $y(x)$ is divided by the generator polynomial $g(x)$ and any non-zero remainder is a result of a nonzero error vector $e(x)$. If the error vector $e(x)$ was zero then the syndrome $s(x)$ will also be zero, and it can be assumed that there were no errors.

$$
\begin{equation*}
s(x)=R_{g(x)}[y(x)]=R_{g(x)}[c(x)+e(x)]=0+R_{g(x)}[e(x)] \tag{2.22}
\end{equation*}
$$

This syndrome can be found through division with the encoder circuit shown in Figure 2-5. A benefit of cyclic codes is that it is not necessary to keep track of all possible syndromes as it was with syndrome decoding. In fact, only the syndrome that represents an error vector in the most significant bit (MSb) position needs to be tracked. This is because a left cyclic shift of the error pattern results in a syndrome that changes by the amount equivalent to $R_{g(x)}[x \cdot s(x)]$ [2]. What this means is, if a non-zero syndrome is present in the shift registers, each clock of the division circuit is equivalent to a left cyclic shift of the error pattern. For instance, assume a corrupted word $y(x)$ has an error in its $5^{\text {th }} \mathrm{MSb}$, i.e. $e(x)=[0,0,1,0,0,0,0]$. When it is clocked into the divisor circuit a syndrome will be formed that indicates this error. Now a further clocking of the divisor circuit will result in a left cyclic shift of the error vector to $e(x)=[0,1,0,0,0,0,0]$, and thus, a syndrome will be formed indicating an error in the $6^{\text {th }} \mathrm{MSb}$. One further clocking of this circuit then will produce a syndrome indicating an error in the $7^{\mathrm{th}} \mathrm{MSb}$, i.e. $e(x)=[1,0,0,0,0,0,0]$. Thus, as long as the circuit is monitoring for this syndrome it can correct the flipped bit. To check for an error in the MSb , the syndrome to monitor for when $g(x)=[1,0,1,1]$ is shown in Equation 2.23.

$$
\begin{equation*}
s(x)=R_{g(x)}[1,0,0,0,0,0,0]=[1,0,1] \tag{2.23}
\end{equation*}
$$

Therefore it will take exactly $n$ clocks to form the original syndrome, and then $n-1$ more clocks to cycle through all the remaining $n-1$ possible error vectors [2].

The practical significance of this result is that the circuit only needs to look for a single error pattern and continue clocking the division circuit until the syndrome for this error pattern is matched. Once this happens it simply corrects the erroneous bit by inverting it. On the other hand if the syndrome is never matched it can be assumed that there were no errors. Decoders using this principle are called Meggitt Decoders [2].

The circuit in Figure 2-6 shows how the above process can be achieved using very simple digital circuits. For example, taking the $C W C(x)=[1,0,0,1,1,1,0]$, and flipping the $5^{\text {th }} \mathrm{MSb}$ as described gives $y(x)=[1,0,1,1,1,1,0]$. Figure 2-6 shows that for the first $n=7$ clocks the corrupted $y(x) \mathrm{CW}$ is fed in, which forms the original syndrome $s(x)=[1,1,0]$ as shown at $c l k=7$. As the last bit of $y(x)$ is being fed in the first bit of $y(x)$ is leaving the buffer of length $n-1$, where it is XORed with the syndrome $s(x)$ through the boolean circuit. Had there been an error in the MSb of $y(x)$ the syndrome would have been 101 causing the output of the AND gate to be high, thus flipping or correcting this erroneous bit.

Hence it can be seen from Figure 2-6 that on the $9^{\text {th }}$ clock $(c l k=9)$, the syndrome $s(x)=[1,0,1]$ has been matched and the erroneous bit has been corrected. The original CW $c(x)$ is then formed by ignoring the first $n-1$ bits that leave the buffer and keeping the last $n$ bits. Furthermore since this is a systematic code the original $\mathrm{SW} d(x)$ is recovered by removing the last $n-k$ bits. Clearly this circuit is simpler than decoding with the parity check matrix.


Figure 2-6: Meggitt decoder and operation correcting the $5^{\text {th }} \mathbf{M S b}$

### 2.16. Advantages of Error Control Coding

Before the use of error control coding the only way to improve the performance of a communication system was to increase the transmitted power, or increase the amount of bandwidth used, both of which come with a never ending financial penalty. However EC coding allows for the clever encoding of data which results in an improvement in performance using the current allotted bandwidth and transmit power, at the cost of added complexity. Thus EC codes do an excellent job of bringing the bit error rate down to any level desired, as long as the system can tolerate the added redundancy.

### 2.17. Drawbacks of Error Control Coding

The drawback of EC coding is that it is impossible to detect and correct all errors. Since this is a stochastic problem there is always some chance that errors will slip through the system undetected. In addition, when the number of errors in a received word exceed the error correcting abilities of the EC code, the error correction process may in fact introduce additional errors into the decoded sequence. This is called error extension and it occurs because the EC code is fooled by the excessive errors and incorrectly flips bits that did not have errors to begin with, consequently introducing additional errors into the system.

### 2.18. Constrained Sequence Coding

When digital pulse signals are conveyed over a practical transmission medium, certain bit patterns can be more prone to errors than others [13]. Knowing some characteristics about the channel and the type of corruption the signals will undergo allows for the information to be encoded prior to transmission to avoid transmitting those sequences that are likely to be decoded in error. These types of encoded sequences are being constrained and thus this type of coding is called Constrained Sequence (CS) coding. It is also known as Line Coding (LC) in digital transmission systems [4] and Recording Coding (RC) in digital storage systems. These terms will be used interchangeably.

### 2.19. Line Coding Basics

When a binary data stream is transmitted over a channel it is first converted into an electrical or light waveform through transmission encoding [4]. One of the simplest formats used is on-off coding also known as unipolar coding shown in Figure 2-7. Here a logic 1 is represented by a positive voltage and a logic 0 is represented by no voltage. This can also be interpreted as assigning a positive square pulse shape $p(t)$ to a logic 1 , and assigning no pulse to a logic 0 . This is the most prevalent signaling arrangement used in fiber optics as the laser is either on or off [14,15].


Figure 2-7: On-Off Signaling Formats

If the voltage is constant over the symbol period $T$, then the format is called NonReturn Zero (NRZ). If however there are too many like valued symbols in a row it is difficult to determine the end of one symbol and the start of the next. Thus some systems use Return Zero (RZ) signaling format to force the voltage to return to zero before the end of each symbol period. This comes at the cost of less energy per symbol, and thus a decreased signal to noise ratio. Both formats are shown above in Figure 2-7.


Figure 2-8: Polar Signaling Formats

Another commonly used format is polar coding, also known as bipolar coding shown in Figure 2-8. Following from the last example, a logic 1 is transmitted by the square pulse shape $p(t)$ and a logic 0 is transmitted by the inverted square pulse shape $-p(t)$. This is the most power efficient scheme since it requires the least power to achieve a given error probability [4].

Regardless of the transmission encoding used the serial bit stream requires periodic maintenance as it travels on the channel in order to combat the accumulation of noise and signal distortion. This requires that the signal be received, decoded, and regenerated for further transmission. Most often the receivers used are AC coupled as they are easier to design and capable of better performance [16,15]. The high-impedance amplifier in Figure 2-9 is often used in fiber optic transmission systems because it offers the lowest noise level and hence the highest detection sensitivity [16]. However, because of the high load impedance the frequency response of the amplifier is limited by the RC time constant at the input [15].


Figure 2-9: Simple high-impedance preamplifier design using a bipolar transistor

AC coupling in these receivers has the effect of blocking the average dc value of the signal. As a result a $50 \%$ duty cycle unipolar square wave will appear as a $50 \%$ duty cycle polar square wave on the other side of the capacitor as shown in Figure 2-10.


Figure 2-10: $\mathbf{5 0 \%}$ duty cycle square wave through a dc blocking capacitor

There is an additional effect caused by the large RC time constant. The high impedance receiver tends to integrate the detected signal as shown in Figure 2-11. This leads to a variation of the midpoint known as baseline wander [16].


Figure 2-11: AC coupled signal and baseline wander

When a series of like valued symbols arrive in succession, such as more 1s than 0 s , the transmission is said to be unbalanced. This causes the integration effect to become more severe as shown in Figure 2-12 because the signal decays towards zero [15,16,17]. Note that the integration effect has been exaggerated for demonstration purposes. Nevertheless it is clear that the AC coupled signal in Figure 2-12 would have an increased probability of error in the presence of noise. Thus in order to minimize baseline wander the output needs to be balanced. That is, there must be an equal number of 1 s and 0 s on average in the output sequence.


Figure 2-12: AC coupling of an unbalanced sequence

The receiver also must know the duration of a symbol interval (length of a bit), in order to determine when to make a decision regarding the value of each symbol. To avoid transmission of a separate clock signal receivers typically derive a clock from transitions in the received data stream. This is done by having the oscillator in the receiver lock on to level shifts in the received signal. If there is a long series of logic 1 s or 0 s in a row the receiver's oscillator frequency may drift and become unsynchronized. When this occurs the receiver can lose track of where it is supposed to sample the transmitted data, as shown in Figure 2-13. This is a more serious problem than bits simply being corrupted by the channel [1] as now the system is completely missing data.


Figure 2-13: Clock recovery circuit losing synchronization

Another reason to require adequate transitions is to reduce timing jitter [5]. Random deviations of the incoming pulses from their ideal locations are always present even in the most sophisticated systems [5]. The tuned circuits used for timing extraction are sensitive to the pulse patterns, and long strings of 0 s or 1 s introduce additional jitter as the oscillation slows [4,5]. As well, most data transmitted is packaged in a regular fashion known as framing. This typically consists in a set number of bits for header, payload and tail information. The framing patterns result in a periodic frame structure and these regular patterns can also cause pattern dependent jitter [4].

All of these factors collectively contribute to errors in the received signal. However when CS coding is introduced the transmission signal characteristics are improved in such a way that the regular patterns get randomized, and the long strings get broken up as the transition density is increased. Consequently the clock recovery in the receiving terminal is stabilized and jitter is reduced, decreasing the inter symbol interference (ISI) and improving the accuracy of data reception.

### 2.20. Line Coding Goals

Goals of line coding include:

1. Adequate transitions for timing recovery
2. Balanced transmission to reduce dc drift
3. Low jitter
4. High efficiency, i.e. low redundancy

## 5. Simple implementation

6. Low error extension
7. Good performance regardless of the source statistics

The unpredictable nature of the digital source makes achieving the above goals challenging. For instance, if the probability of the source outputting a 0 is very high, the outgoing sequence would naturally have long runs of 0 s . As a consequence the receiver would have very few transitions to lock onto. The outgoing sequence would also be heavily unbalanced which would lead to drift, jitter and ultimately errors. Thus it would be up to the CS code to modify this bit stream into a more suitable sequence for the channel, removing the long runs of like valued bits. A block diagram of a system incorporating CS coding, also known as line coding, is shown in Figure 2-14.


Figure 2-14: Digital communication system with CS coding

### 2.21. Line Coding approaches

Manchester Encoding is an almost ideal CS coding scheme in that it meets nearly all line coding goals. In this system a logic 1 is mapped to the two bit pattern 01 , and a logic 0 is mapped to the complement two bit pattern 10. This guarantees at least one transition per bit as well as a completely balanced transmission [5]. Even if a source constantly emitted logic 0 s the transmitted sequence would simply appear like a $50 \%$ duty cycle square wave. The drawback with this approach however is that with this $1: 2$ mapping the output sequence is $50 \%$ redundant.

4B/5B encoding is a line coding technique sometimes called block coding [13]. Here every four bits from the source are encoded into five bits for the channel. With $2^{4}=16$ possible SWs mapped to 16 of the $2^{5}=32$ possible CWs, a straightforward implementation results in the 5-bit CWs always having two 1 s even if the source data is all Os. This allows for clock synchronization and is more efficient than Manchester Encoding. A drawback however is that it is no longer balanced since CWs have an odd length.

5B/6B uses the same idea as $4 \mathrm{~B} / 5 \mathrm{~B}$ with the improvement that there are now three 0 bits and three 1 bits in each 6-bit CW. As a result the output is dc balanced. This coding scheme also provides an added error checking capability since invalid data patterns, such as more than three 0 s or three 1 s in a word can be detected.

There are many similar codes such as $7 \mathrm{~B} / \mathbf{8 B}, \mathbf{1 0 B} / \mathbf{1 2 B}$ and so on which map $m$ information bits into $n$ transmission bits and they are commonly referred to as $\boldsymbol{m B n B}$ codes where the $B$ denotes binary signaling. They are also known as Alphabetic Codes where the pre-selected set of $n$ bit CWs are called an alphabet $[1,4,5]$. The mapping is typically done using look-up tables stored in ROMs. The physical limitations of ROMs impose CW length restrictions and speed (access time) restrictions. Because of this they have not gained popularity in high bit rate systems.

Scrambling is a bit-level (or pulse-level) processing applied to a digital signal just prior to transmission on the channel. The objective of scrambling is to improve clock recovery as well as randomize the bit sequence as much as possible to remove long strings of 1 s or 0s. Scrambling is designed to work well regardless of source statistics and also does not affect the bandwidth requirements of the system [14,18]. Therefore the bit rate of the system is the same before and after scrambling and thus it has found great popularity in fiber optic systems.

Scrambling can be described with the same polynomial representation used to describe cyclic EC codes. For example, to scramble a stream of source data $s(x)$ with the scrambling polynomial $d(x)$ using Self Synchronous Scrambling (SSS), the data stream is multiplied by the degree of the scrambling polynomial and divided by $d(x)$. The scrambled sequence is the quotient [18] as shown in Equation 2.24.

$$
\begin{equation*}
c(x)=Q_{d(x)}\left[s(x) x^{d}\right] \tag{2.24}
\end{equation*}
$$

The scrambled sequence is formed using the same polynomial operations as were used with EC coding. For example, consider a sequence that contains a long run of zeros, i.e. $s_{0}(x)=01000000001$, and scrambling polynomial $d(x)=100101$. The scrambled sequence would be

$$
\begin{equation*}
c_{0}(x)=Q_{d(x)}\left[s_{0}(x) x^{d}\right]=00010010111 \tag{2.25}
\end{equation*}
$$

Clearly the scrambled sequence $c_{0}(x)$ meets our line coding goals far better than the unscrambled sequence. While scrambling can greatly increase the probability of avoiding long runs of like valued bits it does not provide the same guarantee that many block coding techniques such as $m B n B$ codes provide. It can be seen that there is still situations that can cause the transmitted sequence to contain long run of 1 s or 0 s . For instance, if $s_{1}(x)=01001010001001010100101$, the output sequence would be

$$
\begin{equation*}
c_{1}(x)=Q_{d(x)}\left[s_{1}(x) x^{d}\right]=01000000001000000100000 \tag{2.26}
\end{equation*}
$$

Here the unscrambled sequence appears better suited to meet our constrained sequence goals then the scrambled sequence. To account for this guided scrambling [14] is an extension and improvement of SSS which augments the SW prior to scrambling. As a result it is capable of guiding the outcome of an otherwise normal scrambling process to produce a highly efficient balanced bit stream [14].

### 2.22. Advantages of Constrained Sequence Coding

CS codes do an excellent job of ensuring sufficient transitions for timing recovery and encoding the output sequence to match channel constraints [18]. In this way they can be thought of as pre-conditioning the data in a suitable fashion for transmission on a real channel. This new data sequence will then suffer less corruption than had the data been sent uncoded [14,18,19].

CS coding is often thought of as spectral shaping [20] since the channel constraints are being met by modifying the frequency domain characteristics of the transmitted signal. In essence, CS coding is a proactive approach to preventing errors from occurring in the first place.

### 2.23. Disadvantages of Constrained Sequence Coding

The major drawback of CS coding comes from their inability to effectively handle errors that inevitably occur. When errors are present in the received sequence error propagation results as the CS decoder incorrectly decodes the message. In many cases this error extension can affect numerous subsequent CWs [18].

### 2.24. Combining Error Control Coding and Constrained Sequence Coding

Since EC coding and CS coding both aim to improve the accuracy of digital communication systems, many systems utilize both approaches to extract the benefits from the two coding techniques [6]. Figure 2-15 shows the configuration that is normally used. The idea is to apply an EC code first and then use CS coding to condition the information for the channel. The hope is that the overall number of errors will be reduced, and any errors that do occur can be corrected by the EC code.


Figure 2-15: Typical configuration that incorporates EC coding and CS coding

The difficulty with this approach arises during demodulation/decoding. When errors inevitably occur the CS decoder has no means of dealing with them. As a result it incorrectly decodes the incoming sequence, in effect increasing or multiplying the number of errors $[6,7,8]$. This error multiplication can be thought of as turning a single error into a burst (multiple bit errors) [7]. Hence the error correcting code must be a burst correcting code even when noise in the channel is dominated by random errors [7]. In general more redundancy is needed to correct $t$ error bursts than to correct $t$ random errors [2]. This means that there is a greater cost in efficiency and complexity with the above approach. Therefore a logical conclusion is that errors should be corrected prior to CS decoding to avoid the error multiplication as shown in Figure 2-16.

However straightforward exchange of the decoders is impossible since the required order of operations is no longer followed. The EC decoder would not recognize any of the words arriving off of the channel since the CS encoder would have modified the EC CWs for the channel before they were transmitted. The EC decoder would then mistakenly try to correct these words unintentionally introducing more errors. Finally, since the CS decoder is now last, it would receive words that it would also not recognize (as a result of the EC decoder) and most likely introduce additional errors when it attempts to remove the effect of the line code.


Figure 2-16: Desired configuration that incorporates EC coding and CS coding

Channel coding researchers are now investigating ways of combining the above codes into one monolithic code to avoid this error multiplication and extract the benefits of both. Several methods are similar in their approach of adding additional redundancy in the form of multimode coding, where one SW is mapped to a selection set of CWs [7], or through appending additional CS bits to an existing EC code [10], or through augmentation of EC and CS code bits [6]. These codes are sometimes called combi-codes [21] or CC-EC codes [11] and are depicted in Figure 2-17.


Figure 2-17: Combined approach that incorporates EC coding and CS coding

This thesis introduces a simple approach to achieving the above goal of integrating EC coding with CS coding that is easily implemented and has good performance tradeoffs. The approach uses linear cyclic block codes and the simple family of Hamming codes for proof of concept. CS coding is achieved through multimode coding and this new technique is thoroughly analyzed, simulated, and implemented in hardware using field programmable gate arrays (FPGAs).

## 3. Integrating EC coding and CS coding

Creating a combined code is not straightforward since the methodologies behind EC coding and CS coding are different. Error control CWs are designed to be as different from one another as possible in order to increase the probability of detecting and/or correcting the most errors. Therefore CWs such as the all-zero CW $c(x)=0000000$ or the all-one $\mathrm{CW} c(x)=1111111$ are often present. These CWs have no transitions and they are completely unbalanced, but clearly it would take multiple flipped bits (many errors) to mistake one for the other. Constrained sequence CWs on the other hand are designed to give balanced output and ensure an adequate number of transitions. Thus with properties like these CWs often exist that only differ by a single bit. Consequently single bit errors can often make one CW appear as another.

In order to create a combined code one of the following two approaches can be employed. More redundancy can be added to an existing CS code, to increase the distance between the CWs in an attempt to turn it into an EC code. Alternatively, more redundancy could be added to an existing EC code in the form of restricting the CWs that are used to only those that meet the systems CS coding requirements in an attempt to turn it into a CS code.

If the latter approach is taken and a typical ( $n, k$ ) error correcting block code is used, inspection of the $2^{k}$ possible CWs will show that many of them meet the CS coding goals outlined in Section 2.20. Therefore if transmission is restricted to this subset of EC CWs, and since these CWs can natively correct errors, a combined EC and CS code is obtained. This is the novel approach introduced in this thesis, which uses the EC CWs themselves as the CS code.

While the above approach qualifies as a combined EC and CS code, straightforward implementation presents some difficulties. For example, the mapping of SWs to the restricted set of CWs can be accomplished through look-up tables. However this would result in the encoder being subject to all of the same ROM limitations as $m B n B$ codes, preventing its use in high bit rate systems. As well, many EC codes have

CWs that contain an odd number of bits. In these cases any individual CW will have more 1s than 0s and vice versa. Thus in a worst case situation where a source continually emits the same SW , the encoder would continually transmit the same odd length CW , resulting in a transmission that grows increasingly unbalanced. These problems need to be addressed, and the complete details for the combined EC and CS code presented in this thesis are presented through example in the next Section.

### 3.1. A Simple Scheme

The major flaw in the $1: 1$ mapping of SWs to CWs described above is that the system cannot guarantee balanced output. If however there existed the flexibility to send the CW, or the complement of the CW, this guarantee could be made. For example, examining the CWs of the $(7,4)$ Hamming code shown in Table $2-1$ shows that $c_{7}(x)=0111010$ and $c_{8}(x)=1000101, c_{6}(x)=0110001$ and $c_{9}(x)=1001110$, all the way to $c_{0}(x)=0000000$ and $c_{15}(x)=1111111$ are all CW complements of one other. Thus in a worst case situation where a source continually emits the same SW repeatedly, this system would remain balanced since the encoder could alternately transmit the original CW followed by its complement. That is $c_{7}(x)=0111010$ which has four 1 s and three 0 s could be transmitted, followed by $c_{8}(x)=1000101$ which has three 1 s and four Os, thus balancing the transmission. An encoder that maps one SW to a choice of more than one CW is classified as a multimode encoder, which was first discussed in Section 2.24. Specifically, this encoder is further classified as a bimode encoder since each SW is mapped to a choice of only two CWs.

This multimode encoder could be implemented without look-up tables and using the original digital logic gate encoders of Figure 2-5 by taking advantage of linear code properties of block codes. For example, from Section 2.11, any CW added to the all-zero CW will result in the original CW unchanged. However any CW added to the all-one CW will result in the CWs complement. For example, $c_{7}(x)=0111010$ added with $c_{0}(x)=0000000$ is $c_{7}(x)=0111010$ (no change), but $c_{7}(x)=0111010$ added with
$c_{15}(x)=1111111$ is $c_{8}(x)=1000101$ (every bit inverted). Thus based on this idea of CW addition, Figure 3-1 shows a possible setup of a bimode coding scheme where the encoder can choose to send the original CW or its complement.


Figure 3-1: A simple EC + CS encoder using bimode coding

From the discussion above SWs are still converted to CWs in the usual fashion. However with the above system, these CWs are then added to both the all-zero and the all-one CW. This has the effect of presenting the original CW and its complement to the Select Best decision block. The decision of selecting which CW to transmit is based on feedback from CWs already transmitted on the channel. Depending on these sequence statistics, the CW that keeps the overall number of 1 s and 0 s sent on the channel approximately equal will be chosen. Therefore with this setup, in a worst case situation where a source continually emits the same SW , such as the all-zero SW , this system can ensure balanced transmission. These output sequences are contrasted in Figure 3-2, which shows the original all-zero sequence listed as before $_{(7,4)}$, versus the improved sequence listed as after $r_{(7,3)}$. Clearly this is an improved sequence since it is balanced and there is a transition once every CW.

$$
\begin{array}{rllll}
\text { before }_{(7,4)} & 0000000 & 0000000 & 0000000 & 0000000 \\
\text { after }_{(7,3)} & 0000000 & 1111111 & 0000000 & 1111111
\end{array}
$$

Figure 3-2: The effect of the bimode coding: balanced output

As shown below in Table 3-1 this setup has SW 0 mapped to two CWs 0 and 15. Likewise, SW 1 is mapped to CWs 1 and 14 and so on. Hence SWs 8 to 15 can no longer be used to represent data, or in other words, the ability to achieve balanced transmission
has resulted in an additional bit of redundancy, thus modifying the $(7,4)$ Hamming code to a $(7,3)$ Hamming code (i.e. there are now $2^{3}=8$ possible SWs, 0 through 7 as listed in Table 3-1). Since the overall length of the code stayed the same, this is known as expurgating the code [2], i.e. one of the source bits is now acting like a parity bit. This extra parity bit is the MSb of each SW listed in Table 3-1.

Table 3-1: SW to CW mapping of this bimode (7,3) setup. Compare with Table 2-1

| Index | $S W$ | $C W_{A}$ | $\overline{C W_{A}}$ |
| :---: | :---: | :---: | :---: |
| (complement) |  |  |  |
| 0 | 0000 | $0000000_{(0)}$ | $1111111_{(15)}$ |
| 1 | 0001 | $0001011_{(1)}$ | $1110100_{(14)}$ |
| 2 | 0010 | $0010110_{(2)}$ | $1101001_{(13)}$ |
| 3 | $0011_{(3)}$ | $0011101_{(3)}$ | $1100010_{(12)}$ |
| 4 | 0100 | $0100111_{(4)}$ | $1011000_{(11)}$ |
| 5 | 0101 | $0101100_{(5)}$ | $1010011_{(10)}$ |
| 6 | 0110 | $0110001_{(6)}$ | $100111_{(9)}$ |
| 7 | $0111_{1}$ | $0111010_{(7)}$ | $1000101_{(8)}$ |

Decoding of these CWs is simple. Examination of Table 3-1 shows that the MSb of every CW is either a 0 or a 1 (shown in bold). This MSb is the information bit that is now acting as a parity bit. Thus the decoder can determine whether or not the received CW is the original or the complement, simply by inspecting the MSb. Thus as shown in Figure 3-3 the decoder first error corrects the received word, and then based on the value of the MSb , removes the CS coding by adding either the all-zero or all-one CW . Recovering the SW is then the simple operation of removing the parity check bits.


Figure 3-3: A simple EC + CS decoder using two AddCWs

There are two main advantages of this scheme. First of all the original encoder and decoder circuits which consist of simple boolean logic can still be used with only a slight modification for CS coding. Secondly, received words arriving off the channel avoid error propagation since they are error corrected first before removing the CS coding through CW addition. Thus the encoder of Figure 3-1 and decoder of Figure 3-3 demonstrate a combined EC and CS code that incorporates feedback to maintain balanced output. In this coding scheme the all-zero and all-one CWs used for CW addition are defined as Add Code Words (AddCWs). Therefore the encoder of Figure 3-1 and decoder of Figure 3-3 use two AddCWs.

### 3.2. Improved scheme

Looking once again at the output data stream after $_{(7,3)}$ in Figure 3-2 demonstrates that this setup can guarantee balanced transmission. However it also demonstrates that it cannot guarantee numerous transitions. For example, this bimode system presented above turns the all-zero sequence into an alternating sequence of seven zeros followed by seven ones. Therefore in this $(7,3)$ example there is a transition every seven bits since $n=7$. However if the system used a larger code such as the $(63,57)$ code expurgated to $(63,56)$, there would only be a transition every sixty-three bits since $n=63$. That is, the output sequence would alternately consist of sixty-three zeros followed by sixty-three ones. While the output would still be balanced, this situation is not desirable since so few transitions could pose problems for clock recovery. Therefore a way to introduce more transitions is required.

This can be accomplished using the same AddCW technique if the encoder has more CW choices per SW. This is because the encoder could choose to transmit the CW that not only balances the transmission but also contains the most transitions. Based on the CW addition technique above, this can be achieved by doubling the number of AddCWs. For example, consider the same worst case situation where the all-zero CW $c_{0}(x)=0000000$ is continually emitted. If the encoder has two additional AddCWs available for CW addition, such as $c_{4}(x)=0100111$ and $c_{11}(x)=1011000$ as shown in

Figure 3-4, then the Select Best decision block would have four choices for transmission that are: $\quad c_{\text {result } 0}(x)=0000000, \quad c_{\text {result } 1}(x)=0100111, \quad c_{\text {result } 2}(x)=1011000 \quad$ and $c_{\text {result }}(x)=1111111$. This is also a multimode scheme where one SW is mapped to a choice of four CWs.


Figure 3-4: A simple EC + CS encoder using four AddCWs

In this situation it is clear that $c_{\text {result } 1}(x)$ and $c_{\text {result } 2}(x)$ are better choices for transmission than $c_{\text {result } 0}(x)$ and $c_{\text {result } 3}(x)$ simply because they contain transitions within each CW. The encoder can now alternately choose $c_{\text {result } 1}(x)$ and $c_{\text {result } 2}(x)$ for transmission resulting in the improved output sequence listed as after $_{(7,2)}$ in Figure 3-5. Comparison with the original all-zero sequence original ${ }_{(7,4)}$ and the simple bimode scheme before $_{(7,3)}$ demonstrate how this multimode setup with double the number of AddCWs can introduce transitions and still maintain balanced output.

| original $_{(7,4)}$ | 0000000 | 0000000 | 0000000 | 0000000 |
| ---: | :--- | :--- | :--- | :--- |
| before $_{(7,3)}$ | 0000000 | 1111111 | 0000000 | 1111111 |
| after $_{(7,2)}$ | 0100111 | 1011000 | 0100111 | 1011000 |

Figure 3-5: The effect of the $\mathbf{( 7 , 2 )}$ scheme is balanced output and numerous transitions

This CW addition technique has taken the all-zero CW with zero transitions and a 7:0 ratio of 0 s to 1 s per CW , and produced a sequence of alternating CWs that have three
transitions and a $3: 4$ ratio and $4: 3$ ratio of 0 s to 1 s respectively. This technique is successful since the addition of a CW that has poor statistics (i.e. unbalanced with few transitions) to a CW that has better statistics (i.e. balanced with numerous transitions), results in a new CW whose statistics are an average of the two, upper bounded by the statistics of the better CW.

As show in Table 3-2, SW 0 is now mapped to four CWs, $0,4,11$ and 15. Likewise, SW 1 is now mapped to CWs $1,5,10$ and 14 and so on. Hence SWs 4 to 15 can no longer be used to represent data, or in other words, the ability to add transitions has resulted in an additional bit of redundancy over the previous bimode system. Now the first two bits of each SW can be viewed as redundant bits. Thus the $(7,4)$ Hamming code has now been expurgated to a $(7,2)$ Hamming code. Also note that these CWs still maintain the complementary nature of the system since $C W_{A}$ and $\overline{C W_{A}}$ as well as $C W_{B}$ and $\overline{C W_{B}}$ are complements.

Table 3-2: SW to CW mapping of this multimode (7,2) setup

| Index | $S W$ | $C W_{A}$ | $C W_{B}$ | $\overline{C W_{B}}$ | $\overline{C W_{A}}$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 0000 | $0000000_{(0)}$ | $0100111_{(4)}$ | $1011000_{(11)}$ | $1111111_{(15)}$ |
| 1 | 0001 | $0001011_{(1)}$ | $0101100_{(5)}$ | $1010011_{(10)}$ | $1110100_{(14)}$ |
| 2 | 0010 | $0010110_{(2)}$ | $0110001_{(6)}$ | $1001110_{(9)}$ | $1101001_{(13)}$ |
| 3 | 0011 | $0011101_{(3)}$ | $0111010_{(7)}$ | $1000101_{(8)}$ | $1100010_{(12)}$ |

Decoding of these CWs does not change significantly. Examination of Table 3-2 shows that the two MSbs of each CW are either $00,01,10$ or 11 (shown in bold), or in decimal notation this is $0,1,2$ or 3 . Thus the decoder now inspects the two MSbs of the error corrected words to determine which of the four AddCWs was added. Thus as shown in Figure 3-6, the decoder first error corrects the received CW, then based on the value of the two MSbs, removes the CS coding by adding the correct AddCW. Recovering the SW is then the simple operation of removing the parity check bits.


Figure 3-6: A simple EC + CS decoder using four AddCWs

Looking at the output data stream after $_{(7,2)}$ in Figure 3-5, it is clear that this system can guarantee balanced transmission and definitely improves the number of transitions. However there is another factor that needs to be considered which is the run of like valued bits across CW boundaries. For instance, when the concatenation of CWs is considered a run of four 1 s and four 0 s in still present in the output sequence as shown in Figure 3-7.

$$
\text { after }_{(7,2)} \frac{0100111101100001001111011000}{4} \frac{1}{4}
$$

Figure 3-7: Concatenation of CWs shows runs of four like valued bits

While this may not seem significant, consider that this is a small code of $n=7$, and this represents a run of like valued bits more than half the CW length. Thus in a longer code with $n=63$, this could mean a run of more than thirty-three like valued bits in a row, which could cause problems for clock recovery. Therefore a way to break up these runs of like valued bits is required.

This could be accomplished once again using the same AddCW techniques if the encoder had more CW choices per SW. This is because the encoder could choose to transmit the CW that not only balances the transmission and contains the most transitions, but also limits the runlengths of like valued bits. That is, building again on the previous example consider the same situation where the all-zero $\mathrm{CW} c_{0}(x)=0000000$ is continually emitted. If four additional AddCWs were available for CW addition, such as $c_{2}(x)=0010110, c_{6}(x)=0110001, c_{9}(x)=1001110$ and $c_{13}(x)=1101001$ as shown in

Figure 3-8, then the Select Best decision block would have eight choices for transmission that are: $\quad c_{\text {result }}(x)=0000000, \quad c_{\text {result } 1}(x)=0010110, \quad c_{\text {result } 2}(x)=0100111$, $c_{\text {result } 3}(x)=0110001, c_{\text {result } 4}(x)=1001110, c_{\text {result } 5}(x)=1011000, c_{\text {result } 6}(x)=1101001$ and $c_{\text {result }}(x)=1111111$. This is also a multimode scheme where one SW is mapped to a choice of eight CWs.


Figure 3-8: A simple EC + CS encoder using eight AddCWs

In this situation the Select Best decision block would select $c_{\text {result } 1}(x)=0010110$ and $c_{\text {result }}(x)=1101001$ for transmission since they not only ensure balanced transmission and contain numerous transitions, but a maximum runlength of two like valued bits is achieved. This is shown as after ${ }_{(7,1)}$ in Figure 3-9.

Comparison with the original $(7,4)$ all-zero sequence original ${ }_{(7,4)}$, simple bimode $(7,3)$ scheme before $_{(7,3)}$ and the four AddCW $(7,2)$ system before $_{(7,2)}$ demonstrate how this multimode setup with eight AddCWs can introduce transitions, maintain balanced output and constrain the output to a maximum runlength of two like valued bits.

| original $_{(7,4)}$ | 0000000 | 0000000 | 0000000 | 0000000 |
| ---: | :--- | :--- | :--- | :--- |
| before $_{(7,3)}$ | 0000000 | 1111111 | 0000000 | 1111111 |
| before $_{(7,2)}$ | 0100111 | 1011000 | 0100111 | 1011000 |
| after $_{(7,1)}$ | 0010110 | 1101001 | 0010110 | 1101001 |

Figure 3-9: SW to CW mapping of this $(\mathbf{7 , 1})$ setup

As shown in Table 3-3, SW 0 is now mapped to eight CWs, $0,2,4,6,9,11,13$ and 15 . Likewise, SW 1 is now mapped to CWs $1,3,5,7,8,10,12$ and 14. Hence SWs 2 to 15 can no longer be used to represent data, or in other words, the ability to decrease the runlengths has resulted in an additional bit of redundancy over the previous setup. Thus the $(7,4)$ Hamming code has now been expurgated to a $(7,1)$ Hamming code. Once again the CWs have still maintained the complementary nature of the system since $C W_{A}$ and $\overline{C W_{A}}$, as well as $C W_{B}$ and $\overline{C W_{B}}, C W_{C}$ and $\overline{C W_{C}}$ and $C W_{D}$ and $\overline{C W_{D}}$ are complements.

Table 3-3: SW to CW mapping of this multimode $(7,1)$ setup

| $I$ | $S W$ | $C W_{A}$ | $C W_{B}$ | $C W_{C}$ | $C W_{D}$ | $\overline{C W_{D}}$ | $\overline{C W_{C}}$ | $\overline{C W_{B}}$ | $\overline{C W_{A}}$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 0000 | $0000000_{(0)}$ | $0010110_{(2)}$ | $0100111_{(4)}$ | $0110001_{(6)}$ | $1001110_{(9)}$ | $1011000(12)$ | $1101001_{1(13)}$ | $1111111_{(15)}$ |
| 1 | 0001 | $0001011_{(2)}$ | $0011101_{(3)}$ | $0101100_{(5)}$ | $0111010_{(7)}$ | $1000101_{(8)}$ | $1010011_{(10)}$ | $1100010_{(12)}$ | $1110100_{(12)}$ |

Decoding of these CWs is identical to the previous decoders. Examination of Table 3-3 shows that the three MSbs of each CW are either $000,001,010011,100,101$, 110 or 111 . In decimal notation, this is $0,1,2,3,4,5,6$, or 7 . The decoder now checks the three MSbs to determine which of the eight AddCWs was added. Thus as shown in Figure 3-10, the decoder first error corrects the received CW, then based on the value of the three MSbs, removes the CS coding by adding the correct AddCW. Recovering the SW is then the simple operation of removing the parity check bits.


Figure 3-10: A simple EC + CS decoder using eight AddCWs

### 3.3. Tradeoffs

All three schemes presented yield balanced outputs, and when more AddCWs are available, increase the transition density. This can also be viewed as limiting the runlengths of like valued bits. These benefits however come at the cost of added redundancy. A quick comparison of the coding schemes is shown in Table 3-4, assuming equiprobable source statistics.

With 1 bit of redundancy for $\operatorname{CS}$ coding the $(7,3)$ code is balanced and has on average an $80 \%$ chance of runlengths that are three or less. However it still has a $20 \%$ chance of runlengths that are higher than this, with a $0.147 \%$ chance of a run of twelve (not shown in the table). With two bits of redundancy for CS coding, the $(7,2)$ code has a $96.41 \%$ chance of runlengths of three or less, and no possibility of a run longer than four. Finally with three bits of redundancy for CS coding, the $(7,1)$ code can guarantee runlengths of two or less with a probability of $100 \%$. Note that the $(7,4)$ code is completely unbounded in terms of runlengths, i.e. it is possible to have runlengths of any length, whose likelihood is dependent on the statistics of the source data stream.

Table 3-4: (7,x) code performance with equiprobable source statistics

| RunLengths | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
| :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- | :--- |
| $(7,3)$ Code | 0.2719 | 0.2609 | 0.2683 | 0.0364 | 0.0151 | 0.0049 | 0.0562 | 0.0378 | 0.0285 |
| $(7,2)$ Code | 0.3794 | 0.3571 | 0.2276 | 0.0357 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| $(7,1)$ Code | 0.4285 | 0.5714 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 |

Clearly with more AddCWs available the CS properties are improved. This however comes at a tradeoff of adding more redundancy.

### 3.4. Goals of the combined EC and CS system

The above examples demonstrate the combined EC and CS concept, whose multimode approach is made possible through CW addition. The following sections reiterate the most important goals, outline common pitfalls, and demonstrate the details of this approach and how it can be implemented with any linear block code.

The main goals of the system are:

1. Simple implementation;
2. Guarantee of the desired performance regardless of the incoming sequence statistics;
3. Addition of minimal redundancy into the encoded sequence.

Due to the cyclic nature of the simple Hamming EC code, encoding and decoding can be achieved using the simple logic circuits of Figure 2-5 and 2-6, i.e. look-up tables in ROMs are not required ensuring the use in high speed systems. The introduction of CS coding through CW addition can also be easily implemented using simple logic circuits, i.e. XOR gates since the addition is over GF(2).

Guaranteeing the desired performance of balanced transmission, numerous transitions and limiting the runlengths of like valued bits is achieved through feedback in the system. As discussed the system monitors the overall number of 0 s and 1 s sent on the channel to record transitions and runlengths, and then uses this information to determine the best CW for transmission. This feedback guarantees the system's CS goals will be maintained regardless of source statistics.

Finally, the $(7,4)$ code discussed so far was used because it is small and presentable, but it suffers from too much redundancy with this scheme. However this
redundancy is less of a factor when larger codes are used. For instance with three additional bits of redundancy (eight AddCWs), the $(15,11)$ code expurgated to $(15,8)$ is $46 \%$ redundant, but the $(31,26)$ expurgated to $(31,23)$ is only $25 \%$ redundant, and the $(63,57)$ expurgated to $(63,54)$ is only $14 \%$ redundant and so on. Therefore, the redundancy can be minimized to any desired level as larger and different codes are used.

### 3.5. Generic Systems

The following systems presented can be used with any linear block code and can be incorporated into existing digital circuits with a minimum of additional complexity.

Balanced transmission: As discussed through example in Section 3-1, balanced transmission can be achieved with a single additional bit of redundancy. The generic structure of the complementary bimode encoder is shown in Figure 3-11, and the corresponding decoder is shown in Figure 3-12.


Figure 3-11: Generic bimode complementary encoder using two AddCWs


Figure 3-12: Generic bimode complementary decoder using two AddCWs

With this setup the two AddCWs are recommended to be the all-zero and all-one CWs as shown in Figure 3-13. This gives the Select Best decision block the ability to transmit the original CW or its complement. (Note that other CWs could be used here and
this will be discussed below). Figure 3-13 also displays these CWs as $00 . .00$ and 11... 11 since their length is dependent on the $(n, k)$ block code used. Therefore if the $(7,4)$ Hamming code is used like in the examples of Section 3.1 and 3.2, then their length is $n=7$ bits. However if the $(1023,1014)$ Hamming code was used, then their length would be $n=1023$ bits.

$$
00 \ldots 00 \quad 11 \ldots 11
$$

Figure 3-13: AddCW format for a system using two AddCWs

In Figure 3-13 the MSbs of the two AddCWs (shown in bold) are 0 and 1 respectively, which as discussed in Section 3-1, is the extra bit of redundancy that allows the decoder to determine whether the received CW is the original or the complement. Strictly speaking then, for the decoder to function properly these two AddCWs do not necessarily have to be the all-zero and all-one CW. All that is required is that the two AddCWs used have their MSbs as 0 and 1 respectively. As long as this criteria is met, the decoder will be able to determine which AddCW needs to be added at the receiver in order to recover the original CW (i.e. remove the CS coding). Therefore Figure 3-14 shows the specific format that these AddCWs must take, where the $x$ 's represent any available CW that has the MSbs in this fashion.

$$
0 \mathrm{x} \ldots \mathrm{xx} \quad 1 \mathrm{x} \ldots \mathrm{xx}
$$

Figure 3-14: Format of the two AddCWs for proper decoder operation

Note that since the code is cyclic, not using the all-zero and all-one CWs as AddCWs only has the effect of changing the overall mapping of SWs to CW. For example, if the two AddCWs used were $c_{6}(x)=0110001$ and $c_{9}(x)=1001110$, the SW to CW mapping would be that shown in Table 3-5. This table can be compared with Table 3-1 where the AddCWs were the all-zero and all-one CW.

Table 3-5: Using AddCWs $c_{6}(x)=0110001$ and $c_{9}(x)=1001110$

| Index | $S W$ | $C W_{A}$ | $\overline{C W_{A}}$ |
| :---: | :---: | :---: | :---: |
| (complement) |  |  |  |
| 0 | 0000 | $0110001_{(6)}$ | $1001110_{(9)}$ |
| 1 | 0001 | $0111010_{(7)}$ | $1000101_{(8)}$ |
| 2 | 0010 | $0100111_{(4)}$ | $1011000_{(11)}$ |
| 3 | 0011 | $0101100_{(5)}$ | $1010011_{(10)}$ |
| 4 | 0100 | $0010110_{(2)}$ | $1101001_{(13)}$ |
| 5 | 0101 | $0011101_{(3)}$ | $1100010_{(12)}$ |
| 6 | 0110 | $0000000_{(0)}$ | $1111111_{(15)}$ |
| 7 | 0111 | $0001011_{(1)}$ | $1110100_{(14)}$ |

It is clear that with this new SW to CW mapping the output transmission will still be balanced, since in a worst case situation where a source continually emits the same CW such as the all-zero $\mathrm{CW} c_{0}(x)=0000000$, it would be mapped alternately to $c_{6}(x)=0110001$ followed by $c_{9}(x)=1001110$ and so on.

Note that if the two AddCWs were not chosen to be complements then balanced output can no longer be guaranteed. For example Table 3-6 shows the SW to CW mapping if the two AddCWs were $c_{6}(x)=0110001$ and $c_{8}(x)=1000101$.

Table 3-6: Using AddCWs $c_{6}(x)=0110001$ and $c_{8}(x)=1000101$

| Index | $S W$ | $C W_{A}$ | $\overline{C W_{A}}$ |
| :---: | :---: | :---: | :---: |
| 0 | 0000 | $0110001_{(6)}$ | $1000101_{(8)}$ |
| 1 | 0001 | $0111010_{(7)}$ | $1001110_{(9)}$ |
| 2 | 0010 | $0100111_{(4)}$ | $1010011_{(10)}$ |
| 3 | 0011 | $0101100_{(5)}$ | $1011000_{(11)}$ |
| 4 | 0100 | $0010110_{(2)}$ | $110000_{(12)}$ |
| 5 | 0101 | $0011101_{(3)}$ | $1101001_{(13)}$ |
| 6 | 0110 | $0000000_{(0)}$ | $1110100_{(14)}$ |
| 7 | 0111 | $0001011_{(1)}$ | $1111111_{(15)}$ |

In this case if the all-zero CW $c_{0}(x)=0000000$ is continually emitted it would be mapped alternately to $c_{6}(x)=0110001$ followed by $c_{8}(x)=1000101$ and so on, which are both unbalanced with a 0 to 1 ratio of $4: 3$. Hence this system would continually grow unbalanced. As a result, the AddCWs used in this bimode system must be chosen such that their MSbs are 0 and 1 respectively, and that they are CW complements. Therefore, since inspection of Table 3-1 and Table 3-5 shows that the only difference is the SW to CW mapping, for simplicity it is recommended that the AddCWs are always chosen to be the all-zero and all-one CW. In general the only requirement that this places on an EC code is that it contains the all-zero and all-one words as CWs.

Balanced transmission, Transitions: As discussed in Section 3-2, balanced transmission can be guaranteed and transition density can be improved with two additional bits of redundancy. Figure 3-15 shows the generic structure of the multimode complementary encoder that uses four AddCWs, with the corresponding decoder shown in Figure 3-16.


Figure 3-15: Generic multimode complementary encoder using four AddCWs


Figure 3-16: Generic multimode complementary decoder using four AddCWs

As with the previous setup in order for the decoder to know which AddCW was added at the transmitter, the AddCWs must follow the format shown in Figure 3-17. Since this system uses four AddCWs, two MSbs are used and they must be in the form $00,01,10$ and 11 , which in decimal is $0,1,2$ and 3 . Therefore strictly speaking, each AddCW could be any CW that meets this criteria.
00...xx 01...xx 10...xx 11...xx

Figure 3-17: Choosing an AddCW set with four AddCWs

However, from the previous discussion in order to guarantee balanced transmission the encoder needs the ability to send complementary CWs. Therefore the AddCWs should maintain the format of Figure 3-17, and they should be chosen to be complements as shown in Figure 3-18. Hence the all-zero and all-one CWs are chosen, and $\mathrm{CW} a$ can be chosen in the form $01 \ldots \mathrm{xx}$, and $\bar{a}$ is simply its complement.

$$
00 \ldots 00 \quad \text { a } \quad \bar{a} \quad 11 \ldots 11
$$

Figure 3-18: Strict format of the four AddCWs

Balanced transmission, Transitions, limit Runlengths: As discussed in Section 3-2, balanced transmission can be guaranteed, transition density can be improved, and runlengths can be greatly controlled with three additional bits of redundancy. Figure 3-19 shows the generic structure of the multimode complementary encoder that uses eight AddCWs, with the corresponding decoder shown in Figure 3-20.

As with the previous setup in order for the decoder to know which AddCW was added at the transmitter, the AddCWs must follow the format shown in Figure 3-21. Since this system uses eight AddCWs, three MSbs are used and they must be in the form $000,001,010,011,100,101,110$ and 111 , which in decimal is $0,1,2,3,4,5,6$ and 7 . Therefore strictly speaking each AddCW could be any CW that meets this criteria.


Figure 3-19: Generic multimode complementary encoder using eight AddCWs


Figure 3-20: Generic multimode complementary decoder using eight AddCWs
000..xx 001..xx 010...xx 011...xx 100..xx 101...xx 110...xx 111...xx

Figure 3-21: Choosing an AddCW set with eight AddCWs

However, from previous discussions in order to guarantee balanced transmission the encoder needs the ability to send complementary CWs. Therefore the AddCWs should maintain the format of Figure 3-21, and they should be chosen to be complements as shown in Figure 3-22. Hence the all-zero and all-one CWs are chosen, and CW $a$ can be chosen in the form $001 \ldots \mathrm{xx}, \mathrm{CW} b$ can be chosen in the form $010 \ldots \mathrm{xx}$, and CW $c$ can be chosen in the form $011 \ldots \mathrm{xx}$. Then CWs $\bar{a}, \bar{b}$ and $\bar{c}$ are simply chosen as the complements of $a, b$ and $c$ respectively.

```
00.1.00 a b c c \ b \
```

Figure 3-22: Strict format of the eight AddCWs

### 3.6. General notes on AddCWs

### 3.6.1. AddSWs and hexadecimal notation

From the previous example in Figure 3-4, the four AddCWs used were $c_{0}(x)=0000000, \quad c_{4}(x)=0100111, \quad c_{11}(x)=1011000$ and $c_{15}(x)=1111111$, which follow the above guidelines since $c_{0}(x)$ and $c_{15}(x)$ as well as $c_{4}(x)$ and $c_{11}(x)$ are CW complements. However these could have also been listed in terms of SWs. For instance, inspection of Table 2-1 confirms that the SWs $s_{0}(x)=0000, s_{4}(x)=0100, s_{11}(x)=1011$ and $s_{15}(x)=1111$ are the SWs that form the CWs $c_{0}(x), c_{4}(x), c_{11}(x)$ and $c_{15}(x)$ respectively. Therefore these SWs are defined as Add Source Words (AddSWs). Thus using a compact hexadecimal notation this AddCW set can be represented as the AddCWs $\{00 \mathrm{~h}, 27 \mathrm{~h}, 58 \mathrm{~h}, 7 \mathrm{Fh}\}$, or the AddSWs $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\}$. These two representations will be used interchangeably since AddCWs are simply AddSWs that have been encoded.

### 3.6.2. Number of AddCWs can be any power of two

Analysis in this chapter has shown that the number of AddCWs can be increased to any power of two. While the generic systems listed in Section 3.5 have only used $2^{1}=2,2^{2}=4$ and $2^{3}=8$ AddCWs, more could be utilized such as $2^{4}=16,2^{5}=32$ or higher. These AddCWs would need to be chosen in the same complementary fashion, and the decoder would need to extract the appropriate number of MSbs in order to add the correct AddCW at the receiver. Note that as discussed in Section 3-4, the use of more AddCWs increases the transition density and decreases the runlengths of like valued bits. However these CS coding benefits come at an efficiency cost in terms of code rate.

### 3.6.3. MSbs and identification bits

Up to this point the encoders and decoders have used the MSbs to identify which AddCW has been used. However these identification bits can be located anywhere in the CW and are located in the MSb position simply due to simplicity.

### 3.6.4. Good AddCW set

An aspect not covered in this chapter is what makes a good AddCW set. For example, in the (7,2) Hamming code example from Figure 3-4, the four AddCWs used were arbitrarily chosen as $\{00 \mathrm{~h}, 27 \mathrm{~h}, 58 \mathrm{~h}, 7 \mathrm{Fh}\}$. However based on the AddCW set format outlined above, three more AddCW sets could have been used: $\{00 \mathrm{~h}, 2 \mathrm{Ch}, 53 \mathrm{~h}, 7 \mathrm{Fh}\}$, $\{00 \mathrm{~h}, 3 \mathrm{hh}, 4 \mathrm{Eh}, 7 \mathrm{Fh}\}$ or $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{hh}, 7 \mathrm{Fh}\}$. The effects of using these different AddCW sets is thoroughly examined in Chapter 4, as well as the criteria regarding what makes a good AddCW.

### 3.6.5. Number of AddCW sets

To the casual observer it is often surprising to see that only four AddCW sets exist with the $(7,2)$ Hamming code introduced in this chapter. It is unexpected since the format from Figure 3-17 may lead one to believe that there are $(4)(4)(4)(4)=16$ AddCW sets. However, this $(7,2)$ code only has four possible AddCW sets which results from the AddCW recommendations and the size of this code. Recall from Figure 3-18 that two AddCWs are recommended to be the all-zero and all-one CW. Furthermore only CW $a$ can be freely chosen to meet the format $01 \ldots \mathrm{xx}$, and $\bar{a}$ is simply chosen to be its complement. Therefore the correct number of possible AddCW sets is $(1)(4)(1)(1)=4$, and they were listed above in Section 3.6.4. These four AddCW sets can also be written in compact hexadecimal notation in terms of AddSWs as $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\},\{0 \mathrm{~h}, 5 \mathrm{~h}, \mathrm{Ah}, \mathrm{Fh}\}$, $\{0 \mathrm{~h}, 6 \mathrm{~h}, \mathrm{9h}, \mathrm{Fh}\}$ and $\{0 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Fh}\}$ respectively.

This small number of AddCW sets simply results from the small size of the code. If a larger code such as the $(31,26)$ Hamming code expurgated to a $(31,24)$ code was used, the number of AddCW sets increases to $(1)\left(2^{24}\right)(1)(1)=16777216$.

Likewise in an eight AddCW system, a $(7,4)$ Hamming code expurgated to a $(7,1)$ code will only have eight possible AddCW sets. This is because Figure 3-22 shows that two of the AddCWs are recommended to be the all-zero and all-one CW. Furthermore only CWs $a, b$ and $c$ can be freely chosen to meet the format $001 \ldots \mathrm{xx}, 010 \ldots \mathrm{xx}$, and $011 \ldots \mathrm{xx}$ respectively, and $\bar{a}, \bar{b}$ and $\bar{c}$ must be chosen as their CW complements.

In general for any linear block code using this multimode coding approach the number of AddCW sets will be

$$
\begin{equation*}
\left(2^{k}\right)^{\frac{z-2}{2}} \tag{3.1}
\end{equation*}
$$

where $k$ is from the codes expurgated size ( $n, k$ ) and $z$ is the number of AddCWs.

Therefore the number of possible AddCW sets for the $(7,4)$ code, expurgated to a $(7,1)$ code using eight AddCWs is $\left(2^{1}\right)^{\frac{8-2}{2}}=\left(2^{1}\right)^{3}=(1)(2)(2)(2)(1)(1)(1)(1)=8$. In terms of AddSWs these sets can be written in hexadecimal form as $\{0 h, 2 h, 4 h, 6 h, 9 h, B h, D h, F h\},\{0 h, 2 h, 4 h, 7 h, 8 h, B h, D h, F h\},\{0 h, 2 h, 5 h, 6 h, 9 h, A h, D h, F h\}$, \{0h,2h,5h,7h,8h,Ah,Dh,Fh \}, \{0h,3h,4h,6h,9h,Bh,Ch,Fh\}, \{0h,3h,4h,7h,8h,Bh,Ch,Fh\}, $\{0 h, 3 h, 5 h, 6 h, 9 h, A h, C h, F h\}$ and $\{0 h, 3 h, 5 h, 7 h, 8 h, A h, C h, F h\}$. Once again this small number of AddCW sets simply results from the restrictions placed on AddCW selection and the small size of the code. If a larger code such as the $(31,26)$ Hamming code expurgated to a $(31,23)$ code using eight AddCWs was used, the number of AddCW sets increases to $\left(2^{23}\right)^{3}=(1)\left(2^{23}\right)\left(2^{23}\right)\left(2^{23}\right)(1)(1)(1)(1)=2^{69}$. Note that Appendix B lists all AddSW sets used in this thesis.

### 3.7. Combined EC and CS code summary

This chapter introduced a combined EC and CS code that takes advantage of CW addition through linear code properties to achieve a multimode coding system. This combined code can be incorporated into existing systems with minimal complexity, avoiding the use of look-up tables in ROMs and making it applicable to high bit rate systems. Furthermore this system uses feedback to achieve its CS goals regardless of source statistics and avoids CS error propagation by error correcting CWs from the channel before removing the effect of the CS coding.

The real challenge of this system then comes in deciding which CWs to use as AddCWs. For instance, from Section 3.6 .4 with the $(7,2)$ Hamming code there are only four AddCW sets and therefore they can be directly compared against one another. However when dealing with larger codes there could be as many as $2^{24}, 2^{69}$ or more possible AddCW sets to choose from. As a result, directly comparing each AddCW set is not feasible. In general a good AddCW set should lead to a transmitted sequence that is balanced, contains numerous transitions, and has a high probability of low runlengths. Chapter 4 will examine the use of different AddCW sets and their effect on the output sequence in the time domain. Furthermore Chapter 5 will examine the use of different AddCW sets and their effect on the power spectral density of the transmitted sequence in the frequency domain.

## 4. Code Word Search and Analytical Results

The multimode coding system introduced in Chapter 3 involves generating an original CW and adding it to a specific set of CWs called AddCWs. This produces a new set of CWs from which the best is chosen for transmission. This chapter defines what is considered to be a best CW , as well as the metrics used for finding good AddCW sets.

### 4.1. Evaluating CW Statistics

The objective of the encoder is to transmit balanced sequences that contain numerous transitions. The encoder obtains this goal by transmitting CWs from the selection set that best matches these objectives. The encoder is able to make this decision by comparing the following CW statistics.

Transitions: A transition in a sequence occurs when there is a low to high or high to low level shift. Counting the number of transitions in a CW requires examination of the sequence of bits in the word and also requires keeping track of the last bit transmitted on the channel. For a simple example consider the case when the CW $\{0111010\}$ is transmitted, represented in hexadecimal notation as 3Ah. As shown in Figure 4-1 if the last bit transmitted was a 0 then the number of transitions would be four. However if the last bit transmitted was a 1 , then the number of transitions would have been five. Thus a particular CW will have a different number of transitions depending on the value of the last bit (LB) transmitted.


Figure 4-1: Counting Transitions

Running Digital Sum: In order to achieve a balanced output the number of 1 s and 0 s must be equal over the long term. These 1 s and 0 s will sometimes be referred to as logic 1s and logic 0 s to avoid confusion when the transmitted sequence uses bipolar transmission encoding of Section 2.19. That is, during transmission logic 0s are mapped to the value -1 and logic 1 s are mapped to the value +1 . Therefore a binary sequence $x$ of logic 1s and 0s during transmission is represented by Equation 4.1.

$$
\begin{equation*}
\{x\}=\left\{\ldots, x_{-1}, x_{0}, \ldots, x_{i}, \ldots\right\} \quad x_{i}=\{-1,1\} \tag{4.1}
\end{equation*}
$$

This mapping allows for the testing of a balanced transmission by simply summing up all of the bits in the sequence and testing if the result is zero. This process is known as evaluation of the Running Digital Sum (RDS) [19] and it is defined as

$$
\begin{equation*}
z_{i}=\sum_{j=-\infty}^{i} x_{j} \tag{4.2}
\end{equation*}
$$

The RDS is calculated and updated with each bit transmitted. The RDS value at the end of the CW is defined as the word end running digital sum (WRDS). To demonstrate this, assume that the system begins with an RDS of 0 , then continuing with the previous example, CW 3Ah would transition through various RDS values and ultimately result in a WRDS of +1 as shown in Figure 4-2. This updates the systems RDS to +1 . If CW 3Ah was transmitted once again, its WRDS would be +2 updating the overall systems RDS to +2 and so on.


Figure 4-2: Running Digital Sum of the code word 3Ah

Having an RDS of +2 indicates that the system has currently had two more logic 1 bits than logic 0 bits and thus the transmission is slightly unbalanced. The next CW would need to have two more logic 0 bits than logic 1 bits to bring the system RDS back down to 0 . This however represents an ideal situation that might not always be possible. It may take one or more additional CWs to bring the RDS back to 0 . However it has been proven that as long as the RDS is bounded the transmission will be balanced [19].

Minimum Squared Weight: The WRDS is a useful metric that gives the overall balance of a CW. However this metric only tracks the RDS at the end of a CW and does not give any indication of the range of RDS values assumed within the CW. For instance an RDS value that steadily grows and falls within a CW indicates long runs of like valued bits. Inspecting only the number of transitions and WRDS of a CW may not indicate these runs. For example, consider the case when the two CWs 2 F 0 h and 0 E 6 h are being considered for transmission as shown in Figure 4-3.


Figure 4-3: Minimum squared weight can take the internal RDS into account

It can be seen that both CWs have four transitions and a WRDS of 0 . Hence the encoder would consider this a tie since both CWs are equally suitable for transmission. However it can be seen that the first CW has a run of four like valued bits, while the second CW only has runs of three like valued bits. The encoder would need an additional metric in order to monitor these runs and select the second CW for transmission which is a better choice in terms of limiting the runlengths. As a result the squared weight metric takes into account the RDS within a CW. It is defined as the square of the RDS values summed across the sequence as shown in Equation 4.3.

$$
\begin{equation*}
z_{i}=\sum_{j=-\infty}^{1} R D S_{j}^{2} \tag{4.3}
\end{equation*}
$$

Figure 4-4 shows the same CWs as Figure 4-3 with the squared weight calculated. Due to the long runs of like valued bits the squared weight of the first sequence is much higher at forty-five than the squared weight of the second sequence at nine. Therefore by using the minimum squared weight (MSW) criteria the second CW would be chosen for transmission.


Figure 4-4: Minimum squared weight criteria chooses the second CW

### 4.2. Choosing the Best Code Word

The multimode coding system is presented with a choice of CWs for each SW, and must select only one CW for transmission. The CW chosen will be the one that best meets the system's requirements. Based on the above discussion there is some flexibility when choosing the best CW.

1. Choose the CW that has the most transitions: This will ensure that there is adequate timing information and will therefore satisfy the clock recovery objectives and reduce jitter. However this alone cannot ensure balanced transmission. For example the waveform in Figure 4-5 has numerous transitions but is clearly unbalanced.

$$
0 \longdiv { 1 1 0 } 1 1 0 \longdiv { 1 1 0 } 1 1
$$

Figure 4-5: Sequence has many transitions but it is unbalanced
2. Choose the CW that has the smallest abs(WRDS): This by itself will guarantee balanced transmission, but it cannot ensure a large number of transitions. For example the waveform in Figure 4-6 is clearly balanced but has very few transitions.

## $0 0 0 0 0 0 0 \longdiv { 1 1 1 1 1 1 1 0 0 0 0 0 0 0 \longdiv { 1 1 1 1 1 1 1 } 0 . }$

Figure 4-6: Sequence is balanced but it contains few transitions
3. Choose the CW that has the smallest squared weight: The MSW criteria works best as a tie breaking metric and does not perform well alone. The problem is that the MSW criteria squares the RDS values in an attempt to penalize long runs of like valued bits. If the run of like valued bits passes through an RDS of 0 , the squared weight will be small and the MSW criteria can be fooled into believing this CW is the best. For example consider the case when the current RDS value is +3 , the LB is 1 and the following CWs are considered for transmission.

Table 4-1: With the MSW criteria the all-zero CW would be transmitted

|  | WRDS | TRAN | SqW |
| :--- | :--- | :--- | :--- |
| 00 h | -4.0 | 1.0 | 35.0 |
| 27 h | +4.0 | 4.0 | 47.0 |
| 58 h | +2.0 | 3.0 | 95.0 |
| 7 Fh | +10.0 | 0.0 | 371.0 |

From Table 4-1 it can be seen that based on the lowest RDS criteria, CW 58h would be chosen as the best with a WRDS of +2 . With the most transitions criteria, CW 27 h would be chosen as the best with 4 transitions. However, with the MSW criteria, CW 00 h would be chosen to be the best, even though it is the all-zero CW which this multimode system is trying to avoid. This
demonstrates that the MSW criterion does not work well alone and is most useful as a tie breaking mechanism.

In general the CW decision process depends on the system requirements. If system performance is highly dependent on clock recovery, then the best CW to transmit will be the one with the most transitions. If system performance depends to a large extent on having a balanced transmission, then the best CW will be the one that minimizes the RDS. Therefore what is considered to be the best CW is subjective.

Furthermore, as already mentioned it is possible for more than one CW to satisfy the best CW requirements. In other words ties are possible. In these situations either CW is equally suitable for transmission. That is, the system can always choose to transmit the first CW that meets the systems goals, even if alternatives exist. This will still result in a superior output that is balanced and contains numerous transitions.

However an undesirable problem can arise in this situation. If ties occur in a regular fashion and the encoder continually chooses the first CW that meets the CS goals of the system, the encoder may be selecting this CW in a periodic fashion. This can result in discrete like components in the power spectrum of the transmitted waveform, possibly leading to cross-talk and possibly violating any spectral masks that must be followed. As a result it is useful to have enough tie breaking mechanisms to break any and all ties such that there is always a single clear best CW. Otherwise it is recommended that when one or more CWs are equally suitable for transmission, that the best CW is chosen randomly from the set of words considered equally good.

### 4.3. Selection process used in this thesis

In this thesis the analysis was done based on selecting the CW that minimized the RDS. This is because it is the only selection criteria that can guarantee balanced transmission. Other criteria such as the number of transitions and MSW are still employed in order to break ties.

### 4.4. Example of the CW selection process based on minimizing the RDS

Consider the following $(7,2)$ system using the AddSWs $\{0 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Fh}\}$ encoded to the AddCWs $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{~h}, 7 \mathrm{Fh}\}$ as shown in Figure 4-7.


Figure 4-7: (7,2) encoder with AddCWs \{ 00h, 3Ah, 45h, 7Fh \}

In this system there are $2^{2}=4$ possible SWs. Therefore in each encoding interval one of the four possible SWs is encoded to a CW, added to all four AddCWs listed above, and then the best CW is chosen from this set. In order to verify this process a simulation called SimFPGA was written and its interface is shown in Figure 4-8. Note that the name of this program comes from the fact that this system was implemented on two FPGA boards as will be discussed in Chapter 6. This program writes every step of the best CW decision process to a file.


Figure 4-8: SimFPGA - Program for verifying the CW selection process

Consider a simple example using the $(7,2)$ code where there are four 2-bit SWs for transmission, $00,10,11$, and 11 , or in decimal $0,2,3$ and 3. As shown in Figure 4-8 the two AddSWs $\{0 \mathrm{~h}, 7 \mathrm{~h}\}$ are entered into the program in hexadecimal; their complements $\{8 \mathrm{~h}, \mathrm{Fh}\}$ are automatically generated. These four AddSWs are then converted into the AddCWs $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{~h}, 7 \mathrm{Fh}\}$ respectively. The output from this program is written to a file as shown in Table 4-2 below.

Table 4-2: Decision process from SimFPGA for the first SW

| SW $=00$ | WRDS | TRAN | SqW | stTran |
| :--- | :--- | :--- | :--- | :--- |
| 00000000 h | -7.0 | 0.0 | 140.0 | 0.0 |
| 0000003 Ah | +1.0 | 4.0 | 12.0 | 2.0 |
| 00000045 h | -1.0 | 5.0 | 12.0 | $2.0 * * * *$ |
| 0000007 Fh | +7.0 | 1.0 | 140.0 | 0.0 |
|  |  |  |  |  |
| WinningLoc $=2$ BitsLeft $=6$ | FilePointer $=0 \quad$ CW's sent 1 |  |  |  |
| Tie $=00000005$ NumTies $=0$ | state 1 of numstates 2 |  |  |  |
| $((((00000045 \mathrm{~h}$ | RDS $=-1.0 \quad$ LB $=1) \\|))$ |  |  |  |

The first 2-bit SW 00 results in CW 00 h which is added to all AddCWs generating the four CWs $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{~h}, 7 \mathrm{Fh}\}$. From these four CW choices the encoder must now choose the best one for transmission. In order to do this the encoder arbitrarily assumes the initial RDS of the system is 0 and the initial LB is 0 . Table $4-2$ lists the metrics of each CW choice. It is clear that the all-zero and all-one CW perform poorly with respect to all metrics since they have high WRDS values of -7 and +7 , low transitions of 0 and 1 , and a squared weight of 140 . Therefore the choice comes down to either CW 3Ah or 45 h . Since these CWs are complements the abs(RDS) and squared weight $(\mathrm{SqW})$ are the same at 1 and 12 respectively. The only difference between them is the number of transitions. Since the LB was initially chosen to be 0 the winning CW selected is 45 h and it is considered to be the best.

Since CW 45 h was transmitted and it has a WRDS of -1 and a LB of 1 , the system RDS is updated to -1 and the LB is updated to 1 . Now the system can move onto the next SW. Continuing in the same fashion the next 2-bit SW 01 is encoded to the CW 16 h and
added to the 4 AddCWs to generate the CWs $\{16 \mathrm{~h}, 2 \mathrm{Ch}, 53 \mathrm{~h}, 69 \mathrm{~h}\}$. Table $4-3$ shows that two CWs will bring the system RDS back to 0 . Thus in terms of smallest abs(WRDS) either of these CWs would be considered the best and the encoder has encountered a tie.

Table 4-3: Decision process from SimFPGA for the second SW


Since the main requirement of the system has been met the encoder can now choose the CW that meets the secondary goals of maximizing transitions for clock recovery, i.e. removing long runs of like valued bits. The first tie breaking mechanism considered is the number of transitions. However both "good" CWs have 4 transitions and this tie persists. Only as a result of the MSW criteria has the tie been broken and CW 69 h is chosen to be the best since its squared weight is smaller than CW 53 h . This demonstrates how the number of transitions and MSW criteria can still be used even though the overall goal is only to transmit the CW with the lowest abs(RDS).

Finally consider when the source outputs the same SW 11 twice. As shown in Table 4-4 the first SW 11 causes the CW 27h to be transmitted and the second SW 11 causes the CW 62 h to be transmitted. This demonstrates that the same SW will often be represented with different CWs as a result of multimode coding. It also shows that the best CW will not always be the same for the same $S W$. The best $C W$ will often change depending on the current RDS and LB value. Therefore these two values are formally defined as a state of the system and will be represented as an (RDS,LB) state.

Table 4-4: Decision process from SimFPGA for the third and fourth SW

| $\mathrm{sw}=11$ | WRDS | tran | SqW | 1stTran |
| :---: | :---: | :---: | :---: | :---: |
| 0000001 Dh | +1.0 | 4.0 | 8.0 | 3.0 |
| 00000027h | $+1.0$ | 4.0 | 8.0 | 2.0**** |
| 00000058 h | -1.0 | 3.0 | 8.0 | 2.0 |
| 00000062 h | -1.0 | 3.0 | 8.0 | 3.0 |
| WinningLoc | BitsL | $\mathrm{ft}=2$ | File | inter $=$ |
| Tie $=0000$ | NumTi | $s=0$ |  |  |
| : sta | of $n$ | states |  |  |
| $111(10000$ |  |  | RDS | . 0 LB |
| $\mathrm{SW}=11$ | WRDS | TRAN | SqW | 1stTran |
| 0000001Dh | +2.0 | 4.0 | 11.0 | 3.0 |
| 00000027h | +2.0 | 4.0 | 7.0 | 2.0 |
| 00000058 h | +0.0 | 3.0 | 23.0 | 2.0 |
| 00000062h | +0.0 | 3.0 | 19.0 | 3.0**** |
| ```WinningLoc = 3 BitsLeft = 0 FilePointer = 0 . . CW's sent 4 Tie = 0000000F NumTies =0 state 0 of numstates 4 ((()( 00000062h RDS = 0.0 LB = 0 )))))``` |  |  |  |  |
|  |  |  |  |  |
|  |  |  |  |  |
|  |  |  |  |  |

### 4.5. Comparing unconstrained and constrained transmission

If no CS coding had been used in the above example then only the regular $(7,4)$ Hamming code would had have been used. As a result the transmitted sequence would have been $\{00 \mathrm{~h}, 16 \mathrm{~h}, 1 \mathrm{Dh}, 1 \mathrm{Dh}\}$. This sequence would have had 11 transitions, been unbalanced with an RDS of -6 , and it would have contained a run of nine like valued bits as shown in Figure 4-9.


Figure 4-9: Uncoded sequence is unbalanced and has a run of 9

Conversely, the multimode coding system that incorporates CS coding has resulted in an output sequence that is completely balanced with an RDS of 0 and containing 16 transitions.


Figure 4-10: Coded sequence is balanced but still has a run of 5 like valued bits

While this is a definite improvement inspection of Figure 4-10 shows that this constrained sequence still has a run of five like valued bits. The question that arises then is how likely is this run of length five? Furthermore, could this run be removed if different AddCWs were used?

### 4.6. Effect of Different AddCWs

Analysis in Sections 3.6.4 and 3.6.5 reported that there are only four possible AddCW sets in this $(7,2)$ system. They are $\{00 \mathrm{~h}, 27 \mathrm{~h}, 58 \mathrm{~h}, 7 \mathrm{Fh}\},\{00 \mathrm{~h}, 2 \mathrm{Ch}, 53 \mathrm{~h}, 7 \mathrm{Fh}\}$, $\{00 \mathrm{~h}, 31 \mathrm{~h}, 4 \mathrm{Eh}, 7 \mathrm{Fh}\}$ and $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{~h}, 7 \mathrm{Fh}\}$, or in terms of AddSWs are $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\}$, $\{0 \mathrm{~h}, 5 \mathrm{~h}, \mathrm{Ah}, \mathrm{Fh}\},\{0 \mathrm{~h}, 6 \mathrm{~h}, 9 \mathrm{~h}, \mathrm{Fh}\}$ and $\{0 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Fh}\}$ respectively. Using the same sequence of four SWs, $00,10,11$ and 11 with the AddSW set $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\}$ (versus $\{0 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Fh}\}$ previously) an entirely different sequence will be transmitted as shown in Tables 4-5 and 4-6.

Here it can be seen that the first 2-bit SW 00 is now encoded to the CW 58 h instead of 45 h as it was previously. In addition, this CW has caused the RDS to be -1 and the LB to be 0 . Comparison with the last AddCW set shows that this is an (RDS,LB) state that was not reached before. As a result it is possible that this AddCW set will cause the encoder to experience different RDS and LB combinations and therefore have a different number of states than the last example. Table 4-6 continues this example by showing the decision process for the next three SWs.

Table 4-5: Decision process from SimFPGA for the $1^{\text {st }} \mathrm{SW}$


Table 4-6: Decision process from SimFPGA for the second, third and fourth SW


In this case the transmitted CWs are $\{58 \mathrm{~h}, 69 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{~h}\}$. This sequence has 19 transitions, is completely balanced and now has a maximum run of only three like valued bits as shown in Figure 4-11.


Figure 4-11: Coded sequence is balanced and now has a run of only three like valued bits

Based on this result it would appear that this is a better AddCW set. However making this assumption would be premature since only four SWs were tested. A complete analysis will look at the probability of various runlengths occurring and can only be performed once all SW/CW combinations have been tested. In order to conduct this analysis and evaluate different AddCW sets, it is helpful to model the encoder as a finite state machine (FSM).

### 4.7. Finite State Machines

Any system that operates at discrete instants of time and takes on a finite number of configurations can be represented as a finite state machine (FSM) [22]. A FSM is classified as either a Moore machine, where the output is a function of the internal state only, or as a Mealy machine where the output is a function of the internal state and the present input [22]. The encoder presented in this thesis is classified as a Mealy machine since the output CW and next state are both dependent on the present SW and present (RDS,LB) state.

A FSM can also be represented graphically by a state diagram which shows the progression of states through which the system operates and the resulting outputs based on specific inputs.


Figure 4-12: Partial state diagram for the (7,2) encoder transmitting the four SWs $\mathbf{0}, 2,3$ and 3. This causes transitions from state (0,0), to state $(-1,0)$, to state $(0,1)$, to state $(\mathbf{1 , 0})$, and back to state $(\mathbf{0}, 1)$

For instance, in the second $(7,2)$ example above with AddSWs $\{0 h, 4 h, B h, F h\}$, the sequence of four SWs transmitted caused the system to move into four (RDS,LB) states as shown in Figure 4-12. The state diagram shows how each SW is mapped to each CW depending on which state the system is presently in. For instance the system was assumed to start with an RDS of 0 and a LB of 0 . As a result the first state is denoted $(0,0)$. The first SW emitted was 0 h and this caused the CW 58h to be transmitted and it moved the system into state $(-1,0)$, i.e. an RDS of -1 and a LB of 0 . This SW/CW interaction is shown as $0 / 58$ on the trace that moves from state $(0,0)$ to state $(-1,0)$. Likewise the next SW was 2 h with corresponding CW 69 h , shown as $2 / 69$ and this moved the system into state $(0,1)$. Finally the next two SWs were both 3 h , which caused the CWs 3 Ah and 45 h to be transmitted, shown as $3 / 3 \mathrm{~A}$ and $3 / 45$ respectively, which moved the system to state $(1,0)$ and then back again to state $(0,1)$.

Figure 4-12 only shows a partial state diagram after transmitting four CWs. In order to see the complete state diagram all SW/CW combinations must be explored from every state. Figure 4-13 shows the complete state diagram for this $(7,2)$ encoder
using the AddSW set $\{0 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Fh}\}$ encoded to the AddCW set $\{00 \mathrm{~h}, 3 \mathrm{Ah}, 45 \mathrm{hh}, 7 \mathrm{Fh}\}$ where CW selection is based on minimizing the RDS. It consists of six (RDS,LB) states numbered 0 through 5 for convenience. Inspection of this state diagram shows that each state will be left the interval after it is entered. However the probability of entering each state is not equally probable. Furthermore since the goal of the encoder is to minimize the RDS, the states with the most entry points are states 2 and 3 which both represent an RDS of 0 , and a LB of 0 and 1 respectively.


Figure 4-13: State Diagram for (7,2) code with AddSWs $=\{0 \mathrm{~h}, \mathbf{4 h}, \mathrm{Bh}, \mathrm{Fh}\}$

Focusing on the left half of the state diagram, i.e. the three states 0,2 and 4 that each have a LB of 0 , it can be seen that there are eight different CWs that take the system into state 2 , three different CWs that take the system into state 0 , but only one CW that can take the system into state 4 . Clearly state 2 will be visited more often than states 0 and 4. This is an important distinction since even though all exit paths from a particular state are equally likely with equiprobable SWs, the probability of being in each state is not. This means that the CWs that leave state 2 will be seen on the channel more often
than the CWs that leave states 0 and 4 . Hence the probability of a CW being transmitted is weighted by the probability of being in a particular state. When attempting to calculate runlength probabilities these state probabilities need to be taken into consideration. This form of analysis can be performed by interpreting the FSM as a Markov Chain.

### 4.8. Markov Processes and Markov Chains

### 4.8.1. Markov Chains

A random process $X(t)$ is a Markov Process if the future value of the process is dependent only on its immediate present value. In other words the process is independent of the past; its future value is dependent only on its present value. This is expressed concisely in Equation 4.4, which states that the probability of $X(t)$ assuming a new value given all the previous values for all time is equal to the probability of $X(t)$ assuming a new value if only the immediate value $x_{k}$ is considered.

$$
\begin{equation*}
P\left[X\left(t_{k+1}\right)=x_{k+1} \mid X\left(t_{k}\right)=x_{k}, \ldots, X\left(t_{1}\right)=x_{1}\right]=P\left[X\left(t_{k+1}\right)=x_{k+1} \mid X\left(t_{k}\right)=x_{k}\right] \tag{4.4}
\end{equation*}
$$

In Markov chains the probability distribution functions (PDFs) that are conditioned on several time instants always reduce to one PDF that is conditioned only on the most recent time instant. This is known as the Markov property [23].

An integer valued Markov Process is a discrete time random process called a Markov Chain. Here the random variable $X_{n}$ takes on a countable number of values at discrete moments in time, where $T$ is the interval between discrete time events. The value of $X_{n}$ at the discrete time $n$ is referred to as the state of the process at time $n$.

$$
\begin{equation*}
X_{n}=X(n T) \tag{4.5}
\end{equation*}
$$

### 4.8.2. Markov Chain Properties

A system can be modeled by a Markov chain if the sequence of trials satisfies the following properties:

1. Each outcome belongs to a finite set of outcomes $\left\{a_{1}, a_{2}, \ldots, a_{m}\right\}$ called the state space of the system. If the outcome of the $n^{\text {th }}$ trial is $a_{i}$, then the system is defined to be in state $a_{i}$ at time $n$, or the system is in state $a_{i}$ at the $n^{\text {th }}$ step.
2. The outcome of any trial depends at most upon the outcome of the immediately preceding trial and not upon any other previous outcome.
3. There is a given probability $p_{i j}$ that state $a_{j}$ occurs immediately after $a_{i}$ occurs. This can also be interpreted as follows: if the process is currently in state $a_{i}$, then $p_{i j}$ is the probability of it moving to state $a_{j}$.

The numbers $p_{i j}$ are called the transition probabilities and they can be arranged in a matrix called the transition matrix as shown in Equation 4.6.

$$
P=\left[\begin{array}{cccc}
p_{11} & p_{12} & \cdots & p_{1 m}  \tag{4.6}\\
p_{21} & p_{22} & \cdots & p_{2 m} \\
\vdots & \vdots & & \vdots \\
p_{m 1} & p_{m 2} & \cdots & p_{m n}
\end{array}\right]
$$

The transition matrix $P$ is a stochastic matrix since each row is a probability vector that sums to 1 . Additionally a Markov chain has an initial probability vector $p$ which indicates the starting state. For instance, calculation of the probability of the system starting in state 0 and then moving from state 0 to 1 , and then from state 1 to 2 and so on would be done as shown in Equation 4.7.

$$
\begin{equation*}
p\left(X_{0}=i_{0}, X_{1}=i_{1}, X_{2}=i_{2}, \ldots, X_{n}=i_{n}\right)=p\left(i_{0}\right) P\left(i_{0}, i_{1}\right) P\left(i_{1}, i_{2}\right) \ldots P\left(i_{n-1}, i_{n}\right) \tag{4.7}
\end{equation*}
$$

For example, let $P$ be a transition matrix with the initial probability vector $p$ as shown in Equation 4.8 below. Since $p=\left[\begin{array}{ll}1 & 0\end{array}\right]$, the system begins in state 0 with $100 \%$ probability. The probability $P(0001110)$ denotes the probability of starting in state 0 , staying in state 0 for two more time steps, moving to state 1 on the $3^{\text {rd }}$ time step, staying in state 1 for the next two time steps, and finally ending in state 0 on the $6^{\text {th }}$ time step. As shown in Equation 4.9 this probability is approximately 0.01627. Likewise, the probability $P(0011110)=0.01808$ as shown in Equation 4.10.

$$
\begin{gather*}
P=\left[\begin{array}{ll}
3 / 4 & 1 / 4 \\
1 / 6 & 5 / 6
\end{array}\right] \quad p=\left[\begin{array}{ll}
1 & 0
\end{array}\right]  \tag{4.8}\\
P(0001110)=(1)\left(\frac{3}{4}\right)\left(\frac{3}{4}\right)\left(\frac{1}{4}\right)\left(\frac{5}{6}\right)\left(\frac{5}{6}\right)\left(\frac{1}{6}\right)=0.01627  \tag{4.9}\\
P(0011110)=(1)\left(\frac{3}{4}\right)\left(\frac{1}{4}\right)\left(\frac{5}{6}\right)\left(\frac{5}{6}\right)\left(\frac{5}{6}\right)\left(\frac{1}{6}\right)=0.01808 \tag{4.10}
\end{gather*}
$$

Equation 4.9 and 4.10 show two cases of the system starting in state 0 and after six time steps finishing in state 0 . If we were interested in all possible cases of starting in state 0 and after six time steps ending in state 0 , we would need to find all possible chains of the type $0 x x x x x 0$, where x indicates a don't care state, and add together the probabilities of these chains for the total probability which would be denoted as $P\left(X_{6}=0 \mid X_{0}=0\right)$. There would be $2^{5}=32$ possible chains, and clearly this operation would become cumbersome, especially for a large number of time steps.

The probability $P\left(X_{n}=j \mid X_{0}=i\right)$ can be obtained however from the $n^{\text {th }}$ power of the transition matrix $P^{n}$ [23]. This is due to a nice link between linear algebra and probability. The transition matrix $P$ can be interpreted as $P^{l}$, which is the probability that the system changes from the state $a_{i}$ to the state $a_{j}$ in one time step. Likewise $P^{2}$ is the probability that the system changes from the state $a_{i}$ to the state $a_{j}$ in two time steps and continuing in this fashion $P^{n}$ would represent the probability that the system changes
from state $a_{i}$ to the state $a_{j}$ in $n$ time steps. Thus the probability $P\left(X_{6}=0 \mid X_{0}=0\right)$ can be obtained from the $6^{\text {th }}$ power of the transition matrix $P$.

As shown in Equation 4.11 below, $P\left(X_{6}=0 \mid X_{0}=0\right)=0.424$, which means that if the starting state is state 0 , the probability of it ending in state 0 six time steps later is $42.4 \%$. Likewise the probability of starting in state 0 and ending in state 1 is $57.6 \%$ and so on.

$$
P^{6}=\left[\begin{array}{ll}
.424 & .576  \tag{4.11}\\
.384 & .616
\end{array}\right] \text { so } P\left(X_{6}=0 \mid X_{0}=0\right)=0.424
$$

In general a stochastic matrix is said to be regular if all the entries of $P^{n}$ for $n>1$ are positive (non-zero) [23]. In Equation 4.8 the stochastic matrix $P$ is considered to be a regular stochastic matrix by this definition.

### 4.8.3. Long Term Behavior of Markov Chains

Understanding the long term behavior of a Markov chain simplifies to understanding $P^{n}$ for large $n$. Each $P^{n}$ preserves the property of rows summing to one and having non-negative entries. Thus all $P^{n}$ matrices are still transition matrices. For example, $P$ after 20 time steps is shown in Equation 4.12.

$$
P^{20}=\left[\begin{array}{ll}
.400 & .599  \tag{4.12}\\
.399 & .600
\end{array}\right]
$$

It appears to be converging and one can conjecture that as $n$ approaches infinity

$$
P^{n}=\left[\begin{array}{ll}
.4 & .6  \tag{4.13}\\
.4 & .6
\end{array}\right]
$$

Each row (which is a probability vector) of $P^{n}$ for large $n$ is identical. This probability vector is of special interest and is called the invariant probability vector $\pi$ or $\lambda_{\infty}{ }^{[23]}$. The probability vector $\pi$ is invariant since

$$
\begin{equation*}
\pi=\pi P=\pi P^{2}=\pi P^{3}=\ldots=\pi P^{n} \text { as } n \rightarrow \infty \tag{4.14}
\end{equation*}
$$

This expression demonstrates that with distribution $\pi$, the chain is in equilibrium. That is, if the system starts with this probability distribution $\pi$, then this distribution will be maintained for all time. The $\pi$ vector is also a limiting probability vector since

$$
\begin{equation*}
\pi=\lim _{n \rightarrow \infty} \nu P^{n} \tag{4.15}
\end{equation*}
$$

where $v$ is any initial probability vector. Equation 4.15 states that for any initial probability vector $v, v P^{n}$ approaches the invariant probability vector $\pi$ for large $n$. Every Markov chain will have an invariant probability vector $\pi$ [23] and this vector gives the long term probability of being in any particular state. That is, if the system was running for some period of time, and it was suddenly stopped, $\pi$ would give the probability of being in each state. In general if $P$ is a regular stochastic matrix then the subsequent powers of $P$ approach a matrix $\Pi$ that has rows that consist of the fixed probability vector $\pi$ [23].

However $\pi$ is not always unique [23]. If $\pi$ is a unique vector and the rows of the transition matrix $P$ raised to large $n$ converge to $\pi$, then the chain/system is said to be ergodic. This comes from the Perron-Frobenius theorem [23]. This theorem states that if there exists an $n$ such that $P^{n}>0$ for $n>1$ then $P$ is ergodic [23]. Ergodicity of Markov
chains is a desired property since it guarantees that no matter which state you start in, if you run the system long enough the influence of the initial probability vector $p$ vanishes and the system will approach the invariant distribution $\pi$. This result has practical applications such as with Monte Carlo Markov Chain (MCMC) algorithms and simulations [23].

### 4.8.4. Non-Regular Markov Chains

It is possible to have a Markov chain that is ergodic and has a unique invariant probability vector that does not satisfy the Perron-Frobenius theorem. For example consider this trivial case

$$
P=\left[\begin{array}{ll}
0 & 1  \tag{4.16}\\
1 & 0
\end{array}\right] \quad P^{2}=\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right] \quad P^{3}=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] \quad P^{4}=\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right]
$$

This chain will never converge and the powers of $P$ will alternate between the two matrices in Equation 4.16. This is because the chain has a period of size $d=2$. For chains with a period of $d$ the subsequent powers of $P$ cycle between $d$ different matrices [23]. In this case $P^{n}$ will not converge and thus $\nu P^{n}$ will continue to alternate between $d=2$ vectors unless $v=\pi$. Therefore this chain is not regular.

However since the transition matrix states that when the system is in state 0 it must move to state 1 , and on the next time instant it must move back to state 0 and so on, one can intuitively conclude that $50 \%$ of the time must be spent in each state. Therefore even though there is no convergence it can be expected that $\pi=\left[\begin{array}{ll}.5 & .5\end{array}\right]$. In systems like this that are not regular the average of the transition matrices raised to large $n$ will always converge to a matrix $M$ [23]. Furthermore, if all of the rows of $M$ are identical then the invariant probability vector $\pi$ is unique [23].

$$
\begin{equation*}
\frac{1}{n} \sum_{k=1}^{n} P^{k} \rightarrow M \tag{4.17}
\end{equation*}
$$

Using Equation 4.17 the invariant probability vector can be found from averaging the transition matrices as shown in Equation 4.18 to find $\pi=\left[\begin{array}{ll}.5 & .5\end{array}\right]$.

$$
M=\frac{1}{2}\left(\left[\begin{array}{ll}
0 & 1  \tag{4.18}\\
1 & 0
\end{array}\right]+\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right]\right)=\left[\begin{array}{ll}
.5 & .5 \\
.5 & .5
\end{array}\right] \text { thus } \pi=\left[\begin{array}{ll}
.5 & .5
\end{array}\right]
$$

The above example demonstrates a case where $\pi$ is unique and yet $P$ does not converge, hence they do not go hand in hand. As well this chain does not satisfy the Perron-Frobenius theorem and yet it is ergodic. This is an important distinction since many of the combined EC and CS codes presented in this thesis, when analyzed as Markov chains are also ergodic but do not satisfy the Perron-Frobenius system and their transition matrices never converge.

### 4.8.5. State Diagrams

A Markov chain can also be represented graphically by a state diagram just like a FSM. For example the state diagram for the Mealy FSM state graph of the $(7,2)$ encoder using the AddSWs $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\}$ from Figure $4-13$ can be redrawn using the probability of taking a particular trace, instead of the SW/CW relationship, as shown in Figure 4-14. With this representation it is easy to see the probability of each CW being transmitted from each state. Furthermore, assuming that the source symbols are independent and equally likely the transition matrix can be constructed by inspection.


Figure 4-14: Markov Chain State Diagram

### 4.9. Analyzing the Encoder as a Markov Chain

Analyzing Figure 4-13 or Figure 4-14 shows that if the system is in state 0 it will move to state 2 with probability $1 / 4$ and move to state 3 with probability $3 / 4$. Likewise if the system is in state 2 it will move to state 0 with probability $1 / 4$, move to state 1 with probability $1 / 4$ and move to state 5 with probability $1 / 2$. Continuing in this fashion all possible transition probabilities can be found and they are shown in Equation 4.19.

$$
P=\left[\begin{array}{cccccc}
0 & 0 & 1 / 4 & 3 / 4 & 0 & 0  \tag{4.19}\\
0 & 0 & 1 / 4 & 3 / 4 & 0 & 0 \\
1 / 4 & 1 / 4 & 0 & 0 & 0 & 1 / 2 \\
1 / 2 & 0 & 0 & 0 & 1 / 4 & 1 / 4 \\
0 & 0 & 3 / 4 & 1 / 4 & 0 & 0 \\
0 & 0 & 3 / 4 & 1 / 4 & 0 & 0
\end{array}\right]
$$

Another approach to finding this transition matrix is to use the definitions first presented by Cariolaro and Tronca [24]. Define the input probability matrix $\theta_{u}$ for $u=1,2, \ldots, S$, where $S$ is the number of SWs as

$$
P_{u}\left(S W \mid l_{i}\right)_{\text {diag }}=\theta_{u}(i, j)=\left\{\begin{array}{cc}
P_{u}\left(S W \mid l_{i}\right), & i=j  \tag{4.20}\\
0 & \text { otherwise }
\end{array}\right.
$$

where $l_{i}$ denotes the $i^{\text {th }}$ state. Equation 4.20 is the probability of a particular SW occurring given the current state. Since there are 4 possible SWs and they are equally likely then there is a $1 / 4$ probability of any one of them being chosen regardless of the present state.

$$
\theta_{0}=\theta_{1}=\theta_{2}=\theta_{3}=\left[\begin{array}{cccccc}
.25 & 0 & 0 & 0 & 0 & 0  \tag{4.21}\\
0 & .25 & 0 & 0 & 0 & 0 \\
0 & 0 & .25 & 0 & 0 & 0 \\
0 & 0 & 0 & .25 & 0 & 0 \\
0 & 0 & 0 & 0 & .25 & 0 \\
0 & 0 & 0 & 0 & 0 & .25
\end{array}\right]
$$

Next define $E_{u}$ for $u=1,2, \ldots, S$ as the next-state matrices, where $E_{u}(i, j)=1$ if and only if state $l_{j}$ is entered from state $l_{i}$ given input $S_{u}$. For each possible SW the system will move to only one state and therefore the $E_{u}$ matrices will have a single 1 on each row. This is shown in Equation 4.22.

$$
\begin{array}{ll}
E_{0}=\left[\begin{array}{llllll}
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 1 & 0 & 0 \\
1 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 1 \\
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0
\end{array}\right] \quad E_{1}=\left[\begin{array}{llllll}
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 1 \\
1 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0
\end{array}\right]  \tag{4.22}\\
E_{2}=\left[\begin{array}{llllll}
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 1 \\
1 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0
\end{array}\right] \quad E_{3}=\left[\begin{array}{llllll}
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 0 & 1 & 0 & 0 & 0 \\
0 & 1 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 1 & 0 \\
0 & 0 & 0 & 1 & 0 & 0 \\
0 & 0 & 0 & 1 & 0 & 0
\end{array}\right]
\end{array}
$$

By considering all the ways in which these state transitions can happen, and the probability of their occurrence, it is straightforward to verify that

$$
\begin{equation*}
P=\sum_{u=1}^{s} \theta_{u} E_{u} \tag{4.23}
\end{equation*}
$$

Using Equation 4.23 the transition matrix for the encoder is found to be

$$
\begin{align*}
& P=\left[\begin{array}{cccccc}
25 & 0 & 0 & 0 & 0 & 0 \\
0 & .25 & 0 & 0 & 0 & 0 \\
0 & 0 & .25 & 0 & 0 & 0 \\
0 & 0 & 0 & .25 & 0 & 0 \\
0 & 0 & 0 & 0 & .25 & 0 \\
0 & 0 & 0 & 0 & 0 & .25
\end{array}\right]\left[\begin{array}{llllll}
0 & 0 & 1 & 3 & 0 & 0 \\
0 & 0 & 1 & 3 & 0 & 0 \\
1 & 1 & 0 & 0 & 0 & 2 \\
2 & 0 & 0 & 0 & 1 & 1 \\
0 & 0 & 3 & 1 & 0 & 0 \\
0 & 0 & 3 & 1 & 0 & 0
\end{array}\right]  \tag{4.24}\\
& P=\left[\begin{array}{cccccc}
0 & 0 & .25 & .75 & 0 & 0 \\
0 & 0 & .25 & .75 & 0 & 0 \\
.25 & .25 & 0 & 0 & 0 & .5 \\
.5 & 0 & 0 & 0 & .25 & .25 \\
0 & 0 & .75 & .25 & 0 & 0 \\
0 & 0 & .75 & .25 & 0 & 0
\end{array}\right]
\end{align*}
$$

Next define $C_{u}$ for $u=1,2, \ldots, S$ as matrices, where the row $C_{u}(i)$ is the CW generated when state $l_{i}$ is entered due to a given input $S_{u}$. For the example above:

$$
\begin{align*}
& C_{0}=\left[\begin{array}{l}
01001111 \\
0100111 \\
1011000 \\
0100111 \\
10111000 \\
1011000
\end{array}\right] \quad C_{1}=\left[\begin{array}{l}
1010011 \\
1010011 \\
1010011 \\
0101100 \\
0101100 \\
0101100
\end{array}\right] \\
& C_{2}=\left[\begin{array}{l}
1101001 \\
1101001 \\
1010001 \\
0010110 \\
0010110 \\
0010110
\end{array}\right] \quad C_{3}=\left[\begin{array}{l}
0111010 \\
0111010 \\
1000101 \\
0111010 \\
1000101 \\
1000101
\end{array}\right] \tag{4.25}
\end{align*}
$$

With these definitions all information regarding the likelihood of entering a state and the CW emitted from each state is neatly formatted into a matrix representation. Note that the transition matrix $P$ found in Equation 4.24 is the same as the transition matrix found by inspection in Equation 4.19. $P$ can now be used to find the invariant probability vector $\pi$.

### 4.9.1. Finding the invariant probability vector of the encoder

From Section 4.8 .3 and Equation 4.15 the invariant probability vector $\pi$ can be found for any initial starting distribution $v$ by evaluating $v P^{n}$ as $n$ approaches infinity. The rows of $P^{n}$ approach the invariant distribution for large $n$ for regular stochastic matrices satisfying the property $\pi=\pi P$. However for the encoder described above, taking $P^{n}$ to large powers of $n$ shows that it alternates between the two matrices below for odd and even powers of $n$.

$$
P^{30}=\left[\begin{array}{cccccc}
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8}  \tag{4.26}\\
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8} \\
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8} \\
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8}
\end{array}\right] \quad P^{31}=\left[\begin{array}{ccccccc}
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8} \\
\frac{3}{8} & \frac{1}{8} & 0 & 0 & \frac{1}{8} & \frac{3}{8} \\
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0 \\
0 & 0 & \frac{1}{2} & \frac{1}{2} & 0 & 0
\end{array}\right]
$$

This system has a chain of $d=2$ and will cycle between these $d$ matrices. This is because the Markov chain is not regular, and it cannot be expected to converge. The invariant distribution as found by Equation 4.17 is

$$
\begin{align*}
& \pi=\left[\begin{array}{llllll}
3 / 16 & 1 / 16 & 1 / 4 & 1 / 4 & 1 / 16 & 3 / 16
\end{array}\right] \tag{4.27}
\end{align*}
$$

This means that if the system was running for some amount of time and the number of times each state was visited was recorded, it would be found that the invariant distribution is $\pi=\left[\begin{array}{llllll}3 / 16 & 1 / 16 & 1 / 4 & 1 / 4 & 1 / 16 & 3 / 16\end{array}\right]=\left[\begin{array}{llllll}1875 & .0625 & .25 & .25 & .0625 & \text {.1875 }\end{array}\right]$, which is the probability of being in any particular state. Combining this information with the state diagram shown in Figure 4-15, it can be seen that the system is in the two most popular states, states 2 and 3 that correspond to an RDS of $0,50 \%$ of the time.


Figure 4-15: Markov Chain State Diagram with State Probabilities

Using Markov chain analysis it is now possible to evaluate the likelihood of a particular CW being sent. On average the probability of a CW being transmitted from any state is $1 / 4$, weighted by the probability of being in the state that emitted the CW.

For example, the probability of being in state 0 is $18.75 \%$. Therefore the CWs emitted from this state $\{27 \mathrm{~h}, 53 \mathrm{~h}, 69 \mathrm{~h}, 3 \mathrm{Ah}\}$ would be seen on the channel ( $18.75 \%$ )( $25 \%$ ) or $4.68 \%$ of the time. Likewise the probability of being in state 1 is $6.25 \%$. Therefore even though the CWs emitted from this state are the same $\{27 \mathrm{~h}, 53 \mathrm{~h}, 69 \mathrm{~h}, 3 \mathrm{Ah}\}$, they would only be seen on the channel $(6.25 \%)(25 \%)$ or $1.56 \%$ of the time. Continuing in this manner the likelihood of all possible CWs being transmitted can be calculated. With this information it is now possible to characterize the encoder in terms of runlength probabilities.

### 4.10. Finding Runlength Probabilities

To find the runlength probabilities it is convenient to decompose the state diagram into the format shown in Figure $4-16$. From this representation it easy to see the probability of specific CWs as the encoder enters and leaves states.


Figure 4-16: Decomposed state diagram illustrating the probability of a SW/CW entering each state, and probability of each CW leaving each state, allowing straightforward calculation of runlengths

For example, Figure $4-16$ shows that CW 58 h enters state 0 with probability $(.25)(.25)=.0625$. This is because CW 58 h is one of four possible CWs that can be emitted from state 2 , which the system visits on average $25 \%$ of the time. Once the system is in state 0 there are 4 possible CWs $\{27 \mathrm{~h}, 53 \mathrm{~h}, 69 \mathrm{~h}, 3 \mathrm{Ah}\}$ that can be emitted all with probability $1 / 4$. Hence the probability of the particular sequence 58 h followed by 27 h is $(.25)(.25)(.25)=.015625$.

In order to accurately evaluate runlength probabilities the concatenation of CWs must be taken into account. For example Figure 4-17 shows the sequence formed through the concatenation of the two CWs 58 h and 27 h .

```
10110000100111 «5827
1011 0000100 1114- Portion of interest
```

Figure 4-17: Concatenation of two CWs and the Portion of Interest

Note that CWs 58 h and 27 h alone only have a worst case runlength of 3 like valued bits. However once they are concatenated a runlength of 4 like valued bits appears. Therefore the string of three 0 s at the end of CW 58 h cannot be counted simply as a run of three, since it may lead to a longer run of $0 s$ when concatenated with the next CW. Likewise the final string of three 1 s in CW 27 h is not counted as a run of three either since it may lead to a longer run of 1 s when it is concatenated to the next CW . As shown in Figure 4-17 the middle portion of the concatenated CWs is called the portion of interest.

### 4.11. Analysis of runlength probabilities

As previously discussed the probability of this particular two CW sequence 58 h followed by 27 h is $(.25)(.25)(.25)=.015625$ or $(.25)^{* 1 / 4} * 1 / 4$. The probability of being within the seven bit portion of interest out of these fourteen bits is $7 / 14$. Inspection shows that there is one run of 1 , one run of 2 and one run of 4 that will occur with probability $1 / 7,2 / 7$ and $4 / 7$ respectively. Finally since the portion of interest spans two CWs, and the particular run of interest could commence at any encoding interval, including the $n^{\text {th }}$ and $(n+1)^{\text {th }}$ intervals, then the above sequence probability must be multiplied by two. For this example then,

$$
\begin{align*}
& \text { Run of } 1=.25\left(\frac{1}{4}\right)\left(\frac{1}{4}\right) 2\left(\frac{7}{14}\right) \frac{1}{7}=\frac{1}{448}=.002232 \\
& \text { Run of } 2=.25\left(\frac{1}{4}\right)\left(\frac{1}{4}\right) 2\left(\frac{7}{14}\right) \frac{2}{7}=\frac{2}{448}=.004464  \tag{4.28}\\
& \text { Run of } 4=.25\left(\frac{1}{4}\right)\left(\frac{1}{4}\right) 2\left(\frac{7}{14}\right) \frac{4}{7}=\frac{4}{448}=.008929
\end{align*}
$$

Enumeration like this is required for all possible two CW combinations, and even for this simple system there are 96 possible two CW combinations. Since this calculation is best performed by a computer, two programs were written.


Figure 4-18: Build_EuCu,exe Program that builds $\theta_{u}, E_{u}$ and $C_{u}$ matrices

The first program Build_EuCu.exe is shown in Figure 4-18 and it allows the user to select the expurgated code, selection criteria, source statistics and AddSWs. It then generates three files. The first file generated is shown below in Figure 4-19 called ProbOf1_50_States_(7,2)_[0,4,B,F]_Lowest_RDS.txt, and it shows the RDS and LB states generated by the code. These states are found by testing all SW/CW combinations and they are sorted from lowest to highest to show the symmetry in the system. Inspection shows that these are the same states from Figure 4-15.


Figure 4-19: Output states from Build_EuCu.exe

Two other files are generated by this program. One is used in the evaluation of the power spectral density that will not be considered until Chapter 5, and the other contains the $E_{u}$ and $C_{u}$ matrices shown in Figure 4-20.

| SW 0 |  | SW 1 |  | SW 2 |  | SW 3 |  |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 3 | 27 | 3 | 53 | 3 | 69 | 2 | 3A |
| 3 | 27 | 3 | 53 | 3 | 69 | 2 | 3A |
| 0 | 58 | 5 | 53 | 5 | 69 | 1 | 45 |
| 5 | 27 | 0 | 2 C | 0 | 16 | 4 | 3A |
| 2 | 58 | 2 | 2 C | 2 | 16 | 3 | 45 |
| 2 | 58 | 2 | 2 C | 2 | 16 | 3 | 45 |

Figure 4-20: Output $E_{u}$ and $C_{u}$ matrices from Build_EuCu.exe

This file lists the $E_{0}, C_{0}, E_{1}, C_{1}, E_{2}, C_{2}, E_{3}, C_{3}$ matrices as shown in Equations 4.22 and 4.25 respectively. The difference is that instead of writing the information in binary matrix form, it is written in a compact hexadecimal form. The reason for this compact notation is because this data is read into the second program CalcPIMandProbs.exe which is shown in Figure 4-21. This program parses the $E_{u}$ and $C_{u}$ information to calculate the transition matrix, invariant probability vector and runlength probabilities. The compact form may not seem necessary for this $(7,2)$ code which only had six states and four possible SWs. However large codes like the $(15, x)$ and $(31, x)$ codes could have thousands of CWs and hundreds of states and hence this file can grow to hundreds of megabytes. Therefore the most compact form possible was adopted.


Figure 4-21: CalcPIMandProbs.exe

The information in this file is read in a top to bottom fashion as follows. Looking at the first line in Figure $4-20$, when SW 0 is chosen and the system is in state 0 , it moves to state 3 and emits CW 27h. Likewise on the next line down, when SW 0 is chosen and the system is in state 1 , it moves to state 3 and also emits CW 27 h and so on.

CalcPIMandProbs.exe takes this information and generates a single file that is shown in Figure 4-22. Inspection shows that it has calculated the same transition matrix as found in Equation 4-24. Furthermore it has calculated the same invariant vector as found in Equation 4-27 and finally it has calculated all the runlength probabilities. It shows that this system with AddSWs \{0h,4h,Bh,Fh\}using the lowest RDS selection criteria has on average a $37.94 \%$ chance of a run of one, $35.71 \%$ chance of a run of two, $22.76 \%$ chance of a run of three and a $3.57 \%$ chance of a run of four. Runlengths of five or more will never occur.


Figure 4-22: Output from CalcPIMandProbs,exe

An additional feature of CalcPIMandProbs is that it can also show the intermediate steps of how the runlengths were calculated. This feature is only feasible for small codes because the file size grows very quickly. Nevertheless the intermediate steps are shown in Figures $4-23$ to $4-26$ where it can be seen that the same runlength probabilities are calculated as shown in Equation 4.28.


Figure 4-23: Runlength probabilities with SW 0


Figure 4-24: Runlength probabilities with SW 1


Figure 4-25: Runlength probabilities with SW 2


Figure 4-26: Runlength probabilities with SW 3

### 4.12. Code Word Search

In order to determine the best AddCW sets, the effect that different AddCW sets have on the number of transitions and runlength probabilities must be evaluated. To do so involves testing each set of AddCWs using the above Markov chain analysis and searching for the set that has the best statistics. This was done through an exhaustive computer search. For example, for the $(7,1)$ code the exhaustive search will begin with the AddSW set $\{0 \mathrm{~h}, 2 \mathrm{~h}, 4 \mathrm{~h}, 6 \mathrm{~h}, 9 \mathrm{~h}, \mathrm{Bh}, \mathrm{Dh}, \mathrm{Fh}\}$, since $\mathrm{SWs} 2 \mathrm{~h}, 4 \mathrm{~h}$ and 6 h are the first SWs to meet the selection criteria from Section 3.5 and Figure 3-21. Furthermore SWs 9h, Bh and Dh are chosen because they are the SW complements of $2 \mathrm{~h}, 4 \mathrm{~h}$ and 6 h , and SWs 0 h and Fh are chosen since it was recommended that all AddCW sets include the all-zero and all-one CW. Note that this AddSW set must be encoded into its corresponding AddCW set. After gathering statistics on this AddSW set the exhaustive search will increment SW 6h to the next SW 7h, whose complement is 8 h , and as a result the AddSW set $\{0 \mathrm{~h}, 2 \mathrm{~h}, 4 \mathrm{~h}, 7 \mathrm{~h}, 8 \mathrm{~h}, \mathrm{Bh}, \mathrm{Dh}, \mathrm{Fh}\}$ will be tested. In this fashion the computer search would continue testing the following AddSW sets $\{0 \mathrm{~h}, 2 \mathrm{~h}, 5 \mathrm{~h}, 6 \mathrm{~h}, 9 \mathrm{~h}, \mathrm{Ah}, \mathrm{Dh}, \mathrm{Fh}\}$, $\{0 h, 2 h, 5 h, 7 h, 8 h, A h, D h, F h\},\{0 h, 3 h, 4 h, 6 h, 9 h, B h, C h, F h\},\{0 h, 3 h, 4 h, 7 h, 8 h, B h, C h, F h\}$, $\{0 h, 3 h, 5 h, 6 h, 9 h, A h, C h, F h\}$ and $\{0 h, 3 h, 5 h, 7 h, 8 h, A h, C h, F h\}$, for testing a total of eight sets. Note that the total number of AddSW sets tested agrees with the number given by Equation 3.1.


Figure 4-27: CW search program

To perform this computer search the program FindAllProbs.exe in Figure 4-27 was created, which is a combination of the two programs BuildEuCu.exe and CalcPIMandProbs.exe.

Like the previous two programs FindAllProbs allows the user to choose the selection criteria and the expurgated code of interest. The major difference here is that the AddSWs are not specified. Instead this program runs through all possible AddSW sets (automatically converting them to AddCW sets) and tallies the statistics. For the $(7,2)$ code considered so far, FindAllProbs will test the four AddSW sets $\{0 h, 4 h, B h, F h\}$, $\{0 h, 5 h, A h, F h\},\{0 h, 6 h, 9 h, F h\}$ and $\{0 h, 7 h, 8 h, F h\}$.


Figure 4-28: CW search results for the $\mathbf{( 7 , 2 )}$

Figure 4-28 shows the results for the $(7,2)$ code found by FindAllProbs. It can be seen that for the AddSWs $\{0 \mathrm{~h}, 4 \mathrm{~h}, \mathrm{Bh}, \mathrm{Fh}\}$, the runlength probabilities match the results from the previous example in Figure 4-22. Furthermore, using the AddSWs \{0h,7h,8h,Fh\} leads to runs of length five $2.2 \%$ of the time, confirming the result shown in Figure 4-10.

Inspection of this output file shows that the two sets $\{0 \mathrm{~h}, 5 \mathrm{~h}, \mathrm{Ah}, \mathrm{Fh}\}$ and $\{0 \mathrm{~h}, 6 \mathrm{~h}, 9 \mathrm{~h}, \mathrm{Fh}\}$ are the best since they only have runs up to length 3 , i.e. a transition will occur in the output sequence at worst case every three bits. Deciding on a clear winner between these two sets however is subjective, since even though set $\{0 \mathrm{~h}, 6 \mathrm{~h}, 9 \mathrm{~h}, \mathrm{Fh}\}$ has the highest probability of runs of length one with $37 \%$, it also has a higher probability of runlengths of length three with $25.4 \%$.

### 4.13. Analysis and CW Search on larger codes

Up to this point only the $(7, x)$ codes have been discussed since their small size makes them easy to present and demonstrate. However, this multimode coding scheme with a length $n=7$ code suffers greatly in terms of code rate from the increased redundancy. Therefore larger codes such as the $(15,11)$ and $(31,26)$ codes are now considered.

With eight AddCWs the $(15,11)$ code is expurgated to a $(15,8)$ code resulting in $2^{8}=256$ possible SWs. This means there will be $(1)\left(2^{8}\right)\left(2^{8}\right)\left(2^{8}\right)(1)(1)(1)(1)=2^{24}$ AddCW sets to analyze based on the arguments from Section 3.6.5 and Equation 3.1. For each AddCW set the state diagram of the encoder must be constructed. This can only be found by testing all possible SWs, determining the CWs produced, and keeping track of the resulting states. For each newly discovered state however, the process must be repeated because each CW transmitted has the potential to cause a new state.

For instance, consider the case of transmitting the 15 -bit all-one CW from the (RDS,LB) state of $(0,0)$. This would put the system into a new state of $(15,1)$. If the allone CW is transmitted again it would put the system into another new state of $(30,1)$ and so on. This scenario simply illustrates how each CW has the potential to cause new states. On the other hand, if lowest RDS is the primary metric (balanced transmission), many of the states discovered should be duplicates. That is, the number of states in the system should not grow unbounded since ideally the system is attempting to return to the states with an RDS of 0 . However, if the maximum transitions or MSW selection criteria is used as the primary metric, then as successive words are encoded, it is possible for the
encoder to have an infinite number of states. For example, many CWs contain numerous transitions but they are unbalanced. Consider the case when the system chooses one of these CWs that results in the (RDS,LB) state of $(7,1)$. In the next encoding interval one of these CWs might be chosen again, resulting in the (RDS,LB) state of $(14,1)$. Continuing in this fashion with maximizing transitions as the primary metric, it is entirely possible that the system will continue choosing these CWs causing the number of states to be $(21,1),(28,1),(35,1)$ and so on, and therefore grow unbounded as encoding proceeds.

To limit the evaluation time a hard upper limit of 256 states was established for the number of states considered by FindAllProbs regardless of the primary decision metric. Any AddCW set that exceeds this limit is discarded.

Once the state diagram is found, the $\theta_{u}, E_{u}$ and $C_{u}$ matrices must be constructed in order to build the transition matrix and solve for the invariant distribution. Once this information is complete the runlengths can be found using the analysis presented in Section 4.10. Recapping this information shows how this problem grows in complexity.

With this $(15,8)$ code and 256 possible SWs, it can be estimated that each state can be entered and exited 256 ways. Assuming that there are only 32 states (well below the upper limit of 256 ), this will represent $\left(256^{2}\right)(32)$ iterations. This amount of work however is only for a single AddCW set and must be done once for each of the $2^{24}$ sets (from Section 3.6.5). Furthermore each iteration involves numerous matrix multiplications. As a result the computer search for the $(15,8)$ code needed to be run over a three month period using sixteen Pentium 3 machines. This intensive task was split up such that each computer worked on sets of 262144 AddCWs at a time, with each set taking approximately one week to complete running twenty-four hours a day.

It is obvious then that conducting the same CW search for the $(31,26)$ code is just not feasible since even when expurgated down to a $(31,23)$ code it still has over $2^{69}$ possible AddCW sets to test (from Section 3.6.5). Therefore the results gathered for the $(15,8)$ code will be used to make suggestions on what constitutes a good AddCW set for larger codes, which can be extrapolated to $(31, x)$ codes, $(63, x)$ codes and larger.

### 4.14. CW Search Results

Recall from Figure 3-22 that the AddCWs (or AddSWs since the codes considered are systematic) $\boldsymbol{a}, \boldsymbol{b}$, and $\boldsymbol{c}$ must have their MSbs in the binary format 001,010 and 011 . As a result the $(15,8) \mathrm{CW}$ search compared the 11 -bit SWs $a, b$, and $\boldsymbol{c}$ in the range 100 h to $1 \mathrm{FFh}, 200 \mathrm{~h}$ to 2 FFh and 300 h to 3 FFh respectively. Out of all the AddSW/AddCW sets tested only four had maximum runlengths of 7 . As well each of these four AddCW sets had a maximum of 14 states which means the RDS values of the sequences only varied in the range $\pm 3$ (Note that this means RDS values of $-3,-2,-1,0$, $+1,+2$, and +3 are possible, as well as LB values of 0 and 1). Therefore these four AddSW/AddCW sets were considered to be the best. They are shown in Table 4-7 and their runlength probabilities are shown in Table 4-8.

Table 4-7: Best AddSW sets from the $(\mathbf{1 5 , 8})$ search

| $\mathbf{A}$ | 0 h | 107 h | 2 C 8 h | 323 h | 4 DCh | 537 h | 6 F 8 h | 7 FFh |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{B}$ | 0 h | 14 Dh | 298 h | 370 h | 48 Fh | 564 h | 6 B 2 h | 7 FFh |
| $\mathbf{C}$ | 0 h | 1 AAh | 29 Bh | 323 h | 4 DCh | 564 h | 655 h | 7 FFh |
| $\mathbf{D}$ | 0 h | 1 E 7 h | 2 CBh | 323 h | 4 CCh | 537 h | 618 h | 7 FFh |

Table 4-8: Runlength probabilities for the Best AddCWs

|  | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| \% | 0.3515993125 | 0.3252365187 | 0.1917786803 | 0.0966918019 | 0.0288151237 | 0.0056458649 | 0.0002328572 | 0 |
| \% | 0.3698323555 | 0.3238076524 | 0.2029532910 | 0.0765068932 | 0.0209136549 | 0.0059514779 | 0.0000347347 | 0 |
| C | 0.3582579240 | 0.3189974631 | 0.1980528103 | 0.0844292445 | 0.0340671827 | 0.0056263416 | 0.0005492588 | 0 |
| D | 0.3549359883 | 0.3238715642 | 0.1886010711 | 0.0963552075 | 0.0292343068 | 0.0067199212 | 0.0002818141 | 0 |

Inspection shows that these four best AddSW sets could guarantee runlengths of four or less $96.53 \%$ of the time. This means that the output sequence will have a transition every four bits (or less) $96.53 \%$ of the time. Furthermore these AddSW sets could guarantee runlengths of five or less $99.41 \%$ of the time, and six or less $99.97 \%$ of the time. Only $0.023 \%$ of the time would runlengths ever be higher than this to a maximum of seven. These are great results considering the CW length is fifteen.

In addition there were only four AddCW sets considered to be the worst. These sets had maximum runlengths of 15 and a maximum of 28 states representing system

RDS values that varied in the range $\pm 7$. These sets are shown in Table 4-9 with their runlength probabilities shown in Table 4-10.

Table 4-9: Worst AddSW sets from the $(15,8)$ search

| E | Oh | 108h | 200n | 300h | 4FFh | 5FFh | 6F7h | 7FFh |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| F | On | 10Ch | 200h | 300h | 4FFh | 5FFh | 6F3h | 7FFh |
| G | On | 120h | 200h | 300h | 4FFh | 5FFh | 6DFh | 7FFh |
| H | Oh | 130h | 200h | 300h | 4FFh | 5FFh | 6CFh | 7FFin |

Table 4-10: Runlength probabilities for the Worst CWs

|  | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 6 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| $\mathbf{B}$ | 0.328473565 | 0.304656190 | 0.189284263 | 0.106383216 | 0.044738961 | 0.015932939 | 0.004961433 | 0.003010110 |
| $\mathbf{F}$ | 0.337667123 | 0.309838523 | 0.182793727 | 0.117408239 | 0.036369763 | 0.011969801 | 0.003141914 | 0.000671532 |
| $\mathbf{G}$ | 0.328816060 | 0.306967874 | 0.194495320 | 0.103729269 | 0.040345690 | 0.019072758 | 0.003513444 | 0.003903702 |
| $\boldsymbol{H}$ | 0.330055271 | 0.306040341 | 0.196458392 | 0.1 .10386161 | 0.036408923 | 0.015802792 | 0.001505133 | 0.000846763 |


|  | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| H | 0.002508024 | 3.740718-05 | 1.253448-05 | 9.944398-07 | 2.313834-07 | 7.45758-08 | 2.72768-08 | 0 |
| F | 0.000107569 | 2.492738-05 | 5.124678-06 | 1.482138-06 | 1.826898-07 | 4.99018-08 | 2.67338-08 | 0 |
| G | 0.000132108 | 1.813168-05 | 4.142278-06 | 1.158898-06 | 1.984281-07 | 7.32938-08 | 2.69138-08 | 0 |
| $\boldsymbol{H}$ | 0.002445873 | 4.117338-05 | 2. 948518-06 | 5.934328-06 | 1.86449H-07 | 5.1507R-08 | 2.75938-08 | 0 |

Even though these four AddSW sets had the worst statistics, they could still guarantee runlengths of four or less $92.87 \%$ of the time, runlengths of seven or less $99.44 \%$ of the time, and only $0.56 \%$ of the time would runlengths be higher than this to a maximum of fifteen. Note that the probability of a run of fifteen occurring is extremely small at $0.00000027 \%$ (i.e. $2.7276 \mathrm{E}-6 \%$ ).

### 4.15. AddCW recommendations

No distinct pattern emerges from inspection of Tables 4-7 to 4-10. However some general conclusions can be drawn from these results and recommendations can be made.

Details of the worst AddCW sets from Table 4-9 are shown in Table 4-11 to Table 4-14. These tables show that on average the Hamming weight is 3.5 , the average number of transitions is 4.9 , and ratio of 0 -bits to 1 -bits is 11.5 to 3.5 . Clearly these AddCWs are unbalanced with few transitions and on average will do very little to help the constrained sequence goals of the system.

Table 4-11: AddSW set $\{0 \mathrm{~h}, \mathbf{1 0 8 h}, \mathbf{2 0 0 h}, \mathbf{3 0 0 h}, \mathbf{4 F F h}, 5 \mathrm{FFh}, \mathbf{6 F 7 h}, \mathbf{7 F F h}\}$

| SW | CW | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 108 h | 1084 h | 3 | 6 | -9 | $12: 3$ |
| 200 h | 200 h | 4 | 5 | -7 | $11: 4$ |
| 300 h | 3002 h | 3 | 4 | -9 | $12: 3$ |

Table 4-12: AddSW set $\{\mathbf{0 h}, \mathbf{1 0 C h}, \mathbf{2 0 0 h}, \mathbf{3 0 0 h}, \mathbf{4 F F h}, \mathbf{5 F F}, \mathbf{6 F 3 h}, 7 \mathrm{FFh}\}$

| SW | CW | Waight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 10 Ch | 10 Ch | 4 | 6 | -7 | $11: 4$ |
| 200 h | 200 D | 4 | 5 | -7 | $11: 4$ |
| 300 h | 3002 h | 3 | 4 | -9 | $12: 3$ |

Table 4-13: AddSW set $\{0 \mathrm{~h}, \mathbf{1 2 0 h}, \mathbf{2 0 0 h}, \mathbf{3 0 0 h}, \mathbf{4 F F h}, 5 \mathrm{FFh}, 6 \mathrm{DFh}, 7 \mathrm{FFh}\}$

| SW | CW | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 120 h | 1205 h | 4 | 7 | -7 | $1:: 4$ |
| 200 h | 200 h | 4 | 5 | -7 | $11: 4$ |
| 300 h | 3002 h | 3 | 4 | -9 | $12: 3$ |

Table 4-14: AddSW set $\{0 \mathrm{~h}, \mathbf{1 3 0 h}, \mathbf{2 0 0 h}, \mathbf{3 0 0 h}, \mathbf{4 F F h}, 5 \mathrm{FFh}, 6 \mathrm{CFh}, 7 \mathrm{FFh}\}$

| SW | CW | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $130 h$ | $1300 h$ | 3 | 4 | -9 | $12: 3$ |
| $200 h$ | $200 D h$ | 4 | 5 | -7 | $11: 4$ |
| $300 h$ | $3002 h$ | 3 | 4 | -9 | $12: 3$ |

Therefore the first recommendation to make regarding the choice of a good AddCW set regardless of code size is to choose relatively balanced AddCWs that contain a significant number of transitions. This is validated by inspecting the statistics of the best AddCW sets as shown in Table 4-15 to Table 4-18. Here the average Hamming weight is 7.5 , the average number of transitions is 7.5 and the average ratio of 0 -bits to 1 -bits is also 7.5.

Table 4-15: AddSW set $\{0 h, 107 \mathrm{~h}, 2 \mathrm{C} 8 \mathrm{~h}, \mathbf{3 2 3 h}, 4 \mathrm{DCh}, 537 \mathrm{~h}, \mathbf{6 F 8 h}, 7 \mathrm{FFh}\}$

| SW | CN | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 107 h | 1076 h | 6 | 6 | -2 | $9: 6$ |
| 2 C h | 2 CBFh | 8 | 7 | +1 | $7: 8$ |
| 323 h | 323 Dh | 8 | 7 | +1 | $7: 8$ |

Table 4-16: AddSW set $\{0 \mathrm{~h}, 14 \mathrm{Dh}, 29 \mathrm{Bh}, \mathbf{3 7 0 h}, 48 \mathrm{Fh}, 564 \mathrm{~h}, \mathbf{6 B 2 h}, 7 \mathrm{FFh}\}$

| SW | CW | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $14 D h$ | $14 D C h$ | 7 | 8 | -1 | $8: 7$ |
| 29 Bh | 2988 h | 7 | 8 | -1 | $8: 7$ |
| 370 h | 370 Ah | 7 | 8 | -1 | $8: 7$ |

95

Table 4-17: AddSW set $\{0 \mathrm{~h}, 1 \mathrm{AAh}, 29 \mathrm{Bh}, \mathbf{3 2 3 h}, 4 \mathrm{DCh}, 564 \mathrm{~h}, \mathbf{6 5 5 h}, \mathbf{7 F F h}\}$

| SW | CWN | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $1 A A h$ | $1 A A 6 h$ | 7 | 10 | -1 | $8: 7$ |
| 29 Bh | $29 B 8 h$ | 7 | 8 | -1 | $6: 7$ |
| 323 h | $323 D h$ | 8 | 7 | +1 | $7: 8$ |

Table 4-18: AddSW set $\{0 \mathrm{~h}, 1 \mathrm{E} 7 \mathrm{~h}, \mathbf{2 C 8 h}, \mathbf{3 2 3 h}, 4 \mathrm{DCh}, 537 \mathrm{~h}, \mathbf{6 1 8 h}, \mathbf{7 F F h}\}$

| SM | CW | Weight | Tran | RDS | $0: 1$ |
| :---: | :---: | :---: | :---: | :---: | :---: |
| $1 E 7 h$ | $1 E 75 h$ | 9 | 7 | +3 | $6: 9$ |
| $2 C 8 h$ | $2 C 8 F h$ | 8 | 7 | +1 | $7: 8$ |
| $323 h$ | $323 D h$ | 8 | 7 | +1 | $7: 8$ |

Closer inspection of CWs 2C8Fh and 323Dh in Tables 4-15 and 4-18 show that they are cyclically shifted versions of one another. As well all CWs in Table 4-16 are cyclically shifted versions of one another. However the CWs in Table 4-17 show no such symmetry.

As a result it is difficult to make precise recommendations on what constitutes a good AddCW set. Furthermore it is tempting to assume that in this system all eight AddCWs are used equally. This however is not the case as can be seen in Figure 4-29. Here the SimFPGA output shows that AddCW1 (1076h), AddCW2 (2C8Fh) and their CW complements AddCW5 (5370h) and AddCW6 (6F89h), are each used approximately $16 \%$ of the time. On the other hand AddCW3 (323Dh) and its CW complement AddCW4 (4DC2h) are used less at approximately $13 \%$. Finally the all-zero and all-one CWs (AddCW0 and AddCW7) are only used approximately $3.7 \%$ of the time. Thus, deciding on what constitutes a good AddCW is made more difficult by the fact that not all AddCWs are used with the same frequency. Note that this information is found simply by counting how many times each AddCW is used when building the state diagram.


Figure 4-29: SimFPGA keeps track of percentage of time each AddCW is used

It can be stated that in general for codes with odd length, a good AddCW will belong to a set of CWs that are balanced in the fashion $(n-1) / 2:(n+1) / 2$ or $(n+1) / 2:(n-1) / 2$. From this set the best AddCWs will consist of those with as many transitions as possible. Furthermore if any of the AddCWs can be cyclically shifted to yield other AddCWs whose MSbs are in the form 001, 010, 011 (as shown in Figure 3-21), then these are most likely good AddCWs.

However since CWs such as 1076 h exist in the best set, and such CWs are not balanced and have less than the average amount of transitions, this is a loose recommendation. Nevertheless when these recommendations are followed the maximum runlengths produced can be expected to be near the minimal values observed of $(n-1) / 2$, where $n$ is the CW length.

Further analysis on what constitutes good AddCWs can also be done in the frequency domain by inspecting the power spectral density of these codes. This is considered in Chapter 5.

## 5. Power Spectral Density Results

Chapter 3 introduced the multimode coding technique and Chapter 4 discussed how to model it as a finite state machine and how to analyze it using Markov chain theory. Chapter 4 then continued by presenting the results of the AddCW enumeration for the $(7, x)$ and $(15, x)$ codes. It was shown that a great deal of insight about what constitutes a good AddCW set can be found by analyzing these time domain statistics. However an equally suitable method of assessing this combined EC and CS code is to calculate its power spectral density (PSD). This is because the coding rules of this technique will directly influence the shape of the power spectrum. This chapter presents how to theoretically calculate the PSD of an encoded sequence and how to interpret the results.

### 5.1. Relationship between the time and frequency domain

As discussed in Section 2.19 a binary sequence using bipolar coding maps logic 0 values to a negative square pulse shape $-p(t)$, and logic 1 values to a positive square pulse shape $+p(t)$. When a transmitted sequence contains an overall equal number of logic 0 s and 1s the transmission is said to be balanced. This is because the transmitted signal has a negative amplitude half of the time and an equal but opposite positive amplitude the other half of the time. As a result the dc average is 0 volts. Therefore inspecting this balanced signal in the frequency domain would reveal that the PSD would have a null at 0 Hz . For example Figure 5-1 shows the PSD of the Manchester code presented in Section 2.21. Recall that this line coding scheme maps logic 0 values to the 2 -bit pattern 10 , and maps logic 1 values to the complement 2-bit pattern 01. This guarantees at least one transition per bit as well as a completely balanced transmission [5]. As a result it can be seen that the PSD of this balanced coding technique has a null (zero power) at 0 Hz . This is shown on both the linear and logarithmic scales in Figure 5-1.


Figure 5-1: Demonstrating nulls at 0 Hz on both linear and logarithmic scales

To understand how coding can effect the shape of the PSD, consider Figure 5-2 which shows an output sequence with long runs of 0 s and 1 s . It is clear that a signal such as this will appear as a slowly varying square wave. As a result the PSD of this sequence would have the majority of its power at low frequencies.
low frequency

$$
1 1 1 1 1 1 1 0 0 0 0 0 0 0 \longdiv { 1 1 1 1 1 }
$$

Figure 5-2: Low frequency content of a signal

On the other hand consider an output sequence that has numerous transitions as shown in Figure 5-3. A signal such as this will appear as a rapidly changing square wave. These rapid changes are indicative of high frequencies in the signal. As a result this sequence would have a significant amount of its power at high frequencies.
high frequency

Figure 5-3: High frequency content of a signal

Since the multimode coding technique of Chapter 3 is designed to remove long runs of like valued bits (low frequencies) and introduce more transitions (high frequencies), the PSD of the encoded waveform is expected to have most of its power at high frequency and less power at low frequency. Thus the performance of the multimode coding system can be evaluated by analyzing the PSD of the output sequence.

### 5.2. Evaluating the Power Spectral Density

Communication signals are inherently random. This is because the receiver does not know beforehand the message that the transmitter sent. As a result a communication signal is usually treated as a random process, since the data is random with time. Each possible waveform is called a sample function and the collection of all possible waveforms is called the ensemble of the random process [23].

To completely characterize the random process a probability density function (PDF) or cumulative distribution function (CDF) would need to be known at every instant in time. Since this information is usually difficult to obtain, random processes are often characterized in terms of average sequence statistics [23]. There are generally two approaches used to evaluate the average statistics. A time average considers a single sample function over all time, and an ensemble average considers all possible sample functions and averages them at a single point in time. Processes where the time averages equal the ensemble averages are called ergodic [23]. If the random process has statistics that do not change with time it is also called stationary [23].

Useful information to obtain about a random process is the mean or expected value, as well as how quickly the process is expected to change which is given by its autocorrelation function [23]. The mean value of a random process has an intuitive meaning as the dc content of the signal and the autocorrelation function gives an indication of the frequency content of the random process. This is due to the fact that the correlation of a random process with itself at two different points in time depends on how rapidly the amplitude is expected to change with time [23]. All of these topics are investigated in detail with examples in Appendix A.

### 5.3. Evaluating the Power Spectral Density of a Coded System

As previously stated the coding rules of the multimode coding system introduced in Chapter 3 will have a direct influence on the shape of the PSD. Furthermore these coding rules make calculating the PSD slightly more complicated. As a result this section demonstrates through example the efficient procedure for calculating the PSD of a coded system as outlined by Cariolaro and Tronca [24]. Their method uses the finite state machine model of the encoder first presented in Section 4.7, and the Markov chain analysis presented in Section 4.9.

The multimode coding system is designed to transmit an output sequence with an equal number of 0 s and 1 s even if the source statistics are unbalanced. Thus if the logic values 0 and 1 are represented with voltage levels of 0 V and 1 V respectively, the average value transmitted is $(0+1) / 2=0.5 \mathrm{~V}$. However if a system used the polar NRZ format from Section 2.19 with 0 s and 1 s mapped to $\pm p(t)$, with $p(t)$ being a square pulse shape of amplitude 1 V , then the average value transmitted would be 0 since $(-1+1) / 2=0 \mathrm{~V}$. Therefore in terms of output CWs one can intuitively expect that the average CW will be the all-zero sequence. The output is collectively known as the mean sequence and the average CW is called the mean symbol vector $\eta_{c}$ [24].

Recall from the Markov chain analysis in Section 4.9 the input probability matrix $\theta_{u}$, code word matrix $C_{u}$ and long term invariant distribution vector $\pi$. Using these definitions it can be seen that each $\mathrm{CW} c_{u}(i)$ will occur with probability $\pi(i) \theta_{u}(i, i)$. Therefore with mutually exclusive input symbols the mean symbol vector $\eta_{c}$ or average CW is defined as

$$
\begin{equation*}
\eta_{c}=\pi \sum_{u=1}^{S} \theta_{u} c_{u} \tag{5,1}
\end{equation*}
$$

Continuing with the example from Section 4.9 the mean symbol vector would be

$$
\begin{align*}
& \eta_{c}=\left[\begin{array}{lllll}
.1875 & .0625 & .25 & .0625 & .1875
\end{array}\right]\left[\begin{array}{cccccc}
.25 & 0 & 0 & 0 & 0 & 0 \\
0 & .25 & 0 & 0 & 0 & 0 \\
0 & 0 & 25 & 0 & 0 & 0 \\
0 & 0 & 0 & .25 & 0 & 0 \\
0 & 0 & 0 & 0 & .25 & 0 \\
0 & 0 & 0 & 0 & 0 & .25
\end{array}\right]\left[\begin{array}{lllllll}
2 & 3 & 2 & 2 & 1 & 3 & 3 \\
2 & 3 & 2 & 2 & 1 & 3 & 3 \\
4 & 1 & 2 & 2 & 1 & 1 & 3 \\
0 & 3 & 2 & 2 & 3 & 3 & 1 \\
2 & 1 & 2 & 2 & 3 & 1 & 1 \\
2 & 1 & 2 & 2 & 3 & 1 & 1
\end{array}\right] \\
& \eta_{c}=\left[\begin{array}{lllllll}
.5 & .5 & .5 & 5 & .5 & .5 & .5
\end{array}\right] \tag{5.2}
\end{align*}
$$

The result in Equation 5.2 is intuitively pleasing since it demonstrates that the average CW (without bipolar NRZ mapping) is in-between the all-zero and all-one CW, i.e. a vector of all 0.5 . However if the bipolar NRZ mapping had been used the mean symbol vector would have been the all-zero vector as shown in Equation 5.3.

$$
\left.\begin{array}{l}
\eta_{c}=\left[\begin{array}{lllll}
1875 & 0625 & 25 & 25 & 0625
\end{array}, 1875\right.
\end{array}\right]\left[\begin{array}{ccccccc}
25 & 0 & 0 & 0 & 0 & 0  \tag{5.3}\\
0 & 25 & 0 & 0 & 0 & 0 \\
0 & 0 & .25 & 0 & 0 & 0 \\
0 & 0 & 0 & .25 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & .25
\end{array}\right]\left[\begin{array}{ccccccc}
0 & 2 & 0 & 0 & -2 & 2 & 2 \\
0 & 2 & 0 & 0 & -2 & 2 & 2 \\
4 & -2 & 0 & 0 & 0 & -2 & -2 \\
-4 & 2 & 0 & 0 & 2 & 2 & -2 \\
0 & -2 & 0 & 0 & 2 & -2 & -2 \\
0 & -2 & 0 & 0 & 2 & -2 & -2
\end{array}\right]
$$

This confirms that the transmitted sequence of this encoder has a dc average of zero volts, verifying that the goal of balanced transmission has been obtained.

From Appendix A the correlation of a random process with itself is called autocorrelation [4] and leads to spectral information of the random process. For comparison the autocorrelation function $R_{x x}\left(t_{1}, t_{2}\right)$ of a random process $X(t)$ is shown below in Equation 5.4.

$$
\begin{align*}
& R_{x x}\left(t_{1}, t_{2}\right) \equiv \int_{-\infty}^{\infty} \int_{-\infty}^{\infty} x_{1} x_{2} f_{x}\left(x_{1}, x_{2} ; t_{1}, t_{2}\right) d x_{1} d x_{2}  \tag{5.4}\\
& R_{x x}\left(t_{1}, t_{2}\right)=E\left[X\left(t_{1}\right) X\left(t_{1}+\tau\right)\right]
\end{align*}
$$

Since the autocorrelation function of a stationary process only depends on time separation $\tau=t_{2}-t_{1}$, it is often represented in terms of the expectation operator [4] as shown in Equation 5.5.

$$
\begin{equation*}
R_{x}(\tau)=E[X(t) X(t+\tau)] \tag{5.5}
\end{equation*}
$$

As a result Cariolaro and Tronca define the CW sequence autocorrelation $R_{C, k}$ as

$$
\begin{equation*}
R_{C, k}=E\left[c_{n} c_{n+k}\right] \tag{5.6}
\end{equation*}
$$

Evaluation of $R_{C, k}$ at time separation $k=0$ is $R_{C, 0}=E\left[c_{n} c_{n}\right]$. Since CW $c_{u}(i)$ will occur with probability $\pi(i) \theta_{u}(i, i)$, its contribution to $R_{C, 0}$ will be $R_{C, 0}=c_{u}^{2}(i) \pi(i) \theta(i, i)$. To find the contribution from all CWs to $R_{C, 0}$ it is helpful to define $\Lambda$ as an $L$-square diagonal matrix such that

$$
\Lambda(i, j)=\left\{\begin{array}{cc}
\pi(i) & i=j  \tag{5.7}\\
0 & \text { otherwise }
\end{array}\right.
$$

Then the autocorrelation with zero time separation can be obtained by summing over all CWs as shown in Equation 5.8.

$$
\begin{equation*}
R_{C, 0}=\sum_{u=1}^{S} c_{u}^{\mathrm{T}} \Lambda \theta_{u} c_{u} \tag{5.8}
\end{equation*}
$$

Continuing in this manner and using the definitions defined by Cariolaro and Tronca, when the input symbols are independent of state, the autocorrelation for all time separations $k$ can be shown to be

$$
R_{C, k}= \begin{cases}\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} c_{u} & k=0  \tag{5.9}\\ \sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{k-1} \sum_{v=1}^{s} \theta_{v} c_{v} & k \geq 1 \\ R_{C, k}^{\mathrm{T}} & k \leq-1\end{cases}
$$

Then continuing with the example there are four SWs and as a result the autocorrelation at zero time separation $R_{C, 0}$ is found by summing up the four matrices $c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}+c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}+c_{2}^{\mathrm{T}} \Lambda \theta_{2} c_{2}+c_{3}^{\mathrm{T}} \Lambda \theta_{3} c_{3}$ shown in Equation 5.10 . This was done by mapping the CW matrices in Equation 4.25 to $\pm A$ with $A=1$.

$$
\begin{align*}
& c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}=\left[\begin{array}{lllllll}
+r & -r & +r & +r & -r & -r & -r \\
-r & +r & -r & -r & +r & +r & +r \\
+r & -r & +r & +r & -r & -r & -r \\
+r & -r & +r & +r & -r & -r & -r \\
-r & +r & -r & -r & +r & +r & +r \\
-r & +r & -r & -r & +r & +r & +r \\
-r & +r & -r & -r & +r & +r & +r
\end{array}\right] \quad c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}\left[\begin{array}{lllllll}
+r & -r & +r & -r & -r & +r & +r \\
-r & +r & -r & +r & +r & -r & -r \\
+r & -r & +r & -r & -r & +r & +r \\
-r & +r & -r & +r & +r & -r & -r \\
-r & +r & -r & +r & +r & -r & -r \\
+r & -r & +r & -r & -r & +r & +r \\
+r & -r & +r & -r & -r & +r & +r
\end{array}\right] \\
& C_{2}^{\mathrm{T}} \wedge \theta_{2} c_{2}=\left[\begin{array}{ccccccc}
+r & +r & -r & +r & -r & -r & +r \\
+r & +r & -r & +r & -r & -r & +r \\
-r & -r & +r & -r & +r & +r & -r \\
+r & +r & -r & +r & -r & -r & +r \\
-r & -r & +r & -r & +r & +r & -r \\
-r & -r & +r & -r & +r & +r & -r \\
+r & +r & -r & +r & -r & -r & +r
\end{array}\right] \\
& c_{3}^{\mathrm{T}} \wedge \theta_{3} c_{3}=\left[\begin{array}{lllllll}
+r & -r & -r & -r & +r & -r & +r \\
-r & +r & +r & +r & -r & +r & -r \\
-r & +r & +r & +r & -r & +r & -r \\
-r & +r & +r & +r & -r & +r & -r \\
+r & -r & -r & -r & +r & -r & +r \\
-r & +r & +r & +r & -r & +r & -r \\
+r & -r & -r & -r & +r & -r & +r
\end{array}\right] \tag{5.10}
\end{align*}
$$

where $r=0.25$.

Using Equation 5.9 the autocorrelation obtained for all CWs at time separation 0 is shown in Equation 5.11.

$$
R_{C, 0}=\left[\begin{array}{rrrrrrr}
1 & -.5 & 0 & 0 & -.5 & -.5 & .5  \tag{5.11}\\
-.5 & 1 & -.5 & .5 & 0 & 0 & 0 \\
0 & -.5 & 1 & 0 & -.5 & .5 & -.5 \\
0 & .5 & 0 & 1 & -.5 & -.5 & -.5 \\
-.5 & 0 & -.5 & -.5 & 1 & 0 & 0 \\
-.5 & 0 & .5 & -.5 & 0 & 1 & 0 \\
.5 & 0 & -.5 & -.5 & 0 & 0 & 1
\end{array}\right]
$$

The interpretation of this autocorrelation matrix is as follows. Equation 5.11 is showing how correlated the individual bits ( 0 through 6) of any 7 -bit CW are with themselves at time separation $k=0$. For instance the diagonal ones indicate that the $1^{\text {st }}$ bit is $100 \%$ correlated with the $1^{\text {st }}$ bit, the $2^{\text {nd }}$ bit is $100 \%$ correlated with the $2^{\text {nd }}$ bit and so on down to the $7^{\text {th }}$ bit is $100 \%$ correlated with the $7^{\text {th }}$ bit. Conversely, the correlation between the $1^{\text {st }}$ bit with the $2^{\text {nd }}$ bit is -.5 . This means that the first two bit combinations of any CW are more likely to be $1,-1$ or $-1,1$ then they are to be 1,1 or $-1,-1$. Continuing in this fashion the correlation between bit 0 and bit 2 is zero, which indicates that knowing the value of the $1^{\text {st }}$ bit gives no indication of the value of the $3^{\text {rd }}$ bit. Hence all four combinations $-1,-1,-1,1,1,-1$ or 11 are equally likely. The rest of the matrix can be understood in this fashion.

Before evaluating the autocorrelation of the encoded sequence for time separation $k>0$, inspection of Equation 5.9 suggests that the second summation term can be computed separately as shown in Equation 5.12 since it does not change with $k$.

$$
\beta=\sum_{v=1}^{S} \theta_{\nu} c_{v}=\left[\begin{array}{rrrrrrr}
0 & .5 & 0 & 0 & -.5 & .5 & .5  \tag{5.12}\\
0 & .5 & 0 & 0 & -.5 & .5 & .5 \\
1 & -.5 & 0 & 0 & -.5 & -.5 & .5 \\
-1 & .5 & 0 & 0 & .5 & .5 & -.5 \\
0 & -.5 & 0 & 0 & .5 & -.5 & -.5 \\
0 & -.5 & 0 & 0 & .5 & -.5 & -.5
\end{array}\right]
$$

Thus $R_{c, 1}$ is evaluated as

This autocorrelation matrix indicates how similar any CW is with the preceding CW on the channel. Clearly the correlation falls off rapidly. This is because in this example knowing the value of the $1^{\text {st }}$ bit in the first CW only gives a marginal -.125 indication of the value of the $1^{\text {st }}$ bit in the next CW and so on. Nevertheless the autocorrelation can be evaluated for greater time separation as shown in Equation 5.14.

$$
\begin{align*}
& \boldsymbol{R}_{c, 2}=\left[\begin{array}{rrrrrrr}
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 \\
-.0625 & .03125 & 0 & 0 & .03125 & .03125 & -.03125 \\
.0625 & -.03125 & 0 & 0 & -.03125 & -.03125 & .03125 \\
.0625 & -.03125 & 0 & 0 & -.03125 & -.03125 & .03125
\end{array}\right] \quad R_{c, 3}=\left[\begin{array}{lllllll}
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0
\end{array}\right]  \tag{5.14}\\
& R_{c, 100}=\left[\begin{array}{lllllll}
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0
\end{array}\right] \quad R_{c, \infty}=\left[\begin{array}{lllllll}
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0
\end{array}\right]
\end{align*}
$$

Notice that by time separation $k=3$ the autocorrelation matrices have converged to the all-zero matrix, which demonstrates that this random process loses correlation with itself very rapidly. Further notice that the autocorrelation matrix at $k=\infty$ can also be
obtained by multiplying the transpose of the mean symbol vector with itself as shown in Equation 5.15.

$$
\begin{equation*}
R_{c, \infty}=\eta_{c}^{\mathrm{T}} \eta_{c}=\eta_{c}^{2} \tag{5.15}
\end{equation*}
$$

Autocovariance matrices can also be found which indicate how the random process varies with time separation. They are related to the autocorrelation matrices as shown in Equation 5.16.

$$
\begin{align*}
& K_{c, k}=R_{c, k}-\eta_{c}^{2}  \tag{5.16}\\
& K_{c, k}=R_{c, k}-R_{c, \infty}
\end{align*}
$$

It can be seen from this expression that if the mean symbol vector is zero, the autocovariance matrices are equal to the autocorrelation matrices. As noted previously this will occur when the output transmission is balanced and bipolar signaling is used. $K_{c, 0}$ is given by $R_{c, 0}-R_{c, \infty}$, while $K_{c, 1}=R_{c, 1}-R_{c, \infty}$ and $K_{c, 2}=R_{c, 2}-R_{c, \infty}$ and thus as $k$ tends to infinity $R_{c, k}$ tends to $R_{c, \infty}$ and the all-zero matrix is reached as shown below in Equation 5.17

$$
\left.\begin{array}{rl}
K_{c, 0} & =\left[\begin{array}{rrrrrrr}
1 & -.5 & 0 & 0 & -.5 & -.5 & .5 \\
-.5 & 1 & -.5 & .5 & 0 & 0 & 0 \\
0 & -.5 & 1 & 0 & -.5 & .5 & -.5 \\
0 & -5 & 0 & 1 & -.5 & -5 & -.5 \\
-.5 & 0 & -.5 & -.5 & 1 & 0 & 0 \\
-.5 & 0 & -5 & -.5 & 0 & 1 & 0 \\
.5 & 0 & -.5 & -.5 & 0 & 0 & 1
\end{array}\right] \quad K_{c, 1}=\left[\begin{array}{rrrrrrr}
-.125 & .0625 & 0 & 0 & .0625 & .0625 & -.0625 \\
0 & -.0625 & 0 & 0 & .0625 & -.0625 & -.0625 \\
.125 & -.0625 & 0 & 0 & -.0625 & -.0625 & .0625 \\
.125 & -.0625 & 0 & 0 & -.0625 & -.0625 & .0625 \\
0 & .0625 & 0 & 0 & -.0625 & .0625 & .0625 \\
0 & -.0625 & 0 & 0 & .0625 & -.0625 & -.0625 \\
-.25 & .0625 & 0 & 0 & .1875 & .0625 & -.0625
\end{array}\right]  \tag{5.17}\\
K_{c, 2} & =\left[\begin{array}{rrrrrrr}
0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0625 & -.03125 & 0 & 0 & -.03125 & -.03125 & -.03125 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
-.0625 & .03125 & 0 & 0 & .03125 & .03125 & -.03125 \\
.0625 & -.03125 & 0 & 0 & -.03125 & -.03125 & .03125 \\
.0625 & -.03125 & 0 & 0 & -.03125 & -.03125 & .03125
\end{array}\right]
\end{array} K_{c, \infty}=\left[\begin{array}{lllllll}
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0 \\
0 & 0 & 0 & 0 & 0 & 0 & 0
\end{array}\right]\right)
$$

The PSD is obtained using the autocorrelation and autocovariance matrices and it can be defined in terms of a continuous component $X_{C}(f)$, and a discrete component $X_{D}(f)$. These components are weighted by the Fourier Transform of the pulse shape $P(f)$.

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}\left(X_{C}(f)+\frac{X_{D}(f)}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right) \tag{5.18}
\end{equation*}
$$

where

$$
\left.\begin{array}{c}
X_{C}(f)=v\left[K_{C, 0}+2 \sum_{k=1}^{\infty} K_{C, k} e^{-j 2 \pi j k N T}\right] v^{*} \\
X_{D}(f)=v R_{C, \infty} v^{*} \\
v=\left[\begin{array}{lllll}
1 & e^{j 2 \pi f T} & e^{j 4 \pi f T} & e^{j 6 \pi f T} & \ldots
\end{array} e^{j 2 \pi f(N-1) T}\right. \tag{5.21}
\end{array}\right] .
$$

In this balanced transmission example $R_{C, \infty}=0$ and by Equation 5.20 there is no discrete component $X_{D}(f)$. Therefore Equation 5.18 can be rewritten as

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}\left(X_{c}(f)+0\right) \tag{5.22}
\end{equation*}
$$

Since the covariance matrices are only non-zero up to $k=2, X_{C}(f)$ is found to be
where

$$
v=\left[\begin{array}{lllllll}
1 & e^{j 2 \pi f T} & e^{j 4 \pi f T} & e^{j 6 \pi f T} & e^{j 8 \pi f T} & e^{j 10 \pi f T} & e^{j 12 \pi f T} \tag{5.24}
\end{array}\right]
$$

The result is a continuous function of frequency $f$ that is weighted by the pulse shape. If a square pulse shape is used and normalized to $A=1 \mathrm{~V}$ with duration $T=10 \mu \mathrm{~s}$, Equation 5.25 shows its time domain representation and Fourier transform. The PSD is then this function (pulse shape) squared, and plotted on either a linear or logarithmic scale as shown in Figure 5-4.

$$
\begin{gather*}
p(t)=A \operatorname{Arcct}\left(\frac{t}{T}\right) \leftrightarrow P(\omega)=A T \operatorname{sinc}\left(\frac{\omega T}{2}\right) \\
P(f)=(10 \mu \mathrm{~s}) \operatorname{sinc}(\pi f 10 \mu \mathrm{~s}) \tag{5.25}
\end{gather*}
$$





Figure 5-4: PSD of a square pulse shape plotted on linear and logarithmic scales

Therefore using Equation 5.22 and evaluating $X_{C}(f)$ at $f=500 \mathrm{~Hz}$, for example, gives

$$
\begin{align*}
& \quad X_{C}(500)=.0543+.215 j=.2216 e^{1.3233 j}  \tag{5.26}\\
& W(500)=\frac{|(10 \mu s) \operatorname{sinc}(\pi 500 * 10 \mu s)|^{2}}{7 * 10 \mu s}(.0543+.215 j) \\
& W(500)=7.75 E-8+j 3.07 E-7 \\
& |W(500)|=3.167 E-7
\end{align*}
$$

This is the PSD calculated at a single frequency. This evaluation however needs to be done as a continuous function of frequency since by definition the power contribution of a single point on a density curve is zero [4]. This time consuming operation is best performed using mathematical software such as Matlab.

### 5.4. Evaluating the Power Spectral Density using Matlab

Recall from Section 4.11 the program Build_EuCu.exe. This program allows the user to select the expurgated code, selection criteria, source statistics and AddSWs to use. It then generates three files. The first two files created were shown in Figure 4-19 and Figure 4-20. They dealt with the number of states in the encoder and created the $\theta_{u}, E_{u}$ and $C_{u}$ matrices used to calculate the runlength probabilities of the code.

Continuing from the example in Section 4.11, the third file will contain a combination of the information found in the first two files, and it is used to evaluate the PSD. An example of this file is shown in Figure 5-5. This file begins with some comments and general information about the code used. Following this is each Input word (SW) along with its probability of occurrence which is read into the $\theta_{u}$ matrices, followed by the $E_{u}$ and $C_{u}$ matrices which are written in binary form.


Figure 5-5: Input file for PSD calculation

The $\theta_{u}, E_{u}$ and $C_{u}$ information is used by a Matlab program called "calcCompletePSDfromLCODEPWRIN.m". Additional parameters are entered into the dialog box as shown in Figure 5-6. These are a) the number of autocorrelation and autocovariance matrices to compute, b) whether or not the system uses NRZ mapping (i.e. balanced mapping), c) frequency stepping resolution, d) which figure number to use, e) maximum frequency to calculate values for and f) the duration of a single bit in seconds. Note that the pulse shape is hard coded to be square and normalized to 1 volt to match with the hardware implementation. This Matlab script will also output the PSD with and without the influence of the square pulse shape. This was done in order to see the effect of the coding alone on the shape of the PSD.


Figure 5-6: Information required for PSD calculation

Continuing the previous example with the parameters in Figure 5-6, the continuous component $X_{C}(f)$ can be plotted on a linear scale without the effect of the pulse shape as shown in Figure 5-7. This plot demonstrates the influence of the combined EC and CS coding scheme alone.


Figure 5-7: Linear PSD of the multimode system (no pulse shape included)

There are a few things to notice about this PSD. First of all as a result of the discrete nature of the autocorrelation the PSD is periodic in the frequency domain as shown in Figure 5-8a. Secondly since the symbols are all real valued the PSD is an even function and thus symmetrical about 0 Hz as shown in Figure $5-8 \mathrm{~b}$. Therefore the PSD repeats in this fashion to infinity where each lobe is a replica of the previous one as shown in Figure 5-7b.


Figure 5-8: Showing that the PSD is periodic and an even function

However once the pulse shape of Figure 5-4 is taken into consideration, the PSD begins to decay as sinc squared. This effect makes the null at 0 Hz less noticeable, but
since this PSD is periodic, it is clear that the null is present. This is shown in Figure 5-9 on both the linear and logarithmic scale.


Figure 5-9: Effect of the square pulse shape plotted on a linear and logarithmic scale

Note that the logarithmic scale is used in order to compare theoretical spectra with those measured from the FPGA implementation in the lab. This is because digital oscilloscopes and spectrum analyzers often display frequency domain results in decibels. For example Figure 5-10 shows this PSD plotted for the first two lobes as generated by this Matlab script, with the measured PSD from the FPGA implementation beside it. It is clear that the Cariolaro and Tronca approach matches closely with the measured values. This aspect will be more fully discussed in Chapter 6.


Figure 5-10: Comparing the spectrum analyzer to Matlab spectrum

### 5.5. Understanding the Discrete-Like Nature of the PSD

A final source of confusion is why the PSD sometimes has a "spiky" shape. Normally strong spectral spikes are indicative of periodic sequences in the time domain. These present themselves as discrete components in the PSD. However these strong spectral peaks are not caused by periodic sequences with this coding scheme. In this case the concentrated power in narrow bands in the frequency domain actually come from the sometimes limited selection of CWs. This is best seen by example.

Consider the $(7,1)$ Hamming code from Table 3-3 using the eight AddCWs $\{00 \mathrm{~h}, 16 \mathrm{~h}, 27 \mathrm{~h}, 31 \mathrm{~h}, 4 \mathrm{Eh}, 58 \mathrm{~h}, 69 \mathrm{~h}, 7 \mathrm{Fh}\}$. Figure $5-11$ shows this code as a state diagram. This small code is interesting to analyze since there are only two input SWs, 0 and 1. Furthermore, even though there are eight AddCWs in this system, the simplicity of this code results in only four CWs $\{69 \mathrm{~h}, 53,2 \mathrm{Ch}, 16 \mathrm{~h}\}$ ever being transmitted.


Figure 5-11: Simplest FSM model for the (7,1) code

The FSM model in Figure 5-11 indicates that the only possible CW sequences that leave state 0 and return are $\{69 \mathrm{~h}, 16 \mathrm{~h}\},\{69 \mathrm{~h}, 2 \mathrm{Ch}\},\{53 \mathrm{~h}, 16 \mathrm{~h}\}$ and $\{53 \mathrm{~h}, 2 \mathrm{Ch}\}$. One may expect this system to produce a discrete PSD since the transmitted sequences appear periodic. However this is not the case since even though there are only four possible sequences, they are emitted randomly. This results in the continuous PSD shown in Figure 5-12. This PSD was calculated assuming all four sequences are equally likely.


Figure 5-12: PSD of the multimode code shown in Figure 6-50

At first glance this spectrum appears to have discrete components. Understanding why this is can be found by analyzing the Fourier Series (see Appendix A) of the four sequences that can leave state 0 and return. As discussed above these sequences are $\{69 \mathrm{~h}, 16 \mathrm{~h}\}, \quad\{69 \mathrm{~h}, 2 \mathrm{Ch}\}, \quad\{53 \mathrm{~h}, 16 \mathrm{~h}\}$ and $\{53 \mathrm{~h}, 2 \mathrm{Ch}\}$, and their Fourier Series representations are shown in Figure 5-13 and 5-14. All four figures have seven frequency components in common as shown in Figure 5-15.

Comparing the spectrum in Figure 5-15 with the PSD in Figure 5-12 explains where the apparent discrete components come from. These seven frequency components will be dominant in this system regardless of randomness. For example if the output of the encoder was ever forced to always be one of the four sequences, then the output would be one of the four Fourier Series spectra shown below. However, since the output at any point in time is one of the four sequences chosen in a random fashion, each one of the four spectra is prevalent, but overall yields a continuous spectrum.


Figure 5-13: PSD of the periodic sequences $\{6916 \mathrm{~h}\}$ and $\{692 \mathrm{Ch}\}$


Figure 5-14: PSD of the periodic sequences $\{5316 \mathrm{~h}\}$ and $\{532 \mathrm{Ch}\}$


Figure 5-15: A verage of the seven frequency components

### 5.6. Spectra of various codes

The CW search results of Section 4.14 for CWs of length 15 revealed that the coding technique on average could guarantee runlengths of 4 or less $94 \%$ of the time. Only $6 \%$ of the time would runlengths be higher than this to a maximum of 11 .

Furthermore out of the 16777216 AddCW sets tested, only four sets were considered to be the worst since they could have runlengths as high as 15 . However these runlengths would occur with very low probability and they could still guarantee runlengths of 4 our less $92 \%$ of the time. The four best AddCW sets on the other hand could guarantee runlengths of 4 or less $96 \%$ of the time, and no runlengths higher than 7 . This section now investigates the PSD of these codes.

For comparison the PSDs are calculated using two heavily unbalanced source statistics of $10 \%$ probability of a logic 1 and $90 \%$ probability of a logic 1 . Furthermore only the four best and four worst AddCW sets are calculated, as all other AddCW sets will have statistics within their range.


Figure 5-16: 90\% ones - All four Best AddCW sets

Figure 5-16 shows the PSD of the four best AddCW sets with a source that emits logic 1 s with $90 \%$ probability. It is clear that the coding scheme is working since the nulls are still present and quite wide. The "spiky" nature of the PSD however is a result of the unbalanced source. Since the likelihood a SW containing many 1 s is very high, only a limited number of the available CWs are being transmitted. That is since only a subset of available SWs is being selected (i.e. SWs comprised mostly of 1 s ), only a subset of available CWs is being transmitted. Hence the correlation between CWs is higher and the PSD shows spiky peaks indicative of a periodic-like transmission. In other words, the same group of CWs is being repeatedly transmitted but in a random fashion.


Figure 5-17: 10\% ones - All four Best AddCW sets

Similarly Figure 5-17 shows the PSD of the four best AddCW sets with a source that emits logic 1s with $10 \%$ probability. This is of course the same but opposite problem as above since now the likelihood of a logic 0 is $90 \%$. Thus the PSD still exhibits the
spiky response. Nevertheless the attenuation of the PSD near dc shows that lower frequencies are less likely than higher frequencies, and hence these PSDs are indicative of a balanced sequence that has numerous transitions even with these unbalanced source statistics.

Note that when the source is balanced the PSD loses its spiky nature. Figure 5-18 shows the PSD of two of the best AddCW sets with a balanced source. Since all the SWs are equally likely the entire set of available CWs is used and hence the discrete-like components disappear from the PSD. It is now more "smooth" and similar to the PSD of a random binary waveform with the addition of the null at dc. Note that the PSD of all $(15,8)$ codes are practically identical when the source is balanced, and as a result they all resemble those presented in Figure 5-18.


Figure 5-18: PSD of $(\mathbf{1 5 , 8})$ codes with a balanced source

Continuing with this analysis Figure $5-19$ shows the PSD of the four worst AddCW sets with a source that emits logic ls with $90 \%$ probability. The null at dc indicates that the coding scheme is working, however since runlengths of up to length 15 are possible, the null is narrower. Furthermore it is clear that the worst AddCW sets have more power at low frequencies than the best AddCW sets.


Figure 5-19: 90\% ones - All four Worst AddCW sets

Similarly Figure 5-20 shows the PSD of the four worst AddCW sets with a source that emits logic 1s with $10 \%$ probability. While the best AddCW sets had PSDs with convex shapes, the worst AddCW sets have almost concave shapes. This indicates that the highest frequencies are less likely than some of the midband frequencies. On the other hand the power at low frequencies is still quite small but clearly larger than the PSD of the comparable best AddCW sets. For example Figure 5-21 shows the PSD of one of the best AddCW sets beside one of the worst AddCW sets when the probability of the source emitting logic 1 s is $90 \%$.


Figure 5-20: 10\% ones - All four Worst AddCW sets


Figure 5-21: 90\% ones - Best AddCW set followed by a worst AddCW set

While it is obvious that the best AddCW set has more power at high frequency it is most evident when they plotted together as shown in Figure 5-22. Here it is clearly seen that not only does the best AddCW set have most of its power at higher frequencies, it also has a wider null width indicating its ability to limit runlengths.


Figure 5-22: 90\% ones - Best and worst AddCW set

### 5.7. Spectra of larger codes

Based on the AddCW recommendations of Section 4.15 the PSD of larger codes based on the $(31,26)$ Hamming code are now investigated. The PSD of these codes however cannot easily be calculated using the Cariolaro and Tronca technique used so far. This is because their method involves modeling an encoder as a FSM and analyzing it as a Markov chain which involves numerous calculations for every possible way into and out of a state. In the case of the $(15,8)$ codes this meant on average $2^{8}$ ways into a state, $2^{8}$ ways out of each state, and sometimes as many as $2^{8}$ states. Thus the number of calculations is on the order of $256^{3}$. Furthermore the file sizes for the intermediate $\theta_{u}$, $E_{u}$ and $C_{u}$ matrices are on the order of 1 MB . Consequently for the $(31,23)$ codes the number of calculations is on the order of $\left(2^{23}\right)^{3}=8388608^{3}$ with file sizes for the intermediate $\theta_{u}, E_{u}$ and $C_{u}$ matrices well past the order of 1 GB . Thus calculating exact

PSDs for larger codes proved to be impractical with today's computing power. Thus an alternative approach used was to simulate the encoding scheme and calculate the PSD using FFT techniques.


Figure 5-23: Simulation for finding the PSD

Figure 5-23 shows the program SimFPGAwithFFT written to find the PSD through simulation. This program uses a technique called FFT overlap processing with a Hanning window to find an accurate spectrum representation. This technique involves windowing 1024 CWs , taking an FFT, sliding the window by 256 CWs (i.e. overlapping 768 CWs ), taking an FFT, and repeating over 64 windows and averaging the results. Figure 5-24 shows two PSDs found through simulation. Comparison with Figure 5-21 shows the accuracy of this technique. The noisiness of the spectra is due to the fact that the transmission is random and an infinite number of samples would need to be averaged in order to precisely match the PSD of Figure 5-21.


Figure 5-24: 90\% ones - best AddCW set followed by a worst AddCW set

Using the AddCW recommendations of Section 4.15, Figure 5-25 shows the PSD of the $(31,23)$ code, with a source emitting logic $1 s$ with $10 \%$ probability, using the AddSW sets $\{000000 \mathrm{~h}, 084 \mathrm{D} 4 \mathrm{D} 4 \mathrm{~h}, 1323232 \mathrm{~h}, 189 \mathrm{~B} 9 \mathrm{~B} 9 \mathrm{~h}, 2764646 \mathrm{~h}, 2 \mathrm{CDCDCDh}$, 37B2B2Bh, $3 F F F F F F h\}$ and $\{0000000 \mathrm{~h}$, 08AAAAAh, 1555555h, 1832323h, 27CDCDCh, 2AAAAAAh, 3755555h, 3FFFFFFh\}. It is immediately obvious from Figure 5-25 that the first AddCW set appears to have many bands of high power at high frequency. On the other hand the second AddCW set has a majority of power at high frequencies. This is indicative of a transmitted sequence that has very short runlengths.


Figure 5-25: 10\% ones - Two (31,23) codes

This can be further understood by inspecting the statistics of the transmitted sequence. In addition to the PSD the SimFPGAwithFFT program generates a file containing the number of states in the encoder, maximum runlengths and AddCW usage. These files are shown in Figures 5-26 and 5-27 respectively.


Figure 5-26: Statistics of the $\mathbf{( 3 1 , 2 3 )}$ ) code with the $\mathbf{1}^{\text {st }}$ AddCW set


Figure 5-27: Statistics of the $(31,23)$ code with the $2^{\text {nd }}$ AddCW set

Comparison of these two files shows that the first set only has maximum runlengths of $10(M R 1=M R 0=10)$, while the second set has maximum runlengths of 11 $(M R 1=M R 0=11)$. On the other hand, the second set has a $60.4 \%$ chance of runlengths of length 1 , while the first set only $27.5 \%$. This was clearly evident from the PSD plots.

Similarly it was noticed that the first set had more power at intermediate frequencies than at its highest frequencies. This can be seen in the statistics file since the probability of a run of length 2 is $40.7 \%$ and the probability of a run of length 3 is $20.7 \%$.

### 5.8. PSD Summary

The multimode coding technique introduced in Chapter 3 is designed to guarantee balanced transmission and remove long runs of like valued bits. An output sequence that has long runs of 0 s and 1 s will appear as a slowly varying square wave, and an output sequence with short runs of 0 s and 1 s will appear as a rapidly varying square wave. Hence these sequences can be analyzed in the frequency domain in terms of their PSD.

Cariolaro and Tronca [24] outline an efficient procedure for calculating the PSD of a coded system, the details of which were demonstrated in this chapter, with further examples found in Appendix A. It was shown that in systems where the number of states or size of the code is too large, it is preferred to simulate the PSD.

The following chapter looks at the FPGA hardware implementation of this multimode encoder. It also compares measured PSD results to the theoretical and simulated PSDs calculated in this chapter.

## 6. Implementing the Multimode Encoder and Decoder in Hardware

The previous chapters introduced the combined EC and CS coding concept, as well as outlined how to evaluate performance in both the time and frequency domain. This chapter chronicles how a working transmitter and receiver was implemented on two FPGA boards and PSD measurements were taken for comparison with results from Chapter 5. For proof of concept the goals of this communication system were simple:

1. Implement the combined CS coding and EC coding technique;
2. Transmit a file from transmitter to receiver to verify the encoding and decoding techniques;
3. Measure the PSD of the transmitted sequence to verify it agrees with calculations.

A block diagram of the communication system is shown in Figure 6-1. It consists of the two FPGA boards TX FPGA and RX FPGA, the three wires over which they communicate, and the controlling computers TX PC and RX PC. Computer control allows for a more sophisticated user interface than what is present on the FPGA boards themselves. Furthermore this setup allows a user to select any file on the local TX PC and transmit it to the RX PC using the combined EC and CS coding technique. Typically no distinction will be made between the PC and the FPGA and they will collectively be referred to as the TX and the RX.


Figure 6-1: Block diagram of the FPGA communication setup

The TX and RX communicate using three wires: serial data, serial clock and ground. The serial data line transmits the actual coded sequence and the ground is simply the common line between the two FPGAs. These two wires together would form what is commonly known as a twisted pair. A third wire is also included which allows the RX clock to synchronize with the TX. This is because a typical RX must synchronize to the clock rate of the TX in order to determine the duration of a single bit. Conventional communication systems accomplish this using a phase lock loop (PLL) to derive this clock from the transitions in the incoming bit stream on the serial data line. In fact the combined EC and CS code introduced in this thesis is designed to aid PLLs to accomplish this task. However the added complexity of designing a stable PLL for use with the RX was considered unnecessary since the analysis reported in Chapters 4 and 5 has proven that the outgoing bit stream of this multimode coding technique contains numerous transitions. Furthermore this coding technique was designed to be added to an existing communication system. Therefore as long the calculated PSDs presented in Chapter 5 match with the measured PSDs of this hardware implementation, the system would be proven to be valid even without the implementation of a PLL at the receiver. As a result, to simplify the receiver circuitry a clock signal was connected directly from the transmitter.


Figure 6-2: FPGA transmitter and receiver communicating

Figure 6-2 shows the laboratory setup of the block diagram of Figure 6-1. Two PCs and two FPGAs using three connecting wires are shown. The TX and RX each 128
consisted of a Digilent Digilab 2E (D2E) development board as shown in Figure 6-3, and a Digilent Digital I/O board 2 (DIO2) expansion board shown in Figure 6-4. The D2E boards each featured a 200K-gate Xilinx Spartan 2E FPGA (XC2S200E) in a PQ208 package running at 50 MHz with 143 user I/O pins. The DIO2 boards provided input and output functions with fifteen push buttons, eight toggle switches, sixteen LEDs, an LCD screen and four 7 -segment displays.


Figure 6-3: Digilent Digilab FPGA board


Figure 6-4: DIO2 peripheral board for input and output

### 6.1. Pinouts

On each FPGA board pin 1 was the common ground (GND), pin 73 was the serial data (SDATA) and pin 74 was the serial clock (SCLK). The data and clock pins were easily accessed from the D-connector on the development boards in locations D40 and D39 as shown in Figure 6-5. These translated into the physical pin locations on the FPGA as pin 73 and 74 respectively.


| Transmitter |  |  |
| :---: | :---: | :---: |
| Name | Pin $\#$ | In/0ut |
| SDATA | 40 | Out |
| SCLK | 39 | Out |
| GND | 1 | - |


| Receiver |  |  |
| :---: | :---: | :---: |
| Name | Pin $\#$ | In/0ut |
| SDATA | 40 | In |
| SCLK | 39 | In |
| GND | 1 | - |



Figure 6-5: The pinouts used on the FPGA boards

### 6.2. SDATA and SCLK signals

Figure 6-6 shows a typical data and clock sequence. It is clear from the digital oscilloscope screenshots that SCLK rises halfway through SDATA. Thus the RX samples SDATA midway through each bit on the rising edge of SCLK.


Figure 6-6: SDATA sequence (top) and SCLK sequence (bottom)

### 6.3. Controlling the FPGAs over the PC parallel port

The D2E development boards each had a 25 -pin parallel port connector that is used during programming. This port could also be used to communicate with the FPGA after programming. The parallel port of a PC, when accessed in standard parallel port (SPP) mode, is typically located at address 378 h (BIOS configurable), and is made up of three individual ports. Of the 25 pins there is an 8 -bit bi-directional data port (address 378h), a 5-bit input only status port (address 379 h ) and a 4-bit output only control port (address 37Ah). The remaining 8 pins are all grounds as shown in Figure 6-7.


Figure 6-7: Parallel Port connector on the back of a PC

It is also important to note that some of the bits of the SPP are inverted by PC hardware ( 7404 inverters) for legacy printer reasons. This must be compensated in software by inverting these bits before writing to the write only control port, and by inverting these bits when reading from the read only status port.

PC parallel ports when used in SPP mode run at 125 kHz . This means that new data can be written to or read from the SPP bus every 8 us. However when writing data there is often a 2 us settling time in the form of random transitions on the parallel port data bus. For example, using the Agilent Logic Wave logic analyzer it can be seen that there are transient values on the data bus when the data is changing from 65 h to 00 h as shown in Figure 6-8.


Figure 6-8: The 'transient' nature of the parallel port when switching values

This presents a small problem when dealing with an FPGA that can sample the parallel port inputs at 50 MHz . For instance, if the transient values that occur when the data is changing from 65 h to 00 h proceeds through the series $65 \mathrm{~h}, 61 \mathrm{~h}, 21 \mathrm{~h}, 20 \mathrm{~h}, 00 \mathrm{~h}$, the FPGA would see the above sequence as five separate and valid data values. This issue must be resolved by implementing a debouncing circuit in VHDL code. That is, before reading any change on the parallel port inputs, a timer should begin that will re-sample the parallel port bits after the 2 us settling time has passed. Then a simple comparison with the original data determines what the new value is, and the transient values are ignored as shown in the FSM in Figure 6-9.


Figure 6-9: FSM model of the parallel port debounce circuit

### 6.4. Transmitter FPGA State Machine

The transmitter FPGA runs a main state machine that consists of four states as shown in Figure 6-10. On power up it is forced into state 1 which is the reset state that initializes the encoder. After this the FSM cycles through states 2, 3, and 4 indefinitely. Only by powering the FPGA off and on, or through user input in state 2 will the system ever enter the reset state again.


Figure 6-10: Main transmitter FPGA FSM

1. Reset State - this state is entered during power on or through specific user input (such as command Eh in Table 6-1). Once entered all variables and buffers are cleared and the system is ready to accept further commands such as transmitting a file.
2. Check for Input - in this state the transmitter checks for input commands from the parallel port as discussed in Section 6-3. User input is set up as follows. The 4-bit control port of the SPP is used to issue one of sixteen individual commands to the FPGA, with additional data for each command coming from the 8-bit data port. These are summarized in Table 6-1.

For example, to write the data value AAh to the FPGAs RAM at address 0E20h, begin by placing the FPGA into a HighZ state by issuing command Fh. This makes the FPGA back-off the bi-directional data bus so the parallel port can write data without any bus contention. The next step is to assign the RAM address to modify by writing the address to the 12 -bit FilePointer. Since the data bus is
only 8 -bits this must be done is two steps. First the upper 4-bits of the address is written to the data bus by writing data value 0Eh followed by command 2 h . Then the lower 8 -bits is written to the data bus by writing data value 20 h followed by command 2h. At this point the FPGA has latched the value E20h into the FilePointer register. Finally in order to write a new value AAh into this RAM location, simply write data value AAh, followed by command 1 h .

Table 6-1: Commands for both the transmitter and receiver FPGAs

| 4 bit Control Port Command | Function | Description |
| :---: | :---: | :---: |
| 0h | RAM Read mode | Latch file pointer RAM address Enable RAM read mode Increment file pointer |
| 1h | RAM Write mode | Latch file pointer RAM address Enable RAM write mode Increment file pointer |
| 2h | Assign File Pointer | Write to file pointer. This allows for non-sequential access to RAM |
| 3h | Assign File Size | File size of the data written to RAM |
| 4h,5h | Write AddSWs | Assign AddSWs |
| 6h,7h | Read AddCWs | Read AddCWs |
| 8h | Set number of AddCWs | Set either 0,2,4 or 8 AddCWs |
| 9h | Initialize the Encoder | This builds the AddCWs from the AddSWs |
| Ah | Running Mode Continuous | Begins transmitting the file continuously. When it reaches the end of the file, it does not stop and simply wraps around. |
| Bh | Running Mode Once | Begins transmitting the file, and when it reaches the end of the file it stops. |
| Ch | Set Serial Transmission Speed | Either $1 \mathrm{kHz}, 10 \mathrm{kHz}, 100 \mathrm{kHz}$ or 1 MHz |
| Dh | Enable RAM | If RAM is in ReadMode (0), data is now put onto the data bus. If RAM is in WriteMode (1) it will read data from the data bus and store it. |
| Eh | Reset | Reset all variables |
| Fh | HighZ | Similar to reset, except not all variables are reset, and data bus that is connected to the parallel port is put into HighZ state. |

3. Service Output - This state involves writing data to the 16 character by 2 line LCD screen ( 32 ASCII characters) on the DIO2 expansion board. The LCD was used for debugging and testing purposes. For instance in order to verify that the multimode coding system was choosing the correct CW at each coding interval, a debug mode was set up that allowed stepping through the CW selection process.

In this mode there needed to be a way to display up to a 32 bit number (8-hex characters). The LCD screen was the only viable choice since the DIO2 board only had four 7 -segment displays and thus could only display 4-hex characters.

It is important to note that the LCD screen uses the Samsung KS0066 controller which has enough Display Data RAM (DDRAM) to handle up to a 40 character by 2 line display. Thus each of the 80 DDRAM location corresponds to a single character location on the screen even though physically the LCD screen only has 32 character locations as shown in Figure 6-11. This must be taken into account in order to see the text displayed correctly. Thus it is not immediately obvious but the simplest way of writing the 10 character string "Hello" and "World" on two separate lines is to use 45 characters, not 10 . This is accomplished in the following manner.

Displaying the 5 character string "Hello" on the first line is trivial since it requires placing the LCD screen into data mode followed by sending the five ASCII characters $48 \mathrm{~h}, 65 \mathrm{~h}, 6 \mathrm{Ch}, 6 \mathrm{Ch}, 6 \mathrm{Fh}$ for ' H ', 'e', ' l ', ' l ', ' o '. Writing the 5 character string "World" on the second line however is a bit more involved.

In order to get to the next line on the LCD screen the cursor must manually be moved 35 locations to the right since the generic KS0066 controller has 40 DDRAM locations per line. One method to do this is to switch the LCD screen into control mode followed by sending the "move cursor right" command 35 times. While this is not difficult, switching into control mode can be avoided by just writing 35 spaces instead (ASCII character 20h). It is then just a matter of writing the string "World" which is done by sending the 5 ASCII characters 57 h , 6Fh, 72h, 6Ch, 64h, for 'W', 'o', 'r', 'l', 'd'. This is illustrated in Figure 6-11, where the LCD and corresponding DDRAM locations are shown.


Figure 6-11: LCD screen and corresponding DDRAM locations

Accordingly the simplest way to display a string on the LCD screen without any additional overhead is to pad the string out to 80 characters with spaces. In this manner each string starts at the first location of the LCD screen and overwrites every DDRAM location. This has the added benefit of erasing any past characters that were on the LCD, thus clearing the screen before displaying each new string without ever having to switch modes.

Table 6-2: String table held in the first 400h bytes of RAM

| Ram Address | String |
| :---: | :--- |
| 000 h | Multimode Coder Online |
| 051 h | Source Word |
| 0 A 2 h | Code Word |
| 0 F 3 h | Hamming Code is |
| 144 h | Number of AddCWs |
| 195 h | RDS |
| 1 E 6 h | Num Tran |
| 237 h | 1st Tran at loc / MSW |
| 288 h | Winning Location |
| 2 D 9 h | Last Bit is |
| 32 Ah | Not Updating RDS or LB for Test |
| 37 Bh | Updating RDS and LB as normal |

For this reason all strings displayed on the LCD are stored in RAM as 80 bytes plus a 0 terminator for a total of 81 bytes ( 51 h ). That is, the $1^{\text {st }}$ string is stored from 000 h to 050 h , the $2^{\text {nd }}$ string is stored from 051 h to 0 Alh and so on. Therefore the first 1024 bytes (400h) of RAM are used to hold a string table that consists of 12 strings as shown in Table 6-2.

In accordance with the above discussion the following FSM in Figure 6-12 illustrates how to display information on the LCD screen. One of the 12 strings to display must be chosen, followed by an optional number.


Figure 6-12: FSM for displaying strings and numbers on the LCD screen
4. Running Mode - When the TX is in this state it is transmitting data serially to the RX. This process involves reading SWs from RAM, encoding them as CWs, performing the multimode CW addition, calculating statistics for each candidate CW to see which one maximizes the CS coding requirements, and finally transmitting these CWs serially. There are two modes of operation.
I. Run Once Mode - This mode is used for sending a data file once from the TX to the RX. The file is stored in the FPGA's RAM, and the TX sends each CW on the channel until the end of file is reached. Once the end of file is reached the TX automatically appends a 00 h data byte to indicate the end of the file. Note that depending on the code and AddCWs used, this 00 h byte is transmitted fully EC and CS coded.
II. Run Continuously Mode - This mode is used when measuring the PSD so the spectrum analyzer has a continuous signal to measure. In this mode the file is transmitted repeatedly which means when the end of file is reached, the transmitter wraps around to the beginning of the file.

Showing a complete block diagram of the Running mode FSM is not practical due to its size and complexity. Instead Figure 6-13 shows a high-level block diagram with a text description of the functionality of each state.


Figure 6-13: High-level block diagram of the running mode state FSM

Each block in Figure 6-13 represents a sequential FSM. As the FPGA was programmed with VHDL, each block also represents a VHDL module. The following section briefly describes the functionality of each one of these modules as well as the corresponding test benches to verify their functionality.

Read RAM and SW Packing: Since a data file is organized as a series of 8-bit bytes, the first major task is to take the file data and convert it into SWs of sizes as small as 1 -bit and as large as 26 -bits depending on the code being used. This is not straightforward since, for example, with the $(15,11)$ code 11-bits are needed to form a single SW. This would require reading two bytes from RAM to get 16-bits, to then form an 11-bit SW with 5 bits left over. Building the next SW however only requires reading from RAM once (unlike twice the last time) to get an additional 8 -bits for a total of 13 -bits. The next 11 -bit SW can now be created leaving 2-bits this time and so on. Thus the number of bits left over and the number of RAM reads varies for each coding interval depending on the code being used.


Figure 6-14: Testbench for SW packing

Figure 6-14 shows the testbench for this module called SW Packing. This particular test shows a $(15,9)$ SW being formed. From initialization the number of bits needed to form the SW (oneed) is set to 9 . The first byte of the file is read (theram_dout) from RAM address 400h (otheram_addr). Its value is 63 h and it is latched (oram_readlatched). The 8 -bits are then loaded into osw and it can be seen that one more bit is needed (oneed) to form the 9 -bit SW. Thus another byte is read (theram_dout) from address 401 h (otheram_addr) and its value 8 Ah is latched (oram_readlatched). Here only the MSb is taken from oram_readlatched and it is shifted left by one bit leaving 14 h . Now the complete

9-bit SW is formed (osw) which is 0C7h or 011000111b. Thus 7-bits remain (oneed) in oram_readlatched for building the next SW. To clarify how the two bytes 63h and 8Ah form the 9-bit SW C7h, the process is shown in Figure 6-15.


Figure 6-15: 63h and 8Ah form the 9-bit SW C7h or 011000111b

Hamming Encoder: As discussed in Section 2-14, the EC Hamming code can be implemented using very low level logic blocks. It is a configurable stand-alone module that allows selecting the generator polynomial as well as the length of the code. In addition, encoding can be done in parallel on the FPGA so a SW to CW conversion takes only a single clock cycle.


Figure 6-16: Testbench for the Hamming encoder

Figure 6-16 shows the testbench for the Hamming encoder module using a $(15,11)$ code. Encoding of the SWs $001 \mathrm{~h}, 002 \mathrm{~h}, 00 \mathrm{Fh}$, and 7 FFh each take a single clock and are encoded to $0013 \mathrm{~h}, 0026 \mathrm{~h}, 00 \mathrm{~F} 2 \mathrm{~h}$ and 7 FFFh respectively. Note that all SWs and CWs are represented using 32 bits and are truncated before serial transmission according to the $n$ and $k$ values of the $(n, k)$ code.

Codeword Addition: CW addition is the simplest block/module of Figure 6-13. As discussed in Section 2.5, modulo-2 addition can be done using XOR gates. It too can be done in parallel on the FPGA. Figure 6-17 shows the testbench for the
$(15,9)$ encoder．The CW 0C70h（ocw）is added modulo－2 to the four AddCWs $0000 \mathrm{~h}, 024 \mathrm{EBh}, 5 \mathrm{~B} 14 \mathrm{~h}$ and 7 FFFh ，to form the new CWs（onewcw0 to onewcw3）0C70h， $289 \mathrm{Bh}, 5764 \mathrm{~h}$ ，and 738 Fh ．Note that the flexible algorithm used always adds each CW to eight AddCWs for simplicity．Thus the testbench in Figure 6－17 displays eight new CWs，but only the first four are valid．The last four（onewcw4 to onewcw7）new CWs would not be considered for transmission．

| 7 | 几几几几 | 几几几几 | 几几「 |
| :---: | :---: | :---: | :---: |
|  | ［5］ 6123 | $4 \times 15 \times 18$ | $9 \times 123$ |
|  | －100000006 | 00000007 | ，00000000 |
| TT |  | Y00 | 000C70 |
|  |  |  | 100000C70 |
| What ${ }^{\text {a }}$ |  |  | 00002898 |
|  |  |  | 100005764 |
|  |  |  | 0000738 F |
| （T）${ }^{2}$（1） |  |  | 100000C70 |
|  |  |  | 000000770 |
| W9＝ |  |  | 00000C70 |
|  |  |  | R00000C70 |

Figure 6－17：Testbench for CW addition

Collect Statistics：As discussed in Section 4－1，the multimode coder evaluates the CW statistics and selects the best CW that minimizes the RDS and／or the MSW as well as maximizes the transitions．These statistics are gathered using the module called StatsGen．This module is instantiated once for every possible AddCW and thus the statistics for all 8 AddCWs can be calculated in parallel．


Figure 6－18：Testbench for the StatsGen module

Figure 6-18 shows the testbench for the StatsGen module. As discussed in Sections 4-2 and 4-3 the encoder keeps track of the current RDS (irds) and current last bit (ilastbit) of the previous CW transmitted on the channel (these comprise the state of the encoder). This continuous feedback is updated with every CW sent. To verify this functionality the testbench of Figure 6-18 collects the statistics on the same input CW (icw) 022Ch for the $(15,9)$ code three times, but with different irds and ilastbit equaling ( 0,0 ), $(-1,0)$ and $(0,1)$. As a result when the first CW 022Ch is sent, the calculated RDS is -7 and the number of transitions is 6 . However when the CW is sent a second time the current RDS is at -1 , resulting in a calculated RDS that is one less than before at -8 . This however has no effect on the number of transitions but does result in a higher MSW. The third and final time CW 022Ch is sent the RDS is back at 0 resulting in an RDS of -7 again. However since the current last bit was changed to a 1 this results in an extra transition for a total of 7 . This extra transition occurs between the last bit and the start of the CW. Finally the testbench also transmits CW 7A0Fh with an irds and ilastbit of $(0,1)$. This CW is more balanced than 022 Ch resulting in an RDS of 3. However this CW has fewer transitions with only 4. The results from the testbench in Figure 6-18 are summarized in Table 6-3.

This StatsGen module is the most important block of Figure 6-13. This is because the information gathered in this module is used to decide which new CW will represent the original SW, i.e. the multimode coding. This information is passed to the next block Select Best, which decides which of the new CWs will be transmitted.

Table 6-3: Summary of the StatsGen module gathering statistics

| CW | Current RDS | Current LB | RDS | Tran | MSW | 1stTran |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 022C | 0 | 0 | -7 | 6 | 412 | 5 |
| 022C | -1 | 0 | -8 | 6 | 575 | 5 |
| 022C | 0 | 1 | -7 | 7 | 412 | 5 |
| 7A0F | 0 | 1 | 3 | 4 | 84 | 4 |

Select Best and Update Channel Statistics: Once the statistics have been collected a decision must be made to determine which CW will be sent on the channel. In the $(15, x)$ and $(7, x)$ encoders the decision was based on which of the new CWs had the lowest RDS, followed by the tie breaking criteria which included most transitions, lowest MSW, and the CW with the first transition. Note that the CW with the first transition test was only used to provide a final tie break in order to always be able to predict which CW the encoder would select in the unlikely event that two or more CWs had the identical RDS, MSW, and number of transitions. Simulations demonstrated that this selection metric was used less than $1 \%$ of the time with any AddCW set.

Also note that for the $(31, x)$ codes only the lowest RDS, most transitions and first transition metrics were used. This was due to the size and resource limitations of the FPGA. Missing the MSW test has very little impact on the code performance, since the RDS and transitions test were the most important, and the MSW test was only used to break ties.


Figure 6-19: Testbench for the SelectBest module

Continuing with the example from Figure 6-17 the testbench for the SelectBest module is shown in Figure 6-19. Here the statistics are compared for the four CWs 0C70h, 289Bh, 5764h, and 738Fh. This is done sequentially. The algorithm initially chooses CW0 to be the best, and then the module compares each new CW's statistics to the current best statistics and only updates the winner if the statistics are better. For example from Figure 6-19 it can be seen that the winning location (owloc) is temporarily changed from 0 to 1 , before location 2 is selected as the best.

This process is summarized in Table 6-4 where it can be seen that CW1 (289Bh) is better than CW0 ( 0 C 70 h ) since it has a lower absolute RDS of -1 as compared to -5 . CW1 (289Bh) is then the current best until comparison with CW2 (5764h) indicates that they are tied in terms of lowest absolute RDS. Since both of these CWs will minimize the RDS, preference should be given to the CW that has the most transitions. Thus CW2 (5764h) with 10 transitions is chosen to be the best CW to transmit since it has one more transition than CW1 (289Bh). Comparison with CW3 (738Fh) indicates that CW2 (5764h) is still the better choice and thus it is the overall winner and is selected for transmission.

Table 6-4: Summary of the SelectBest module

| CW location | $C W$ | RDS | Tran | MSW | 1stTran |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | $0 C 70 h$ | -5 | 4 | 116 | 3 |
| 1 | 289 Bh | -1 | 9 | 72 | 1 |
| 2 | 5764 h | 1 | 10 | 72 | 1 |
| 3 | 738 Fh | 5 | 3 | 116 | 3 |

Send Serially: Once the best CW is selected it must be transmitted serially along with a serial clock to the RX FPGA. Figure 6-20 shows the testbench for the Parallel2Serial module. Here the best CW 5764h chosen from the testbench above in Figure 6-19 is being transmitted serially. Once the go pulse is received the data is sent starting with the MSb first on the sdata line. The sclk signal is also sent
with a low to high transition occurring midway between each data bit. Once the entire 15 -bits are transmitted the Parallel2Serial module signals that it is ready to transmit another CW. This hand shaking is used to prevent the FPGA that is running at 50 MHz from swamping the Parallel2Serial module that transmits at a maximum rate of 1 MHz .


Figure 6-20: Testbench for the Parallel2Serial module

### 6.5. PC Software: Controlling the TX FPGA

The TX FPGA was controlled using Windows software with the interface shown in Figure 6-21. As shown in Table 6-1 all sixteen commands were issued using the control port, with additional instructions coming from the data port. In order to explain the operation and features of the software, an example is given illustrating how to transmit a file.


Figure 6-21: Software that controls the TX FPGA

### 6.6. Transmitting a File

This section briefly describes how to transmit a standard ASCII text file that contains the 11 byte string "Hello World". The first step after powering on the board is to ensure that the FPGA is in a known state by pressing Reset FPGA Board. This forces the FPGA into the reset state and initiates the main FSM of Figure 6-10. Since the FPGA has just powered up the string table from Table 6-2 must be loaded into the first 400 h bytes of RAM. This is accomplished using the Send Strings button and pressing the Check Strings button reads back the string table from RAM to verify that they were stored without error. Now the number of AddCWs can be set up as $0,2,4$ or 8 using the drop down box as shown in Figure 6-22. For example with length 31 Hamming CWs, this would set up a $(31,26),(31,25),(31,24)$, or $(31,23)$ code respectively.


Figure 6-22: Setting up the number of AddCWs

Once the number of AddCWs has been selected, pressing the Send number of AddCW's button writes this value to the FPGA. The resultant code is immediately displayed on the DIO2 boards 7 -segment display. For example, eight AddCWs has been selected resulting in the $(31,23)$ code as shown in Figure 6-23.


Figure 6-23: Eight AddCWs have been selected for use with the (31,x) code

Once the encoder knows the number of AddCWs to use it must be told what these AddCWs are. This is done by sending the encoder AddSWs which it then encodes into AddCWs. Pressing the Read AddSW's from File button allows selection of an ASCII text file that contains the desired AddSWs as shown in Figure 6-24. Note that only the first four AddSWs are stored in this file. This is because the system is set up to automatically use the CW complements. Hence these four AddSWs are encoded to four AddCWs and four AddCW complements, resulting in a total of eight AddCWs.


Figure 6-24: Reading the AddSWs from file to use with the $\mathbf{( 3 1 , 2 3 )}$ code

The Send AddSW's button writes these AddSWs into the FPGA memory. The Initialize Encoder button causes the encoder to encode the four AddSWs to eight AddCWs using the Hamming encoder module as explained above. The encoder then automatically writes these AddCWs back to the PC to allow for verification that they were encoded correctly, as shown in Figure 6-25.


Figure 6-25: Encoder writes back the 8 AddCWs it is going to use

The file to transmit is selected by pressing the Choose a file to Send button which brings up a file selection dialog box. Once a file is selected it is transferred to the FPGA's RAM by pressing either the Send the file to FPGA button or the Send Encoder File button as shown in Figure 6-26. The difference between these two functions is that the Send the file to FPGA button writes the file contents into the string tables address space. This is done to allow the loading of different strings into the string table (other than the default hard coded ones) if required. The Send Encoder File button writes the file contents directly above the string tables' address space.


Figure 6-26: Two ways to transmit the file into FPGA RAM

There are two ways to verify that the file was loaded correctly. First by pressing the Read Encoder File saves to "FileReadBack.txt", the file is read back from the FPGA and a byte by byte comparison is done with the original file that was sent. Any errors that occurred are displayed in a popup box and the file read back is saved to the file FileReadBack.txt. This allows for saving and inspection of the data at a later time.

Secondly by pressing the Read back from FPGA button the file is read back and displayed in the two windows as shown in Figure 6-27. The top window displays the ASCII representation of the data, while the bottom window shows the decimal representation. Figure 6-27 shows the file "Hello World" was read back and what a possible error might look like.


Figure 6-27: Reading back the file Hello World, and what a possible error might look like

Once the file is loaded into the FPGA's RAM and the AddCWs are selected, the next step is to select the serial transmission speed. This is done by selecting the Parallel 2 Serial Speed as either $1 \mathrm{kHz}, 10 \mathrm{kHz}, 100 \mathrm{kHz}$ or 1 MHz as shown in Figure $6-28$. These various speeds were useful for testing and trouble shooting. For example, the logic analyzer could capture far more samples at 1 kHz than it could at 1 MHz . However, the spectrum analyzer could not measure frequencies below 9 kHz , and thus having the ability to select different speeds proved quite useful.


Figure 6-28: Choosing the serial transmission speed

Another debugging feature implemented was the ability to change the amount of RAM the encoder could use, specified as RAM Size. This is because when transmitting a file repeatedly, the end of the file is inevitable reached and the file data is deliberately wrapped around back to the start. In order to verify that the rollover occurred correctly, it
was easier and faster to simply reduce the size of RAM (as far as the FPGA was concerned) to force a roll over sooner than it was to load in a smaller version of the file.


Figure 6-29: Setting the amount of RAM that the FPGA could use

The final step is to select which method to use when transmitting the file. If Running Mode (Once) is selected then the file is transmitted until the file in RAM is exhausted. That is, this mode is used to transmit a single file once from the TX to the RX. The other method of transmitting a file is to choose Running Mode (Repeat). In this mode the transmission runs continuously, sending the file over and over again until the transmitter is stopped by pressing either Initialize Encoder or Reset FPGA Board. This mode is used when measuring PSDs since the digital oscilloscope and spectrum analyzer are designed to monitor continuous signals.

### 6.7. Debug mode of the FPGA

Once the system was designed and implemented it had to be verified that the encoder was indeed selecting the best CW for transmission. When switch 0 (SW0) is set on the DIO2 board, the encoder is put into a debug mode where with alternate use of buttons 8 and 9 , each phase of the CW selection process can be stepped through.

As discussed in Section 4-4, a simulation called SimFPGA was developed to emulate the way the FPGA hardware would perform the encoding. Table 6-5 shows a small sample of the output file created with this program. In this example the $(31,26)$ code is expurgated to $(31,23)$ using 8 AddCWs created from the set of four AddSWs which are $\{0 \mathrm{~h}, 0800000 \mathrm{~h}, 1000000 \mathrm{~h}, 1800000 \mathrm{~h}\}$. Since this is a large code the FPGA needs to read from RAM three times in order to obtain enough data to form the 23-bit SW. In this case the three bytes read are 2Ah (00101010b), 22h (00100010b) and 56h
(01010110b). These three bytes (24-bits) form the 23-bit SW 015112Bh (00101010001000100101011b) with 1-bit left over. Table 6-5 shows the 31-bit CW added to all eight AddCWs along with the RDS, number of transitions, MSW and first transition location statistics.

Table 6-5: Output from SimFPGA

| ****** RamRead $=2 \mathrm{~A}$ |  | FilePointer $=0$ of 49 ****** |  |  |  |
| :---: | :---: | :---: | :---: | :---: | :---: |
| ****** RamRead | $=22$ | FilePo | inter $=$ | 1 of 49 * | $\star * * * * *$ |
| ****** RamRead | $=56$ | FileP | inter = | 2 of 49 * | ****** |
| $\mathrm{SW}=0015112 \mathrm{Bh}$ | RDS | TRAN | MSW | 1stTran |  |
| 02A22562h | -11.0 | 18.0 | 1768.0 | 6.0 |  |
| 12A2256Ch | -7.0 | 20.0 | 880.0 | 3.0 |  |
| 22A2257Eh | -3.0 | 18.0 | 744.0 | 2.0 |  |
| 32A22570h | -7.0 | 18.0 | 448.0 | 2.0 |  |
| 4D5DDA8Fh | $+7.0$ | 19.0 | 448.0 | 2.0 |  |
| 5D5DDA81h | $+3.0$ | 19.0 | 744.0 | 2.0 **** |  |
| 6D5DDA93h | $+7.0$ | 21.0 | 880.0 | 3.0 |  |
| 7D5DDA9Dh | +11.0 | 19.0 | 1768.0 | 6.0 |  |

Figure 6-30 shows the encoder in debug mode displaying this process on the LCD screen. Note that all values are given in hexadecimal including negative numbers. For example an RDS of -11 is really F5h (using two's complement representation) and 18 transitions would be displayed as 12 h .


Figure 6-30: Encoder in debug mode showing the CW selection process

From Table $6-5$ it can also be seen that the $6^{\text {th }} \mathrm{CW}$ or CW5 is the winner with an RDS of $+3,19$ transitions and an MSW of 744. Figure 6-31 demonstrates that after stepping through all possible CWs and their statistics, the FPGA selects the Winning Location to be CW5.


Figure 6-31: Winning location is CW5 or 5D5DDA81h

### 6.8. Receiver State Machine

The RX FPGA runs the same main state machine as the TX which consists of four states as shown in Figure 6-10 or Figure 6-32. On power up it is forced into state 1 which is the reset state that initializes the decoder. After this the FSM cycles through states 2,3 , and 4 indefinitely. Only by powering the FPGA off and on, or through user input in state 2 will the system enter the reset state again.


Figure 6-32: Main receiver FPGA FSM

## 1. Reset State - Identical to the TX reset state

2. Check for Input - Identical to the TX check for input state.
3. Service Output - Identical to the TX service output state.
4. Running Mode - This is the main mode of the communication system. It is similar to the equivalent TX state in that many of the same operations are present. The major difference however is that most functions are now in reverse. When the RX is in this state it is receiving data serially from the TX and processing it to recover the original data. This involves combining the serial data into CWs, performing error correction, extracting the MSbs to apply the CW addition again to remove the CS coding and finally storing these SWs into RAM.


Figure 6-33: High-level block diagram of the running mode state FSM

Showing a complete block diagram of the Running mode FSM is not practical due to its size and complexity. Instead Figure 6-33 shows a high-level block diagram with a text description of the functionality of each state. Each block represents either a VHDL module and/or a sequential FSM.

Serial to Parallel Conversion: Once in running mode the RX continuously monitors for changes on the serial clock line. The serial data (sdata) is sampled when a rising edge (low to high transition) occurs on the serial clock (sclk). Whenever the RX has sampled $n$ bits it packs them into an $n$-bit CW which is passed to the Hamming Decoder for error correction and conversion to a SW.

Figure 6-34 shows the testbench for the Serial2Parallel module. The CW 5D5DDA81h has been received serially and stored into data. At this point the Serial2Parallel module signals that a CW is ready.


Figure 6-34: Testbench for the Serial2Parallel module

Hamming Decoder: Once a CW has been received it is sent to the Meggitt Decoder (see Figure 2-6) which attempts to correct any errors in the CW. Recall from Section 2-12 that Hamming codes can correct single bit errors per CW. Thus if the number of errors exceeds the power of the error correction code there will possibly be error extension. Therefore Figure 6-35 shows the CW 5D5DDA81h from above being decoded in three different scenarios. The first time the CW does not contain any errors and it is correctly decoded to SW 2EAEED4h. The second time however the CW is forced to contain two bit errors. This results in CW 5D5DDA87h being decoded to SW 2EAEFD4h. Note that this SW still contains one error. In the last case the CW only has a single bit error resulting in CW 5D5DDA80h being decoded to the correct SW 2EAEED4h.


Figure 6-35: Testbench for the Meggitt Decoder module

CW Addition: As described in Section 3-2 when a CW has been received and corrected the original CW can be recovered by adding the same AddCW that was added at the TX. This is done by checking the MSbs of the CW as shown in Figure 3-20. However an alternative and equally valid approach takes advantage of the fact that CWs used in this system are systematic (see Section 2.4). Since the Meggitt Decoder module returns the error corrected SW (ohamd_sw), the CW addition module can recover the original SW by adding the correct AddSW. The testbench in Figure 6-36 demonstrates this.

Continuing with the example presented in Table 6-5 the original SW 015112Bh was transmitted as CW 5D5DDA81h. This was error corrected and decoded by the Meggitt Decoder module to the SW (ohamd_sw) 2EAEED4h. In order to recover the original SW the three MSbs of this 23 -bit SW are inspected to determine that the winning location (owloc) must have been 5 . Therefore adding AddSW5 (oaddsw5) which is 2FFFFFFh to this SW recovers the original SW (ointsw) 015112Bh.

|  | 7ecthesweddifoncikgit |
| :---: | :---: |
|  | nestheswadthitnrohamd siw |
|  |  |
|  | hestheswaditantonsw |
|  | Nexthernodation orumithe |
|  | denthesmbationot |
|  | hosineswad |
|  |  |
|  |  |
|  |  |
|  |  |
|  |  |
|  | hesheswodriowoadim? |


|  |  |  |
| :---: | :---: | :---: |
|  |  |  |
| $\longrightarrow-15$ |  |  |
| x-xox | 00151128 |  |
| $0 \times 1 \times 12 \times 14$ | $15] 17 \times 15$ |  |
| X $\times$ X $\times$ X $\times 100000$ | 00 |  |
|  | 00 |  |
|  |  |  |
| XXXXXX 018000 |  |  |
| WXXSXXX027FF |  |  |
|  | FF |  |
|  | FF |  |
|  |  |  |

Figure 6-36: Testbench for the CW addition module

SW Unpacking: The final step in this process is to recover the original 8-bit data values (bytes) from the $k$-bit SWs and store them into RAM. However, similar to the SW Packing module, this is complicated by the variable $k$-bit SW size. For instance if $k$ is a small number as in the case of the $(7,1)$ code, it will take eight SWs to fill a single byte. However if $k$ is a large number, as in the case of the $(31,25)$ code, a single SW will fill three bytes in a row with one bit left over.

Figure 6-37 shows the testbench for the SW Unpacking module. Here SW 015112Bh from Figure $6-36$ is unpacked and stored into RAM. The 23 -bit SW is 8 -bits +8 -bits +7 -bits. Thus 2 Ah is stored at address 400 h , and 22 h is stored at address 401 h . Referring back to Table $6-5$ shows that these are the correct values. The remaining 7-bits will not be written to RAM until the next SW arrives.


Figure 6-37: Testbench for the SW Unpacking module

### 6.9. PC Software: Controlling the RX FPGA

As shown in Figure 6-38, the RX software has many of the same functions as the TX software. This is because the RX FPGA needs to set up the same Hamming code, use the same AddCWs and so on. The difference however is that the RX can receive the bit sequence at any of the predetermined transmission rates without having to specifically select one. Furthermore the RX was constantly in a state of monitoring the clock and data lines for activity. In order to explain the operation and features of the software an example follows which outlines how to receive a file.


Figure 6-38: Software that controllers the RX FPGA

### 6.10. Receiving a File

This section briefly describes how to receive a standard ASCII text file such as the eleven byte "Hello World" file discussed in Section 6.6. Initializing the RX FPGA is almost identical to the TX FPGA. The first step after powering on the board is to ensure the FPGA is in a known state by pressing Reset FPGA Board. This forces the FPGA into the reset state shown in Figure 6-32, and therefore begins the main FSM. After this the string table from Table 6-2 is loaded, the number of AddCWs is set up and the AddSWs are loaded into the FPGA memory in the exact same manner as the TX FPGA. At this point however the setup changes.

To receive a file what is typically done is to first set up the RX RAM with a preloaded dummy file. Then any data received by from the TX will overwrite this data to verify that communication was successful. For example, Figure 6-39 shows the dummy file "numbers.txt" loaded into the RX FPGA's RAM. This is displayed when the Read back from FPGA button is pressed. The top window displays the ASCII representation and the bottom window displays the decimal representation, i.e. ' 1 ' $=49$ decimal, $' 2 '=50$ decimal and so on.


Figure 6-39: Dummy file read back from the RX FPGA RAM

Pressing the Running Mode button places the RX into a mode where any data received on the sdata line is automatically stored into the RX RAM. Thus when the TX transmits the "Hello World" file in the Run Once Mode, the 11-byte file plus 00h byte terminator is sent to the RX. Pressing the Read back from FPGA button now shows the current RAM contents.


Figure 6-40: RAM contents read back from the RX FPGA

Figure $6-40$ shows how the received 11 -byte file has overwritten the RAM contents. Notice that the top window only shows the string "Hello World", while the bottom window still displays the 40 byte RAM contents. This is because in the Run Once Mode the TX automatically appends a 00 h byte to indicate the end of the
transmission. In the lower window it can be seen that the $12^{\text {th }}$ byte is a 0 terminator. In the top window the ASCII representation stops displaying data at this point since strings in $\mathrm{C}++$ are 0 terminated. However, since it was requested that 40 bytes were to be read back from RAM, the lower window still shows all 40 bytes in decimal form.

### 6.11. Verify that the system could correct single bit errors

In this 3-wire system the channel length was less than 10 cm and the maximum transmission rate was 1 MHz in base band. Hence the channel was practically ideal (noiseless). Therefore, in order to test the error correcting abilities of the combined code, errors were intentionally introduced into the transmitted bit stream. Using SW(6) or SW(7) a 1-bit or 2-bit error per CW could be introduced on every CW sent on the channel as shown in Figure 6-41.


Figure 6-41: How the switches on the DIO2 board controlled the TX FPGA

To verify that bit errors were being introduced the RX was configured to store all received CWs into a file called FileReadBack.dat, to compare with the file transmitted. Figure 6-42 shows the same sixteen CWs transmitted using the (7,2) code with and without $S W(6)$ on. The top line shows the CWs all have a single bit error in the LSb position while the bottom line shows the CWs without errors. Note that as a result of the error correcting ability, single bit errors have no effect on the accuracy of the SWs and they will still be decoded correctly.

```
000000 45 3^ 69 45 27 58 5B 53 2C 27 3A 45 16 69 27 58 E:iE'XXS,':E.i'X
000000 44 3B 68 44 26 59 59 52 2D 26 3B 44 17 68 26 59 D:hD&YYR-&:D.h&Y
```

Figure 6-42: Top line is CWs with single bit errors, bottom line is error free CWs

### 6.12. The effect of more than single bit errors

When $\operatorname{SW}(7)$ is set the TX introduces 2-bit errors per CW into the transmitted bit stream. This number of errors exceeds the error correcting ability of the EC code, and thus the SWs will no longer be decoded correctly. This effect can be demonstrated best by observing the result of transmitting the "Hello World" file again with SW(7) on. The resulting decoded data is displayed in Figure 6-43.

It is obvious from Figure 6-43 that the text in the ASCII window is completely unrecognizable when compared to Figure 6-40. This is because the excess bit errors resulted in the wrong SWs being decoded and as a result the decoded ASCII text is garbled. Another interesting result is that the original dummy file "numbers.txt" appears in both the ASCII and decimal windows in its entirety. This is because the terminating 00 h byte which followed the "Hello World" text was also corrupted (the $12^{\text {th }}$ byte is decoded as a 1 as shown in the lower window in Figure 6-43). Thus without the 0 to terminate the string the entire requested 40 bytes is displayed.


Figure 6-43: "Hello World" file received with 2-bit errors per CW

### 6.13. Equipment used to measure the PSD

The two devices used to measure the spectral characteristics of the transmitted sequences were the Agilent 54621A Digital Oscilloscope (DO) and the Agilent E4402B $\mathbf{9 k H z}-\mathbf{3 G H z}$ Spectrum Analyzer (SA) as shown in Figure 6-44.


Figure 6-44: Digital oscilloscope and spectrum analyzer used

### 6.14. Testing discrete spectra - $\mathbf{5 0 \%}$ duty cycle square wave

In order to have confidence in the measured spectra some initial tests were performed. The first step was to observe the spectrum of a unipolar $3 \mathrm{Vpp}, 100 \mathrm{kHz}, 50 \%$ duty cycle square wave on the SA and DO , then compare this spectrum to calculations. The actual lab signal measured was -31 mV to 2.875 V or approximately 2.9 Vpp . The measured average voltage was 1.438 V and the measured $\mathbf{R M S}$ voltage was 2.046 V , as shown in the DO screenshots in Figure 6-45.


Figure 6-45: 50\% duty cycle unipolar square wave

Calculating the ideal average and RMS value is shown in Equations 6.1 and 6.2.

$$
\begin{gather*}
A v g=\frac{1}{T_{0}} \int_{T_{0}} f(t) d t \quad A v g=\frac{1}{T_{0}}\left[\int_{0}^{T_{0} / 2} 2.875 d t+\int_{T_{0} / 2}^{T_{0}}-0.31 d t\right]=1.42 V  \tag{6.1}\\
R M S=\sqrt{\frac{1}{T_{0}} \int_{T_{0}}(f(t))^{2} d t} \quad R M S=\sqrt{\frac{1}{T_{0}}\left[\int_{0}^{T_{0} / 2}(2.875)^{2} d t+\int_{T_{0} / 2}^{T_{0}}(-0.31)^{2} d t\right]}=2.03 V \tag{6.2}
\end{gather*}
$$

Table 6-6: Calculated AVG and RMS versus measured

## Ideal Measured

AVG $1.42 \mathrm{~V} \quad 1.43 \mathrm{~V}$
RMS 2.03V 2.04 V

Table 6-6 shows how the actual square wave produced by the FPGA agrees with calculations in terms of average and RMS voltages.

### 6.15. Fourier Series Representation

Converting periodic signals from the time domain to the frequency domain can be done with a Trigonometric Fourier Series. The most general form is

$$
\begin{equation*}
f(t)=a_{0}+\sum_{n=1}^{\infty}\left(a_{n} \cos (n \omega t)+b_{n} \sin (n \omega t)\right) \quad \text { where } \quad \omega=\frac{2 \pi}{T_{0}} \tag{6.3}
\end{equation*}
$$

$$
\begin{array}{ll}
a_{0}=\frac{1}{T_{0}} \int_{0}^{T_{0}} f(t) d t \\
a_{n}=\frac{2}{T_{0}} \int_{0}^{T_{0}} f(t) \cos (n \omega t) d t & n=1,2,3 . .  \tag{6.4}\\
b_{n}=\frac{2}{T_{0}} \int_{0}^{T_{0}} f(t) \sin (n \omega t) d t & n=1,2,3 . .
\end{array}
$$

Using the relation

$$
\begin{equation*}
a_{n} \cos (n \omega t)+b_{n} \sin (n \omega t)=C_{n} \cos (n \omega t+\theta n) \tag{6.5}
\end{equation*}
$$

where

$$
\begin{align*}
& C_{n}=\sqrt{a_{n}^{2}+b_{n}^{2}} \\
& \theta_{n}=\tan ^{-1}\left(\frac{-b_{n}}{a_{n}}\right) \tag{6.6}
\end{align*}
$$

this can also be written as a Compact Fourier Series which has the form:

$$
\begin{equation*}
f(t)=C_{0}+\sum_{n=1}^{\infty} C_{n} \cos (n \omega t+\theta n) \tag{6.7}
\end{equation*}
$$

This form gives a one-sided spectrum identical to what is displayed on a DO and SA. In addition it is very easy to evaluate a magnitude and phase spectrum that together completely describe the waveform. Calculating the Compact Fourier Series of the $50 \%$ duty cycle square waveform yields:

$$
\begin{equation*}
a_{0}=\int_{0}^{T_{0} / 2} 2.875 d t+\int_{T_{0} / 2}^{T_{0}}-0.31 d t=1.42 \tag{6.8}
\end{equation*}
$$

$$
\begin{gather*}
a_{n}=\frac{2}{T_{0}}\left(\int_{0}^{T_{0} / 2} 2.875 \cos (n \omega t) d t+\int_{T_{0} / 2}^{T_{0}}-0.31 \cos (n \omega t) d t\right)=0 \text { for } n=1,2, \ldots  \tag{6,9}\\
b_{n}=\frac{2}{T_{0}}\left(\int_{0}^{T_{0} / 2} 2.875 \sin (n \omega t) d t+\int_{T_{0} / 2}^{T_{0}}-0.31 \sin (n \omega t) d t\right)  \tag{6.10}\\
b_{n}=\frac{.0098(\cos (2 n \pi)-93.741(\cos (n \pi)-0.989))}{n} \text { for } n=1,2, \ldots
\end{gather*}
$$

Using these values the $C_{n}$ coefficients of the Compact Trigonometric Fourier series can be calculated. The result is shown in Table 6-7.

Table 6-7: First nine $\boldsymbol{C}_{\boldsymbol{n}}$ terms from Compact Fourier Series (linear scale)

| Harmonic | Frequency (kHz) | Calculated |
| :---: | :---: | :---: |
| 1 - Fundamental | 100 | 1.85 |
| 2 | 200 | $1.39 \mathrm{E}-15$ |
| 3 | 300 | 0.616 |
| 4 | 400 | $45.26 \mathrm{E}-15$ |
| 5 | 500 | 0.370 |
| 6 | 600 | $30.17 \mathrm{E}-15$ |
| 7 | 700 | 0.264 |
| 8 | 800 | $46.49 \mathrm{E}-15$ |
| 9 | 900 | 0.205 |

These $C_{n}$ values represent peak values of sinusoids and can be used to recreate the waveform. Also since ideal $50 \%$ square waves contain no even harmonics, these values are extremely small (approximately zero) as shown in Table 6-7, and only exist as a result of calculator precision (i.e. machine precision).

In order to compare these values with the DO and SA they must be converted to the decibel (dB) scale. The DO displays its results in terms of $d B$ per volt RMS
( $\mathrm{dBV}_{\mathrm{RMS}}$ ) across a 1 Mohm load (ideally infinite), while the SA displays its results in dB per milliwatt across a 50 ohm load ( $\mathbf{d B m}$ ). Therefore in order to get the same units for comparison the following conversions are used.

$$
\begin{align*}
& d B V r m s=10 \log \left(\left(\frac{C_{n}}{\sqrt{2}}\right)^{2}\right)  \tag{6.11}\\
& d B m=10 \log \left(\frac{\left(\frac{C_{n}}{\sqrt{2}}\right)^{2}}{(50 / 1 m W)}\right) \tag{6.12}
\end{align*}
$$

Table 6-8 shows the first nine $C_{n}$ values converted to $\mathrm{dBV}_{\text {rms }}$ which are compared to measured values from the DO .

Table 6-8: First nine $C_{n}$ terms from Compact Fourier Series in dBV ${ }_{\text {RMS }}$

| Harmonic | Frequency (kHz) | DBVrms (measured) | DBVrms (calculated) |
| :---: | :---: | :---: | :---: |
| Fundamental | 100 | 2.5 | 2.33 |
| 2 | 200 | -60 | $<-60$ |
| 3 | 300 | -7.19 | -7.21 |
| 4 | 400 | $<-60$ | $<-60$ |
| 5 | 500 | -11.56 | -11.64 |
| 6 | 600 | 660 | $<-60$ |
| 7 | 700 | -14.44 | -14.57 |
| 8 | 900 | $<60$ | $<-60$ |
| 9 | 900 | -16.56 | -16.77 |

Note that the Fourier Series indicates that there should only be odd harmonics. However the PSD measured on the DO clearly shows even harmonics less than - 60 $\mathrm{dBV}_{\text {rms }}$ as shown in Figure $6-46$. This is due to the fact that the square waves emitted
from the FPGA are not perfect, i.e. they do not have an infinitely fast rise time. Hence there will be some power in the even harmonics. For comparison the -30 dBV RMS value is the center line of Figure 6-46 allowing one to gauge that the first peak is near 2 dBV RMs.


Figure 6-46: PSD measured on the DO and SA

Table 6-9: First nine $\boldsymbol{C}_{\boldsymbol{n}}$ terms from Compact Fourier Series in dBm

| Harmonic | Frequency (kHz) | DBm (measured) | DBm (calculated) |
| :---: | :---: | :---: | :---: |
| Fundamental | 100 | 15.44 | 15.32 |
| 2 | 200 | -40 | $<-200$ |
| 3 | 300 | 6.11 | 5.78 |
| 4 | 500 | 1.65 | $<-200$ |
| 5 | 600 | -40 | 1.34 |
| 6 | 700 | -1.37 | $<-200$ |
| 7 | 900 | -3.45 | -1.57 |
| 8 | 900 |  | -3.75 |
| 9 |  |  |  |

Figure 6-46 also shows the PSD as measured on the SA. In this case the first peak has a marker on it which indicates the peak is at 15.44 dBm . Table 6-9 shows how well the calculations match up with the SA measured values. The even harmonics are also noticeable on the SA which again are due to the fact that the square waves are not ideal. In addition, both PSDs should ideally be line spectra, however the resolution settings on
the DO and SA and the imperfect nature of the square waves result in these broader spikes (concentrated frequency bands) in the frequency domain.

### 6.16. Accuracy of discrete PSD measurements

While the DO is useful for displaying time domain waveforms it is generally not considered to give the most accurate PSD. This is because the PSD is based on the FFT and using various digital signal processing (DSP) techniques such as windows. Therefore the majority of measured PSDs presented in this thesis will be from the SA since it is considered to give a more accurate representation of the PSD.

When measuring spectra with the SA one factor that has to be taken into consideration is its bandwidth ( BW ) of 9 kHz to 3 GHz . The manual stated that frequency measurements below 9 kHz were not accurate. This is demonstrated in Figure 6-47 where a $6.25 \%$ duty cycle square wave is measured on the SA and compared with calculations. Clearly it can be seen that as the frequency falls below the low end of the bandwidth of the analyzer the measured values no longer agree with calculations. In addition, neither the DO nor SA is capable of giving accurate readings at $\mathrm{DC}(0 \mathrm{~Hz})$. As a result the DC value shown in Figure 6-47 is the measured DC averages from the time domain.


Figure 6-47: Shows how the SA is not accurate below 9 kHz

### 6.17. Accuracy of continuous PSD measurements

The BW of the SA must also be taken into account when measuring continuous spectra. For example Figure 6-48 demonstrates the difference between the measured PSD of a random binary waveform and the calculated PSD. The 9 kHz cutoff frequency of the SA has a pronounced effect on the measured PSD in that the power seems to drop dramatically, only to ramp up very high at DC . The power drop off results from the BW limitation of the SA and the ramp up at DC is due to the DC marker of the SA. This is a feature found on many SA's since PSD measurements are often double sided. That is, any spectrum appearing to left of the DC marker (negative frequency range) is essentially a mirror image of the spectrum to the right of the marker (positive frequency range). These two effects at low frequencies should not be the present as shown in the PSD plot generated by Matlab.


Figure 6-48: PSD of a random binary square wave, measured versus calculated

Another factor that needs to be taken into account is the resolution bandwidth (ResBW) of the SA. When the SA takes power measurements it does so using banks of analog filters. It dwells in certain frequency bandwidths set by ResBW, and measures the power in these bands. As a result, large ResBWs tend to give a more coarse picture of the shape of the PSD, while smaller ResBWs tends to give a more detailed picture of the shape of the PSD. Aside from the shape, the actual power measured will differ depending on the ResBW also. For example in Figure 6-49 the same PSD is measured using a

ResBW of 3 kHz and 1 kHz respectively over a bandwidth of 300 kHz and 100 KHz respectively. The power is measured at 50 kHz is different in both plots at -0.825 dBm and -5.042 dBm respectively.


Figure 6-49: Same PSD measured with two different ResBW's

The 3 kHz ResBW plot on the left shows a coarser picture of the shape of the PSD and a high measured power of -0.825 dBm at 50 kHz . However the 1 kHz ResBW plot on the right shows a detailed picture of the shape of the PSD over one lobe, and a smaller measured power of -5.042 dBm at 50 kHz . This three times increase in ResBW directly corresponds to approximately a $10 \log [3]=4.77 \mathrm{~dB}$ change in the measured power, i.e. $5.042-.825=4.2 \mathrm{~dB}$. As shown in the following section, this difference in measured power is to be expected with continuous spectra, and for every factor of 10 increase in ResBW, the measured power should drop by a factor of 10 dB , and vice versa.

### 6.18. How the ResBW affects the PSD

Recall from Figure 5-12 the PSD of the $(7,4)$ Hamming code expurgated to a $(7,1)$ code from Section 5-5. Figure 6-50 shows the PSD measured on the SA with different ResBWs. The top line corresponds to a ResBW of 3 kHz , the middle line to 1 kHz and the bottom line to 300 Hz . The first thing to notice is that the ResBW has a factor of 10 difference between the 3 kHz plot and the 300 Hz plot, and this directly relates to a 10 dB
difference in continuous power levels. What is happening is that when the ResBW is set to 3 kHz , the SA measures power in 3 kHz bands and plots the result as a single point in the spectrum. Hence it more crudely estimates where the power is located since it is unable to distinguish frequencies smaller than 3 kHz . Contrast this with the 300 Hz ResBW which has 10 times the frequency resolution. The SA now measures and plots power in the 300 Hz bands. While it takes longer for the SA to generate this plot, it can display more accurate information with these smaller bands. As a result the plot appears more spiky. Also the more the ResBW is lowered the more the continuous component of the PSD drops. This is because the SA is measuring and plotting the power in smaller frequency bands. However, when there are strong spectral components in narrow frequency ranges, their power will not change as the ResBW is decreased. This can be seen in Figure 6-50 where the dominant discrete like components maintain their relative power levels as the ResBW is decreased. Clearly, when comparing measured PSDs to calculated PSDs, it is only practical to compare their relative shapes and not their overall power levels.


Figure 6-50: The effect of decreasing the ResBW

### 6.19. Comparing spectra of various codes

This section deals with measuring the PSDs of various codes with different AddCW sets and comparing the results to the theoretical PSDs presented in Section 5.6. As discussed in that section, two heavily unbalanced source statistics of $10 \%$ and $90 \%$ probability of a logic 1 are considered. The source statistics were generated using the computer program BuildRandomBitFile as shown in Figure 6-51.

Figure 6-51 demonstrates the use of this program to generate a file of size 1024 bytes that contains approximately $10 \%$ logic 1 s . The file generated in this example actually had $9.83 \%$ probability of a logic 1 . The output file consists of 1024 bytes or 8192 bits which are primarily logic 0s as shown in Figure 6-52. This file is then transmitted repeatedly on the channel using Run Mode Continuously by the TX.


Figure 6-51: BuildRandomBitFile program for generating random data files


Figure 6-52: 'Random_ones_10_SizeIs1024.bin' shows the file is mostly logic 0s

### 6.20. $(15,8)$ Best AddCW sets

The following PSDs were measured with the SA using the randomly generated 8192-bit files. Analysis of these PSDs can be found in Section 5.4.


Figure 6-53: 90\% ones - All four Best AddCW sets. Compare with Figure 5-16

Note that in all of the above PSD plots, the SA is set to 10 dB per division, while the comparable plots of Section 5.6 are 5 dB per division. The following Figure 6-54 shows how one of the spectra would compare on the same scale. The agreement of the curves is clear.


Figure 6-54: Comparing the $\mathbf{9 0 \%}$ ones best AddCW set 1 AA on the $\mathbf{5 d B}$ /div scale


Figure 6-55: 10\% ones - All four Best AddCW sets. Compare with Figure 5-17


Figure 6-56: 50\% ones shows little difference between codes - Compare with Figure 5-18

Figure 6-56 once again demonstrates that comparing these codes with balanced source statistics does not portray the effectiveness of this coding technique.

### 6.21. $(15,8)$ Worst AddCW sets



Figure 6-57: 90\% ones - two of the Worst AddCW sets. Compare with Figure 5-19


Figure 6-58: 10\% ones - two of the Worst AddCW sets. Compare with Figure 5-20



Figure 6-59: Comparing the $\mathbf{9 0 \%}$ ones worst AddCW set 108 on the $\mathbf{5 d B} / \mathrm{div}$ scale

Again the majority of the PSD plots were taken on the 10 dB per division scale, while the comparable plots of Section 5.6 are 5 dB per division. Figure 6-59 compares the PSDs on the same scale in the same manner as Figure 6-54.

As in Section 5.6 the PSDs can be contrasted side by side. For example the PSD of one of the best AddCW sets plotted beside one of the worst AddCW sets when the probability of the source emitting a logic 1 is $90 \%$ is shown in Figure 6-60. While it is obvious that the best AddCW set has more power at high frequency, it is most evident when they are both plotted on the same scale as shown in Figure 6-61.


Figure 6-60: 90\% ones - Best AddCW and Worst AddCW set - Compare with Figure 5-21


Figure 6-61: 90\% ones - Best AddCW and Worst AddCW set - Compare with Figure 5-22

In Figure 6-61 it is clearly seen that not only does the best AddCW set have most of its power at higher frequencies, it also has a wider null indicating its ability to limit runlengths and generate a signal with less baseline wander.

### 6.22. Spectra of larger codes

Based on the AddCW recommendations in Section 4.15, the PSD of larger codes based on the $(31,26)$ Hamming code are now compared to those of Section 5.7. Recall that these spectra could not be calculated exactly using the Cariolaro and Tronca method. Instead they were simulated using the SimFPGA program. Comparing the spectra in Figure 6-62 with Figure 5-25 show how closely they match.


Figure 6-62: $10 \%$ ones - Two $(\mathbf{3 1}, 23)$ codes. Compare with Figure 5-25

### 6.23. Summary

This chapter outlined and explained the methods used to create a hardware proof of concept system for the multimode coding technique introduced in Chapter 3. The communication system was implemented on two FPGA boards, one as a transmitter and the other as the receiver. There were two modes of operation. The first was used to transmit a file once on the channel to be decoded by the receiver. The second mode was used to transmit the file repeatedly on the channel in order to allow for measurement of the PSD. The PSD was measured in order to verify that the calculations presented in Chapter 5 were correct and that the coding technique operates as designed.

## 7. Bit Error Rate Performance

In this chapter the bit error rate (BER) performance of the multimode coding technique is investigated. This is achieved through simulations on a high-pass additive white Gaussian noise (AWGN) channel that models ac-coupled receivers.

### 7.1. Channel model

Recall from Section 2.19 that the serial bit stream requires periodic maintenance as it travels on the channel in order to combat the accumulation of noise and signal distortion. This requires that the signal be received and regenerated for further transmission. Receivers that derive their clock from the incoming bit stream require numerous transitions for adequate clock recovery. If there is a long series of 1 s or 0 s in a row the receiver's oscillator frequency may drift and lose synchronization, consequently losing track of where it is supposed to sample the transmitted data.

Recall that in the frequency domain these strings of like valued bits can be considered low frequency content as shown in Figure 5-2. Furthermore a bit stream with many transitions is considered to contain high frequency content as shown in Figure 5-3. Therefore receivers that derive their clock from the incoming bit stream can be summarized in the frequency domain as requiring a signal to have a majority of high frequency content and little low frequency content.

It is also very common for receivers to be AC coupled since they are easier to design and are capable of better performance [15,16]. However this results in the frequency response of the receiver being limited by the $R C$ time constant of the dc blocking capacitor and input impedance of the receiver [15]. These types of receivers not only block the average DC value, but they also tend to integrate the detected signal giving rise to baseline wander [16].

Therefore the effects and demands of this type of receiver are simplified by considering it as a distinct type of channel referred to as a dc-constrained channel, i.e. a
channel that heavily attenuates the low frequency and dc power of a signal [17]. As a result one simple model of a dc-constrained channel is that of a first-order high pass filter (HPF) concatenated with an AWGN source [17] as shown in Figure 7-1. This model will be referred to as the HPF channel model.


Figure 7-1: dc-constrained channel - HPF concatenated with AWGN

Using this channel model it will be illustrated that the time domain waveform of the transmitted bit stream will decay exponentially towards zero during each symbol interval, where the rate of decay will be dependent on the value of the $R C$ time constant $\tau$. For example, letting $V_{s, k}$ and $V_{e, k}$ be the voltages at the start and the end of the $k^{\text {th }}$ bit interval $(k \geq 0)$, the voltage during the $k^{\text {th }}$ interval is represented by Equation 7.1 [17]. The instantaneous voltage at the end of the interval is represented by Equation 7.2 [17], where $T$ denotes the duration of each bit interval.

$$
\begin{equation*}
V(t)=V_{s, k} e^{-(t-k T) / \tau} \tag{7.1}
\end{equation*}
$$

where $k T \leq t \leq(k+1) T$ and $\tau=R C / T$, and

$$
\begin{equation*}
V_{e, k}=V_{s, k} e^{-1 / \tau} \tag{7.2}
\end{equation*}
$$

At the start of the next interval the starting voltage $V_{s, k+1}$ takes on one of the values:

$$
V_{s, k+1}= \begin{cases}V_{e, k} & , \text { if logic values are the same }  \tag{7.3}\\ V_{e, k}+2 & , \text { if logic values change from }-1 \text { to } 1 \\ V_{e, k}-2 & , \text { if logic values change from } 1 \text { to }-1\end{cases}
$$

Using Equations 7.1 to 7.3 the effect of this channel model can be demonstrated using Matlab. For example, consider transmitting the bit sequence $1,0,0,1,1,1,0$ using bipolar coding with a square pulse shape. Figure 7-2 (a) shows the time domain waveform on a noiseless channel, while for reference Figure 7-2 (b) shows the midpoints of the symbols in order to see the individual bit locations.


Figure 7-2: (a) Binary sequence $[1,0,0,1,1,1,0]$ transmitted using bipolar coding. (b) Midpoint locations

Figure 7-3 on the other hand shows the binary sequence on the HPF channel with RC time constants of $\tau=10$ and $\tau=2$ respectively. In both cases it is clear that the time domain waveform decays exponentially towards zero due to the frequency response of the channel. That is, the more like valued bits in a row transmitted on this channel (i.e. the greater the low frequency content), the more severe the power loss is to the intelligible signal. Conversely the AWGN is practically unaffected on this channel since the distribution of white noise has zero mean. As a result it is clear that the output SNR of
the receiver will not be constant over a symbol period. The loss of the low frequency portion of the signal power causes a fluctuating SNR that degrades the BER.


Figure 7-3: Showing the effect of the HPF channel with $R C$ time constants 10 and 2, and how the SNR is not constant over a bit period

While Figure 7-3 demonstrates how the output SNR is not constant per symbol interval, a greater problem occurs if there are not adequate transitions in the output sequence. If for example there was a long run of logic 0 s , the signal would decay to practically 0 V as shown in Figure 7-4. In this case no amount of equalization can recover the time domain waveform since in this extreme situation the receiver is working with practically zero signal power.


Figure 7-4: HPF channels effect of a long run of logic 0 s with an $R C$ constant of 10 and 2 . It can be seen that the voltage is decaying to 0 Volts

Thus the motivations to create a coding technique that can combat the effects of this type of channel are clear. For improved BER performance on this channel the transmitted sequence not only needs to be balanced, but it also must contain numerous transitions and have limits on the length of consecutive like valued bits.

### 7.2. Simulation

In order to test the multimode coding technique on this HPF channel model a simulation called HPF__BER_Sim.exe was written; the user interface for this program is shown in Figure 7-5. This program allows the user to select the linear block code, selection criteria and AddCWs to use, similar to the SimFPGA program introduced in Section 4.4. The source statistics can be controlled using the Source Probability of a 0 and the $R C$ time constant of the channel can be controlled using the RC time constant Tau. The simulation runs at each SNR until the set number of decoding errors has been encountered. Finally, the type of filtering in the receiver can be chosen as either a Rectangular Filter or Matched Filter. Note that the channel can also be specified as Not a HPF channel, at which time the receiver uses a Matched Filter.


Figure 7-5: HPF channel model simulation

An example is shown in Figure 7-6. Here the simulation was configured to run the $(31,23)$ code using eight AddCWs. The source statistics were balanced and the $R C$ time constant $\tau$ was set to 30 .


Figure 7-6: Looking for 1000 errors at each SNR from 5.5 dB to 10 dB with $\boldsymbol{\tau}=\mathbf{3 0}$

### 7.3. BPSK Simulation results

The channel was first tested without coding using bipolar signaling (BPSK) with matched filtering as a benchmark. The $R C$ time constant chosen for all channel simulations was $\tau=30$. This value was selected in order to test the performance of the multimode coding technique on a channel that was highly sensitive to baseline wander.


Figure 7-7: BPSK on an AWGN channel versus the HPF channel with AWGN

Figure 7-7 demonstrates the effect of the channel on a BPSK signal that does not use any form of EC or CS coding. For comparison a regular BER curve is also plotted for a BPSK signal on a typical AWGN channel (i.e. not the HPF channel).

The arrow in Figure 7-7 clearly shows how the BER of the BPSK signal on the HPF channel is worse than the BER on a standard AWGN channel. However the lower two curves only show the BER when the source emits logic 0 s and 1 s with equal probability. If the instantaneous source statistics become unbalanced to the point where logic 0s are transmitted $90 \%$ of the time, the BER curve almost flattens out. Thus the BER performance is highly dependent on the source statistics. Figure 7-8 shows the approximate range of BER values that a BPSK signal would experience on this channel with different statistics of the source.


Figure 7-8: Range of BER values that occurs with changing source statistics

### 7.4. Effect of error control codes

It is of interest to determine if error control coding can alleviate the BER degradation on this channel. Figure 7-9 demonstrates the limited effectiveness of the simple Hamming codes used so far in this thesis. In the case of an unbalanced source only marginal improvement is observed with diminishing returns as larger error control codes are used.


Figure 7-9: Effect of EC codes on this channel is minimal

### 7.5. Simple multimode code

As discussed in Section 3.1 and 3.2, a primary goal of this multimode coding technique is to guarantee balanced transmission regardless of the source statistics. It was shown that the use of a single extra bit of redundancy for CS coding is sufficient for this to be achieved. As confirmation Figure 7-10 shows the BER improvement using this technique with the $(15,11)$ code expurgated to $(15,10)$.


Figure 7-10: Comparison of performance of $(\mathbf{1 5 , 1 1})$ and $(\mathbf{1 5 , 1 0})$ codes with balanced source

This figure demonstrates that both codes have the ability to correct single bit errors per CW, and at low SNRs the $(15,11)$ code actually performs slightly better than the $(15,10)$ code. This is because with the extra bit of redundancy the overall per bit energy versus noise spectral density is slightly lower for the $(15,10)$ code than the $(15,11)$ code. At low SNRs the coding gain of the $(15,10)$ code is not realized as the signal energy is simply too low. However around an SNR of 8.5 dB a crossover occurs. It is at this point that the ability of the $(15,10)$ code to guarantee balanced transmission results in improvement of the BER over that of the $(15,11)$ code, and it can be seen that the BER curves are actually diverging. This is a significant improvement considering only a single extra bit of redundancy was used. It is not dramatic however since analysis in Chapter 4 showed that the $(15,10)$ code has minimal ability to limit runlengths. As shown in Figure 7-11 maximum runlengths (MR1 and MR0) of 22 like valued bits in a row are still possible. Furthermore, the average runlength (aMR1 and aMR0) with balanced source statistics is approximately 3.6 like valued bits in a row. With unbalanced source statistics however the maximum runlengths (MR1 and MR0) are actually less at 13, but the average run (MR1 and MR0) has increased to approximately 5 like valued bits in a row. These results are still better than the $(15,11)$ code which can have runlengths that are completely unbounded.


Figure 7-11: SimFPGA results for the $(\mathbf{1 5 , 1 0})$ code showing runlength probabilities

Figure 7-12 compares the range of BER values the $(15,10)$ code can assume compared with BPSK. Clearly the range of results for the $(15,10)$ code is less than the corresponding range of the BPSK signal. As well, when the SNR is greater than 9 dB , the performance of the $(15,10)$ code with unbalanced source statistics $(90 \%$ chance of a logic 0 ) is actually better than the performance of the BPSK signal with balanced source statistics ( $50 \%$ chance of a 0 ). It can be concluded that the multimode coding technique is removing the BER dependency on the source statistics.


Figure 7-12: Range of BER results for the $(\mathbf{1 5 , 1 0})$ code

### 7.6. Effect of 8 AddCWs with the $(15,8)$ code

In Chapter 4 and Chapter 5 the $(15,11)$ code expurgated to a $(15,8)$ code with eight AddCWs was analyzed in great detail. Using the $(15,8)$ code with any one of the four best AddCW sets results in the BER curve shown in Figure 7-13. With balanced source statistics the first test demonstrates the ability of this code to further improve the BER curve on the HPF channel beyond the performance of the $(15,10)$ code at high values of SNR. This ability is a result of the fact that the $(15,8)$ code is not only balanced, but that the worst case maximum runlength is only 7 , versus 22 with the $(15,10)$ code.


Figure 7-13: BER performance of the $\mathbf{( 1 5 , 8 )}$ code

The benefits of this coding technique are appreciated more when compared to BPSK signaling on a regular AWGN channel. In Figure 7-14 it can be seen that the BER curve of the $(15,8)$ code is approaching the BER curve of the original BPSK signal on a regular AWGN channel.


Figure 7-14: The $(\mathbf{1 5 , 8})$ code on the HPF channel compared to BPSK on an AWGN channel

Finally, when using the $(15,8)$ code with an unbalanced source, simulation results show that the BER curve remains fixed. That is, the BER curve for the $50 \%$ zeros and
$90 \%$ zeros are identical. This is shown in Figure $7-15$ which demonstrates that the source statistics no longer have any effect of the BER curve performance as a result of the CS coding features of this multimode coding technique.


Figure 7-15: Performance of the ( $\mathbf{1 5 , 8}$ ) code does not change with varying source statistics

### 7.7. Performance of larger codes

It was suggested in Chapter 3 that the additional redundancy of the multimode coding technique can be minimized by using larger codes. Based on Section 7.5 and the results of the $(15,10)$ code, the performance of the $(31,26)$ code expurgated by 1 -bit to a $(31,25)$ code with two AddCWs is now investigated.

Figure 7-16 shows the range of values the BER curve can assume for the $(31,25)$ code. At first glance it may be surprising to see that the range of BER values is far greater than the corresponding $(15,10)$ code considered in Section 7.5. This can be understood by looking at the SimFPGA results. As shown in Figure $7-17$ when the source statistics are balanced the maximum runlength (MR1 and MR0) of the $(31,25)$ code is approximately 20 , which is 2 less than the $(15,10)$ code. As well, while the average runlength (aMR1 and aMR0) of approximately 4.5 is slightly higher than that of the $(15,10)$ code, the redundancy in the $(31,25)$ code is considerably less. Thus with a balanced source the $(31,25)$ code performs slightly better than the $(15,10)$ code.


Figure 7-16: BER performance of the $(\mathbf{3 1 , 2 5})$ code

Conversely when the source statistics become unbalanced, the $(31,25)$ code generated maximum runlengths (MR1 and MR0) of approximately 25 like valued bits in a row, versus only 13 for the $(15,10)$ code. In addition the average runlength (aMR1 and a MR0) increases to approximately 8.7 , versus only 5 for the $(15,10)$ code. Hence the BER curve for the $(31,25)$ code with unbalanced source statistics is significantly worse than the curve for the corresponding $(15,10)$ code, and hence the range of BER values for the $(31,25)$ code is much greater as shown in Figure 7-18.


Figure 7-17: SimFPGA results for the $(\mathbf{3 1 , 2 5})$ code showing runlength probabilities


Figure 7-18: Range of BER values for the $(\mathbf{3 1 , 2 5})$ code versus the $(\mathbf{1 5 , 1 0})$ code

### 7.8. Effect of 8 AddCWs with the $(31,23)$ code

Using the AddCW recommendations of Section 4.15, the $(31,26)$ code expurgated to $(31,23)$ with eight AddCWs is now investigated. In general it can be seen from Figure $7-19$ that this code performs better than the $(15,8)$ code. This is due to the fact that the $(31,23)$ code has less redundancy than the $(15,8)$ code, while still providing EC and CS abilities. Figure 7-19 also demonstrates that like the $(15,8)$ code, the performance of the $(31,23)$ code is not affected by the source statistics.


Figure 7-19: BER performance of the $(\mathbf{3 1 , 2 3})$ code is better than that of the $(\mathbf{1 5 , 8})$ code

Finally it can be seen in Figure 7-20 that the BER performance of the $(31,23)$ code is actually better than BPSK signaling on an AWGN channel only. This clearly demonstrates how with larger codes the drawbacks associated with redundancy in this multimode coding technique can be minimized. With larger codes the efficiency increases and consequently the BER penalty incurred decreases.


Figure 7-20: BER performance of the $\mathbf{( 3 1 , 2 3 )}$ code on the dc-constrained channel is better than the BER performance of BPSK on an AWGN channel

### 7.9. Conclusions

This chapter has investigated the BER performance of the multimode coding technique on a dc-constrained AWGN channel. On this type of channel the source statistics play a significant role in BER performance. As the source statistics change, so too does the BER experienced in the system.

It was shown that on this type of channel EC coding techniques alone do little to overcome this dependency on source statistics. However by using the multimode coding technique introduced in Chapter 3, the BER performance of the system can be improved. With a single extra bit of redundancy it was shown that the range of BER values experienced can be restricted and improved versus a system that does not use this coding.

In addition, it was shown that once the system incorporates three additional bits of redundancy, the BER performance can be improved and completely lose its dependency on the source statistics. Furthermore, the redundancy required to obtain this CS coding effect can be minimized by using larger block codes.

## 8. Conclusion

This thesis has introduced a new combined EC and CS code based on linear block codes. This chapter summarizes the development of this coding technique and offers suggestions for future work.

### 8.1. Thesis summary

Following the introduction and discussion of the concepts of EC and CS coding presented in Chapter 1 and 2, Chapter 3 introduced a novel combined EC and CS code which is based on multimode coding. It was apparent that with a single extra bit of redundancy any linear block code that includes the all-zero and all-one codeword can be used to construct a balanced dc-free code since the CWs are effectively partitioned into two complementary sets, giving the encoder two CW choices per SW. It was also shown that this partitioning can be accomplished through linear code word addition. Furthermore this technique was extended by giving the transmitter four and eight CW choices per SW through the use of four and eight AddCWs respectively, thus it is called a multimode coding technique.

Chapter 4 reviewed the mathematics required for analyzing the encoder as a Markov chain and evaluating its performance in the time domain. This involved a computer search to find the best AddCW sets, and results were presented contrasting the performance between the best and worst AddCW sets. The chapter concluded with recommendations and applications of this technique to other systems.

Chapter 5 investigated the frequency domain characteristics of the multimode coding scheme, and the power spectral densities of various code configurations were evaluated and analyzed. Chapter 6 presented the hardware implementation that used two FPGAs and compared their measured time domain and frequency domain results to the results in Chapter 4 and 5 , demonstrating how they agreed with calculations.

Finally Chapter 7 looked at the bit error rate performance of this combined coding technique on a dc-constrained channel and clearly showed how its performance is superior to uncoded systems.

### 8.2. Future work

In this thesis thorough analysis was completed on the $(15,11)$ code expurgated to a $(15,8)$ code. It was shown that four sets of AddCWs had the best performance in terms of limiting runlengths. However no specific pattern emerged from these results. While most of the AddCWs were balanced and had numerous transitions, there was the odd exception that did not adhere to this rule. Therefore only general guidelines could be recommended for selecting AddCWs for use with larger codes where a computer search was impractical. While it was shown in Chapter 7 that these recommendations with the $(31,23)$ code produced excellent results, the optimum AddCW sets for this code and larger codes remain unknown. Further work could quantify the results from the analysis of the $(15,8)$ code and extend them to larger codes.

Finally, as proof of concept, the only linear block code considered in this thesis was the simple Hamming code. Future work could apply this multimode coding technique to other more powerful error control block codes.

## References

[1] R. Togneri, C. Desilva, Fundamentals of Information Theory and Coding Design, Chapman \& Hall/CRC, London, 2003.
[2] P. Sweeney, Error Control Coding: From Theory to Practice, John Wiley \& Sons Ltd, Chichester, 2002.
[3] T. Cover, J. Thomas, Elements of Information Theory, Wiley-Interscience, New York, 1991.
[4] B. Lathi, Modern Digital and Analog Communications Systems, Third Edition, Oxford University Press, New York, 1998.
[5] B. Sklar, Digital Communications: Fundamentals and Applications, Second Edition, Prentice Hall PTR, New York, 2001.
[6] I. J. Fair, Y. Xin, "A method of integrating error control and constrat. $1 e d$ sequence coding," Elec Letters, vol. 36, pp. 210-215, February 2003.
[7] F. Zhai, Y. Xin, I. J. Fair, "DC-Free Multimode EC Block Codes," Proceedings of the 2003 IEEE International Symposium on Information Theory, p. 76, Yokohama, Japan, June 29 - July 4, 2003.
[8] I. J. Fair, D. R. Bull, "DC-Free Error Control Coding through Guided Convolutional Coding," Proceedings of ISIT 2002, p. 297, Lausanne, Switzerland, June 30 - July 5, 2002.
[9] I. J. Fair, Y. Xin, "Constrained Sequences with Embedded Redundancy for Error Control," Elec Letters, vol. 36, pp 215-217, February 2000.
[10] T. Wadayama, A. J. Han Vinck, "DC-Free Binary Convolutional Coding," IEEE Trans. Infor. Theory, vol 48, no. 1, pp. 162-173, January 2002.
[11] R. H. Deng, M. A. Herro, "DC-Free Coset Codes," IEEE Trans. Infor. Theory, vol. 34, pp. 786-792, July 1988.
[12] CE Shannon. "A mathematical theory of communication," Bell System Technical Journal, Journal 27 pp. 379-423, July and October 1948.
[13] G. Clark Jr., J. Cain, Error-Correction Coding for Digital Communications, Springer Plenum US, 1981.
[14] I. J. Fair, W. D. Grover, W. A. Krzymien, R. I. MacDonald, "Guided Scrambling: a new line coding Technique for High Bit Rate Fiber Optic Transmission Systems," IEEE Trans. Commun., vol. 39, no. 2, pp. 289-297, February 1991.
[15] S. D. Personick, "Receiver design for optical fiber systems," Proc. IEEE, vol. 65, pp. 1670-1678, December 1977.
[16] T. Van Muoi, "Receiver design for High-Speed Optical-Fiber Systems,"Journal Lightwave Tech, vol. LT-2, no. 3, pp 243-267, June 1984.
[17] F. Zhai, Y. Xin, and I. J. Fair, "Performance Evaluation of DC-Free EC Block Codes," Proceedings of the 2004 Ninth International Conference on Communication Systems, pp. 451-455, Singapore, September 6-8, 2004.
[18] G. Byeong, C. Seok, Scrambling Techniques for Digital Transmission, SpringerVerlag Ltd, Great Britain, 1994.
[19] K. A. S. Immink, Codes for Mass Data Storage System, Shannon Foundation Publishers, The Netherlands, 1999.
[20] I. J. Fair, V. K. Bhargava, Q. Wang, "Evaluation of the power spectral density of Guided Scrambling Coded Sequences," IEE Proc.-Commun., vol. 144, no. 2, April 1997.
[21] S. Al-Bassam, B. Bose, "Design of Efficient Balanced Codes," IEEE Trans. Comput, vol. 43, no. 3, March 1994.
[22] T. Floyd, Digital Fundamentals, Eighth Edition, Prentice Hall, New York, 2002.
[23] A. Papoulis, S. Pillai, Probability, Random Variables and Stochastic Processes, Fourth Edition, McGraw-Hill Science, New York, 2001.
[24] G. L. Cariolaro, G. P. Tronca, "Spectra of Block Coded Digital Signals," IEEE Trans Commun, vol. Com-22, no. 10, pp 1555-1564, October 1974.

## Appendix A - Calculating Power Spectral Density

The energy spectral density (ESD) is a measure of the energy contributed by all spectral components of an energy signal. If a signal has infinite energy it is considered to be a power signal and its power spectral density (PSD) is a measure of the power contributed by all spectral components. Both the ESD and PSD are important quantities in evaluating the performance of a communication system. For example, in Figure A-1, modulation shifts the spectrum of a baseband signal to a higher frequency band. If the allocated BW for this system was 100 kHz , measuring the ESD or PSD on a spectrum analyzer would indicate whether or not the system is operating within this limit.


Figure A-1: PSD displays the contribution of power at each frequency

## A.1. Determining the Power Spectral Density

There are three ways to find spectral densities

1. Measure them with a Digital Oscilloscope or Spectrum Analyzer
2. Calculate them mathematically
3. Simulation

The first approach can be difficult since it involves actually building a working system. This may not be practical if you are only interested in the PSD of some new communication approach. The second approach can also be difficult simply due to the mathematics involved, and it becomes more challenging once coding is introduced. For example, what is the PSD of a Hamming code? The third approach is therefore often used. In this Appenelix, the second approach is considered in detail.


Figure A-2: Once coding is introdsced, calculating the PSD can be challenging

## A.2. How to calculate RMS for non-sinusoidal signals

Quite often measuring equipment such as oscilloscopes work in RMS when dealing with time varying voltages. Therefore to compare calculations with measured values it is convenient to also work in RMS. Thus it is useful to review this concept.

The RMS (root mean square) value of a signal is defined in general as the square root of the power of a time varying signal. It was originally used to determine the equivalent DC voltage required to deliver the same power as a sinusoidal signal.

Example 1-RMS: Consider a 1 Vpp sine wave at 1 MHz . This is a signal that varies from -0.5 V to +0.5 V and would be written as $y_{1}(t)=0.5 \sin (2 \pi f t)$. Find the power and RMS voltage with $t_{0}=1 / f$.


Figure A-3: Calculating the power and RMS of a sine wave

As shown in Figure A-3 the power is found to be 125 mW assuming the voltage is dissipated over 1 ohm. The square root of the power gives 353.55 mV RMS. Thus 353.55 mV is the equivalent DC voltage that would give the same power over 1 ohm , that is $(353.55 \mathrm{mV} \mathrm{DC})^{2}=125 \mathrm{~mW}$.

A simplified way to find the RMS voltage for sinusoidal waveforms is:

$$
\begin{equation*}
R M S=\frac{V p p}{2 \sqrt{2}} \quad R M S=\frac{V p e a k}{\sqrt{2}} \tag{A.1}
\end{equation*}
$$

Either way the result is $\frac{1 V p p}{2 \sqrt{2}}=353.55 \mathrm{mV} \quad \frac{0.5 V_{\text {peak }}}{\sqrt{2}}=353.55 \mathrm{mV}$


Figure A-4: Digital oscilloscope measurements

Oscilloscope screen shots shown above in Figure A-4 have a peak to peak value of 1.073 V and as a result the measured the RMS value is 381 mV . This matches very closely to 379.36 mV and $1.073 /(2 \sqrt{2})=379.36 \mathrm{mV}$.

## A.3. Comparing Fourier calculations to spectrum analyzer output

Converting periodic time domain signals to the frequency domain is done with Fourier Series representations. These are classified as the Trigonometric Fourier Series, the Compact Fourier Series and the Exponential Fourier Series.

## A.4. Trigonometric Fourier Series

$$
\begin{equation*}
f(t)=a_{0}+\sum_{n=1}^{\infty}\left(a_{n} \cos \left(n \omega_{o} t\right)+b_{n} \sin \left(n \omega_{o} t\right)\right) \tag{A.2}
\end{equation*}
$$

where $\omega_{o}=\frac{2 \pi}{T_{o}}, \quad T_{o}$ is the period

$$
\begin{array}{ll}
a_{0}=\frac{1}{T_{o}} \int_{0}^{T_{0}} f(t) d t & \\
a_{n}=\frac{2}{T_{o}} \int_{0}^{T_{o}} f(t) \cos \left(n \omega_{o} t\right) d t & n=1,2,3 . .  \tag{A.3}\\
b_{n}=\frac{2}{T_{o}} \int_{0}^{T_{o}} f(t) \sin \left(n \omega_{0} t\right) d t & n=1,2,3 . .
\end{array}
$$

## A.5. Compact Fourier Series

An equivalent form of the trigonometric Fourier series is to combine the sine and cosine terms into a single sinusoid using the identity

$$
\begin{align*}
& a_{n} \cos \left(n \omega_{o} t\right)+b_{n} \sin \left(n \omega_{o} t\right)=C_{n} \cos \left(n \omega_{o} t+\theta_{n}\right)  \tag{A.4}\\
C_{n}= & \sqrt{a_{n}^{2}+b_{n}^{2}} \\
\theta_{n}= & \tan ^{-1}\left(\frac{-b_{n}}{a_{n}}\right) \quad f(t)=C_{0}+\sum_{n=1}^{\infty} C_{n} \cos \left(n \omega_{o} t+\theta_{n}\right) \tag{A.5}
\end{align*}
$$

## A.6. Exponential Fourier Series

This is the most compact form of the Fourier series and it is based on the Euler identities since each sinusoid is expressed as the sum of two exponentials.

$$
\begin{gather*}
\sin (\theta)=\frac{e^{j \theta}-e^{-j \theta}}{2 j} \quad \text { and } \quad \cos (\theta)=\frac{e^{j \theta}+e^{-j \theta}}{2}  \tag{A.6}\\
f(t)=\sum_{-\infty}^{\infty} D_{n} e^{j n \omega_{o} t} \quad \text { where } \quad D_{n}=\frac{1}{T_{o}} \int_{T_{o}} f(t) e^{-j n \omega_{o} t} d t \tag{A.7}
\end{gather*}
$$

The important thing to note with an Exponential Fourier Series is that it is a twosided series. Thus there are negative and positive frequency components that range from $-\infty$ to $+\infty$. This is in contrast to the Compact and Trigonometric Fourier Series that are one-sided series that only have positive frequency components in the range 0 to $+\infty$. Note that these two forms are related by Equation A. 8 for real valued signals.

$$
\begin{equation*}
C_{n}=2 D_{n} \quad n=1,2,3 \ldots \tag{A.8}
\end{equation*}
$$

Example 2 - Compact Fourier Series: Calculate the Fourier series of a $100 \mathrm{kHz}, 2.9 \mathrm{~V}$, $50 \%$ duty cycle unipolar square wave.


Figure A-5: 50\% duty cycle square wave on scope


Figure A-6: Calculating the average and RMS

Figures A-5 and A-6 show that the calculated average and RMS values are practically identical to the measured values. Calculating the Trigonometric Fourier Series by Equation A. 3 with $f_{0}=100 \mathrm{kHz}$ and $t_{0}=1 / f_{0}=10 \mu \mathrm{~s}$ is shown below in Figures A-7 and A-8.


Figure A-7: Calculating $a_{0}$ and $a_{n}$


Figure A-8: Calculating $b_{n}$

Figure A-7 shows that $a_{0}=1.422 \mathrm{~V}$ which is identical to the DC average shown in Figure A-6. As well, the $a_{n}$ and $b_{n}$ values are calculated for $n=1,2,3, \ldots$ which are used to calculate the Compact Fourier Series using Equation A.5.


Figure A-9: Shows the $a_{n}$ and $b_{n}$ values in a list, and the conversion to find the $C_{n}$ values


Figure A-10: $\mathrm{C}_{\mathrm{n}}$ values in a list with frequency values and $\boldsymbol{n}$ index below.


Figure A-11: $\boldsymbol{\theta}_{\mathrm{n}}$ values shown in list with frequency values and $\boldsymbol{n}$ index below

Figures A-10 and A-11 above show the Compact Fourier Series representation of the signal. These magnitude and phase values indicate exactly how to recreate $f(t)$. That is you would need a DC level of 1.422 V , plus a 1.85 V peak cosine wave of frequency 100 kHz with initial phase -1.57 rad , plus a 0.616 V peak cosine wave of frequency 300 kHz with initial phase -1.57 plus and so on. Clearly these are peak values which is important when converting to RMS.

## A.7. Comparing to a Spectrum Analyzer and Digital Oscilloscope

In order to compare measured results to calculations, two things must be taken into account. First, most spectrum analyzers (SA) and digital oscilloscopes (DO) only show one-sided spectra, and second, SAs and DOs often work in RMS, and display their results in decibels (dB). Specifically the DOs considered here work in $\mathbf{d B V} V_{R M S}$, which is dB per $1 \mathrm{~V}_{\mathrm{RMS}}$ normalized to 1 ohm , and SAs work in $\mathbf{d B m}$, which is dB per 1 mW normalized to 50 ohms.

As a result the Compact Fourier Series is a natural choice since the $C_{n}$ values are already one sided. All that is required for comparison is to convert them to dB . This can easily be done by dividing them by $\sqrt{2}$ first (Equation A-1) since they are Vpeak, and then converting them to the appropriate dB scale as shown in Equations A-9 and A-10.

$$
\begin{align*}
& \mathrm{dBV}_{R M S}=10 \log \left(\frac{\left(\frac{c_{n}}{\sqrt{2}}\right)^{2}}{1}\right)  \tag{A.9}\\
& \mathrm{dBm}=10 \log \left(\frac{\left(\frac{c_{n}}{\sqrt{2}}\right)^{2}}{50^{*} .001}\right) \tag{A.10}
\end{align*}
$$

For example Figure A-12 shows the square wave from Example 2 in the frequency domain. Figure A-13 shows the $\mathrm{dBV}_{\text {RMS }}$ values calculated from the $C_{n}$ values, and Table A-1 shows how these calculated values match up with the measured values from the DO .


Figure A-12: DO showing the frequency domain PSD of this square wave


Figure A-13: Calculated $\mathrm{dBV}_{\text {RMS }}$ values of the square wave

Table A-1: Showing how the measured and calculated values match up

| Harmonic | Frequency (kHz) | DBV |  |
| :---: | :---: | :---: | :---: |
| Fundamental measured | Calculated |  |  |
| 2 | 100 | 2.5 | 2.31 |
| 3 |  |  | -7.22 |
| 4 | 300 | -7.19 | -11.66 |
| 5 | 500 | -11.56 |  |
|  |  |  | -14.58 |
| 7 | 700 | -14.44 | 660 |
| 9 | 900 | -16.56 | -16.76 |

Clearly this approach is very accurate.

Similarly Figure A-14 shows the same output on the SA. Figure A-15 shows the dBm values being calculated from the $C_{n}$ values, and Table A-2 shows how these calculated values match up with the measured values.


Figure A-14: SA showing the frequency domain PSD of this square wave


Figure A-15: Calculated dBm values of the square wave

Table A-2: Showing how the measured and calculated values match up

| Harmonic | Frequency (kHz) | dBm measured | Calculated |
| :---: | :---: | :---: | :---: |
| Fundamental | 100 | 15.42 | 15.32 |
| 3 |  |  |  |
| 3 | 300 | 6.11 | 5.78 |
| 5 | 500 |  |  |
| 6 | 700 | 1.65 | 1.34 |
| 7 |  | -1.37 | -1.57 |
| 9 | 900 | -3.45 | -3.75 |

Clearly this approach is very accurate.

Example 3 - Repeat Example 2 with a square wave that does not have a $50 \%$ duty cycle:


Figure A-16: Square wave with a $6.25 \%$ duty cycle

Figure A-16 shows a square wave with one 10us pulse every 160 us. This can also be thought of as a 100 kHz square wave with a $6.25 \%$ duty cycle. Looking at Figure A-16 shows that there is a slight DC offset of approximately 94 mV , so the 2.65 V peak to peak signal swings from 94 mV to 2.75 V .


Figure A-17: Calculating Average and RMS values for the 6.25\% duty cycle square wave

Figure A-17 shows how to calculate the average. The signal has 94 mV for $15 / 16^{\text {th }}$ of the period, and 2.75 V for $1 / 16^{\text {th }}$ of the period for an average of 260 mV , or 271.2 mV measured. As well, the same approach can be used for the RMS voltage where 693.5 mV is calculated and 692.1 mV is measured. Note that $15 / 16^{\text {th }}$ of the period can also be considered as 15 out of 16 bits and $1 / 16^{\text {th }}$ of the period can be considered 1 out of 16 bits.

As shown in Figure A-18 the measured spectra appear more dense and more like the spectrum that corresponds to a single square wave. This result is pleasing since as the
duty cycle decreases the pulses become farther apart and the time domain signal itself begins to appear more and more like a single square wave. Therefore the Fourier Transform could almost be used to calculated the spectrum (or "frequency content of the signal") instead of the Fourier Series.


Figure A-18: $6.25 \%$ duty cycle square wave in the frequency domain on the DO and SA

Table A-3: Measured vs calculated for dBm and $\mathrm{dBV} V_{\text {RMS }}$ for $6.25 \%$ duty cycle square wave

| Harmonic | Frequency <br> (kHz) | dBm <br> measured | dBm calculated | $\begin{aligned} & \text { DBV }_{\text {RMS }} \\ & \text { measured } \end{aligned}$ | DBV ${ }_{\text {RMS }}$ calculated |
| :---: | :---: | :---: | :---: | :---: | :---: |
| 1 | 6.25 | -1.5 | 0.347 | -12.81 | -12.66 |
| 2 | 12.50 | -0.1 | 0.178 | -12.81 | -12.83 |
| 3 | 18.75 | -0.11 | -0.105 | -13.13 | -13.11 |
| 4 | 25.00 | -0.373 | -0.508 | -13.44 | -13.51 |
| 5 | 31.25 | -0.857 | -1.039 | -13.75 | -14.05 |
| 6 | 37.50 | -1.52 | -1.708 | -14.38 | -14.71 |
| 7 | 43.75 | -2.38 | -2.527 | -15.31 | -15.53 |
| 8 | 50.00 | -3.44 | -3.519 | -16.25 | -16.52 |
| 9 | 56.25 | -4.67 | -4.710 | -17.5 | -17.72 |
| 10 | 62.50 | -6.16 | -6.14 | -19.06 | -19.15 |
| 11 | 68.75 | -7.82 | -7.88 | -20.94 | -20.89 |
| 12 | 75.00 | -9.91 | -10.05 | -22.81 | -23.06 |
| 13 | 81.25 | -12.64 | -12.84 | -25.62 | -25.85 |
| 14 | 87.50 | -16.47 | -16.72 | -29.69 | -29.73 |
| 15 | 93.75 | -22.76 | -23.17 | -35.94 | -36.18 |

## A.8. Random Binary Signals

Examples so far have considered only the cases of repeating periodic signals. A different approach is needed when these signals are random. Consider a random binary waveform that takes on two different values $+A$ and $-A$, with transitions that occur only at integer multiples of the symbol period $T_{b}$ with symbol values that are equally likely. This can be considered a discrete time random process with the probability density function (PDF) shown in Figure A-19.


Figure A-19: PDF for the discrete time random process

Since this is now a random process (random signal) there is no easy way to do Fourier analysis. Therefore how does one find the PSD of a signal that is not deterministic? Clearly this signal can take on an infinite number of sequences. The PSD would be known if the duty cycle was always $50 \%$ or $6.25 \%$ like in Example 1 and 2, however now the signal can have any duty cycle at any given time. Since each sequence is different and not periodic the best we can realistically do is to evaluate this signal in terms of some average statistics of bit sequences. In order to do this we need the concept of a random process.

A random process (also known as a stochastic process) is a mapping of an experimental outcome to a function of time. Each waveform associated with an experiment is called a sample function. The collection of all sample functions is called the ensemble of the random process. In general there are two approaches to evaluating average statistics of random processes:

1. Time average: Examine one sample function from the ensemble (collection of sequences) over an extended period of time, i.e. examine a typical sequence
2. Ensemble average: Take averages over all possible different sample functions. This is usually the more comprehensive approach since we don't know if any one sample function is representative of the entire ensemble.

A process whose ensemble statistics don't change with time is a stationary process. They are usually called strict sense stationary (SSS) processes if all statistics do not change with time. i.e. PDF's and CDF's do not change with time. They are wide sense stationary (or weak sense stationary) (WSS) if at least the first and second order statistics do not change with time, i.e. the mean and variance. Processes in which all time averages of all sample functions are equal to the ensemble averages are called strict sense ergodic processes. If at least the first and second order time averages are equal to the first and second order ensemble averages then the process is called wide sense ergodic (or weak sense ergodic).

Note that in reality there is no such thing as a stationary process. However many processes can be considered stationary for the time interval of interest and this stationary assumption allows for a manageable mathematical model.

Some common statistics about random processes that are often useful

- $E[x]$ - Expected value or average. Indicates the DC level of the signal
- $E\left[x^{2}\right]$ - Expected square value - Total power $(\mathrm{AC}+\mathrm{DC})$ of the signal
- $\sigma^{2}=E\left[x^{2}\right]-(E[x])^{2}-$ Variance - Total AC power (i.e. subtracting DC power)
- $\sigma=$ Standard Deviation - RMS value of the AC components

NOTE: When processes are zero mean (such as noise) the variance is the total power and the standard deviation is the total RMS value.

In general only two statistics are usually considered which are the mean and the autocorrelation of the random process. The autocorrelation of a random process is a measure of how similar the process is to itself as time increases. That is, correlating the function with itself as a function of time separation gives an indication of how quickly one can expect the random process to change. This will be demonstrated by example.

Example 4-A random process: Consider the transmission of a signal from a TX to an RX where the frequency is given as $f \mathrm{~Hz}$ and the attenuation is minimal. The received phase can be any value from 0 to $2 \pi$. This is shown in Equation A.11:

$$
\begin{equation*}
X(t)=A \cos (w t+\Theta) \tag{A.11}
\end{equation*}
$$

where $A$ and $\omega$ are constants and the phase $\Theta$ is a uniform random variable over $[-\pi, \pi]$. Therefore the PDF is shown in Figure A-20.


$$
f_{\odot}(\theta)=\left\{\begin{array}{cc}
\frac{1}{2 \pi} & -\pi \leq \theta \leq \pi \\
0 & \text { otherwise }
\end{array}\right.
$$

Figure A-20: PDF of the random process for Example 4

As shown in Equations A. 12 and A. 13 respectively, the first step is to find the time averaged mean and ensemble mean.

## Time averaged mean:

$$
\begin{align*}
& \bar{x}=\langle x(t)\rangle=\lim _{T \rightarrow \infty} \frac{1}{T} \int_{-T / 2}^{T / 2} A \cos (\omega t+\theta) d t \\
& \bar{x}=\frac{A}{T_{0}} \int_{-T_{0} / 2}^{T_{0} / 2} \cos (\omega t+\theta) d t=0 \tag{A.12}
\end{align*}
$$

## Ensemble mean:

$$
\begin{align*}
& \mu_{x}(t)=E[X(t)]=\int_{-\infty}^{\infty} A \cos (\omega t+\theta) f_{\theta}(\theta) d \theta  \tag{A.13}\\
& \mu_{x}(t)=\frac{A}{2 \pi} \int_{-\infty}^{\infty} \cos (\omega t+\theta) d \theta=0
\end{align*}
$$

By inspection the average value of any cosine wave regardless of phase will be 0 , i.e. no DC value. Thus even when considering the ensemble mean (all sample functions) there will be an equal number of sample functions above and below zero and thus the mean is still 0 . Therefore this process is Ergodic in the mean since $\langle x(t)\rangle=E[X(t)]$

The next step is to find the time averaged autocorrelation and ensemble autocorrelation as shown in Equations A. 14 and A. 15 respectively.

## Time averaged autocorrelation:

$$
\begin{align*}
& \left.\bar{R}_{x x}(\tau)=\langle\mathbf{r}(t) \mathrm{t}) \mathrm{r}(t+\tau)\right\rangle \\
& \bar{R}_{x \rightarrow \infty}(\tau)=\lim _{T \rightarrow-\infty} \frac{\int_{-T / 2}}{A^{2} \cos (\omega t+\theta) \cos (\omega(t+\tau)+\theta) d t} \\
& \bar{R}_{x x}(\tau)=\frac{A^{2}}{T_{n}} \int_{-}^{T_{0} / 2} \frac{1}{2}[\cos \omega t+\cos (2 \omega t+2 \theta+\omega \tau)] d t  \tag{A.14}\\
& K_{x x}(\tau) \\
& \int_{-T_{0} / 2}^{2}[\cos \omega t+\cos (2 \omega t+2 \theta+\omega \tau)] d t \\
& \bar{R}_{x x}(\tau)=\frac{A^{2}}{2} \cos \omega \tau
\end{align*}
$$

## Ensemble autocorrelation:

$$
\begin{align*}
& R_{x x}(t, t+\tau)=E[X(t) X(t+\tau)] \\
& R_{x x}(t, t+\tau)=\frac{A^{2}}{2 \pi} \int_{-\pi}^{\pi} \cos (\omega t+\theta) \cos (\omega(t+\tau)+\theta) d \theta \\
& R_{x x}(t, t+\tau)=\frac{A^{2}}{2 \pi} \int_{-\pi}^{\pi} \frac{1}{2} \cos \omega \tau+\cos (2 \omega t+2 \theta+\omega \tau) d \theta  \tag{A.15}\\
& R_{x x}(t, t+\tau)=\frac{A^{2}}{2} \cos \omega \tau
\end{align*}
$$

This process is Ergodic in the autocorrelation since $\langle x(t) x(t+\tau)\rangle=E[X(t) X(t+\tau)]$.

Furthermore since the mean is constant (does not depend on time) and the autocorrelation is a function of time separation $\tau$ only (does not depend on time) then we can conclude that this process is wide sense stationary (WSS).

Example 5 - Random Binary Signal. Recall from Section 2.19 the random binary waveform where 0 is mapped to $-A$ and 1 is mapped to $+A$. Transitions occur only at integer multiples of the symbol period $T_{b}$ and the symbol values are equally likely. If we let $A=1$ then by inspection we can see that if $50 \%$ of the time there are -1 s , and $50 \%$ of the time there are +1 s , then the mean value (DC value, expected value) of this random process should be 0 . This is found to be true as shown in Equation A.16.

$$
\begin{equation*}
E[x]=\sum_{i=0}^{1} x f_{x}(x)=-1\left(\frac{1}{2}\right)+1\left(\frac{1}{2}\right)=0 \tag{A.16}
\end{equation*}
$$

Note that Equation A. 16 finds the ensemble average and not the time average. Time averages are not as useful as ensemble averages in this case since a single sample function will not be indicative of all sample functions. Thus only ensemble averages will be considered from this point on.

The autocorrelation of a random binary waveform is not as straightforward as the previous example since it is known as a cyclostationary random process. This means that by the above definitions this random binary process is not strictly stationary, or even wide sense stationary. However it turns out that the statistics (mean, autocorrelation etc) are periodic. Therefore an average over mean and an average autocorrelation can be evaluated to obtain a stationary result. Hence the term cyclostationary (cyclically stationary).

To analyze random binary processes it is useful to interpret discrete time signals as trains of delta functions as shown in Figure A-21 and Equation A.17. This maps a discrete time random process to a continuous time random process.


Figure A-21: Discrete time random process mapped to continuous time with delta functions

$$
\begin{equation*}
x(t)=\sum_{k=-\infty}^{\infty} a_{k} \delta\left(t-k T_{b}\right) \tag{A.17}
\end{equation*}
$$

This works well since we can consider the final waveform to be the convolution of the train of delta functions with a pulse shape $p(t)$ as shown in Figure A-22 and Equation A-18.


Figure A-22: Waveform is a convolution of the pulse shape and train of delta functions

$$
\begin{equation*}
x(t)=p(t) * \sum_{k=-\infty}^{\infty} a_{k} \delta\left(t-k T_{b}\right) \tag{A.18}
\end{equation*}
$$

Therefore the average autocorrelation function for cyclostationary random processes as a function of time separation $\tau$ is defined as

$$
\begin{equation*}
R_{x x, a v g}=p(\tau)^{*} p(-\tau) *\left(\frac{1}{T} \sum_{k=-\infty}^{\infty} E\left[x_{n} x_{n+k}\right] \delta\left(\tau-k T_{b}\right)\right) \tag{A.19}
\end{equation*}
$$

Equation A. 19 is indicating that to evaluate the autocorrelation of this cyclostationary process, first convolve the pulse shape with itself, then convolve the result with the autocorrelation of the source symbols averaged over all time. An example of a square pulse shape being convolved with itself is shown in Figure A-23.


Figure A-23: Square pulse shape being convolved with itself

Equation A. 20 shows the autocorrelation of the source symbols.

$$
E\left[x_{n} x_{n+k}\right]= \begin{cases}k=0 & E\left[x_{n}^{2}\right]=A^{2}  \tag{A.20}\\ k \neq 0 & E\left[x_{n} x_{n+k}\right]=E\left[x_{n}\right] E\left[x_{n+k}\right]=0\end{cases}
$$

Finally the autocorrelation by Equation A. 19 is shown graphically in Figure A-24.


Figure A-24: Graphical depiction of Equation A. 19

The Fourier Transform of the autocorrelation function of a random process is the power spectral density.


Figure A-25: Fourier transform of the autocorrelation function is the PSD


Figure A-26: Matlab script showing PSD that matches with Example 3

Figure A-25 shows the process to find the PSD from the autocorrelation function. Figure A-26 shows the Matlab script that performs this calculation and plots the PSD. Comparing the shape of this PSD to that found in Example 3 shows their similarity. The difference is that this PSD is continuous since this random process contains every possible duty cycle square wave. Thus Example 5 demonstrated how to find the spectrum of a random binary waveform.

## A.9. Finding the PSD of a Coded Binary Signal

This challenging topic was first covered by Cariolaro and Tronca. Their approach is to first model the communication system as a finite state machine, then analyze it using the theory of Markov Chains.

## A.10. Finite State Machines

Any system which operates at discrete instants of time and takes on a finite number of configurations can be represented as a Finite State Machine (FSM). A general description of a FSM is a system that operates on a finite number of inputs and assumes a specific internal state based on those inputs. The output of the FSM can be determined in two ways.

1. Moore machine: The output is a function of the internal state only.
2. Mealy machine: The output is a function of the internal state and the present input.

A general Mealy FSM is shown in Figure A-27. In this model let $S_{n}, C_{n}$ and $L_{n}$ represent the input, output and internal state values of the system respectively. The system then operates in time intervals $T$, with a new state being generated every $t=n T$.

Therefore using this FSM to model a coded system, $S_{n}$ would be the input $S W s, C_{n}$ would be the output CWs, and $L_{n}$ would represent the internal state of the encoder.


Figure A-27: Mealy Finite State Machine (FSM)

$$
\begin{gather*}
C_{n}=f\left[L_{n}, S_{n}\right]  \tag{A.21}\\
L_{n+1}=g\left[L_{n}, S_{n}\right]
\end{gather*}
$$

A FSM can also be represented graphically by a state diagram which shows the progression of states through which the system operates and the resulting outputs based on specific inputs.

Example 6 - FSM encoder: Below is a typical $1 / 2$ rate convolutional encoder with constraint length 2 . The $1 / 2$ comes from the fact that 1 source bit (SW) is encoded to 2 coded bits (CW).


Figure A-28: Rate ${ }^{1 / 2}$ convolutional encoder

For example, consider the encoded output corresponding to the following input sequence

10100

Details of the operation of the circuit are:

| Clk | Regs | $\mathbf{i} / \mathbf{p}$ | $\mathbf{o} / \mathbf{p}$ |
| :---: | :---: | :---: | :---: |
|  | 00 |  |  |
| 1 | 01 | 1 | 11 |
| 2 | 10 | 0 | 10 |
| 3 | 01 | 1 | 00 |
| 4 | 10 | 0 | 10 |
| 5 | 00 | 0 | 11 |

and the output sequence would be

## 1110001011

Modeling this convolutional encoder as a FSM allows creation of the state diagram as shown in Figure A-29. This is done by considering all possible states with the corresponding inputs and outputs.


Figure A-29: FSM of the convolutional encoder of Example 6

Clearly the FSM model of a communications system is useful since once the FSM model is complete, the behavior of the system can be determined very easily. As well, Cariolaro and Tronca require modeling the coding system as a FSM and analyzing it as a Markov Chain in order to find the PSD.

## A.11. Markov Chains

A random process $X(t)$ is a Markov Process if the future value (or state) of the process is dependent only on its immediate present value (or state). In other words, the process is independent of the past, and its future value is dependent only on its present value. Equation A. 22 indicates that the probability of $X(t)$ assuming a new value given all the past values for all time is equal to the probability of $X(t)$ assuming a new value if only the immediate value $x_{k}$ is considered.

$$
\begin{equation*}
P\left[X\left(t_{k+1}\right)=x_{k+1} \mid X\left(t_{k}\right)=x_{k}, \ldots, X\left(t_{1}\right)=x_{1}\right]=P\left[X\left(t_{k+1}\right)=x_{k+1} \mid X\left(t_{k}\right)=x_{k}\right] \tag{A.22}
\end{equation*}
$$

An integer valued Markov Process is a discrete time random process that is called a Markov Chain. Here the random variable $X_{n}$ takes on a countable number of values at discrete moments in time, where $T$ is the interval between discrete time events. The value of $X_{n}$ at the discrete time $n$ is referred to as the state of the process at time $n$.

$$
\begin{equation*}
X_{n}=X(n T) \tag{A.23}
\end{equation*}
$$

In Markov chains the probability distribution functions (PDFs) that are conditioned on several time instants always reduce to one PDF that is conditioned only on the most recent time instant. This is known as the Markov property. For this reason the value of $X_{n}$ at the discrete time $n$ is referred to as the state of the process at time $n$.

## A.12. Markov Chain Properties

In general a system can be modeled by a Markov chain if the sequence of trials satisfy the following properties:

1. Each outcome belongs to a finite set of outcomes $\left\{a_{1}, a_{2}, \ldots, a_{m}\right\}$ called the state space of the system. If the outcome on the $n^{\text {th }}$ trial is $a_{i}$, then the system is defined to be in state $a_{i}$ at time $n$, or the system is in state $a_{i}$ at the $n^{\text {th }}$ step.
2. The outcome of any trial depends at most upon the outcome of the immediately preceding trial and not upon any other previous outcome.
3. There is a given probability $p_{i j}$ that state $a_{j}$ occurs immediately after the occurrence of $a_{i}$. Therefore if the system is currently in state $a_{i}$, then $p_{i j}$ is the probability of moving to state $a_{j}$.

The numbers $p_{i j}$ from requirement 3 are called the transition probabilities and they can be arranged in a matrix called the transition matrix as shown in Equation A.24.

$$
P=\Pi=\left[\begin{array}{cccc}
p_{11} & p_{12} & \cdots & p_{1 m}  \tag{A.24}\\
p_{21} & p_{22} & \cdots & p_{2 m} \\
\vdots & \vdots & & \vdots \\
p_{m 1} & p_{m 2} & \cdots & p_{m n}
\end{array}\right]
$$

The transition matrix $\Pi$ is a stochastic matrix since each row is a probability vector that sums to 1 . In addition to the transition matrix a Markov chain has an initial probability vector $p$ that indicates the starting state. This is best shown in an example.

Example 7 - Simple Markov Chain: After work a woman enjoys either working out or playing tennis. Since the workout can be quite tiring she never works out 2 days in a row. However if she plays tennis one afternoon, she is just as likely to play tennis the next afternoon as she is to work out. This can be modeled as a Markov chain since the outcome on any day depends only on what happened the preceding day. The transition matrix is shown in Figure A-30.

$$
\mathrm{P}=\begin{gathered}
W \\
\mathrm{~W} \\
\mathrm{~T}
\end{gathered}\left(\begin{array}{cc}
0 & 1 \\
\frac{1}{2} & \frac{1}{2}
\end{array}\right)
$$

Figure A-30: Transition matrix for Example 7

In this figure, W denotes working out and T denotes playing tennis. The rows denote the activity on the current day, and the columns denote the activity on the next day. This matrix is then read in the following manner. If the woman worked out one afternoon she definitely plays tennis the next afternoon (i.e. she never works out 2 days in a row). If however she played tennis one afternoon, $50 \%$ of the time she will play tennis the next afternoon and $50 \%$ of the time she will work out the next afternoon. This can also be shown as a state diagram as shown in Figure A-31.


Figure A-31: State diagram for Example 7

Here it is very easy to see that if she works out one day she does not work out the next. However if she plays tennis one day she is just a likely to play tennis the next day as she is to work out.

Now consider this. What is the likelihood that she will play tennis two days from the current day? This is easy to solve. If she works out today she definitely plays tennis the next day $(100 \%)$, and then two days later there is a $50 \%$ chance of playing tennis. If she plays tennis today she works out the next day $50 \%$ of the time and plays tennis $50 \%$ of the time. Two days from now if she had worked out she definitely plays tennis ( $100 \%$ ), otherwise there is a $50 \%$ chance that she will play tennis. In terms of a probability tree there are two ways she can play tennis again and one way she will work out again in 2 days. So to summarize

If the woman is currently working out, two afternoons from now:

- the probability that she will work out again is $1 / 2$.
- the probability that she will play tennis is $1 / 2$.

If the woman is currently playing tennis, two afternoons from now:

- the probability that she will work out is $1 / 4$.
- the probability that she will play tennis again is $3 / 4$.

Notice that this can be done elegantly using matrix powers as shown in Figure A-32.

$$
\mathbf{p}=\begin{array}{cc}
W & T \\
T
\end{array}\left(\begin{array}{cc}
0 & 1 \\
\frac{1}{2} & \frac{1}{2}
\end{array}\right) \quad \mathbf{p}^{2}=\left(\begin{array}{cc}
0 & 1 \\
\frac{1}{2} & \frac{1}{2}
\end{array}\right)\left(\begin{array}{cc}
0 & 1 \\
\frac{1}{2} & \frac{1}{2}
\end{array}\right)=\begin{gathered}
W \\
W \\
T
\end{gathered}\left(\begin{array}{cc}
\frac{1}{2} & \frac{1}{2} \\
\frac{1}{4} & \frac{3}{4}
\end{array}\right)
$$

Figure A-32: Matrix powers can find the two step probability

The component $p_{i j}$ in the transition matrix $P$ of a Markov chain is the probability that the system changes from the state $a_{i}$ to the state $a_{j}$ in one step (i.e. one time instant). From the above example in one time instant we can move from either state 1 to 2 , or from state 2 to 1 , or stay in state 2 . The probabilities in the transition matrix then show the onestep probabilities.

However we may want to know the probability of moving from state $a_{i}$ to state $a_{j}$ in two or more steps. This can be accomplished simply by matrix multiplication as shown in Equation A. 25

$$
\begin{equation*}
P^{2}=P P \quad \text { or } \quad \Pi^{2}=\Pi \Pi \tag{A.25}
\end{equation*}
$$

Based on this result, it is easy to evaluate the probability of this woman working out or playing tennis in four days, five days or even one hundred days using $P^{4}, P^{5}$, and $P^{100}$ as shown in Figure A-33.


Figure A-33: 4-step, 5-step and 100-step probabilities for Example 7

The interesting thing to note in Figure A-33 is that these matrix powers are converging to a special matrix that has identical rows. After a certain number of steps (matrix powers), the probability of being in state W or T approaches a constant regardless of the starting state. This special matrix is called the transition matrix $\Pi$ (sometimes called the $T$ matrix) and the rows of this matrix are known as the invariant probability vector $\pi$ (also sometime called $\lambda_{\infty}$ ).

## A.13. Partial Summary

It would be useful to summarize the most important concepts so far.

- Oscilloscopes and spectrum analyzers generally work in RMS and dB
- Communication systems must be analyzed as random processes
- Coding in systems adds an extra dimension of difficulty
- Cariolaro and Tronca developed a way to find the PSD of these systems, they require modeling the system as a FSM, and then analyzing the system using Markov Chains.

With these definitions the PSD of systems that use coding can be performed using the approach developed by Cariolaro and Tronca. Their approach is quite involved and two more examples will be explored to explain how their approach works.

## A.14. PSD of Coded Systems - Simplest Example

Example 8 - $50 \%$ duty cycle square wave revisited: Evaluate the PSD of the same $50 \%$ duty cycle square wave from Example 2, with 2.9 V peak and fundamental frequency of 100 kHz .

While this is not really a coding system it can be considered a coding operation with the mapping $1 \rightarrow 10$ and $0 \rightarrow 10$. Hence this is a $1: 2$ mapping and each CW is 2 -bits long. Secondly we can assume that the source outputs a 1 or a 0 bit (SW) with equal probability. The first step is to represent the system as a finite state machine and draw a state diagram. However in this example system there would only be one state no matter what the input symbols are and the output would always be 10 as shown in Figure A-34.


Figure A-34: Original state diagram of the example system

This would lead to a very trivial example and therefore the model used in this example will be modified to have two states as shown in Figure A-35. This does not change the functionality since the system is still always outputting a $50 \%$ duty cycle square wave.


Figure A-35: Expanded state diagram for the example system

This model is adequate since we can consider:

- 1 s and 0 s are equally likely
- In state $S_{0}$ a 1 is mapped to a 10 and a 0 is mapped to a 10
- In state $S_{1}$ a 1 is mapped to a 10 and a 0 is mapped to a 10

A couple observations:

- Due to this coding procedure (mapping), the source statistics are irrelevant (on purpose). That is, even if the source outputs 1 s constantly, the output sequence will always be 10101010 etc.
- Since both a 1 or a 0 forces a move to the other state, we can intuitively determine that $50 \%$ of the time is spent in each state.
- Since the output is always 10 repeatedly, if we had to determine the average output it would have to be 10 .

The next step is to define the input probability matrix $\theta_{u}$ for $u=1,2, \ldots, S$ as shown in Equation A.26, where $S$ is the number of SWs. In this example, $S=2$ since the 1 -bit SWs are either 0 or 1 . Equation A. 26 is showing the probability of a particular SW being chosen given the current state. It is written in diagonal form for mathematical convenience.

$$
P\left(S W \mid l_{i}\right)_{\text {diag }}=\theta_{u}(i, j)=\left\{\begin{array}{cc}
P\left(S W \mid l_{i},\right. & i=j  \tag{A.26}\\
0 & \text { otherwise }
\end{array}\right.
$$

Also define $E_{u}$ for $u=1,2, \ldots, S$ as the next-state matrices, where $E_{u}(i, j)=1$ if and only if state $l_{j}$ is entered from state $l_{i}$ given input $S_{u}$. Thus for each possible SW $S_{u}$, the system will move to only one state so the $E_{u}$ matrices will have a single 1 on each row. Finally define $C_{u}$ for $u=1,2, \ldots, S$ as the CW matrix, where $C_{u}(i, j)$ is the CW generated when state $l_{j}$ is entered due to a given input $S_{u}$. These are shown in Equations A. 27 and A. 28.

$$
\begin{array}{lll}
\theta_{0}=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right] & E_{0}=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] & C_{0}=\left[\begin{array}{l}
10 \\
10
\end{array}\right] \\
\theta_{1}=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right] & E_{1}=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] & C_{1}=\left[\begin{array}{l}
10 \\
10
\end{array}\right] \tag{A.28}
\end{array}
$$

Therefore by considering all the ways in which these state transitions can occur and the probability of their occurrence, it is clear that a transition probability matrix can be constructed by Equation A.29.

$$
\begin{gather*}
P=\Pi=\sum_{u=1}^{S} \theta_{u} E_{u}  \tag{A.29}\\
P=\Pi=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right]\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]+\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right]\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] \tag{A.30}
\end{gather*}
$$

The transition probability matrix $P$ in Equation A. 30 indicates that if the system is currently in state 0 , the probability of moving to state 1 is $100 \%$. Likewise if the system is currently in state 1 the probability of moving to state 0 is $100 \%$, as expected.

Once the transition probability matrix is found the invariant probability distribution can be found by taking $P$ to high powers. However from Markov Chain Theory it is found that $P$ is a non-regular, irreducible, periodic Markov Chain with $d=2$.

Thus the subsequent powers will oscillate between $d$ different matrices. However like most things in probability, we can once again average over these matrices to find a stationary distribution. Therefore finding the invariant stationary distribution is

$$
\begin{align*}
& \frac{1}{n} \sum_{k=1}^{n} \Pi^{k} \rightarrow M  \tag{A.31}\\
& M=\frac{1}{2}\left(\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]+\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right]\right)=\left[\begin{array}{ll}
.5 & .5 \\
.5 & .5
\end{array}\right] \text { therefore } \pi=\left[\begin{array}{ll}
.5 & .5
\end{array}\right]
\end{align*}
$$

Thus vector $\pi$ indicates, as expected, that this system spends $50 \%$ of the time in each state. As well the $\mathrm{CW} c_{u}(i)$ will occur with probability $\pi(i) \theta_{u}(i, i)$ and thus the average $\mathbf{C W}$ or mean symbol vector can be found by summing over all CWs multiplied by their likelihood as shown in Equation A. 33.

$$
\eta_{c}=\pi \sum_{u=1}^{S} \theta_{u} c_{u}=\left[\begin{array}{ll}
.5 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0  \tag{A.33}\\
0 & .5
\end{array}\right]\left[\begin{array}{l}
20 \\
20
\end{array}\right]=\left[\begin{array}{ll}
.5 & .5
\end{array}\right]\left[\begin{array}{l}
10 \\
10
\end{array}\right]=[10]
$$

As expected the average $\mathbf{C W}$ is 10 . As already discussed this is the only CW so it must obviously be the average CW.

Now consider evaluation of the symbol sequence autocorrelation $R_{c, k}=E\left[c_{n} c_{n+k}\right]$. Consider first $R_{C, 0}$ or evaluation of $E\left[c_{n} c_{n}\right]$. Since symbol $c_{u}(i)$ occurs with probability $\pi(i) \theta_{u}(i, i)$, the contribution of this symbol to $R_{C, 0}$ is $c_{u}^{2}(i) \pi(i) \theta_{u}(i, i)$. Therefore to find the contribution from all CWs to $R_{C, 0}$, Cariolaro and Tronca first define $\Lambda$ as the $L$-square diagonal matrix such that

$$
\Lambda(i, j)=\left\{\begin{array}{cc}
\pi(i) & i=j  \tag{A.34}\\
0 & \text { otherwise }
\end{array}\right.
$$

Then summing over all CWs gives

$$
\begin{equation*}
R_{C, 0}=\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} c_{u} \tag{A.35}
\end{equation*}
$$

Therefore when the input symbols are independent of state the autocorrelation is be shown to be

$$
R_{C, k}=\left\{\begin{array}{lr}
\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} c_{u} & k=0  \tag{A.36}\\
\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{k-1} \sum_{v=1}^{s} \theta_{v} c_{v} & k \geq 1 \\
R_{C,-k}^{\mathrm{T}} & k \leq-1
\end{array}\right.
$$

Continuing with the example above, there are two SWs so the autocorrelation at zero time separation $R_{C, 0}$ is found by summing up $c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}+c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}$.

$$
\begin{align*}
& c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}=\left[\begin{array}{ll}
1 & 1 \\
0 & 0
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]=\left[\begin{array}{ll}
.5 & 0 \\
0 & 0
\end{array}\right] \\
& c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}=\left[\begin{array}{ll}
1 & 1 \\
0 & 0
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]=\left[\begin{array}{ll}
.5 & 0 \\
0 & 0
\end{array}\right]  \tag{A.37}\\
& R_{C, 0}=c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}+c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}=\left[\begin{array}{ll}
1 & 0 \\
0 & 0
\end{array}\right]
\end{align*}
$$

Equation A. 37 depicts the correlation of all the CWs. In order to continue this evaluation for the first few time separations it is important to notice that in Equation A. 36 the second summation term does not change with $k$, and thus it can be computed separately as shown in Equation A. 38 .

$$
\sum_{v=1}^{s} \theta_{v} c_{v}=\left[\begin{array}{ll}
.5 & 0  \tag{A.38}\\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]+\left[\begin{array}{cc}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]=2\left[\begin{array}{ll}
.5 & 0 \\
.5 & 0
\end{array}\right]=\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]
$$

Thus $R_{C, 1}$ is found to be the same as $R_{C, 0}$ as shown in Equation A.39.

$$
\begin{align*}
& R_{C, 1}=\sum_{u=1}^{S} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{k-1} \sum_{v=1}^{S} \theta_{v} c_{v} \\
& R_{C, 1}=\sum_{u=1}^{S} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{1-1}\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right] \\
& R_{C, 1}=\sum_{u=1}^{S} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u}\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]  \tag{A.39}\\
& R_{C, 1}=\sum_{u=1}^{S} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u}\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right]=2\left[\begin{array}{ll}
1 & 1 \\
0 & 0
\end{array}\right]\left[\begin{array}{cc}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]\left[\begin{array}{ll}
1 & 0 \\
1 & 0
\end{array}\right] \\
& R_{C, 1}
\end{align*}=2\left[\begin{array}{ll}
.5 & 0 \\
0 & 0
\end{array}\right]=\left[\begin{array}{ll}
1 & 0 \\
0 & 0
\end{array}\right] \quad .
$$

Continuing with this approach it is found that

$$
R_{C, 0}=R_{C, 1}=R_{C, 2}=R_{C, 3}=\ldots=R_{C, \infty}=\left[\begin{array}{ll}
1 & 0  \tag{A.40}\\
0 & 0
\end{array}\right]
$$

where

$$
\begin{equation*}
R_{C, \infty}=\eta_{c}^{\mathrm{T}} \eta_{c} \tag{A.41}
\end{equation*}
$$

which is the transpose of the mean symbol vector multiplied by itself. Finally the autocovariance is found as

$$
\begin{align*}
& K_{c, k}=R_{c, k}-\eta_{c}^{2}  \tag{A.42}\\
& K_{c, k}=R_{c, k}-R_{c, \infty}
\end{align*}
$$

This means that if the mean symbol vector is zero, the autocovariance matrices are equal to the autocorrelation matrices. As discussed previously, the mean symbol vector will be zero if the output waveforms are balanced, i.e. if 0 is mapped to -1 and 1 is mapped to +1 .

Thus $K_{C, 0}$ is found by $R_{C, 0}-R_{C, \infty}$ as $K_{C, 1}=R_{C, 1}-R_{C, \infty}$ and $K_{C, 1}=R_{C, 1}-R_{C, \infty}$ and thus as $k$ tends to infinity the all zero matrix results as shown below.

$$
K_{c, k}=R_{c, k}-R_{c, \infty}=\left[\begin{array}{ll}
0 & 0  \tag{A.43}\\
0 & 0
\end{array}\right] \text { as } k \rightarrow \infty
$$

In the example above $K_{C, k}$ is all zero for all $k$. This makes sense since the output never varies. In other words the output is periodic and in the frequency domain there is no continuous component. Finally the PSD is found by

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}\left(X_{C}(f)+\frac{X_{D}(f)}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right) \tag{A.44}
\end{equation*}
$$

where

$$
\begin{equation*}
X_{C}(f)=v\left[K_{C, 0}+\sum_{k=-\infty}^{\infty} K_{C, k} e^{-j 2 \pi \pi k N T}\right] v^{*} \quad X_{D}(f)=v R_{C, \infty} v^{*} \tag{A.45}
\end{equation*}
$$

$$
v=\left[\begin{array}{llllll}
1 & e^{j 2 \pi f T} & e^{j 4 \pi f T} & e^{j \sigma \pi f T} & \cdots & e^{j 2 \pi f(N-1) T} \tag{A.46}
\end{array}\right]
$$

In this example since the covariance matrices are all zero there will be no continuous component and $X_{C}(f)=0$ and therefore

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}\left(0+\frac{X_{D}(f)}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right) \tag{A.47}
\end{equation*}
$$

and the discrete PSD component is found to be

$$
X_{D}(f)=\left[\begin{array}{ll}
1 & e^{j 2 \pi f T}
\end{array}\right]\left[\begin{array}{ll}
1 & 0  \tag{A.48}\\
0 & 0
\end{array}\right]\left[\begin{array}{c}
1 \\
e^{-j 2 \pi f T}
\end{array}\right]=[1] \quad \text { where } \quad v=\left[\begin{array}{l}
1
\end{array} e^{j 2 \pi f T}\right]
$$

Therefore the overall PSD reduces to

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}\left(\frac{1}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right) \tag{A.49}
\end{equation*}
$$

Equation A. 49 indicates that the PSD of this signal is a train of delta functions weighted by the Fourier Transform of the pulse shape squared. Since the pulse shape is a square wave we can find the Fourier transform from tables or by Equation A.50.


Figure A-36: Square pulse shape used in Example 8

$$
\begin{align*}
& \text { Arect }\left(\frac{t}{\tau}\right) \leftrightarrow A \tau \operatorname{sinc}\left(\frac{\omega \tau}{2}\right) \\
& 2.9 r e c t\left(\frac{t}{5 \mu s}\right) \leftrightarrow 2.9(5 \mu s) \operatorname{sinc}\left(\frac{2 \pi f 5 \mu s}{2}\right)  \tag{A.50}\\
& P(f)=(14.5 \mu s) \operatorname{sinc}(\pi f 5 \mu s)
\end{align*}
$$

With this pulse shape define the weight value as

$$
\begin{equation*}
\text { weight }=\frac{|P(f)|^{2}}{N T} \quad \text { from } \quad W(f)=\frac{|P(f)|^{2}}{N T}\left(\frac{1}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right) \tag{A.51}
\end{equation*}
$$

Since it is a magnitude function squared it is now in the form of power dissipated over 1 ohm. However to match the oscilloscope we want it to be in dB and RMS. Therefore we need to take the square root $\sqrt{\text { weight }}$.

Furthermore Cariolaro and Tronca define this PSD as a two sided spectrum, i.e. the Fourier Transform of the pulse shape is related to the complex exponential Fourier Series. Therefore to match a one sided spectrum the $\sqrt{\text { weight }}$ needs to be multiplied by two as indicated by Equation A. 8 i.e. $2 \sqrt{\text { weight }}$. Therefore the final form to match the digital oscilloscope and spectrum analyzer is

$$
\begin{align*}
& W_{V_{\text {Rus }}}(f)=10 \log \left(\left(\frac{2 \sqrt{\text { weight }}}{\sqrt{2}}\right)^{2}\left(\frac{1}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right)\right)  \tag{A.52}\\
& W_{d B m}(f)=10 \log \left(\frac{\left(\frac{2 \sqrt{\text { weight }}}{\sqrt{2}}\right)^{2}\left(\frac{1}{N T} \sum_{m=-\infty}^{\infty} \delta\left(f-\frac{m f_{0}}{N}\right)\right)}{50(.001)}\right) \tag{A.53}
\end{align*}
$$

Note: Equations A. 52 and A. 53 are general forms for this example only. Some values for dBV RMS are given in the following figures.


Figure A-37: Stores the pulse shape $P(f)$ and weight, finds the $d B V_{\text {RMS }}$ value at 100 kHz


Figure A-38: Find $\mathrm{dBV}_{\text {RMS }}$ values at 200 kHz and 300 kHz


Figure A-39: Find $\mathrm{dBV}_{\text {RMS }}$ values at 500 kHz and 700 kHz

Continuing in this fashion for dBm :


Figure A-40: Finding the dBm value at 100 kHz


Figure A-41: Finding the dBm values at $\mathbf{3 0 0 k H z}$ and 500 kHz

Comparing the dBV RMs results in Figures A-37, A-38 and A-39 to Table A-1, and the dBm results in Figures A-40 and A-41 to Table A-2 demonstrates the accuracy of this method.

While this example is much more complicated than Example 2, it does show the generality of the Cariolaro and Tronca method of finding PSDs. Their approach however is intended for more complicated systems as the next example will show.

## A.15. PSD of Coded Systems - Real World Example

Example 9 - Random Binary Signal: From Example 5 consider a $\pm A$ random square wave with pulse width 5us, modeled as a FSM as shown in Figure A-42.


Figure A-42: State diagram of the random binary signal

The first thing to notice here is that this output will be truly random. Therefore there will definitely be a continuous component in the PSD. As well since the output is balanced, i.e. $\pm A$, there will be no discrete component in the PSD. Like Example 8, the first things to define are the $\theta_{u}, E_{u}$ and $C_{u}$ matrices as shown in Equations A. 54 and A. 55 .

$$
\begin{array}{lll}
\theta_{0}=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right] & E_{0}=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] & C_{0}=\left[\begin{array}{l}
-1 \\
-1
\end{array}\right] \\
\theta_{1}=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right] & E_{1}=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right] & C_{1}=\left[\begin{array}{l}
1 \\
1
\end{array}\right] \tag{A.55}
\end{array}
$$

By considering all the ways these state transitions can occur and the probability of their occurrence it is clear that a transition probability matrix will be the same as in Example 8.

$$
\begin{align*}
& \Pi=\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right]\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]+\left[\begin{array}{cc}
0.5 & 0 \\
0 & 0.5
\end{array}\right]\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]=\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]  \tag{A.56}\\
& \frac{1}{n} \sum_{k=1}^{n} \Pi^{k} \rightarrow M  \tag{A.57}\\
& M=\frac{1}{2}\left(\left[\begin{array}{ll}
0 & 1 \\
1 & 0
\end{array}\right]+\left[\begin{array}{ll}
1 & 0 \\
0 & 1
\end{array}\right]\right)=\left[\begin{array}{ll}
.5 & .5 \\
.5 & .5
\end{array}\right] \quad \text { therefore } \pi=[.5
\end{align*}
$$

As expected this system spends $50 \%$ of the time in each state. As well the CW $c_{u}(i)$ will occur with probability $\pi(i) \theta_{u}(i, i)$ and thus the average CW or mean symbol vector can be found by summing over all the 1-bit CWs.

$$
\eta_{c}=\pi \sum_{u=1}^{s} \theta_{u} c_{u}=\left[\begin{array}{ll}
.5 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0  \tag{A.58}\\
0 & .5
\end{array}\right]\left[\begin{array}{l}
-1 \\
-1
\end{array}\right]+\left[\begin{array}{ll}
.5 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{l}
1 \\
1
\end{array}\right]=[0]
$$

As expected the average CW is 0 since $50 \%$ of the time the system outputs a -1 and the other $50 \%$ of the time the system outputs $a+1$.

Furthermore, since there are two SWs the autocorrelation at zero time separation $R_{C, 0}$ is found by summing up $c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}+c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}$ as shown in Equation A.59.

$$
\begin{align*}
& c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}=\left[\begin{array}{ll}
-1 & -1
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{cc}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{l}
-1 \\
-1
\end{array}\right]=[.5] \\
& c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}=\left[\begin{array}{ll}
1 & 1
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{l}
1 \\
1
\end{array}\right]=[.5]  \tag{A.59}\\
& R_{C, 0}=c_{0}^{\mathrm{T}} \Lambda \theta_{0} c_{0}+c_{1}^{\mathrm{T}} \Lambda \theta_{1} c_{1}=[1]
\end{align*}
$$

Equation A. 59 depicts the correlation of all the CWs at zero time separation. Like Example 8 the second summation term does not change with $k$ so it can be computed separately.

$$
\sum_{v=1}^{s} \theta_{v} c_{v}=\left[\begin{array}{ll}
.5 & 0  \tag{A.60}\\
0 & .5
\end{array}\right]\left[\begin{array}{l}
-1 \\
-1
\end{array}\right]+\left[\begin{array}{ll}
.5 & 0 \\
0 & .5
\end{array}\right]\left[\begin{array}{l}
1 \\
1
\end{array}\right]=\left[\begin{array}{l}
0 \\
0
\end{array}\right]
$$

Thus $R_{c, 1}$ is found to be

$$
\begin{align*}
& R_{C, 1}=\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{k-1} \sum_{v=1}^{s} \theta_{v} c_{v}  \tag{A.61}\\
& R_{C, 1}=\sum_{u=1}^{s} c_{u}^{\mathrm{T}} \Lambda \theta_{u} E_{u} \Pi^{1-1}\left[\begin{array}{l}
0 \\
0
\end{array}\right]=[0]
\end{align*}
$$

Continuing with this approach shows that

$$
\begin{equation*}
R_{C, 1}=R_{C, 2}=R_{C, 3}=R_{C, \infty}=[0] \tag{A.62}
\end{equation*}
$$

Finally the autocovariance is found as

$$
\begin{align*}
& K_{c, 0}=R_{C, 0}-R_{C, \infty}=[1]-[0]=[1]  \tag{A.63}\\
& K_{C, 1}=K_{C, 2}=K_{C, \infty}=R_{C, 1}-R_{C, \infty}=[0]-[0]=[0]
\end{align*}
$$

This shows that there will be a continuous component. Noting that the only covariance matrix that is non-zero is $K_{C, 0}$, the result is

$$
\begin{align*}
& X_{C}(f)=v\left[K_{C, 0}+\sum_{k=-\infty}^{\infty} K_{C, k} e^{-j 2 \pi / k N T}\right] v^{*} \\
& X_{C}(f)=v K_{C, 0} v^{*}  \tag{A.64}\\
& X_{C}(f)=\left[\begin{array}{ll}
1 & e^{j 2 \pi J \tau}
\end{array}\right][1]\left[\begin{array}{c}
1 \\
e^{-j 2 \pi f \tau}
\end{array}\right]=2
\end{align*}
$$

Therefore the PSD in this case reduces to

$$
\begin{equation*}
W(f)=\frac{|P(f)|^{2}}{N T}(2+0) \tag{A.65}
\end{equation*}
$$

Like Example 8, since the pulse shape is a square wave, from Fourier Tables we find that

$$
\begin{align*}
& \operatorname{Arect}\left(\frac{t}{\tau}\right) \leftrightarrow A \tau \operatorname{sinc}\left(\frac{\omega \tau}{2}\right) \\
& 2 r e c t\left(\frac{t}{1}\right) \leftrightarrow 2 \operatorname{sinc}\left(\frac{2 \pi f}{2}\right)  \tag{A.66}\\
& P(f)=2 \operatorname{sinc}(\pi f)
\end{align*}
$$

The final expression for the PSD in this example is then

$$
\begin{equation*}
W(f)=\frac{8 \operatorname{sinc}^{2}(\pi f)}{2 T} \tag{A.67}
\end{equation*}
$$



Figure A-43: Ti-89 and Matlab plotting the spectrum

Inspection of the PSD from Example 5 shows the identical result. Therefore this approach by Cariolaro and Tronca can be applied to very complex coding systems to obtain the PSD.

## Appendix B - AddSWs, AddCWs and Codes

This Appendix lists all possible AddSWs that can be used with each Hamming code considered in this thesis. Encoding to AddCWs is accomplished as outlined in Section 2.14 using the generator polynomial $g(x)=1011=\operatorname{Bh}$ for $(7, x)$ codes, the generator polynomial $g(x)=10011=13 \mathrm{~h}$ for $(15, x)$ codes, and the generator polynomial $g(x)=101111=2$ Fh for ( $31, x$ ) codes. They are presented in the format introduced by Figures 3-18 and 3-22. The number of AddSW/AddCW sets is defined in Equation 3.1.

## B.1. (7,x) codes

Table B-1: $(7,3)$ AddSWs $-\left(2^{3}\right)^{0}=1$ set

| 00.00 | $11 \ldots 11$ |
| :---: | :---: |
| 0 | F |

Table B-2: $(7,2)$ AddSWs $-\left(\mathbf{2}^{2}\right)^{1}=4$ sets

| $00 . .00$ | $a$ | $\bar{a}$ | $11 \ldots 11$ |
| :---: | :---: | :---: | :---: |
| 0 | 4 | $B$ | $F$ |
| 0 | 5 | A | F |
| 0 | 6 | 9 | F |
| 0 | 7 | 8 | F |

Table B-3: $(7,1)$ AddSWs $-\left(\mathbf{2}^{1}\right)^{3}=8$ sets

| 00.100 | a | b | c | $\overline{\mathrm{C}}$ | $\overline{\mathrm{b}}$ | $\overline{\mathrm{a}}$ | $11 \ldots 11$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 2 | 4 | 6 | 9 | B | D | F |
| 0 | 2 | 4 | 7 | 8 | B | D | F |
| 0 | 2 | 5 | 6 | 9 | A | D | F |
| 0 | 2 | 5 | 7 | 8 | A | D | F |
| 0 | 3 | 4 | 6 | 9 | B | C | F |
| 0 | 3 | 4 | 7 | 8 | B | C | F |
| 0 | 3 | 5 | 6 | 9 | A | C | F |

## B.2. $(15, x)$ codes

Table B-4: $(15,10)$ AddSWs $-\left(\mathbf{2}^{10}\right)^{0}=1$ set

| $00 . .00$ | $11 \ldots 11$ |
| :---: | :---: |
| 0 | 7 FF |

Table B-5: $(15,9)$ AddSWs $-\left(2^{9}\right)^{1}=512$ sets

| $00 . .00$ | $a$ | $a$ | $11 . .11$ |
| :---: | :---: | :---: | :---: |
| 0 | 200 | 5 FF | 7 FF |
| 0 | 201 | 5 FE | 7 FF |
| 0 | 202 | 5 FD | 7 FF |
| 0 | $\cdots$ | $\ldots$ | 7 FF |
| 0 | $\cdots$ | $\cdots$ | 7 FF |
| 0 | 3 FE | 401 | 7 FF |
| 0 | 3 FF | 400 | 7 FF |

Table B-6: $(15,8)$ AddSWs $-\left(2^{8}\right)^{3}=16777216$ sets

| 00.100 | $a$ | $b$ | $c$ | $\bar{c}$ | $\bar{b}$ | $\bar{a}$ | $11 \ldots 11$ |
| :---: | :---: | :---: | :---: | :---: | :---: | :---: | :---: |
| 0 | 100 | 200 | 300 | 4 FF | 5 FF | 6 FF | 7 FF |
| 0 | 100 | 200 | 301 | 4 FE | 5 FF | 6 FF | 7 FF |
| 0 | 100 | 200 | 302 | 4 FD | 5 FF | 6 FF | 7 FF |
| 0 | $\ldots$ | $\ldots$ | $\ldots$ | $\cdots$ | $\ldots$ | $\ldots$ | 7 FF |
| 0 | 100 | 201 | 300 | 4 FF | 5 FE | 6 FF | 7 FF |
| 0 | 100 | 201 | 301 | 4 FE | 5 FE | 6 FF | 7 FF |
| 0 | 100 | 201 | 302 | 4 FD | 5 FE | 6 FF | 7 FF |
| 0 | $\ldots$ | $\cdots$ | $\cdots$ | $\cdots$ | $\cdots$ | $\ldots$ | 7 FF |
| 0 | 101 | 200 | 300 | 4 FF | 5 FF | 6 FE | 7 FF |
| 0 | 101 | 200 | 301 | 4 FE | 5 FF | 6 FE | 7 FF |
| 0 | 101 | 200 | 302 | 4 FD | 5 FF | 6 FE | 7 FF |
| 0 | $\cdots$ | $\cdots$ | $\cdots$ | $\cdots$ | $\cdots$ | $\cdots$ | 7 FF |
| 0 | 1 FF | 2 FF | 3 FF | 400 | 500 | 600 | 7 FF |

## B.3. $(31, \mathbf{x})$ codes

Table B-7: $(31,25)$ AddSWs $-\left(2^{25}\right)^{0}=1$ set

| $00 .-00$ | $11 \ldots 11$ |
| :---: | :---: |
| 0 | $3 F F F F F F$ |

Table B-8: $(31,24)$ AddSWs $-\left(2^{24}\right)^{1}=16777216$ sets

| $00 . .00$ | $a$ | $a$ | $11 . .11$ |
| :---: | :---: | :---: | :---: |
| 0 | 1000000 | 2FFFFFF | $3 F F F F F F$ |
| 0 | 1000001 | 2FFFFFE | $3 F F F F F F$ |
| 0 | $\ldots$ | $\ldots$ | $3 F F F F F F$ |
| 0 | $\ldots$ | $\cdots$ | $3 F F F F F F$ |
| 0 | $1 F F F F F E$ | 2000001 | $3 F F F F F F$ |
| 0 | $1 F F F F F F$ | 2000000 | $3 F F F F F F$ |

Table B-9: $(\mathbf{3 1 , 2 3})$ AddSWV $-\left(2^{23}\right)^{\mathbf{3}}=\mathbf{5 9 0 2 9 5 8 1 0 3 5 8 7 0 5 6 5 1 7 1 2}$ sets


