CSE 370 Mini Research Project: CDROM Tech
Dan Thompson
Jason Dang
Kenny Risk
Table Of Contents:
· Objective
· Assumptions
· Answer
· Conclusions
· References
Objective:
We were asked to go beyond 1s and 0s and research on how the string “CSE370” is represented at the physical level in a number of different media. We had a choice of media to concentrate on: CD-ROM, bar code, magnetic tape, 56KB modem, and fluorescent lights. Our group chose CD-ROM as the media to concentrate our efforts on.
Thus, our objective of this project is to discover how the string “CSE370” is physically encoded onto a CD-ROM.
Assumptions:
The core assumptions we made are as follows:
Answer:
Physical Media
Encoding Schemes
The String “CSE370”
Physical Media:
At the very base level a CD is a collection of ones and zeros, or rather, pits and lands, engraved on a very thin sheet of aluminum. Manufactured under very specific specifications, the pits are a mere .15 microns deep, .5 microns wide, and between .83 to 3 microns long (1 micron = 1 millionth of a meter). There are only 1.6 microns between tracks, resulting in a massive track density of 16,000 tracks per inch. During the reading process, a laser of wavelength 780 nanometers is shined on the disc as it rotates, and is reflected back. The reflected beam is passed through a beam splitter, and then into a sensor. As the laser passes over the pits and lands, the wavelength is varied, as shown below. Whenever the signal crosses zero, this is interpreted as a one, whereas anything else is interpreted as a zero.
Encoding
Schemes:
As our first step, we must encode each character into a series of bits. The ASCII standard of encoding character data assigns one byte to each character. Our string “CSE370” can be broken down as follows:
Character |
Hex |
Binary |
C |
43 |
0100 0011 |
S |
53 |
0101 0011 |
E |
45 |
0100 0101 |
3 |
33 |
0011 0011 |
7 |
37 |
0011 0111 |
0 |
30 |
0011 0000 |
So once we have encoded our string into a series of 1’s and 0’s, we’re done you say? Nope. We must go through another step of adjusting the data—namely, using EFM modulation. EFM modulation is a technique used in the coding of data onto a CD-ROM. What EFM does is take each 8-bit segment of data and change it to a 14-bit segment of data. This way, the 0-1 and 1-0 transitions are minimized, thus reducing the appearance of small pits, which furthermore reduces the chances of errors.
Under the EFM encoding standard, our string then is expanded to:
Character |
8-bit |
14-bit after
EFM |
C |
0100 0011 |
10001000100100 |
S |
0101 0011 |
00100000100100 |
E |
0100 0101 |
00000000100100 |
3 |
0011 0011 |
10000100010000 |
7 |
0011 0111 |
00100010001000 |
0 |
0011 0000 |
00000100000000 |
“100010001001000010000010010000000000100100100001000100000010001000100000000100000000”
Conclusions:
Based on the research conducted, we were able to derive both the digital and physical form of the string “CSE370” on a CD-ROM. However, further research is recommended, specifically on the in depth error correction schemes applied in the CD-ROM standard. Many additional forms of error correction are employed, as noted in the assumptions, such as parity check and interleaving of data, which allows for incredible error detection and correction.
References:
http://www.pctechguide.com/08cdrom2.htm#Manufacturing
http://www.ee.washington.edu/conselec/CE/kuhn/cdmulti/cdhome.htm
K.
Pohlman, The Compact Disk Handbook, A-R editions, 1992