Ethernet Evolution: From 10M to 10G
Ethernet Evolution: From 10M to 10G
1
What Will You Learn?
2
What Won’t You Learn?
• Pricing - We don’t know, We don’t care.
• Specific product features – We’re not here to sell.
• The following technologies:
– ATM
– QoS
– IPv6
– VoIP
– DWDM
– OSPF
– MPLS (well, a little on that)
– How to make money in the stock market
Those are all other workshops…
• How to design a real, complete network. (we’ll cover
Ethernet and switching, but not routing)
3
Outline
• Ethernet Essentials
• Media
• Core PHYs
• Auto-Negotiation
• Future Ethernet
– DTE Power via MDI
– 10 Gigabit Ethernet
– Ethernet in the FIRST Mile (EFM)
• Switched Network Design
– Spanning Tree
– Link Aggregation
– VLANs
• Future non-Ethernet (but want to be)
– RPR
– PONs
4
Ethernet Essentials - Outline
• Ethernet History
• Ethernet Standards
• Ethernet Frame
• Half Duplex MAC
• Repeaters
• Full Duplex MAC
• Switches
• Flow Control
5
Ethernet History
• Why is it called Ethernet?
– “In late 1972, Metcalfe and his Xerox PARC colleagues developed
the first experimental Ethernet system to interconnect the Xerox
Alto, a personal workstation with a graphical user interface. The
experimental Ethernet was used to link Altos to one another, and
to servers and laser printers. The signal clock for the experimental
Ethernet interface was derived from the Alto's system clock, which
resulted in a data transmission rate on the experimental Ethernet
of 2.94 Mbps.
– Metcalfe's first experimental network was called the Alto Aloha
Network. In 1973 Metcalfe changed the name to "Ethernet," to
make it clear that the system could support any computer-not just
Altos-and to point out that his new network mechanisms had
evolved well beyond the Aloha system. He chose to base the name
on the word "ether" as a way of describing an essential feature of
the system: the physical medium (i.e., a cable) carries bits to all
stations, much the same way that the old "luminiferous ether" was
once thought to propagate electromagnetic waves through space.
Thus, Ethernet was born.”
6
Ethernet History
7
Ethernet History: Pure
ALOHA
• Transmit when you want to, regardless of
others.
8
Ethernet History: Pure
ALOHA Collisions
• Extremely inefficient, since the worst-case
period of vulnerability is the time to transmit
two frames.
9
Ethernet History: Slotted
ALOHA
• Transmit only at the beginning of
synchronized “slot times”
• Collision inefficiency limited to one frame
transmission time
Frame
Frame Frame
10
Ethernet History: ALOHA v
Slotted ALOHA
• Throughput efficiency increases
dramatically for Slotted Aloha.
11
Ethernet History: CSMA/CD
• Take Slotted ALOHA to the next level, use the
slots as “contention periods”.
– If no collision occurs before the end of the period, then complete
transmission of the frame.
• CSMA/CD can be in one of three states:
contention, transmission, or idle. More on
CSMA/CD later…
12
Ethernet History: Collisions
• Collisions
– Two or more transmissions literally collided with one
another on the same medium.
– Result corrupts the data contents of the
transmissions.
• Possible due to the medium used by the
original Ethernet
13
Ethernet History: The Shared
Bus Topology
• Coaxial Cabling, 10 Mbps
– 10BASE-5 “ThickNet”
– 10BASE-2 “ThinNet”
• Bus Topology = Truly shared media
14
Ethernet History: Bus
Topology Extinction
• Problems with Bus Topology
– Break in Coax cable can sever service to multiple nodes
– Fault in Coax cable can disrupt service to all nodes
» ground fault
» Incorrect termination
– Adding/Removing nodes disrupts network
15
Ethernet History: Star Topology
• Bus Topology Evolved into a Star Topology
– Driven by cabling issues
» Single cable breaks/faults effect only one node
» Emergence of cheap unshielded twisted pair (UTP) cable
– Introduces “Hub” or “Concentrator” that isolates faulty
nodes/cables
16
Ethernet History: Hub
• Hub can refer to
either:
– Repeater (“Bus in a
Box”)
» Star Topology
with Logical Bus
– Switch / Bridge
» Still Star
Topology: Allows
simultaneous
transmissions
between different
stations
17
Ethernet History:
Standardization
• First IEEE Ethernet Standard in 1985
• Standardization
– Preferred over competing proprietary solutions
– Creates a shared, open market for component and
systems vendors
– Defines interfaces and mechanisms to permit
interoperable solutions
» Permits creation of heterogeneous (multi-vendor)
networks
18
Ethernet Standards
• Ethernet fits in the Open Standards
Interface (OSI) model of the International
Standards Organization as shown.
19
Ethernet Standards: IEEE 802
Architecture
20
Ethernet Standards: IEEE
802.3
• 802.3 Now encompasses
– Original 802.3: 10BASE-T 10BASE-5 10BASE-2 10BROAD-36
– 802.3u Fast Ethernet: 100BASE-TX 100BASE-FX 100BASE-T4
– 802.3x: Flow Control
– 802.3z Gigabit Ethernet: 1000BASE-SX / -LX / -CX
• 802.3ab Copper Gigabit Ethernet: 1000BASE-T
• 802.3ac Frame Tagging for VLAN support
• 802.3ad Link Aggregation
• 802.3ae 10 Gigabit Ethernet: Completion by March
2002
• 802.3af DTE Power via MDI: Completion by Sept
2001
21
Ethernet Frame
22
Ethernet Frame
23
Ethernet Frame: Addresses
• Station Addresses
– Must be unique on every LAN
– Unless locally administered (uncommon), each node has a unique
address assigned by the manufacturer. First 3bytes of address
are assigned by IEEE (Organization Unique Identifier - OUI)
• Source Address: Must always be Station Address
• Destination Address: May be –
– Unicast Address (Addressed to Station Address of one other
station)
– Multicast Address (Addressed to multiple stations
simultaneously)
» Broadcast Address (FF FF FF FF FF FF) To All Stations
24
Ethernet Frame: Other Fields
• Preamble: repeating 1010 pattern needed for some PHYs
• Start of Frame Delimiter: mark byte boundary for MAC
• Type / Length: length, or type of frame if >1536 (0x0600)
• Data: protocol data unit from higher layers (ie: IP datagram)
• Pad: Only used when necessary to extend frame to 64 bytes
• Checksum: CRC-32 Detect if frame is received in error
• Idle: Occurs between frames, must be at least 96 bit times.
25
MAC
• Ethernet Frame transmission and reception must be
controlled – via the Media Access Control (MAC) layer.
• Ethernet MAC
Operates in either Half or Full Duplex dependent on
Handles
Frame Transmission
Frame Reception
use
Does need to know speed of physical layer
26
MAC: Half Duplex
• Half Duplex: Only one station may transmit at a
time
• Requirement on shared mediums (Bus
Topologies)
– 10Base-2, 10Base-5
• Half Duplex Mechanism employed by
Ethernet is:
– Carrier Sense, Multiple Access with Collision
Detect (CSMA/CD)
27
MAC: CSMA/CD
• CS - Carrier Sense (Is someone already talking?)
• MA - Multiple Access (I hear what you hear!)
• CD - Collision Detection (Hey, we’re both talking!)
28
MAC: Collision Domain
• Collision detection can take as long as twice the
maximum network end to end propagation delay,
worst case.
• This “round-trip” delay defines the max Ethernet
network diameter, or collision domain.
• Round-trip delay = 512 bit times for all Ethernets
up to this point.
29
MAC: Collision Domain
• Space-Time depiction of Collision Domain
30
Repeaters
• Works at layer 1 (PHY layer) ONLY
– thus it doesn’t understand frame formats
• Repeat incoming signal from a port to all other ports with:
– restored timing
– restored waveform shape
– very little delay
31
Repeaters: 10Mbps
• 5-4-3 Rule
– 5 Segments
– 4 Repeaters
– 3 Populated segments (in the case of 10Base-5 or 10Base-2)
32
Repeaters: 100Mbps
33
Repeaters: 100Mbps
34
The Ethernet Bridge
• How do you make a half duplex Ethernet network
bigger than the Collision Domain allows?? Use a
Bridge.
• Repeaters are
inside
the collision domain,
since they propagate
collisions
• Bridges/Switches
break up the
domains, since they
operate at layer 2
and buffer packets
before sending them
35
Evolution…
• 10Base-T became dominant in early 90s
– Half Duplex / CSMA/CD is simple
– Repeaters are cheap (not complex / low-speed)
– Growth of Star Topology in building infrastructure
– Unshielded Twisted Pair (UTP) cheaper than coax
• Predominant use of UTP allows for the
creation of Full Duplex MAC.
– Media is no longer SHARED
» 10BASE-T devices transmit on one pair of UTP, and
receive on an entirely separate pair.
» Unlike coax – simultaneous reception and
transmission on the media does NOT corrupt the data
transmission.
36
MAC: Full Duplex
37
MAC: Full Duplex Pros and
Cons
• Pros: • Cons:
– aggregate throughput = –
Must be point-to-point link (i.e.,
200mbps no repeaters)
– no collision efficiency –
Higher throughput means higher
penalty speed equipment which means
– no need to defer to higher cost.
incoming transmission –
no built in back pressure
– no collision domain - mechanism (see MAC Control /
Distance is media Flow Control)
dependent and not
affected by the protocol.
– Need for point to point links
requires new hub device to
interconnect multiple station – a
full duplex capable switch.
38
The Ethernet Switch
• Each port of Switch has its own MAC
• Typically can support either Half or Full Duplex
• Can switch between different network speeds (i.e.,
10Mbps and 100Mbps)
• More on this to come…
39
Flow Control
• Supported on Full Duplex links only.
• Sends MAC Control Frames called Pause
Frames
• Pause Frames
– Destination address is a special multicast address
that is never forwarded by bridges/switches.
» Thus, Link Level Flow Control ONLY
– Tells MAC Control to pause frame transmission to the
MAC for a period of time
• Useful for input constrained devices such
as network interface cards (NICs) and
buffered distributors (more on those in a
bit).
40
MAC: Half Duplex at 1 Gbps
• Recall the maximum
network diameter
from 100Mbps is
only ~200m.
– Without modification to
the Ethernet MAC, the
allowed diameter at
1Gbps (10x faster) would
be ~20m (10x smaller)
– To fix this, the minimum
transmission size must
be increased such that
an acceptable distance
(~200m) is covered.
Recall the diagram at
right:
41
MAC: Half Duplex at 1Gbps
• Solution: MAC adds Extension (non-data) bits
after the end of the frame until transmission is at
least 4096bits
• Benefits of the extension system
– Extends collision diameter
– Maintains compatibility
– Bigger frame size would add complexity (more on that soon…)
MAC Frame
FCS Coverage
minFrameSize
Gigabit slotTime
Duration of Carrier Event
42
MAC: Half Duplex at 1Gbps
• Extension fields create waste in small packets
– This waste cannot be eliminated but can be reduced by frame
bursting
• Frame Bursting
– First frame, if less than 512 Bytes must be extended to 512 Bytes, but may be
followed minimum size inter frame gaps (IFGs) filled with extension bits and
frames of any size
– Maximum size transmitted burst ~64Kbits
– MAC must end burst if no frame is ready to be sent
– Useful for Servers/Switches transmitting high loads on a half duplex network
Preamble SFD MAC Frame with Ext IFG Preamble SFD MAC Frame Preamble SFD MAC Frame
IFG
Extension Extension
Bits Bits
Burst Timer (Max 64
kbits)
43
Buffered Distributors
• Very rare devices - That said, they are far more common than
half-duplex 1 Gbps devices (which are essentially non-existent)
Why?
• Where a repeater is a “bus in a box”, a buffered distributor (BD)
is “CSMA/CD in a box”.
– Each attached device is connected to an input buffer via a full duplex link. If the
input buffer is full, the BD uses PAUSE frames to stop traffic from the attached
device. When a frame is removed from the input buffer, it is transmitted out all
other ports – hence BDs are sometimes called “Full duplex repeaters”
– Thus the collision domain is the maximum delay within the box, infact, the
medium access method used in the box need not be CSMA/CD, so long as it is
fair.
• Benefit: Silicon cheaper than a switch (max speed 1Gbps) and
less complex than frame extension and bursting
• These are not specified in the standard explicitly, but all the
“enabling mechanisms” are - device internals (ie: “bus in a
box”) rarely need to be standardized
44
Ethernet JUMBO Frames
• BAD IDEA
• Recall that HALF DUPLEX 1 Gig Ethernet
chose to use extension bits to pad a frame
out to 512Bytes to enable a larger
collision domain
• Why didn’t they just change the frame size?
(change the min to 512Bytes, and max to,
oh say, 4K or 9K)
• Because such equipment would NOT work
with any previously deployed equipment
– NOT a smart move for the WORLDS
most popular networking technology.
45
Jumbo Frames cont. 1
• The Problem:
• “Software”/Users think that this is a simple
problem – simply change the MTU (max
transmission unit) to 4K (or more) then let
the switches adapt the frame size (fragment
and reassemble – this is HARD HARD HARD
to do in hardware – read as, costly) -- OR
– don’t care about backward compatibility
(interesting world you live in then… can I
join you?)
46
Jumbo frame cont. 2
• The Reality:
• Hardware is built with certain expectations
about frame sizes, interframe-gaps, and
transmission rates – simply “changing” the
MTU of your system and using hardware not
built for Jumbo frames WILL RESULT in
frame/data loss (things called “elasticity
buffers”, which bridge digital clock domains
in systems, must be built with these
“certain expectations”) --- not to mention
buffer allocation problems, etc…
47
Jumbo frames cont. 3
• That said, Jumbo frames _are_ out there but
they are NON-STANDARD
• But be careful (SANs like iSCSI intend to
use them, but there are real risks there)
• And don’t PLAN on them working beyond
the SINGLE vendor’s equipment that you
are using them with (yes, there’s a good
chance that multi-vendor jumbo frames
would work – but that’s what a standard
would protect – and I assure you – IEEE
802.3 will NEVER adopt a larger frame size,
as they are COMMITTED to supporting the
installed base.
48
This page left intentionally blank.
49
This page left intentionally blank.
50
This page left intentionally blank.
51
Physical Media Overview
52
Media Outline
53
Structured Cabling
• Defines a generic telecommunications cabling
system for commercial buildings.
• Specifies the performance of the cable and
connecting hardware used in the cabling system.
• Why?
– The installation of a cabling system is simpler and cheaper during
building construction than after the building is occupied.
– Such a cabling system must have the flexibility to allow the
deployment of current and future network technologies.
– A structured cabling standard provides a design target for the
developers of new network technologies (like 1000BASE-T).
54
Structured Cabling Standards
55
Unshielded twisted pair (UTP)
cable
56
The Category System
57
Performance Parameters for
UTP Cable
• DC resistance
• characteristic impedance and structural
return loss
• attenuation
• near-end crosstalk (NEXT) loss
• propagation delay
58
Parameters for UTP
Connecting Hardware
• DC resistance
• attenuation
• NEXT loss
• return loss
59
Attenuation
• Electrical signals lose power while traveling along
imperfect conductors.
• This loss, or attenuation, is a function of conductor
length and frequency.
• The frequency dependence is attributed to the skin
effect.
• Skin Effect:
– AC currents tends to ride along the skin of a conductor.
– This skin becomes thinner with increasing frequency.
– A thinner skin results in a higher loss.
• Attenuation increases up to 0.4% per degree Celsius
above room temperature (20oC).
60
Attenuation vs. frequency
61
Near-end crosstalk (NEXT)
loss
• Crosstalk:
– Time-varying currents in one wire tend to induce time-
varying currents in nearby wires.
• When the coupling is between a local transmitter and
a local receiver, it is referred to as NEXT.
• NEXT increases the additive noise at the receiver and
degrades the signal-to-noise ratio (SNR).
62
NEXT loss vs. frequency (pair A)
63
Return loss
64
Structured cabling overview I
• Work area
– for example, an office
• Telecommunications closet
– focal point of horizontal cabling
– access to backbone cabling and network equipment
• Equipment Room
– can perform any of the functions of a telecommunications closet
– generally understood to contain network resources (for
example, a file server)
• Entrance Facility
– the point at which the network enters the building, usually
in the basement
65
Structured cabling overview II
• Horizontal Cabling
– from the work area to the telecommunications closet.
– up to 90m of 4-pair unshielded twisted pair (UTP) cable.
• Backbone Cabling
– between telecommunications closets, equipment rooms, and
entrance facilities.
– up to 90m of 4- or 25-pair UTP cable.
• Flexible Patch Cords
– cables use solid conductors making them inflexible and difficult to
work with
– cords use stranded conductors for greater flexibility at the
expense of up to 20% more loss than the same length of cable.
– cords are used at points where the network configuration will
change frequently
66
Structured cabling overview
III
• Transition point
– connects standard horizontal cable to special flat cable designed to run
under carpets.
• Cross-connect
– a patch between two interconnects
– horizontal and backbone cabling runs end at interconnects
– network equipment may use an interconnect
• For UTP cabling systems, horizontal and backbone
runs are always terminated in the
telecommunications closet and equipment room
– for example, you cannot cross-connect a horizontal run to a backbone
run.
67
Structured cabling system
example
68
TIA/EIA-568-A channel definition
70
More UTP Copper Info
• looking down at the top of the RJ-45 (locking tab facing down )
1 2 3 4 5 6 7 8
Solid Solid Solid Solid Solid Solid Solid Solid
White, Orange, White, Blue, White, Green, White, Brown,
Orange White Green White Blue White White
Stripe Stripe Stripe Stripe Stripe Brown Stripe
Stripe Stripe
71
Fiber Types
• Fiber’s form and 3 basic types
72
Fiber Core Sizes
73
Modal Dispersion
• A flashy name to describe how in a multi-mode fiber, the multiple
paths arrive at the end of the fiber at different times.
• This multi-path delay causes a “small” input pulse to smear into a
“wide” output pulse, degrading the rate at which those data
pulses can be sent down the fiber (degrading the bandwidth)
• Note that a multimode step index fiber has high modal dispersion,
as a result, such fiber is not typically used today.
• Graded index fiber varies the refraction index of the fiber material
from the center of the fiber out to its edge. The result is that the
modes traveling the longer sinusoidal paths actually propagate
faster than the modes traveling the shorter, straighter paths –
thus, in a perfect graded index fiber, all modes would arrive at the
same time at the output of the fiber, and no smearing would occur.
(of course perfection is impossible to achieve)
74
Fiber Launches
75
Recent Modal Dispersion
Issues
• As a side-note: During the
development of Gigabit
Ethernet, it was discovered that
a sizeable percentage of
installed (old) multimode
graded index fiber had a
deviation in the center of the
core, as depicted at left. This
was determined to cause
increased modal dispersion
(DMD, differential modal delay
as the gig folk called it)
• The result of this discovery of a
flaw in the installed media was
the subsequent reduction in
supported link lengths for GbE
on installed multimode fiber.
• Which leads to the following
new term…
76
Launch Conditioning
• Many options exist to minimize the DMD effect, one of which
is to use a single-mode “pig-tail” to condition/reduce the
modes launched into the multi-mode segment, by launching
them off-center (and thus avoiding the index “notch” in the
center of the installed fibers)
77
Chromatic Dispersion
• Effects all fiber types, but is one of the primary
limiting factors of long-haul single-mode
connections.
• As mentioned earlier, EM waves propagate at
different speeds in different media. Well, simply
put, EM waves at different frequencies propagate at
different speeds, even in the same media! In
general, this is referred to as group delay, but in
Fiber Optics, its more commonly called chromatic
dispersion.
• This spreading of different colors smears the
transmitted pulses just as with modal dispersion
78
Bad Fiber Connections
• Always keep
your Fiber
CLEAN! And
Capped
when not in
use (both
patch and
ports)
79
Some Types of Connectors
LC
MTRJ
80
Most Common Connectors
• ST
• SC
• MTRJ
and LC
to SC
81
SFF Connectors
• Small Form Factor (SFF)
allows more interfaces
in same footprint
• 4 Competing types, but LC
and MT-RJ are the 2 big
ones
• SFF has a hot-pluggable
version (like GBICs)
82
Fiber Info
• Not only must the fiber be kept clean, but its end
should be polished (to eliminate scratches) and
rounded.
• The rounding of the ends allows the two fiber
ends to touch without an airgap forming between
them.
• DO Keep your fiber clean. If in doubt, use
an airgun or alcohol swab to clean the ends.
• DO Keep your fiber capped when not in use (to
prevent dust and scratches)
• DO Keep fiber ports on devices capped when not
in use – dust collected on a transmitter or receiver
can only – at best, be blown at with an airgun.
83
More Fiber Info
• DON’T EVER allow the fiber to bend more than the
diameter of your closed fist. Fiber is glass,
bent glass breaks and/or creates microfractures.
• DON’T touch the tip of the fiber with your
finger/body
– The core may be protruding and slice you open (it
is glass!)
• DON’T look down a fiber! Its YOUR EYESIGHT
at stake!! Use a power meter to check your
cable!
• Quick way to test fiber / find the right fiber –Hold one
fiber at the far end up to a light, look in the other
end to find the white dot! – If you don’t see it, it’s the
wrong fiber, or its EXTREMELY broken.
84
Core PHYs
85
Core PHYs Outline
86
Review
• MAC
– Sends/Receives data to/from higher layer client
– Handles addressing of data frames for the LAN
– Appends a checksum to ensure frame validity on reception
• Medium
– Available channel in infrastructure for data transmission
– May be copper or fiber
– Medium selected is typically a matter of cost and installed base
• PHY – Physical Layer
– Prepares MAC frame for Medium
– Drive signal across Medium (Tx to Rx)
87
PHY Objectives
88
PHY Stack Model - Slide 1
89
PHY Stack Model – Slide 2
• To support different media, different PHYs are
required.
• Each PHY balances the objectives (cost, speed,
etc) differently
90
PHY Evolution
• 1985 – IEEE 802.3 – 10Base-5 & 10Base-2
• 1987 – IEEE 802.3d – FOIRL
• 1990 – IEEE 802.3i – 10Base-T
• 1993 – IEEE 802.3j – 10Base-F
• 1995 – IEEE 802.3u – 100Base-T4 / TX / FX
• 1997 – IEEE 802.3y – 100Base-T2
• 1998 – IEEE 802.3z – 1000Base-SX / LX /
CX
• 1999 – IEEE 802.3ab – 1000Base-T
91
UTP Copper Evolution
Summary
• 1989 – 10Base-T:
– Cat-3 Cabling dominant (pre Cat-5)
• 1995 – 100Base-T4:
– Support Cat-3, using all 4 pairs, only half duplex capable
• 1995 – 100Base-TX:
– Capitalizing on CDDI (FDDI) work
– Requires Cat-5 (low installed base at time), full duplex capable
• 1997 – 100Base-T2:
– Cat-3 or better, requires only 2 pair, full duplex capable
• 1999 – 1000Base-T:
– Cat-5, requires all 4 pair, full duplex capable
92
10BASE-T Overview
93
10BASE-T: Data Encoding
• 10base-T uses Manchester Encoding
• + to - transition = “0” - to + = “1”
• Always DC balanced & Always has a transition
each bit-time for clock recovery.
94
10BASE-T: Between the
frames
• Idle between frames is filled with Link Test Pulses
(LTPs)
• Periodically signal presence of link partner
95
10BASE-T: Preamble and SFD
• Preamble allows device to recover clock of link
partner
• Since “x” bits will be lost until clock is recovered,
start frame delimiter (SFD) marks byte boundary
• Recall: Preamble: 7bytes 10101010 SFD=
10101011
96
10BASE-T: AUI
• Model at right for all
10Mbps Ethernet
PHYs including:
– 10BASE-2, 10BASE-5,
10BASE-T, 10BASE-FP / -
FB / -FL
• Attachment Unit
Interface (AUI) allows
different PHYs to be
attached to the MAC.
Also used in
repeaters.
• Medium Attachment
Unit (MAU) = PHY
97
100BASE-T4 Overview
• 100Mbs
• Cat-3 or Better, 100 meter max
• Uses all 4 pair
– Transmits on 3 pair, listens for collision on 4th.
– Creates half-duplex only limitation in PHY
• Extinct – Why?
– Introduced at same time as 100Base-TX(Cat-5 or better) when
10Base-T(Cat-3 or better) was prevalent.
– 100Base-TX based on pre-existing CDDI, thus low cost TX chips
emerged rapidly
– Provided customers with option –
» Buy T4 equipment supporting installed base of Cat-3
» OR… Install new Cat-5 cable and buy TX equipment
» Most choose to install new cable to “future proof” network
98
100BASE-TX Overview
99
100BASE-TX: Data Encoding
• 4B/5B Block Code
• 4B/5B means 100Mbps data requires 125Mbps on
the media (a 25% speedup – resulting in 20%
overhead(non-databits transmitted)
• Why is this useful? Creates control codes.
• “I” (Idle) = 11111 Which is sent continuously to
provide a good clock to the link partner’s receiver
• “J” and “K” = 11000 & 10001 – Start of Frame,
unique bit pattern, cannot be made from any
combination of other VALID symbols
• 2^5=32 symbols, only need 16 (2^4) symbols for
data, 1 for idle, 2 for start of frame, 2 for end of
frame, rest are invalid
100
100Base-TX: 4B/5B Tables
101
100BASE-TX: MLT-3 Coding
• 5B Idle code is 11111, at 125Mbs (NRZI) that’s a
125MHz tone, recall cat-5 performance specification
ends at 100MHz!
• Multi-Level Transition with 3 levels a.k.a. MLT-3
Reduces frequency of 5B encoded data.
• If data is “1”, then transition from current level to
next level. ie 1111=+1,0,-1,0 reducing freq by 1/4
• If bit data is “0”, then don’t transition
bit 0 1 1 0 1 0 0 1 1 1 0 0
MLT-3
102
100Base-TX: MLT-3 Eye
Diagram
• 1-Bit Time shown, with all possible transitions
103
100BASE-TX: Scrambling
• MLT-3 successfully reduces frequency of 5B
encoded data, but recall the 5B idle stream is sent
continuously between frames.
• After MLT-3 encoding, Idle went from a 125MHz
tone to a 31.25MHz tone (125/4). Since most of the
transmitted energy is at this frequency, it radiates
strongly, which the FCC does not like, hence…
• Scrambling of the 5B encoded stream is required.
• Prior to MLT-3 encoding, the 5B stream is pseudo-
randomly scrambled, this breaks up the repeating
1s and spreads the transmitted energy across
many frequencies, thus no single frequency has
much energy.
104
100BASE-TX: Media
Independent Interface
105
100BASE-T2
• 100Mbs
• Cat-3 or Better
• Uses only 2 pair
– Uses 5 level signaling
– Transmits and receives on both pairs simultaneously
– Allows use of other pairs.
– Full-duplex capable
• Extinct – Why?
– Limited market potential due to 100Base-TX
106
1000BASE-T Overview
• 1000Mbs
• Cat-5 or Better
• Uses all 4 pair
– Uses 5 level signaling
– Transmits and receives on both pairs simultaneously
– Full-duplex capable
• Emerging –
– Growing number of PHY chip vendors
– 10/100/1000 single chip solutions and multiport chips
107
The 1000BASE-T Solution
• Start with 125 MHz signaling rate
125Mbps
– Exactly like 100BASE-TX
• Transmit/Receive on all 4 pairs of cable x 4
– Similar to 100BASE-T4 500Mbps
• Use multilevel signaling – PAM5
– Similar to 100BASE-TX (3-level MLT-3 signaling) 500Mbps
– Exactly like 100BASE-T2 (5-level PAM5 signaling) x 2
– Allows 2 bits per symbol 1000Mbps
• Allow simultaneous bi-directional signaling on all 4
pair
– Exactly like 100Base-T2
– Allows full-duplex communication at 1000Mbps
108
1000Base-T: 4D-PAM5
• 4 Dimensional Pulse Amplitude Modulation with 5
levels
• 5 levels on 4 pairs yields 625 possible symbols
(5^4)
• To encode 8bits every 8ns (125MHz), only 256
symbols are needed (2^8).
• Remaining symbols can be used for control
characters (idle, start of packet, end of packet, etc).
• Data frames are transmitted using a convolutional
(Trellis) code
– Adds structure to the transmitted symbols needed for Viterbi
Decoding
109
1000Base-T: Error Correction
• Viterbi Decoder
– Takes the last few received symbols
– Calculates the most likely sequence of symbols
– A symbol error results in an impossible or unlikely
sequence
– The most likely sequence probably contains the
correct symbols
– “Most likely sequences” are known in advance due
to the trellis code structure
110
1000Base-T: Noise
Environment
111
1000Base-T: DSP
• Digital signal processing (DSP) techniques can
reduce effects of echo and NEXT
– These are caused by local transmissions
– Tap the signal on all 4 transmitters
– Cancel out these signals from the received waveform
• ELFEXT is not easily cancelled with DSP
– Originates from remote transmitters
– Relies on cable and connectors to meet new Cat5 specification, so
that ELFEXT is minimal
• Alien Crosstalk also unpredictable
– Unknown source
– Thus, cannot be cancelled
• 25-pair bundles not allowed!
– Crosstalk between bundled pairs too high
112
1000Base-T: Master/Slave
Timing
• Synchronization to implement adaptive
filters for canceling echo and NEXT
– Need a common master clock to reference
– Symbols transmitted & received at the same rate
• Link pair must have a Master and a
Slave
– Master uses internal clock to transmit
– Slave recovers the master clock from the data
received
– Slave uses recovered clock to transmit its data
• Need a means of deciding who is who
–
– uses Auto-Negotiation (more on that later)
113
10, 100, 1000 Signal
Comparison
114
UTP Copper Phys Wrap-up
• Combo PHYs
– 10/100 PHYs – 10Base-T & 100Base-TX
– 100/1000 PHYs – 100Base-TX & 1000Base-T
– 10/100/1000 PHYs – emerging
• Quad / Hex / Octal PHYs
– Reduces chip and other component count required to
manufacture multiport devices, thus lower cost.
– Most multi-phy devices are also Combo PHYs
115
Fiber Evolution Summary
• 1987 – Fiber Optic Inter-Repeater Link (FOIRL):
– Link 10Mbps repeaters via multimode fiber (2km max)
• 1993 – 10Base-F:
– 10Mbps to end stations via multimode fiber (2km max)
• 1995 – 100Base-FX:
– Capitalizing on FDDI work
– 100Mbps via multimode fiber (2km max) to repeaters or end
stations
• 1998 – 1000Base-SX / LX:
– Capitalizing on FibreChannel work, but spedup
– 1000Mbps
116
10Mbps Fiber Overview
• All use multimode fiber, driven by LEDs
• FOIRL: Fairly common
– FOIRL = Fiber Optic Inter-Repeater Link
– Typical use: FOIRL Medium Attachment Unit (MAU) attached to
Attachment Unit Interface (AUI) of repeaters w/o built in fiber
support
• 10BASE-F
– Catch all for 10Base-FP, 10Base-FB, 10Base-FL
• Distances:
– 10Base-FP: 500m max
– All others: 2000m max – limited by cable attenuation
• Same speed, but different techniques – identical
type of MAU required on both sides of fiber
117
100BASE-FX Overview
• 100 Mbps
• Multimode Fiber
• Inherently Full Duplex (Upper layer
MAC/Repeater may be half duplex though)
• Full Duplex Distance: 2000 meter runs
– Limited by cable attenuation
• Half Duplex Distance: 412 meters
– Point-to-point link, no repeaters
– Limited by collision domain
118
100BASE-FX: Similar to -TX
119
1000BASE-SX/LX Overview
• 1000 Mbps
• Multimode Fiber for SX or LX
• Singlemode Fiber for LX only
• Inherently Full Duplex (One fiber for Tx, one
for Rx)
– Upper layer MAC may be half duplex, but this is highly
unlikely
• Distance:
– Tricky issue
120
1000Base-SX/LX: Distance and
Launch Cable
121
Recent Modal Dispersion
Issu
es
• As a side-note: During the
development of Gigabit Ethernet, it
was discovered that a sizeable
percentage of installed (old)
multimode graded index fiber had a
deviation in the center of the core, as
depicted at left. This was determined
to cause increased modal dispersion
(DMD, differential modal delay as the
gig folk called it)
• The result of this discovery of a flaw in
the installed media was the
subsequent reduction in supported
link lengths for GbE on installed
multimode fiber.
• Which leads to the following new
term…
122
Launch Conditioning
• Many options exist to minimize the DMD effect, one of which
is to use a single-mode “pig-tail” to condition/reduce the
modes launched into the multi-mode segment, by launching
them off-center (and thus avoiding the index “notch” in the
center of the installed fibers)
123
Wavelength vs. Attenuation
124
1000Base-SX Distances
126
1000Base-SX/LX:
Data Encoding
• 8B/10B Block Code
• Code developed by IBM in the 80s, used by
FibreChannel.
• Like 4B/5B, has 20% overhead, used not only
for control characters (idle, start of packet, end
of packet) but also for data code redundancy.
• Running Disparity:
– Every data code has a + and a – running disparity version
– The form of each data code determines whether the NEXT data
code should be + or –
– If data is received that does not follow the running disparity rules,
then an error has occurred.
127
Non-Standard Interfaces
• 1000Base-LH
– Provides up to 10km distances over single mode
– Interoperates with LX for 5km (same wavelength, just
higher transmit power and lower receive sensitivity)
• 1000base-XD
– Provides up to 50km distances over single mode
– Does not interoperate with any other interface type
• 1000Base-ZX
– Provides up to 70km distances over single mode (100km
on dispersion shifted fiber)
– Does not interoperate with any other interface type
128
GBIC
• GigaBit Interface Converter –
industry agreement (SFF
Committee)
• Allows for hot-pluggable
transceiver
• Support both 1000Base-SX and LX,
so user can choose after
purchasing the box
129
This page left intentionally blank.
130
This page left intentionally blank.
131
This page left intentionally blank.
132
Autonegotiation
133
Autonegotiation vs.
Autosensing
• Autonegotiation • Autosensing/Speed
– standardized speed handshake Detection
– auto-configures to best possible link – several different proprietary
(e.g., 100 full duplex) methods
– still links with older or non-autoneg – only auto-configures to 10 or
devices 100, not duplex settings
– sometimes causes autosensing – creates many interoperability
(NOT autonegotiating) devices to headaches
link at 10 and not 100
– user error/misunderstanding cause
problems
134
ISO-OSI model
135
Autonegotiation - How?
• Constantly sends out 10base-T Link Test Pulses before linking
• The pulses are grouped together in defined “words” that convey
meanings, such as “I can do 100 half and full duplex”
• Older 10base-T devices just think they’re LTPs
• Autoneg devices understand them as words and exchange
handshake info to link at best possible link
• If the autoneg device sees regular LTPs coming in (not autoneg
words), it just links at 10 half duplex
• If the autoneg device sees Fast Ethernet IDLE stream coming in, it
just links at 100 half duplex.
• Unfortunately, this only works on copper for 10/100 - it’s supported
on fiber and copper for gig (i.e., there is no autonegotiation for
100base-FX)
136
NLP (Normal Link Pulse)
16 +/- 8 ms
138
When they are FLP Bursts!
•Stick a bunch of NLPs together and use it to signal info (like
Morse code)
Burst width = 2 ms
16 +/- 8 ms
139
Data Pulses
•In between the “clock pulses” we can put a data pulse to
signify a value of 1, or not put one to signify value of 0
16 +/- 8 ms
140
right….What’s an FLP for
again?
The data fields are pre-defined for specific meanings
The selector field is used for version info, such as “802.3
version 1 The NLPs used are all the same
voltage level (height), I just draw
Selector Field the data ones shorter to distinguish
them from the clock ones.
141
right….What’s an FLP for?
The Technology Ability Field is used to signal which modes
the device can handle, for example 100base-T Full Duplex
142
right….What’s an FLP for
again?
The last fields are used to tell the link partner of a remote
fault, acknowledge the receipt of its partner’s FLPs, or that
there are more “pages” of FLP fields to send
Selector Field Technology Ability Field
Remote
10BT 10BT100BT 100BT 100 Asym.
Pause Fault ACK NP
HDX FDX HDX FDX T4 Pause
143
A Sample FLP
Remote
10BT 10BT100BT 100BT 100 Asym.
Pause Fault ACK NP
HDX FDX HDX FDX T4 Pause
144
How It Works: 2 Aneg
Devices
• Device A sends FLPs out, while Device B does the
same
• Each device receives the other’s FLPs and sets their
ACK bit to true (a value of “1”)
• They then choose the best possible mode that both
support and start transmitting IDLE, and it’s
linked!
Device A Device B
TX RX
100Mbps
Aneg 100Mbps
Aneg
Full Duplex RX
10/100 TX Full Duplex
10/100
HD/FD HD/FD
145
How It Works: an Aneg Device
with a
10Mbps Legacy Device
• Device A sends FLPs out, while Device B sends out
LTPs
• Device A “parallel detects” the LTPs and links with
Device B in 10mbps half-duplex mode
Warning!
Device A Device B
TX RX
10Mbps
Aneg 10Mbps
Half
10/100 RX TX Half or Full
Duplex
HD/FD Duplex
146
How It Works: an Aneg Device
with a
100Mbps Legacy Device
• Device A sends FLPs out, while Device B sends out
Fast Ethernet IDLE
• Device A “parallel detects” the IDLE and links with
Device B in 100mbps half-duplex mode
Warning!
Device A Device B
TX RX
100Mbps
Aneg 100Mbps
Half
10/100 RX TX Half or Full
Duplex
HD/FD Duplex
147
Autoneg Problems
148
Aneg Error #1: User
Misconfiguration
User configures Device B to be 100Mbps Full Duplex, not
knowing this disables Autoneg (sending FLPs)
Device A sends FLPs out, while Device B sends out IDLE
Device A sees IDLE and assumes Device B is 100Mbps Half-
Duplex, thus it links in Half-Duplex mode
Mismatch
Device A Device B
TX RX
100Mbps
Aneg 100Mbps
Half
10/100 RX TX Full
Duplex
HD/FD Duplex
149
Duplex Mismatch Problem
If Device A and B send out a frame at the same time, then:
1)Device A will believe a collision occurred and corrupt its outgoing
frame while discarding Device B’s frame, and then attempt to resend
its own frame
2)Device B will not resend its frame, and see Device A’s frame as
corrupted
Device A Device B
TX RX
100Mbps
Aneg 100Mbps
Half
10/100 RX TX Full
Duplex
HD/FD Duplex
150
Duplex Mismatch Symptoms
Device A will record many “Late Collisions” in its counters
Device B will record many “CRC Errors” in its counters
The connection will appear slow, as the dropped frames aren’t
resent until a higher layer times out (usually 1 second later)
Device A Device B
TX RX
100Mbps
Aneg 100Mbps
Half
10/100 RX TX Full
Duplex
HD/FD Duplex
151
Aneg Error #2: User
Misconfiguration
• User configures Device A for 100Mbps Full-Duplex
and Device B for Half-Duplex (or Device B is only
Half-Duplex) while leaving Aneg (sending FLPs)
turned on for both
• Each device receives the other’s FLPs, but the
mismatch never resolves to a link because there is
no common option
Device A Device B
TX RX
Aneg Aneg
100Mbps RX TX 100Mbps
Full Duplex Half Duplex
152
Aneg Error #3: Autosensing Devices
Autosensing devices do not use FLPs
Instead, they send IDLE while watching the RX line for LTPs or
IDLE and decide based on that
So Device A will send FLPs, which Device B will interpret as LTPs
and switch to transmitting LTPs itself
Meanwhile, Device A may have seen the IDLE from B and “parallel
detected” over to sending 100Mbps IDLE, which Device B will now
see as 10Mbps junk data (yes, this is legal) - this should fix itself
in a few seconds, or never
Device A Device B
TX RX
Aneg
100Mbps 10Mbps
Autosensing
10/100
Half RX HalfHD
TX 10/100
HD/FD
Duplex Duplex
153
Aneg Error #3: Autosensing Devices
OR, Device A will see the LTPs from Device B and parallel detect to
10mbps, thereby successfully forming a 10mbps link, which is not
optimal
Device A Device B
TX RX
Aneg
10Mbps 10Mbps
Autosensing
10/100
Half RX HalfHD
TX 10/100
HD/FD
Duplex Duplex
154
Gigabit Ethernet
Autonegotiation
• Gigabit Ethernet uses the same Aneg mechanism as 10/100 copper
Ethernet (remember there is no Aneg for 10 or 100 fiber)
• For 1000base-SX and 1000base-LX, it’s used to determine duplex
and flow-control parameters using the pre-defined fields shown
below
• For 1000base-T, the Next Page field is used to indicate there is
additional info that must be exchanged using more “pages”
RemoteRemote
Reserved Asym.
FDX HDX Pause Pause Reserved Fault 1 Fault 2ACK NP
155
Gig Aneg Error #1: User
Misconfiguration
• There are almost no Half-Duplex Gig devices, so we never
have the mismatch problem of 10/100 Aneg
• The biggest problem is when users configure one device to
“Autoneg Enabled” and the other device to “Autoneg
Disabled”
• Such misconfigured devices will not link, or only one side
will think it’s linked
Device A Device B
TX RX
Aneg Not Aneg
1000base- RX TX
SX or 1000base-
LX SX or
LX
156
Gigabit Copper
Autonegotiation
• For 1000base-T, the Next Page field is used to indicate there is
additional info that must be exchanged using more “pages”
• These pages contain the same type of info regarding duplex and
speeds, but they also contain a couple new things:
– a field to indicate whether the device is a single-port or multi-port
device
– a field for whether the master/slave determination is manually
configured
– a field for whether this device is the master or slave
– a seed value 10BT 10BT100BT 100BT 100 Asym. Remote
HDX FDX HDX FDX T4Pause Pause Fault ACK NP
Selector Field Technology Ability Field
157
Gigabit Copper Autoneg
(cont.)
• The purpose for the new fields is to resolve
which side is the master and which the
slave for a training sequence to adjust to
the cable characteristics (like a DSL modem
does)
• If the user does nothing, then it will
automatically select a master based on:
– if one side is multi-port, it is the master
– if both sides are or both sides are not multi-port, then the
seed is used to randomly choose
• Or the user can manually configure one to
be the master and the other a slave
158
Gig Copper Aneg Error #1:
User Misconfiguration
• There are almost no Half-Duplex Gig devices, so we never
have the mismatch problem of 10/100 Aneg
• The biggest problem is when users configure one device to
“Autoneg Enabled” and the other device to “Autoneg
Disabled”
• Such misconfigured devices will not link, or only one side
will think it’s linked
Device A Device B
TX RX
Aneg Not Aneg
1000base-T RX TX 1000base-T
Master Slave
159
Gig Copper Aneg Error #2:
User Misconfiguration
• A user configures both devices to be master or both
to be slave
• Such misconfigured devices will never link - even if
they’re also both autonegotiating, or both manual
• The problem is manually configuring master/slave
overrides automatically selecting which side is which
Device A Device B
TX RX
1000Mbps
Manual RX TX 1000Mbps
Master Manual
Master
160
Power over Ethernet - 802.3af
161
Etherpower = No More Wall
Warts !!
Hadriel … what a
mess !
162
Power over Ethernet: the
concept
Ethernet Switches
48VDC Modular
or
Rectifier and
‘mid-span
Battery Tray
insertion’ Data Appliances
163
Power over Ethernet: the details
Mid-span Power
Patch Panel
Dongle
164
UTP Copper
• looking down at the top of the RJ-45 (locking tab facing down )
1 2 3 4 5 6 7 8
Solid Solid Solid Solid Solid Solid Solid Solid
White, Orange, White, Blue, White, Green, White, Brown,
Orange White Green White Blue White White
Stripe Stripe Stripe Stripe Stripe Brown Stripe
Stripe Stripe
165
Power Over What?
Unused pairs (4/5,7/8) Signal pairs
• Safer (1/2, 3/6)
• Cheaper for phone side
• Doesn’t need all 4
• Allows for mid-span
insertion (but this breaks pairs
1000Base-T • Works with 1000Base-
• Allows for cheap “dongles” T
and non-data equipment
• Cheaper for switch
side
166
Power Method Options
• Voltage: 44-57v nominal DC
• Amperage: 350mA Max
• Power: 12.95 Watts max during use
• Use either pair sets: the source can feed
power on one or the other, but the sink
(device to be powered) will draw it from
either
– this makes the receiver (the phone side) more
expensive
– this allows for both a cheap powered patch panel and
cheap powered switch method
– this does not allow both data and unused pairs to send
power simultaneously (not safe)
167
Power Discovery
168
Houston, we have a
problem…
• One company has filed a suit against another for
patent infringement
• Their patent covers a security method whereby
they monitor a small current (<1ma). If the
current drops off then they produce an alarm.
• Power over Ethernet's minimum current is 10ma
and if the PD's current draw falls below 10ma
then the PSE removes power.
• By sending current over active Ethernet links,
their patent may cover power over Ethernet
• If they don’t sign a Fair Use Agreement, the
standard will be stopped/blocked
169
…and hopefully a solution
170
This page left intentionally blank.
171
This page left intentionally blank.
172
10 Gigabit Ethernet - 802.3ae
173
IEEE 802.3ae Objectives
• Support full-duplex operation only.
• Provide Physical Layer specifications which support link
distances of:
– At least 300 m over installed MMF
– At least 65 m over MMF
– At least 2 km over SMF
– At least 10 km over SMF
– At least 40 km over SMF
• Define two families of PHYs
– A LAN PHY, operating at a data rate of 10.000 Gb/s
– A WAN PHY, operating at a data rate compatible with the payload rate
of OC-192c/SDH VC-4-64c
174
Where does 10GE fit ?
Key Applications
Data Center
Access
2
Access
1
Legend
10 GE
1 GE
T1
Server Farm
175
Timeline
We Are Here
177
The PMDs
178
PMD Distance
179
PMD Names
• Wavelength: S=850nm L=1310nm E=1550nm
• PMD Type:
– X=WDM LAN(Wave Division Multiplexing – 4 wavelengths on 1 fiber)
– R=Serial LAN using 64B/66B coding (LAN Application)
– W=Serial WAN – SONET OC-192c compatible speed/framing
• 10GBASE-LX4 So that’s 7
• 10GBASE-SR / -LR / -ER
interface
• 10GBASE-SW/ -LW / -EW
types!
180
Types of 10Gig
Interfaces Type Encoding Wave- Fiber Distance
length Type
10GBASE-LX4 WWDM 8B/10B 1310nm MMF or 300m or
SMF 10km
LAN
182
A bit on Technical Feasibility,
183
Technical Feasibility
184
History: Gig Ethernet GBIC
• GBIC is a hot-
swappable
connector for
Gigabit Ethernet
• All layers above
GBIC are
identical
• GBIC module is
simply O/E
and E/O (laser
and photo
detector)
185
XGMII
• PHY
independent
interface
• XGMII is a 32bit
wide data bus,
split into 4 8bit
lanes, and a 4bit
control bus
• Requires 74
pins!
186
XAUI (Sounds like Zowie)
• Phy Independent
Interface
• XAUI works just like
10GBase-LX4, 8B/10B
encoding 4 XGMII 8bit
lanes to 4 10bit lanes,
serialized to 4 3.125Gbps
• Requires only 16pins!
• Allows for longer chip-to-
chip distance
• Same interface as
Infiniband and Fibre
Channel use
• Hot-swappable PHYs
likely to emerge at this
interface (like GBIC for 1
Gig Ethernet) – called
XGP (Ten Gig Pluggable)
187
188
Why do YOU care about
XAUI?
• XAUI will form the basis for your hot
plugable
• Now a word from….
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
XGP
204
Ok, but does XAUI WORK!?
205
206
207
208
209
210
211
XAUI Conclusions
212
64B/66B Coding
213
LAN PHY
• Simple Ol’ Ethernet
• Transmits/Receives MAC frames at 10.000Gbps
• Allows easy/simple speed scaling/aggregating of
10 1-Gbps links
• Supports Link Aggregation
• Can drive 2m to 40km depending on PMD
• Useful for:
– Campus backbones (connect Gig E switches together)
– Dark Fiber runs
– Computer Rooms
– SANs, etc…
214
WAN PHY
• Transmit/Receive MAC Frames at 9.29419 Gig
Why?
• SONET/SDH OC-192c Data rate compatible
• Take coded (via 64b/66b) MAC frames and place in
the payload section of SONET frame and transmit
onto SONET at OC-192c speeds (9.95328)
• For what purpose?
– Make use of the existing SONET Photonic Network
– Predominate optical infrastructure in North America, Europe, and
China
– Avoid use of costly features of SONET – Optics, Stratum Clock, and
many management features
• Likely that most 10GigE WAN phys will be nearly
indistinguishable from an OC-192c interface using
10GigE compatible payload encoding
215
216
WAN Interface Sublayer (WIS)
• MAC (and higher layers) OPERATES AT 10Gbps!!!
• Takes coded (via 64b/66b) MAC frames and places in
the payload section of SONET frame (and builds the
Path Section and Line Overhead) and transmits onto
SONET at OC-192c speeds (9.95328)
• MAC must be configured to WAN mode, which thus
increases the inter-frame gap (using
“ifsStretchRatio”) to slow the frame rate to 9.29419
BUT the MAC (XGMII/XAUI/ etc) signaling rate is still
10Gigabit
• WIS must delete IDLES (on the transmit path) and
insert IDLES (on the receive path) to rate match
between 10Gig MAC and 9.29419 Data rate
• By being able to “Turn on/off” the WIS, a PHY could
be a “UNIPHY” (both WAN AND LAN!!!) in one ‘box’
217
10G WAN Phy
218
WAN PHY
• Does use/fill-in some of the Path, Section, and
Line overhead – but need not use all of it
219
ELTE
220
A 10GigE Phy Type Joke
221
Use in today’s DWDM
222
Stay Tuned…
223
This page left intentionally blank.
224
Switched Networks
225
Topics
• Quick Background
• Spanning Tree
• VLANs
• 802.1p/QoS
• L3 Switching
• Link Aggregation
• Multiple Spanning Trees
• Rapid Reconfiguration
226
Shared Medium (Repeated
Network)
• All machines “share” the Repeaters
network
• Only one machine can talk at
any one time 5m
• Distance limitations
– At most 205m for Fast Ethernet
• Total throughput limited 100m
• Single collision domain
End Stations
227
Bridging Review
• Connects Separate shared
Networks
• Frame Translation/
Encapsulation (Token Ring
to Ethernet)
• Reduces Unicast Traffic
• Switches: Allow for
multiple conversations
228
Bridging Background
• Bridges work at
layer 2 of the OSI
Model
• Their primary
function is to relay
frames
229
Bridge Tables
• One table lists Entry MAC Addr Port active
1 0800900A2580 1 yes
MAC addresses, 2 002034987AB1 1 yes
which port they’re 3 0500A1987C00 2 yes
on, and if they’re 4 00503222A001 2 yes
active or disabled 5
6
7
8
9
10
11
12
230
Learning of addresses
The Filtering Database learns a
station’s location from the source
address on an incoming frame.
hriasmsoeuwcr iethasdoduer rcses
Port 1
ddress
e0a2rnda2Frames
et2ndab
o 2 ”P33abowith
[Link]
1
4. All future
h4edestination
destined for this MAC address Switch
ilrterciframes
e nigved 002222333344 are
address
will be forwarded ONLY out of
only forwarded on port 1
this Port. Port 4
231
The Learning Bridge
That was a bit fast and complex. Let’s review.
Every bridge has a table called a Filtering Database.
Entries in this table are updated upon receipt of frames,
the source addresses and the ports they arrive on are
learned.
Once a MAC address is associated with a port, frames
containing that destination address are only
forwarded out of that port.
In the real switches these tables vary in size, most have the
capability of holding several thousand MAC addresses.
I’ve seen one that has the capacity for more than 150,000
addresses.
232
Spanning Tree
Part of 802.1D
233
802.1D - Spanning Tree
• Configures arbitrary physical
topology into one loop-free
spanning tree
• provides fault-tolerance by
using redundant links as
backups
• configures deterministically/
reproducibly
• low overhead/bandwidth
• auto-configures
234
Auto-Configuration
• Default values allow out-of-box
configuration without user interaction
• If the user wants, though, they can control
the relative priority of bridges and ports
to be used
• The user can also control the “path cost” of
each port, so a least-cost tree can be built
according to the user’s specifications
235
Spanning Tree
Why a tree?
If you have 2 switches that are connected in parallel, it could
create a loop.
LAN Connection
A B
Incoming broadcast
frame
236
More Reasons
Spanning Tree Disables one of these connections.
How does it do
this?
237
Initial Bridge Parameters:
Bridge Priority Path Cost
B1 1 20
B2 2 15
B3 2 25
-All Ports on each bridge have the same Path Cost in this
example.
-The Max Age, Hello Time, and Forward Delay parameters
are left at their default values of 20.0, 2.0, and 15.0
respectively.
238
Initial Bridged LAN Topology
LAN A
0 B1
LAN B
B2 15 B3
25
LAN C
239
Active Bridged LAN Topology
after Bootup
LAN A
0 B1
LAN B
B2 15 B3
25
LAN C
240
VLANs
802.1Q
241
802.1Q - Standard for VLANs
• Defines a method of
establishing
VLANs
• Establishes the
Tagged Frame
• Provides a way to
maintain priority
information across
LANs
242
What are VLANs - Virtual Local Area Networks?
• Divides switch into two or more
“virtual” switches with
broadcast
separate domains
• Achieved by manual
configuration through the
switches’ management interface
• Only that switch will be
segmented
243
Multiple VLANs in One Switch
244
Why VLANs?
• Lots of broadcast traffic wastes bandwidth
– VLANs create separate broadcast domains
» Microsoft Networking
» Novell Networking
» NetBEUI
» IP RIP
» Multicast (sometimes acts like broadcast)
• VLANs can span multiple switches and therefore
create separate broadcast domains that span
multiple switches
245
Why Are VLANs Needed? Internet
Router
Legacy
Server
Switch Farm
246
Possible Solution: Routers Internet
Router
Switch
Server
Farm
Router Router
247
Possible Solution: Move
Cables or Users Internet
Router
Router
Server
Farm
Legacy Legacy
Switch Switch
248
Easier Solution: VLANs Internet
Router
802.1Q
Switch Server
Farm
802.1Q 802.1Q
Switch Switch
249
More Reasons...
• Link Multiplexing
– slower speed technologies share the high-bandwidth uplink
– multiple IP subnets on one physical link with layer 3 switching
• Security
– Without it, broadcasts are seen by everyone
– Virtual private tunnel – “VPN Like” service
• Moving end-stations to different ports
• Switching is faster and cheaper than routing, but
you still need full routing for some applications
and to connect VLANs (IP Subnets) together
250
Standards Based VLANs
251
Basic VLAN Concepts
• Port-based VLANs
– Each port on a switch is in one and only one VLAN (except trunk links)
• Tagged Frames
– VLAN ID and Priority info is inserted (4 bytes)
• Trunk Links
– Allow for multiple VLANs to cross one link
• Access Links
– The edge of the network, where legacy devices attach
• Hybrid Links
– Combo of Trunk and Access Links
• VID
– VLAN Indentifier
252
Tagged Frames
• 4 Bytes inserted
after Destination
and Source Address
• Tagged Protocol
Identifier (TPID) = 2
Bytes (x8100)
– length/type field
• Tagged Control
Information (TCI) = 2
Bytes
– contains VID
253
Trunk Link
254
Trunk Links Internet
Router
802.1Q
Switch Server
Farm
802.1Q 802.1Q
Switch Trunk Links Switch
255
Access Links
256
Access Links Internet
Router
802.1Q
Switch Server
Farm
802.1Q 802.1Q
Switch Access Links Switch
257
Hybrid Links
258
Hybrid Links Internet
Router
802.1Q
Switch Server
Farm
259
So Far So Good...
• So one might ask: “what does the Bridge
table look like?”
• Two answers:
– multiple (distinct) tables: one for each VLAN
– one mother of all tables, with a VLAN column
• They sound similar, but it turns out they are
VERY different
260
Each Table is
Multiple Tables for One VLAN
262
Asymmetric VLANs (also known as “interoperability killer”)
Legacy
Router
• Legacy router can
talk to legacy
clients with one
physical link
• Legacy clients
cannot talk to
each other
263
Asymmetric VLANs Internet
Legacy
4
Router
Legacy
802.1Q 5
Switch
Server
4
3 Farm
6
802.1Q Access Link
Switch
2 7
A Simplified Example:
George
264
Independent Learning I
• Legacy router
learns MAC
addresses from
both VLANs
• Requires 2
physical links
265
Independent VLANs Legacy Internet
Router
4
Legacy
802.1Q 5
Switch
Server
4
3 Farm
6
802.1Q Access Link
Switch
2 7
A Simplified Example:
George
266
Independent Learning II
267
Independent VLANs 802.1Q
Router
Internet
4
Legacy
802.1Q 5
Switch
Server
4
3 Farm
6
802.1Q Trunk Link
Switch
2 7
A Simplified Example:
George
268
Problems
• Can’t combine SFD and MFD switches in
one network
• Some switches only do one or the other,
and can’t be changed
• Hybrids of SFD and MFD makes this
tricky
269
Other Additions
270
GARP (yeah, I know, “the world according to”… that’s a
new one!)
271
Windows screenshot —>
GVRP Vendors
(current): Extreme,
3Com and HP
Several others are
developing working
implementations also.
• Industry
Implementation
Example
– 3Com manufactures
Network Interface
Cards that take
advantage of GVRP
– Accessed via the
Control Panel 272
How Do VLANs “Secure”? Internet
Router
802.1Q
Switch Server
Farm
802.1Q 802.1Q
Switch Broadcasts Switch
not seen by
Green Users
Legacy 802.1Q 802.1Q
802.1Q Legacy
Switch 802.1Q Switch
Switch Switch
Switch Switch
273
VLAN Security
274
This page left intentionally blank.
275
This page left intentionally blank.
276
Priorities/QoS
802.1p
277
The Problem
278
Stack View
279
Inside the Switch
This is a very simplistic view
281
Better Solution
282
Remember VLAN Tagged
Frames?
• 4 Bytes inserted
after
Destination and
Source Address
• Tagged Protocol
Identifier (TPID) =
2 Bytes (x8100)
– length/type field
• Tagged Control
Information (TCI) =
2 Bytes
– contains VID
283
Priority Tagged Frames
284
Limitations of 802.1p
285
Example
286
This page left intentionally blank.
287
This page left intentionally blank.
288
Layer-3 Switching
289
Routing vs. Switching
290
Routing in Hardware
291
Stack View
Remember this?
292
Layer-3 Switch Stack View
293
Layer-3 Switch
Implementations
• Actual implementations vary depending on
architecture
• Most have fast-forwarding engines per blade which
can forward based on IP Address
• Usually, there is one or more central CPUs to handle
exception cases, routing updates and management
• Some L3 Switches now offer advanced features:
– filtering based on address, ports, etc.
– IP multicast in the fast path
– IPX, Appletalk, etc.
– Advanced routing protocols such as BGP
294
This page left intentionally blank.
295
This page left intentionally blank.
296
MPLS over Ethernet
297
The Problem
298
The Solution
• MPLS defines a method of switching packets by
exchanging labels at each hop
• A router in the MPLS context is called a LSR (Label
Switching Router)
• A Traffic engineered MPLS tunnel is a series of label
switched hops (LSH) collectively called an LSP
(Label Switched Path)
• The Tunnel is constructed by the Ingress LSR
• LSPs created with Label Distribution Protocol (LDP)
are bounded by a pair of participating LSRs, the data
plane topology is identical to IP hop by hop
299
Shortcut Routing
Router K's Next
Hop For Routes in
ISP 1 Subscribe
EGRESS
LSR I r ISP 1
Backbone
BGP
Shortest Path to Provider Router E's Next Hop
ISPs 1, 2, 3, and K For Routes in ISP 2
4
B 1 Subscriber
nel
ut Tun
INGRESS
rt C
Sho
E BGP J ISP 2
LSR C
Incoming Transit
Traffic A
Short Cu
t Tunnel 2 Router G's Next Hop
For Routes in ISP 3
G
Forwarding Table D
F EGRESS BGP Subscriber
Destination Next Hop BG LSR H
Prefixes in ISP 1 LSP 1 P ISP 3
Prefixes in ISP Router B
2 Prefixes in LSP 2
ISP 3 Prefixes LSP 2 Subscriber M
Router G's Next Hop
in ISP 4
ISP 4 For Routes in ISP 4
300
MPLS Encoding
• MPLS uses a 32 bit “Shim Header”
– The Header is pushed onto the IP packet by the Ingress LSR
– The Header is popped off by the Egress LSR or Penultimate LSR
– Labels can be STACKED!
32 Bits
301
MPLS over Ethernet
302
Ethernet over MPLS
• Because MPLS defines an end-to-end
labeled tunnel, it’s very useful for creating
VPNs
• The ISP can create VPNs between
Enterprise/corporate regional offices
– For internal VoIP calls
– For internal data exchange
– For intranet apps
• The Carrier can create VPNs between ISP
POPs
– To simulate a leased long-haul backbone
– So the ISP can offer Enterprise VPNs
– As a backup backbone or to offload bandwidth
303
Virtual Leased Line
DA” SA” 0x8847 L2 ’ DA SA Etype Payload
L1
LER LER
CPE CPE
LSR LSR
Provider’s MPLS Backbone
304
Transparent LAN Service
Customer A
CPE
Customer B
Chicago POP
LSR
LER LER
San Francisco POP New York POP
Last Mile Last Mile
LER LER
CPE CPE
LSR LSR
CPE CPE
LER LER
306
This page left intentionally blank.
307
This page left intentionally blank.
308
Link Aggregation
802.3ad
309
Basic Concept
• Originally only one of
many separate physical
links could be used (else
loops)
• Spanning tree used to
disable all but one
• Did provide redundancy,
but not efficient use of
links
310
Enter Trunking
• Multiple physical links
joined into one logical link
• Only one MAC address
used for trunk group
• All links in same group
must be same speed and
full-duplex
311
Proprietary Methods
312
802.3ad - Link Aggregation
• Study group started in
November 1997 – Standard
completed last year (2000)
• website:
[Link]
ups/802/3/ad/
313
Benefits and Goals
• Increased Bandwidth (duh)
• Incremental bandwidth increases - smaller jumps than 10 to 100
to 1000 - not perfect scaling, but better than one link’s worth
• Failure protection/redundancy (maybe)
• Automatic configuration - if it can aggregate, it will
• Rapid reconfiguration
• Deterministic behavior - configuration will not be dependent on
the order in which events occurred
• NOTE: some of these goals are inconsistent (see later)
314
Layers Involved
315
Layers Involved (cont.)
• MAC Client is Bridge layer,
LLC, etc.
• MAC Control (optional) is
for Flow control (802.3x)
• MAC is CSMA/CD, CRC32,
Frame Encapsulation
This means Inter-Packet Gap and encapsulation is enforced by the individual link MACs
(called physical MACs), but FCS and source MAC address can be sent from the single
MAC above (e.g., if relayed)
No CSMA/CD since full-duplex required for link aggregation
Notice the split happens at the traditional software layer, so changes need only be done
in software (usually)
Flow control operates independent of the aggregation - it could pause link
aggregation control frames (this is good, me thinks)
316
Spanning Tree
• Assume redundant link on
right is a trunk group
• Spanning Tree sees one
logical link (because it
operates above the LA
sublayer)
317
Quick Overview
318
How it works (or doesn’t)
Frame 1
Frame 2 Frame 4 Frame 6 Frame 8
Each frame is NOT broken up/spread across the wires (we’re still stuck with a
normal MAC, after all)
Instead, they are each sent whole - on one wire.
Also, we CANNOT send one frame on one wire, another on a second wire, etc.
(the way shown above) Why not?
Because they could arrive out of order - that would be BAD.
319
How it really works
Frame 1 Frame 5
Frame 2 Frame 3 Frame 4
Frame 1
Frame 1
Frame 2 Frame 3 Frame 4
• So to protect against out of order delivery, the standard requires that the transmitting side
(Switch 2) keep “conversations” together on one wire.
• A conversation is a one-way sequence of frames with same dest/src address for the same
application. (i.e., anything that would be hurt by mis-ordering)
• In the case above, Switch 2 could keep all frames from station green to any other station
on the same wire. (src. addr. based) So even if green sends frames to someone else on
the left, it would still go through the same wire.
Likewise, any frames from other stations could be kept together on a separate wire. For
example if blue and yellow sent frames, each of their streams could be on a separate wire.
This is called a distribution algorithm, and is totally up to Switch 2. The standard only
requires that a conversation not be misordered.
320
Reference Topologies
• Notice that repeaters
cannot participate
• Each trunk link is one MAC
layer to one MAC layer.
• All the trunks are MAC-to-
MAC, but network A is
many MAC addresses to
many others, whereas B
and C are many to one, and
D and E are one to one.
Thus the distribution
algorithms will be different
for them: based on src.
addr., src & dest, src & dest
& type, etc.
321
LACPDU
• Link Aggregation Control (LAC) entities
send each other configuration info using
LACPDUs (LAC Protocol Data Units)
• The destination address of the LACPDU is
a well-known multicast address
• If the link partner doesn’t support LA, then
it discards them (doesn’t forward)
• Repeaters forward them like normal
frames (remember, repeaters are layer 1 - they don’t
even know what a “frame” is!) - BUT if there’s a
repeater, then the link should be half-duplex
and thus non-aggregatable (is that a word?)
326
Destination Address
323
Some Background on
Selection
• Wanted auto-config to
avoid user mistakes, as
well as be flexible
• For instance, when the
user crosses wires from
2 systems (shown left)
• By using a System ID
Key, this can be
detected and fixed
324
More Goals
325
One more set of Keys
• For manual config, and
alernative selection
algorithms, the
Aggregators need a
Key
• This way the Agg
Group MAC address
can be assigned to the
ports
• Port selects Agg with
same Key as itself
• This Key is internal -
not transmitted - only
port key and sys ID
transmitted
326
Selection Logic
• IEEE wanted differing implementations of selection, as long as the
following are kept:
– Each port assigned a Key - ports that can agg together have the same key - else
unique Key
– Each Aggregator assigned a Key and MAC address
– Ports only select Aggs with same Key
– Ports in the same LAG use the same Aggregator
– Individual ports each have their own Agg
– If a port can’t select an Agg due to rules above, then it can’t be attached to one
– MAC Client won’t see the port until it’s selected and attached
• That’s it. This leaves a lot open.
327
Default Selection Operation
• When multiple ports
in an aggregation,
select the lowest
number Aggregator
of the ports in the
aggregation
• That port may be just
selected but not
attached
328
Problems
329
This page left intentionally blank.
330
This page left intentionally blank.
331
This page left intentionally blank.
332
Multiple Spanning Trees
802.1s
333
Multiple Spanning Trees
334
Rapid Reconfiguration
802.1w
335
Rapid Reconfiguration
336
Port Based Network Access
802.1X
337
LAN Security
• Anyone can connect their notebook/PC to a
switch and use/snoop the network
• 802.1x provides a method whereby the
switch only allows the user to contact a
RADIUS server with EAP packets to be
authenticated
• Once authenticated the user will be able to
access the net
• This is NOT fool-proof, but good enough for
hotels, libraries, etc. (not good enough for
the government)
338
Resilient Packet Rings
802.17
339
Resilient Packet Rings
• In a SONET ring, half of
the fiber is never used
for real traffic – this is a
waste of possible
bandwidth – better to
use both rings
• Currently, several
vendors (Cisco, Nortel,
Lucent) have
proprietary methods
• There’s still debate
about the need for the
standard, but they’re
trying to move forward
– working group
802.17
340
RPR Applications
341
RPR’s Goal
• Reuse as much of Ethernet as possible
– Frame format (mostly)
– Physical speeds (the higher ones)
– Marketing name power (definitely)
• Connect in a ring topology like SONET &
FDDI
• Have a fast SONET-like protection fail-
over
• Apply QoS from upper layers to Layer 2
– Similar to 802.1p but for media access as well
– Possible changes/additions to IP to make use of it (yes,
that sounds incredibly stupid, but there are people
pushing for this)
• Create more work for people
342
This page left intentionally blank.
343
This page left intentionally blank.
344
Ethernet in the First Mile
802.3ah
345
EFM
346
EFM Work
• Fiber Loop
– Basic Gig Ethernet from CO to switch in the field out to
CPE, like a bigger LAN – only needs some missing
pieces
» Adds 1Gig over single-fiber (bi-directional) for
10km
– PONs (Passive Optical Networks) using passive
splitters
• Copper Loop
– At 10mbps for <750m
– Replace DSL MAC layer with Ethernet but use same
PHY (thus, copper loop)
• Specify OAM
– Remote failure, loopback, and link monitoring
347
First Mile Copper
348
PONs
349
This page left intentionally blank.
350
This page left intentionally blank.
351
This page left intentionally blank.
352
Wrap-Up
353
The Future
354
Acknowledgements
355
Websites
• IEEE website: [Link]
• IETF website: [Link]
• 10 Gigabit Ethernet Alliance: [Link]
• My email address: malleswar@[Link]
356
Acronym Expansion
• ARP – Address Resolution Protocol – the protocol used to discover a
device’s Ethernet address based on its IP address
• ATM - Asynchronous Transfer Mode – the Broadband-ISDN protocol based
on circuits and cell switched communications.
• AUI – Attachment Unit Interface
• BPDU – Bridge Protocol Data Unit
• CIR – Committed Information Rate – the bandwidth level/rate for Frame
Relay circuits which the user has agreement to transmit for
• CoS – Class of Service
• CRC – Cyclical Redundancy Check – a 4-byte value produced from a
polynomial function, used to validate the integrity of the contents of the
frame (like a parity check, or checksum)
• CSMA/CD – Carrier Sense, Multiple Access with Collision Detection – the
name and description of the half-duplex protocol used to access the
network in an Ethernet network
• CWDM – Coarse Wavelength Division Multiplexing – a method of sending
multiple independent signals on the same fiber by sending them at different
wavelengths (colors) – the Coarse version of WDM usually implies very few
wavelengths are used (6 or less) and they may not be in the same band
• DIX – Digital, Intel, and Xerox – the three companies that produced
interoperable Ethernet products before moving into an 802.3 working group
to become a standard
• DMD – Differential Modal Delay
• DOCSIS – Data over Cable System Interface Specification – the industry
standard for data communications over cable TV systems
357
Acronym Expansion
• DSCP – DiffServ Code Point – the binary value field in the IP packet to mark
a packet with a particular per-hop behavior (PHB) for QoS parameters
• DTE – Data Terminal Equipment
• DWDM – Dense Wavelength Division Multiplexing – a method of sending
multiple independent signals on the same fiber by sending them at different
wavelengths (colors) – the Dense version of WDM usually implies all of the
more than 8 wavelengths are within a fixed narrow band
• FEXT – Far End Crosstalk
• FR - Frame Relay - A packet-switched method of data communication
(similar to, but more efficient than, the original X.25 WAN protocol) provided
by telecommunications carriers and Internet service providers (ISPs). Frame
Relay can provide guaranteed bandwidth at no additional charge if the lines
are “open” during periods of low traffic. Frame Relay can run at speeds of
36 Kbps to 2 Mbps. It is now enjoying high popularity as a reasonably priced
alternative to leased line service.
• FRF – Frame Relay Forum – the industry forum that coordinates and
advances standards for Frame Relay.
• FCS – Frame Check Sequence
• GARP – Generic Attribute Registration Protocol
• GMRP – Generic Multicast Registration Protocol
• GMII – Gigabit Media Independent Interface
• GVRP – Generic VLAN Registration Protocol
358
Acronym Expansion
• IEEE – Institute for Electrical and Electronics Engineers – the standards body that
defines electrical standards, including the 802 LAN/WAN committee which defines
such protocols as Ethernet and Token Ring.
• IETF – Internet Engineering Task Force – the standards body that defines protocols for
the Internet.
• IP - Internet Protocol - The communication protocol of the Internet. (version 4 is the
current version)
• IPv6 – IP version 6 – the heir apparent for the current IP version 4 – version 6 adds a
much larger address space, security, auto-configuration, and anycasting
• IPG – Inter-Packet Gap – the idle period between packets, defined as minimally 96 bit
times
• ISDN - Integrated Services Digital Network - Allows for end-to-end digital transfer of
voice and data. ISDN, used primarily by small offices, home offices, and individual
households, combines digital switching and digital transmission and offers higher
bandwidths than current analog modems.
• ISP - Internet Service Provider - A service company that provides customers access to
the Internet.
• ITU – International Telecommunications Union – (formerly the CCITT) the standards
body that defines many international standards, including the phone system.
• LLC – Logical Link Control
• LTP – Link Test Pulse
• MAC – Media Access Control – The layer of the 802.3 stack above the Physical below
the Logical Link layer – the MAC is responsible for encapsulating/decapsulating the
packet with Ethernet fields, accessing the medium (either in half-duplex or full-duplex
mode), and padding or removing the pad on receipt.
359
Acronym Expansion
360
Acronym Expansion
361
Acronym Expansion
362
Acronym Expansion
363
MLT-3 encoding, used in 100BASE-TX to reduce signal frequency, is unnecessary in 100BASE-FX because fiber has high bandwidth capabilities, thus negating concerns about excessive frequencies. Additionally, unlike copper, fiber does not radiate electromagnetic interference, removing the need for scrambling and making the simpler coding of 4B/5B sufficient .
Port-based VLANs assign individual switch ports to specific VLANs, isolating traffic based on logical groupings rather than physical topology. This setup enables simplified management and enhanced security by limiting broadcast domains. However, it might require more complex configurations when changing network layouts and can increase management overhead as the number of VLANs grows .
In 1000Base-T, synchronization through a master/slave timing mechanism is critical for implementing adaptive filters that cancel echo and NEXT. The master uses an internal clock for transmission while the slave recovers this clock to transmit its data, ensuring that symbols are sent and received at identical rates. This alignment is essential to facilitate effective data transmission and error cancellation .
10BASE-T utilizes Manchester Encoding to maintain DC balance and enables clock recovery by having a transition at each bit-time. Between frames, Link Test Pulses (LTPs) signal the presence of a link partner. The preamble allows devices to recover the clock of a link partner, and a start frame delimiter (SFD) establishes the byte boundary after the loss of initial bits .
Spanning Tree Protocol (STP) and VLANs significantly contribute to network resilience. STP avoids broadcast storms by managing redundant paths and activating backup links when an active one fails. VLANs reduce broadcast domains, conserve bandwidth, and provide logical network segmentation, creating isolated environments for addressing organizational needs. Together, STP and VLANs allow efficient bandwidth usage, fault tolerance, and scalable network designs .
100BASE-T4 became obsolete because it relied on the Cat-3 cabling for its operation, whereas 100BASE-TX utilized the newly emerging Cat-5 cables, allowing full-duplex operation and leveraging pre-existing CDDI work. Rapid evolution in TX chip development lowered costs, prompting customers to prefer upgrading to Cat-5 cables to future-proof their networks, thereby reducing T4’s appeal .
The Physical Layer (PHY) balances objectives by considering factors such as cost, data speed, and distance, which involve trade-offs depending on the media type utilized. For instance, supporting both UTP copper and fiber optics involves selecting between low costs of copper and the long distances achievable with fiber. Each type of PHY, such as 10Base-T, 100Base-TX, and 1000Base-T, balances these factors differently to cater to different applications and installed media bases. For example, 100Base-TX capitalized on existing CDDI work, providing full duplex capability but requiring Cat-5 wiring, which had a low installed base at the time . The need for low bit error rates and high data rates further influences the choice of media, with fiber typically supporting higher speeds and longer distances, though at increased cost compared to copper . These trade-offs ensure compatibility with existing infrastructure while aiming for optimal performance and cost-effectiveness .
VLAN tagging according to the 802.1Q standard functions by inserting a 4-byte header tag into Ethernet frames after the source and destination MAC addresses. This tag includes a 2-byte Tagged Protocol Identifier (TPID) set to 8100 and a 2-byte Tag Control Information (TCI). The TCI contains the VLAN ID (VID), which specifies the VLAN to which the frame belongs, and priority bits for Quality of Service (QoS). The purpose of VLAN tagging is to enable frames to be associated with a particular VLAN, allowing multiple VLANs to share the same physical network links while maintaining separate broadcast domains, thus reducing traffic and enhancing network management and security .
10BASE-T utilizes Manchester encoding, which encodes data using a + to - transition for "0" and a - to + transition for "1" This method ensures DC balance and guarantees a transition each bit-time for clock recovery . Meanwhile, 100BASE-TX employs a more complex 4B/5B block coding system to transform 4-bit data into 5-bit symbols, creating unique codes like "I" for idle and "J" and "K" for frame start . The 125 Mbps data stream from the 4B/5B coding is then modulated using Multi-Level Transition-3 (MLT-3) coding to reduce frequency and enable transmission over Category 5 cables, which reduces the frequency of line signal to avoid attenuation issues beyond 125 MHz . Moreover, scrambling is applied before MLT-3 modulation to spread the signal energy across various frequencies and minimize radiation, a requirement due to FCC regulations . Overall, 100BASE-TX offers higher data rates and uses more advanced encoding and modulation techniques compared to 10BASE-T, which sticks to a simpler encoding mechanism suitable for its lower data rate.
The key differences between Multiple Forwarding Databases (MFD) and Single Forwarding Databases (SFD) in VLAN tables lie in their approach to managing MAC addresses for each VLAN. MFD supports independent learning with a separate bridge table for each VLAN, allowing duplicate MAC addresses across VLANs because each VLAN learns its MAC addresses independently . Conversely, SFD uses a shared bridge table for every VLAN, which supports shared learning but does not allow duplicate MAC addresses. This is because all VLANs use the same bridge table, making asymmetric VLAN configurations possible but potentially problematic for interoperability .