Skip to main content

UMass Professor Explains the Internet in 5 Levels of Difficulty

The internet is the most technically complex system humanity has ever built. Jim Kurose, Professor at UMass Amherst, has been challenged to explain the internet to 5 different people; a child, a teen, a college student, a grad student, and an expert.

Released on 11/23/2022

Transcript

Hi, I'm Jim Kurose,

I'm a professor

at the University of Massachusetts at Amherst,

and I've been challenged to describe the internet

in five levels of increasing difficulty.

The internet is the most technically complex system

that humanity has ever built.

The internet is a network of networks.

It's a platform on which all of the internet applications

that you've heard of can be built.

[bright music]

Hi, it's really, really nice to meet you.

What's your name?

My name is Skylar.

Skylar, we're here to talk about the internet,

and I bet you must use the internet a lot, right?

Yeah.

What's your conception about what the internet is?

The internet?

For me, it's just something to use when I need

like to search up something or watch videos.

The internet is, physically, these computers

that all talk to each other.

Billions of computers, in the case of the internet.

The internet allows us to do

a lot of really, really interesting,

what we call applications.

You ever think about how that video gets to you

over the internet?

Yeah, I have no idea.

Got a favorite movie?

Matilda.

Matilda. All right.

We're gonna actually build an internet.

I've got a couple of things here that I wanna show you,

or a couple of toys, actually.

Okay, let's pretend that these round balls are computers.

And the internet is something that connects them.

And right now, the internet is just one communication link.

And Matilda is sent over the internet from this computer

to your computer.

So the internet is a network for carrying information

from one computer to another.

Now this network here looks pretty simple, doesn't it?

Right? It's just one thing.

Should we add some more friends in?

Yeah.

Let's say we want to get a video from here, over to here.

How do you think that video would sort of travel

through this network?

Maybe it could go to here, to here, to here, to here.

That's right.

So that's pretty cool.

There are actually lots of different ways to actually go

through the internet to get from what we call a source,

the place that's sending the information,

to the receiver, the place that's actually gathering

the information together.

And that's something we actually call routing.

Huh, but wouldn't it just be easier

for it to go from here to here,

instead of going from here to here,

to here to here?

Yeah. So that's a really good observation.

In most pieces of the internet,

that's exactly what would happen.

We want to take what's called a shortest path.

But still, there are multiple paths.

And why do you think that might be valuable?

Maybe one way is messed up or broken.

So you go the other way.

Exactly.

So, Skylar, that was a great discussion

about what we just built.

And I wanted to talk to you about,

or ask you about maybe one other really important

part about networks.

And it's not so much the thing itself,

the physical thing,

but more about the rules about communication.

That's governed by something that are called protocols.

Are you up for one?

Yeah. Knock, knock.

Who's there?

Lettuce. Lettuce who?

Lettuce go on.

[Skylar and Jim laughing]

A knock, knock joke is an example of a protocol, right?

The computer that you are using say, makes a request,

you ask for something, you get something in return.

In the internet, there are protocols everywhere.

So that two computers that have never talked

to each other before know the rules

for talking to each other.

This global internet with billions of people using it

are just lots of smaller networks

that are all hooked together to each other.

But also, what the internet allows

are all of these what we call applications, Zoom,

video playing services,

can all run on top of the same internet.

Yeah, so there's one internet for all of 'em.

Exactly.

There's one internet and lots and lots and lots of things

that you can do on top of it.

[bright music]

So you're a student in high school, is that right?

Yes, I'm a sophomore.

Well, we're gonna be talking about computers here today,

and we're gonna be talking about the internet.

I always like to think of the internet by analogy

to say road systems for example,

where you have roads in your neighborhood.

You have state roads,

you have the Interstate Highway System.

And so the internet is a lot like that.

It's an interconnection of local roads,

local networks like the network in your house for example.

How does like all of the networks in my house connect

to all the city networks?

Wow. Great question.

Often, it's a little blue wire called an ethernet cable.

So that cable is able to bring bits of information

up into your apartment at say, a billion bits per second.

That's pretty fast, right?

Literally a wire that goes between a box in your apartment,

sometimes called a router or a modem in your apartment

that comes from an internet service provider

come into this first network and then that network connects

to another network connects to another network

connects to another network.

You could FaceTime somebody who's like in Australia.

You can talk to them at the same time,

and like you're reaching the same signals.

So how is it that it gets there so fast?

We could talk about that by analogy to a road system.

It's not just one big, super highway.

It's a lot of smaller super highways

that are all interconnected.

And those interchanges are what are called routers.

That's where the links come together.

You're talking about talking to a friend in Australia.

So oh, it's coming in from the East Coast

of the United States to this router,

and it's going out say, that routers in San Francisco,

it's going out on an underseas cable over to Australia

rather than in this direction up to Japan.

So there is an underseas cable?

The underseas cables are so cool!

They're these big cables that are laid down by switches.

They cross both the Atlantic, the Pacific, the Indian Ocean.

So the undersea cables are how the networks in Europe,

United States, Asia are all connected together.

How do you connect wirelessly?

That's really what we call the first hop.

It's like from your phone, from your tablet,

from the computer that you're on,

there's no cables coming in.

You go over a wireless connection.

Wi-Fi is the protocol that allows your computer to talk

to that first hop router over a wireless communication link.

And I was wondering how there's so many different movies

or TV shows that you can download and they're all there.

And if you just play it, it just knows what to play.

Like they're all in one spot.

Ah, you said they're all in one spot.

In fact, they're in lots of spots in Netflix.

And so most applications would like to connect you

with a server that's close to you.

Server is really just a big computer with a lot of memory,

a lot of discs that store all the Netflix movies,

and also so that you don't have to cross over

too many internet links to get from where the server is

to the TV or the device in your home.

So when I'm watching Vampire Diaries in my house,

how does it know exactly what to do

without getting scrambled up?

Ah, another great question.

There's a couple of things that could happen

inside the internet.

Information is sent in these little packets of information

from the Netflix server to your display device.

And literally, each packet that arrives says,

This is the first packet for Jenna.

This is the second.

This is the third.

This is the fifth.

This is the fourth.

And they're reordered for you.

Matter of fact, your computer will say,

using the TCP protocol to the server,

Hey, I didn't get packet four, can you resend it again?

And again, the Netflix server is very happy

to send you packet four again.

The other is the internet protocol.

If you think about sending letters

through the US Postal Service,

how you've got an address on it.

So every packet that flows from the Netflix server to you

has an address on it.

It says, This is going to Jenna.

It's going to the what's called

the Internet Protocol address of your device.

Think of all the range of devices

that are hooked up to the internet.

It's totally amazing, right?

Every single one of them has one thing in common,

and that is they speak the IP protocol,

the Internet Protocol.

That was a great question.

[upbeat music]

So tell me a little bit about yourself?

I am a senior at New York University.

I study computer science.

Have you taken any courses on the internet

or studied it at all?

I've taken Applied Internet Technology.

So we've talked about backend/frontend frameworks

and libraries, things like that.

Okay, so really at the application level?

At the application level, for sure.

I wanted to ask you a little bit about what you know

about the history of the internet.

Have you heard of ARPANET, for example?

I have not heard of ARPANET.

Okay, back into the 1960s, there was a research agency

in the United States called DARPA,

the Defense Advanced Research Projects Agency.

Actually, it was called ARPA at the time.

They wanted to build this notion

of a packet-switching network.

Not a circuit switch network like a phone network

where you get a dedicated path

and a dedicated set of bandwidth and links

from source to destination.

So what would packet switching enable?

I'm sure there's something big here, for sure.

There's a lot big, right?

And so remember, this was a Department of Defense,

was they wanted to have forms of of communication

that were very robust, that were survivable.

Packets could all find their own ways,

be routed differently through the network.

So if parts of the network failed,

you could route around failures.

Sounds like the reason

for like a request response type of structure.

So you can sort of see how the network architecture

that wasn't designed to be 100% reliable

inside the core of the network,

and had that complexity built into the edges of the network.

And to me, the really cool thing is you

put this infrastructure in place,

and then all these super creative people

think about amazing things to build on top of it.

And you see this proliferation of amazing applications.

Abstraction, I think it's the reason why everything.

Ah ha! Spoken like a real computer scientist, right?

You're a computer scientist. I'm a computer scientist.

We talk about APIs, application programming interfaces.

The API for the internet is something

called a socket.

And a socket simply says,

I can communicate if I know your internet address,

you know, 128.119.40.186,

that number is the IP address of my server,

the University of Massachusetts.

If you know that, you can write a program

anywhere in the world and send a message,

and it'll pop out at my end.

I will be remembering that.

[Jim laughs]

I've heard that there are like seven keys to the internet,

something like that.

Okay, well I don't know about the number seven,

but there's something in the internet

that's sort of similar to that.

It's called the Domain Name System.

The DNS's role is to translate names

like gaia.cs.umass.edu, or ibm.com, or facebook.com

to an IP address so that your application

can actually send a message to that name,

to that named service.

This whatever quantity of people

is able to have some form of control?

So that's a great question.

Who do you think controls the internet?

I'm pretty sure the internet is fairly decentralized.

Okay. What does that mean?

No one authority holds control

over any sort of decisions or destinations.

That's 98% true.

And if you own a network, like you're att.com,

or your verizon.com, you can do, within that network,

you can do what you want, right?

So in that sense, the internet is very decentralized,

that the control of the network is up

to whoever owns the network.

The 2% where you said there's nobody in control,

there's a a little bit of centralized control.

There's an organization called

the Internet Corporation for Assigned Names and Numbers.

Its responsibility is to handle, as the name ICANN suggests,

names and numbers.

It's that little bit of centralization,

central authority that you need.

When can we see the next tenfold increase

in in Wi-Fi speed?

In terms of tenfold speeds of increases,

depending on what device you're using right now,

it's available, all you need to do is upgrade.

So the Wi-Fi protocol's called 802.11.

And this is sometimes a source of confusion for people.

How can it be that I've got a connection

at 100 megabits per second from our TV into our router?

100 megabits per second not enough?

Packets dropping?

Where do they get dropped, do you think?

Somewhere in their travel process.

Exactly, right.

And maybe they're dropped in your apartment,

but much more likely, they're dropped because of congestion

somewhere between the Hulu or the Netflix

or the Disney server, if you're watching a video,

and your home.

So even though you've got 200 megabits per second

on that last hop, you don't have 200 megabits per second

from the server into your apartment.

I see.

I'm curious, has our conversation

sort of changed your view or sort of taught you new things

about the internet?

I think that I've sort of realized

that the internet is a technology that's dependent

upon so many other factors.

Some more in our control, some less.

[bright music]

Tell us a little bit about yourself?

I'm Caspar Lant.

I'm a PhD student at Columbia University

under Henning Schulzrinne's tutelage.

Oh, good pronunciation. [laughing]

Thank you.

I'm interested in networking, IoT,

and sort of what kind of data science you can use

with the datasets that you get from such devices.

One of the things that I designed before,

starting my PhD with Henning,

was a IoT pill dispenser, essentially,

which pairs with your smartphone,

which does facial detection

and other computer vision controls

and can basically tell who's taking

some sensitive medication

and make sure that they've taken it correctly.

We have these low-power devices

they're sort of at the edge.

Is it just connecting them in across a wireless link?

Is that the primary challenge or?

Well, I think the primary challenge is that for sure,

but then an additional challenge

is keeping everything configured in the way

that you expect it to be configured.

So for example, most IoT devices require you,

when you're configuring them for you

to enter some kind of captive login portal

where you connect to a local network

that the IoT device produces,

and then you can input your Wi-Fi SSID and password.

But then say if you were to change the password

or the name of your Wi-Fi network

or you move to a new place, then suddenly,

everything needs to be reconfigured.

'Cause that's a problem that scales linearly.

That you don't want the complexity of managing them

to go up linearly with that.

You'd like it to still stay pretty flat as you start adding.

Right, exactly.

I mean, the good thing about IoT devices

is that they tend to transmit

very, very small amounts of data.

We're used to ethernet cables

that can handle many hundreds of gigabits per second

over a wired device.

What are the typical data rates for IoT devices?

I mean, not hundreds of gigabits.

No, I mean I would imagine upper bound, KB per second,

lower bound, you could see bytes per second just on average.

But I mean, say that you have a temperature sensor running

off of your Arduino that's reporting the temperature

in your house every minute.

That's going to be far less

than kilobytes per second on average.

My sense is you're spot on,

that they'll produce over time, a lot of data.

And that a lot of IoT is about computing on that data.

That computation happened mostly at the edge,

or somehow a combination between the edge

and something happening in a far away data center.

Well, my sense is right now that all that data tends

to be centralized because IoT devices

are usually the commercial products of companies.

Do you think they'll share it?

Not without some persuasion,

but I agree that these data

have massive, massive research value.

Something I'm interested in with my research

is collaborating with people who manage

these distributed sensor devices,

and then taking advantage of those datasets

and comparing them to, say you were interested

in doing a research project on how daily rush hour traffic

impacts the acoustic landscape of New York City.

Figuring out, look, this street next to this school

is causing visible ratings above what we mandate.

And so there needs to be an intervention here.

I think for a long time,

the internet hasn't grappled with,

but now has with IoT and also with cellular networks,

generally is the question of mobility.

Do you imagine in the future that it might be possible

for mobile devices not to always have to connect

through the same provider to go from one network to another?

Definitely.

I mean, we're already seeing long range networks

like LoRa that can, first of all, provide access

over a much larger coverage area,

but then also look the same

because they're set up to the same specification,

regardless of where the individual gateway is.

[bright music]

So hey, Jen, it's great to see you again.

Good to see you, Jim.

We're in level five now.

So you're the expert-expert.

I'm a huge fan of the work that you did in RCP,

the Routing Control Platform being a precursor

to software-defined networking

and the notion that rather than having protocols

actually always compute things,

that we could compute things in data centers.

I'd be interested if you could sort of just reflect

back on that time and sort of the beginnings

of SDN and where it's come since then.

Yeah, and when we were at AT&T,

the thing we found most frustrating

is AT&T would buy routers,

and they would come pre-baked with a set of protocols

instead of knobs that you could turn if you wanted

to influence how the traffic flowed,

and a set of dials you could read to understand

what was going on inside the network.

Right, you couldn't directly do what you wanted to.

Exactly.

And so we started thinking about earlier work that was done

in the telephony network, the old telephone network.

And there, they had the same problem.

And they had the idea of having a computer running a program

tell a distributed set of telephony switches what to do.

But the idea was like, wow, it was kind of a revelation,

like what would that look like if we did that?

Not for the whole internet, but at least AT&T's part

of the internet.

So in other words, use software

instead of distributed protocols

to to tell the network what to do.

Yeah, do you see the softwarization of the internet

as a whole happening?

So, so far, it hasn't very much.

I mean, basically, software-defined networking exists,

let's say within a single provider backbone,

or a single cloud provider's network or a single campus.

There's been some work on doing it at the juncture points

between a pair of networks.

But one other trend that's happening

that makes it more possible is it used

to be that to get from one end to the internet to the other,

you have access networks getting much closer to, say Google

or Microsoft or other large cloud providers,

where even, you might only go through two networks

Right, so some people have called

that the flattening of the internet, right?

I think it used to be on average,

you would go through 10 different networks

to get from a source to a destination.

Right, exactly.

And if you take that even further,

they're starting to be more edge computing

where you might imagine you might have a cell tower

connected to a small number of routers,

connected directly to a server

that's gonna be running the application.

In that case, the entire infrastructure might be controlled

by a single party.

It's totally fascinating to me

that we have such an important global infrastructure,

and yet, the laws that that govern it tend

to be very, very local.

There are tens of thousands

of separately administered networks,

and of course, in hundreds of countries.

And the fact that it even holds together at all

is kind of a miracle.

Right, well it holds together because we have standards,

and we have protocols

that you mentioned. Exactly, protocol standards

for how the equipment talks to one another.

And increasingly, certificate authorities

that help bootstrap the secure, encrypted

and communication between end hosts.

So there are a few of these sort of centrally,

kind of agreed upon kinds of infrastructure,

but for the most part, each network runs itself.

And certainly, we've heard about some countries

that impose firewalls

that don't let certain kinds of traffic out,

or certain kinds of traffic in.

So there's no global body that is regulating that?

Not really because each country

really can have it's own laws and its own norms.

And so they can decide,

like the Great Firewall of China can decide,

they don't wanna let certain content be accessed

by the citizens inside that country.

So if a country decides they don't wanna answer a request

for a particular domain name, they say,

Hey, I don't want to let someone know the IP address

of this website.

They can decide not to let those answers be delivered

inside their country.

And so encryption plays a role in helping people

keep their privacy or prevent surveillance,

but it's not perfect.

It's often possible, still, to know a fair amount

about what people are doing, even if you can't look

inside the envelope at the letter that's written.

I mean, even you could just tell

that two people are communicating

even though the traffic itself is encrypted.

So you don't know what they're saying,

just even knowing two devices are communicating.

Exactly, and in fact, if you look at say,

the sizes of the transfers that they're doing,

you may know, hey, I'm talking to Netflix.

And by the way, this is the length of the movie I watched.

This is the size- So you can infer

or guess a lot of things.

Exactly.

You're one of the most awesome networking researchers

that I know.

I'm curious, just to pick your brain,

what do you think are some of the hot topics

in networking research?

Where do you think the field is heading?

Yeah, I'm excited about the convergence

of wireless communications, cellular networks, Wi-Fi

with networking and cloud computing.

And in particular, we're seeing in edge computing,

a convergence of all three.

Where you might have a mobile phone

or a drone or some other kind of device connecting

over the wireless medium directly to a network

that connects you directly to the server

that might run your application.

So you want the computation close

to where the endpoint is.

Exactly, and I think that what's now exciting about that

is all three of these technologies, wireless, networking

and cloud, which are normally three different communities,

three different sets of technologies,

three different sets of standards or practices,

now have to work together in close harmony

to be able to service applications that are really critical

and that that might be interacting with the physical world

in ways where safety is a potential concern.

You know, we've had cellular networks now

for 20, 30 years.

So when we hear about 5G,

what's trumpeted the most is the fact

that oh, super high bandwidth, right?

But I sense that the exciting things

are more than just the network being faster.

I agree.

It's both the high bandwidth, it's the low delay

so that you can have these applications

that interact with the physical world

and need answers in real-time.

It's about having the compute really close

so that you can integrate computation and communication.

It's about having more coverage.

Coming back again to the softwarization.

SDN and softwarization

is a maybe a little bit behind the covers,

that you wouldn't normally see it as a user going

from 3G to 4G to 5G.

You just see an increase in speed.

But yet, the way the network is now being managed again,

I think is bringing the cellular networking world

sort of into the internet world

in terms of the softwarization-

Completely agree.

I think the bringing in of compute and storage

is important too.

I think when you think just about networking,

it really is often just one part of the IT,

the information technology ecosystem.

Is there's often compute and storage as well.

And so, I think now there's an opportunity to have all

of those parts of the infrastructure work together

towards an even higher level goal.

And so I think it's a really exciting time

to be in the field 'cause now,

the plumbing is getting close to the application

in a way that it wasn't before.

[upbeat music]

So I really hope you've enjoyed this video,

and I hope you've also understood the internet

is part of the worldwide global communication fabric.

It's absolutely fascinating how it works.

[upbeat music]

Up Next