A. Preconditions for using the Internet:
1. computer--can be PC, Mac, multiuser system (e.g., UNIX-
based) which necessitates logging in
2. online connection (via cable), modem
3. protocols--divvy up physical network into discreet
locations and enable one location to send messages
to another
B. Physical components of the Internet:
1. network host--computer connected to network
2. commercial Internet access provider
3. speed of information exchange within network limited by
a. telephone line
b. your modem speed
4. your computer and the Internet usually separated by LAN
a. privately owned LANs affiliated with businesses and
campuses everywhere
b. common physical medium is Ethernet--connection either
a small, metal "T" with a cable running through it
to back of computer or a small plastic box (with
transceiver) attached to computer by modular cable
5. router
a. a computer bridging gap between whatever medium is
used for LAN and a long-distance line accessing the
Internet at large
b. insulates the two from each other
--LAN doesn't have to process outside site work
--Internet not affected by LAN crashes
6. long distance portions of network consists of dedicated
phone lines
a. differ from voice traffic phone lines in 2 key ways:
--greater capacity for carrying data
--tend to be point-to-point lines (circuit always
open between the 2 points connected by line)
b. information transfer speed comparison
--Ethernet-based LANs: 10 million bits per second
(10 Mbps); 600 screenfuls of information per
second; 19.2 kilobits per second
--voice-grade phone lines: 19,200 bps; 1 screenful
of data
--"leased" lines (dedicated phone lines between 2
points): 56-64 kbps; 3.5 screens per second
--T1 lines: widely used as major data arteries;
capacity of 1.5 Mbps; 94 screens
--T3 lines: introduced to the Internet in 1991 and
now used throughout its North American backbone
(funded by NSF); transmit data at 30x T1 rate
C. the Internet relatively free of network-wide regulation
1. all parts (individual networks) owned by someone; each
sets its own policies
2. largest player, NSF, has dev. an Acceptable Use Policy
a. applies to traffic using NSFNET facilities
b. advocates the following:
--communication between researchers
--collecting information about and applying for
grants and research contracts
--administration of research programs and academic
professional societies
c. some commercial activity unacceptable:
--advertising
--consulting for pay
--general use by "for-profit" institutions
3. commercial sector the fastest growing portion of the
Internet user community
a. many of large regional networks less restrictive
regarding commercial activities
b. an infrastructure for commercial use currently
being built alongside the academic community structure
D. Finger
1. easiest command to use
2. displays information about a user: name, home directory,
file named .plan contents
3. e.g.: finger quake@geophy.washington.edu
4. problems which might arise in trying to execute command:
a. finger can't find user (...mistyped user name)
b. finger can't find computer
--unknown host:...
--ask system administrator for help
c. your computer can't find finger
--finger: Command not found.
--telnet with computer name and no. 79
(1) if telnet connects successfully, type user name
and press Return
(2) e.g.: Login name: aurora
--if you still get "command not found" message,
confirm with computer's administrator that there
is a connection to the Internet
E. Protocols on the Internet
1. definition: mutually agreed-upon format or set of
conventions, defined for a specific purpose
2. functions:
a. choreograph the movement of messages; network protocol
b. check integrity of what was sent; transport protocol
c. "massage" data from one format into another;
application protocol
3. Internet Protocol (IP)
a. a network protocal; manages logistics of getting a
message from sending machine to receiving machine
b. packets the base message unit delivered by IP
--smaller than many of messages and files transferred
over the Internet
--transmission often requires multiple packets
4. Transmission Control Protocol (TCP); User Datagram
Protocol
a. transport protocols
b. collect related packets, order them, and verify that
none are garbled
5. there are as many application protocols as there are
applications on the Internet; e.g., Mail, telnet, ftp,
archie, gopher, WAIS, World Wide Web
6. TCP/IP frequently paired
a. protocol of choice with UNIX vendors; also available
for mac, PC, other multiuser computers
b. TCP/IP-based LAN a miniature version of the Internet
c. many of tools available on the Internet (e.g., mail,
telnet, ftp) packaged with basic networking software
of UNIX systems
7. for IP to work, there must be a means of identifying
sites that will exchange messages
a. points of attachment are arbitrary and changeable
b. under IP, each network and each computer has a fixed
address allowing another computer to reach it
--a 32 bit-number; 4 numbers separated by periods
(none of the 4 numbers can be larger than 255)
--address has 2 components:
(1) one identifies the individual computer
(2) the other identifies the network of which the
computer is a member
--assigned when LAN registers for Internet con-
nection with the Internet Network Informtion
Center (the NIC, or InterNIC)
--the NIC also gives LAN administrator a range
of addresses to be assigned to indiv. hosts
c. although many information resources are available on
the Internet via menu systems, many others identified
only by IP address
8. Domain Name System (DNS)
a. definition: conventions for naming network computer
and directory service for looking up names
b. domain a named group of network hosts
c. domain-style names a series of names separated by
periods
--Fully Qualified Domain Name (FQDN): name of a
computer and hierarchy of domains in which it is
nested (in order of increasing generality)
--can be used in place of an IP address;
e.g., caticsuf.csufresno.edu
--2 main types:
(1) organizational
--tells organizational affiliation of computer
--top-level domains:
com commercial
edu educational
gov U.S. government
int international
mil U.S. military
net Network Backbone Systems &
Information Centers
org nonprofit organizations
(2) geographical
--2-letter names representing country (lower
level domains a mixture of geographical and
non-geographical names)
(a) codes unique only within fields, not
across them
(b) e.g., well.sf.ca.us (computer named
"well" in the San Francisco area
--top-level domains include:
au Australia
at Austria
ca Canada
uk UK/Ireland
as United States
d. unlike IP address
--more user friendly
--more versatile; names can be reassigned or reused
9. Serial Line Internet Protocol (SLIP) & Point-to-Point
Protocol (PPP) make it possible for anyone with computer/
modem/phone line to connect to the Internet
a. your computer becomes an Internet host with a unique
IP address (whereas, if dialing Internet host and
logging in your account to work with the Internet,
your computer is completely unknown to the Internet)
b. SLIP is not very sophisticated; it exchanges IP
packets over a serial line with minimal error
checking and additional encoding
--little overhead
--there are problems, however, in coping with line
noise and other interruptions common on serial lines
c. PPP more versatile; in the long run, it should render
SLIP obsolete
--more robust message control
--designed to handle protocols other than IP
--however, it imposes more overhead
d. both make it possible to run applications that use the
Internet on computers that don't have a permanent
network connection; e.g., via commercial access
providers
--advantage: greater convenience than with UNIX
command line interface common for dial-up users
--disadvantages:
(1) commercial access providers charge more for
such access
(2) you'll have to find and install Internet
applications on your own
F. Software Applications
1. def.: tools enabling us to communicate with others and
access data throughout the Internet
2. some are practically universal because they are bundled
with the network portion of the UNIX operating system
a. they are likely to be included with the networking
packages of other operating systems as well
b. e.g., mail, telnet, ftp
3. other Internet tools aren't part of any operating system
a. developed within the Internet community to simplify
access to worldwide resources
b. e.g., gopher, WAIS, WWW, archie
4. conventional, stand-alone computer programs able to get
all the resources they need from local sources
a. i.e., usually 1 or more files located on a disk in the
computer where the program is run
b. there are limits, however, when they are shared among
a group of users:
--data must either be replicated on many machines, or
everyone must use same machine
--copying same data everywhere wastes disk space
--using same computer creates performance problems
and interpersonal conflicts
5. to overcome these limits network applications are built
on client/server model
a. i.e., tools distribute the work of one application
across 2 programs, a client and a server, that carry
on a dialog with each other
b. this model makes it possible for anyone with Internet
access to use most available data anonymously
c. when information needed, do following:
--start up a program (a client) and supply detail
about what you need
--client program opens a connection to a server
program that controls the information requested
--dialog between the client & server uses application
protocols which format the message for transmission
across the network
--server receives client's request, evaluates it,
finds desired information, formats the result in
application protocol and passes response to the
appropriate protocol handler
--when client receives the requested information, it
provides an interface through which you can either
view information or direct it elsewhere
d. guidelines regarding these applications:
--client programs run ad hoc, but server programs
run continuously
--client and server programs can run on same computer
--clients can maintain multiple, simultaneous
connections to a server
--clients can connect to more than one server at a
time
--servers can open connections to other servers
--clients and servers can support multiple network
protocols
6. interfacing
a. the ASCII version of a client possesses the least
extravagant hardware requirement
--particularly helpful for users
(1) without access to workstations with graphical
user interfaces (GUIs)
(2) whose only access to Internet in indirect
(through telnet or dial-up connection)
--also versions for different GUIs; e.g., Xwindows,
Macintosh, MS-Windows
b. the client program may also deliver messages when
things go wrong during the client/server dialog
--the client program isn't necessarily the source of
errors it reports; e.g., messages originating in
the server or the path between the client and server
--knowing the general structure of client/server
interaction can be helpful in figuring the next move
G. Network Discovery Tools
1. tools checking your Internet connection are utilities
packaged with software that implements the TCP/IP
protocol suite
a. all tools available if accessing via UNIX computer
b. regarding personal computers where TCP/IP an add-on
product, there are more vendors and packaging of the
utilities is less uniform
2. ping
a. queries hosts on an IP network to verify they're up
and capable of sending and receiving network packets
b. standard component of UNIX TCP/IP installations;
also widely available in commercial TCP/IP
implementations for DOS & Windows systems
c. e.g.: ping optimism
optimism is alive.
--since a Fully Qualified Domain Name not used, only
local network computer named optimism being queried
--response says optimism capable of receiving and
sending packets with IP
d. if your version issues queries continuously, don't
let it run on needlessly
--potentially wasteful of Internet resources
--can be interrupted with Ctrl-C in most cases
e. problems which might be encountered:
--ping: Command not found
Execute the command again again specifying its
full path name.
--ping: rs.internic.net:host unknown
Ping couldn't resolve computer name you provided
into an Internet address
(1) ask whether computer accessible from any
Internet host
(2) your computer may not be connected to Internet
(3) you may achieve a connection to some Internet
facilities via e mail
--ping:
no answer from rs.internic.net
Ping received no response to its query
(1) usually means computer being pinged is down
(2) repeat later
3. basic facts about your home computer stored in a file
named hosts
a. configuration file for TCP/IP protocol created when
software to manage the TCP/IP protocol is installed
b. plain text file containing IP addresses and host names
c. in /etc directory on UNIX systems
--DOS systems include it in directory containing
TCP/IP software
--Mac in System Folder
d. commands for display of file
--UNIX: cat /etc/hosts
--DOS: type /etc/hosts
e. check size before display
--UNIX: ls
--DOS: dir
f. response to display command: IP address followed by
the host's name and any aliases
--e.g., 103.214.50.59 optimism loghost
--could have information about different computers
--lines in hosts file beginning with # are comments
g. it you can't tell from the hosts file what the name &
IP address are, use hostname on UNIX computers with IP
--some Internet hosts will return a Fully Qualified
Domain Name
--if you know local comupter's name, easy to look up
its IP address in hosts file
4. also advisable to inventory network applications available
a. easiest way to see if application is available is to
try starting it
b. it commands fail, they'll do so in predictable ways:
--command not found
(1) 1st check that command is typed properly
(2) try executing command by full pathname;
e.g., /usr/ucb/telnet
(3) consult system administrator
--host unknown
--connection refused
--connection timed out
H. ftp
1. generic file transfer program
2. the Internet offers the following ways of locating files
for downloading:
a. the listing of Internet services
b. USENET newsgroups
c. archie
3. information needed to use ftp effectively:
a. name and location of file
b. type of file (must know how to interpret file names)
c. size of file (use "dir" command)
4. files for all sorts of computers available
a. plain text or binary the 2 file types which matter
most to ftp
b. some compressed to conserve space
--a table summarizing file compression software and
naming conventions by platforms is available
--file /doc/penet/compression at ftp.cso.uiuc.edu
5. naming conventions most frequently encountered on the
Internet
Category Extension Meaning _______________________
ASCII .c C Programming Language Source Code
.h C Programming Language Header File
.txt Text (ASCII) file
.uu ASCII file produced by uuencode
.bat DOS Batch file
.shar UNIX Shell Archive
.ps Post Script file
.hqx Mac Bin Hexed file thru release 4.0
Binary .EXE Executable binary file (DOS, VAX/VMS)
.COM Executable binary file (DOS)
.z Binary file created by compress
.sit Stuff it file for Macintosh
.gz file compressed with GNU gzip
.hqx Mac. Bin Hexed file release 5.0+
________________________________________________________________
6. most Internet users don't have accounts on all machines
from which they want to copy a file
a. no administrator wants to be bogged down by adding
indiv. accounts for all network users
b. sites publishing files to be copied via ftp usually
offer a special login name
--anonymous
--frequently asked to supply information; e.g., e mail
address, password
7. connecting via ftp like logging in to a remote computer
a. ftp session generally starts from the login account's
home directory
b. you can work directly with the files there or move to
another directory
c. when connecting to another system anonymously, ftp
also provides a working directory
8. once connected, you receive an ftp> prompt
a. you can then issue commands to move files between
computers
b. 5 key commands: ascii, binary, get, put, quit
c. meaning of some important commands:
--ascii assume transferred files are text (default)
--binary assume transferred files are binary (image)
--cr toggle stripping of carriage returns from
ASCII files (to preserve carriage returns,
toggle cr off
--hash print hash marks to controlling terminal
during transfer
(1) monitors ftp's progress during file
transfer
(2) # printed on your screen with each block
of data transferred
(3) during long file transfer, keeps the
Internet from automatically discon-
necting user deemed inactive too long
--prompt toggle prompting on or off during multiple
file transfer
(1) on: asked to confirm each file transfer
(2) off: transfer of multiple files without
interruption
--status display current status of all settable
options
--user set login name and password; permits
recovery from failed logins
--verbose toggle verbose messages on/off; can suppress
unwanted messages as needed
--help; ? both provide a list of all available
commands:
| cr ls prompt runique
$ delete macdef proxy send
account debug mdelete sendport status
append dir mdir put struct
ascii disconnect mget pwd sunique
bell form mkdir quit tenex
binary get mls quote trace
bye glob mode recv type
case hash mput remotehelp user
cd help mmop rename verbose
cdup image ntrans reset
close lcd open rmdir
d. choice of ascii or binary depends on file's content
and intended use
--if downloaded file will ultimately be used on a
computer with different architecture or operating
system, use binary representation
--if reading file with local machine's generic program
for displaying files (e.g., DOS type, UNIX cat), use
ascii type of transfer
9. ftp provides commands for moving around directories on a
remote computer
a. these commands are similar to directory navigation
commands for UNIX
b. notable examples:
--pwd print working (current) directory for remote
system
--cd change directory on remote system (requires
directory name)
--cdup change directory on remote system to parent
of current directory
--dir list contents of a directory on a remote
system including name, permissions, owner
and size
--ls list contents of a directory on a remote
system by name only
--mls list contents of multiple directories on a
remote system by name only
--lcd change working directory on local system
(requires directory name)
10. mls, mdir and get assume local file will hold the
command's output
a. therefore, it may be prudentto review output on
the screen rather than store it in a file
b. 2 ways to do this:
--hyphen (-): sends ftp output to screen in an
uninterrrupted stream
--| more or | pg: allows for viewing file a page at
a time
11. commands for copying files:
get move a file from remote to local
--must include name of file to be copied (source
file)
--may include a name for the destination file
--e.g.: get inet.services.txt
recv synonym for get
put move a file from local to remote
send synonym for put
mget move several files from remote to local
mput move several files from local to remote
12. on UNIX system, /tmp is directory in which any user can
create files
a. useful if you have problems with directory permissions
& ftp
b. include both path and file name (/tmp/filename) when
specifying destination file
c. avoid using it for:
--large files
--permanent storage
d. delete such files or move them to your own directory
as soon as possible
13. asterisk
a. a wildcard character
b. matches any string of characters in the file name
c. e.g.: mget README Makefile *.c *.h (*.c matches all
filenames ending in .c)
14. archie
a. its heart is a database containing directory listings
of what is available through anonymous ftp at several
hundred Internet sites
--database compiled by a program that surveys each
site monthly via ftp connection
--database then publishes on various archie servers
& is made available to query
b. example, archie server: archie.unl.edu (USA)
c. example, search: archie factbook
d. to determine whether your computer has this client,
type archie at UNIX prompt
--if you have it, prompt becomes archie>
--if not, message to that effect
I. telnet
1. used to log in to one computer on the Internet from
another
2. can navigate manually
3. differs from ftp (which is geared to transferring files)
in that its connections are general-purpose
4. many databases on the Internet are only accessible
through telnet
5. used with login name & password for the remote computer
J. Electronic Mail
1. makes possible:
a. communication with other Internet users
b. use of Internet applications possessing a mail
interface
c. communication with users on networks that are
connected to the Internet but use protocols other
than IP
d. use of some Internet services without having an
account on the Internet itself
2. the standard mailer on UNIX systems named mail
a. contents: address, subject, note proper, cc: notation
b. minimal editing capabilities:
--erase character (either Backspace or Delete) to
back up a character at a time
--use short lines; mailer won't wrap lines
--Ctrl-C to throw text away
c. Ctrl-D or . on line by itself sends message
d. to read mail, type mail & Return
--you'll see infomation regarding your version of
mail & messages in your system mailbox
--> means current message
--N means new message; number which follows is the
sequence of the message in mailbox
--these items followed by: sender's name; time message
sent; size of message; subject
--& (ampersnad) means mail command prompt
--to read current message press Return
--read any other message by typing sequence number
e. respond to current message by typing r at mailer
prompt
f. d means delete message
g. header makes message more difficult to read and
increases the space it takes up in your mailbox
--create .malrc file to make local configuration
changes in mailer's behavior
--mail arriving will include full header, but fields
suppressed with ignore command won't be displayed
when reading
--fields suppressed also won't be saved with message
if moved to a local mailbox
h. system mailbox should be kept small
--there is a risk of running out of space for new
messages
--the more messages, the harder the mailer works when
you read mail (the software usually works slower)
i. most systems automatically move mail messages from the
system mailbox to local mailbox after the message is
read
--UNIX places them in file named mbox in your home
directory
--can browse with command mail -f mbox
--you can read, reply to, delete or save just as with
the system mailbox
K. Gopher
1. technically "a distributed document delivery system"
2. an easy-to-use ftp without a wall
a. i.e., retrieve files without knowing which Internet
host contains file or what directory houses file
b. aim: provide information from all over the Internet
as seamlessly as possible
3. more than 750 gopher servers
a. many devoted to specialized areas of interest
b. e.g., at least 5 have information on plant genome
projects
4. developed in 1991 at the University of Minnesota
a. Mother Gopher still there
b. by default, clients connected there first
c. if you know resources needed are managed by a
different gopher, you can bypass Mother Gopher
5. to use, just need Internet access and gopher client
program
a. gopher a "full-screen" application; it needs to know
some features of your terminal
b. on many Internet hosts, your terminal type will be
set automatically when 1st logging in
c. if not determined automatically, system may ask for
a terminal type as part of login procedure
--if display scrambled, quit by typing Ctrl-C
--after exiting, you can change terminal type
6. can be started by typing gopher and Return
a. another test of system: /usr/local/bin/gopher
b. menu will look something like this:
Internet Gopher Information Client v1.11
Root gopher server: gopher2.tc.umn.edu
1. Information About Gopher/
2. Computer Information/
3. Discussion Groups/
4. Fun & Games/
5. Internal file server (ftp) sites/
6. Libs./
7. News
8. Other Gopher and Information Servers/
9. Phone Books/
10. Search Gopher Titles
7. some menu items not files at all, but gateways to
programs or non-gopher servers
a. menu item can represent any of a dozen different
things
b. most gopher menus include icons indicating what
type of object each item represents
c. sample menu item types with gopher identifier & icon
File 0 -none-
Directory 1 /
Binary file 9 <Bin>
Image File i <Picture>
Index Search 7 <?>
Telnet Session 8 <TEL>
L. World Wide Wb (WWW)
1. provides a hypertext interface to many Internet resources
a. hypertext a system for cross-referencing & retrieving
related documents
--use a special viewer (browser) to read
--it will highlight element(s) of a document cross-
referenced to other documents
--when a cross-reference is selected, the browser
retrieves that document to read (which may lead
to other documents)
b. documents look like pages, but contain text, video,
photos, sounds
2. more than 1500 Web sites, including record companies and
performing artists
3. start by typing www and Return
4. if connection fails or hangs, type Ctrl-C to abort
5. recommeded starting places:
a. by subject
b. by type of service (access protocol, etc.)
c. about www
6. www line-mode browser commands:
<Return> Display next page of the current document
number Follow a link (id. by number) from the
current document and retrieve the document
the link points to
Help List commands in short form
Manual Browse the online manual for www; provides
more detailed information than help command
Back Go back to the previous document read
Home Go back to the first document read
Recall List documents you have visited so far;
to select one, type Recall followed by no.
List List the links from the current document
by title or, if no title, by pathname; to
select a link from this list, type the
number by itself as above
Up, Down Scroll up or down one page in the current
document
Top, Bottom Move the browser's view to the top or the
bottom of the current document
Quit Exit the browser
Print* Print the current document, without
marking links
> file, Save (or append) the current document in
>> file* the given file, without marking its links
CD (or LCD) Change directory locally (directory* for
UNIX computers)