Understanding the World Wide Web
Understanding the World Wide Web
The building blocks of the Web are web pages which are formatted in HTML
and connected by links called "hypertext" or hyperlinks and accessed by HTTP.
These links are electronic connections that link related pieces of information so that
users can access the desired information quickly. Hypertext offers the advantage to
select a word or phrase from text and thus to access other pages that provide
additional information related to that word or phrase.
Small websites store all of their Webpages on a single server, but big websites
or organizations place their Webpages on different servers in different countries so
that when users of a country search their site they could get the information quickly
from the nearest server.
Some people use the terms 'internet' and 'World Wide Web' interchangeably.
They think they are the same thing, but it is not so. Internet is entirely different from
WWW. It is a worldwide network of devices like computers, laptops, tablets, etc.
It enables users to send emails to other users and chat with them online. For
example, when you send an email or chatting with someone online, you are using
the internet.
But, when you have opened a website like [Link] for information, you
are using the World Wide Web; a network of servers over the internet. You request
a webpage from your computer using a browser, and the server renders that page to
your browser. Your computer is called a client who runs a program (web browser),
and asks the other computer (server) for the information it needs.
The World Wide Web was invented by a British scientist, Tim Berners-Lee
in 1989. He was working at CERN at that time. Originally, it was developed by him
to fulfill the need of automated information sharing between scientists across the
world, so that they could easily share the data and results of their experiments and
studies with each other.
The Web works as per the internet's basic client-server format as shown in
the following image. The servers store and transfer web pages or information to
user's computers on the network when requested by the users. A web server is a
software program which serves the web pages requested by web users using a
browser. The computer of a user who requests documents from a server is known
as a client. Browser, which is installed on the user' computer, allows users to view
the retrieved documents.
All the websites are stored in web servers. Just as someone lives on rent in
a house, a website occupies a space in a server and remains stored in it. The server
hosts the website whenever a user requests its Webpages, and the website owner
has to pay the hosting price for the same.
ELEMENTS OF WEB
Note: Some elements does not have end tag and content, these elements are
termed as empty elements or self-closing element or void elements.
Such as:
Example
<!DOCTYPE html>
<html>
<head>
<title>WebPage</title>
</head>
<body>
<h1>This is my first web page</h1>
<h2> How it looks?</h2>
<p>It looks Nice!!!!!</p>
</body>
</html>
All the content written between body elements are visible on web page.
Void element: All the elements in HTML do not require to have start tag and
end tag, some elements does not have content and end tag such elements are known
as Void elements or empty elements. These elements are also called as unpaired tag.
For the default display and styling purpose in HTML, all the elements are
divided into two categories:
Block-level element
Inline element
Block-level element:
These are the elements, which structure main part of web page, by dividing
a page into coherent blocks.
A block-level element always start with new line and takes the full width
of web page, from left to right.
These elements can contain block-level as well as inline elements.
Example:
<!DOCTYPE html>
<html>
<head>
</head>
<body>
<div style="background-color: lightblue">This is first div</div>
<div style="background-color: lightgreen">This is second div</div>
<p style="background-color: pink">This is a block level element</p>
</body>
</html>
Output:
In the above example we have used tag, which defines a section in a web page,
and takes full width of page.
We have used style attribute which is used to styling the HTML content, and
the background color are showing that it's a block level element.
Inline elements:
<a>, <abbr>, <acronym>, <b>, <bdo>, <big>, <br>, <button>, <cite>, <code>, <dfn>,
<em>,<i>, <img>, <input>, <kbd>, <label>, <map>, <object>, <q>, <samp>, <script>,
<select>,<small>, <span>, <strong>, <sub>, <sup>, <textarea>, <time>, <tt>, <var>.
Example:
<!DOCTYPE html>
<html>
<head>
</head>
<body>
<a href="[Link] on link</a>
<span style="background-color: lightblue">this is inline element</span>
<p>This will take width of text only</p>
</body>
</html>
Output:
WEB BROWSERS
Users can find, access, and view web pages with the help of a web browser,
which is a software program. The term "browser" is frequently used to refer to a
web browser. Using languages like 'Hypertext Markup Language' (HTML) and
'Extensible Markup Language' (XML), websites on the internet and other content
are generally displayed and accessed through web browsers.
Web pages and websites transmitted via the Hypertext Transfer Protocol
(HTTP) are converted into content humans can view using browsers. They can also
display additional protocols and prefixes, such as secure 'HTTP' (HTTPS), 'File
Transfer Protocol' (FTP), 'email handling', and files (file :). Additionally, most
browsers accept third-party plug-ins to show dynamic material such as embedded
video, audio, and game content.
Distinguishes a Search Engine from a Browser
Search engines and web browsers are sometimes mistaken for one another,
yet they serve different purposes. A search engine contains searchable data about
other websites (common examples of search engines are Google, Bing, Yahoo, and
Baidu). Nonetheless, a browser must connect to a website's server and view its web
pages. These are a few examples of browsers.
Anything on the internet is accessible with a web browser, and it pulls data
from other websites and displays it on your computer or mobile device. The
Hypertext Transfer Protocol, which outlines how text, pictures, and video are
communicated on the web, is used to send the information. To ensure that users of
any browser, anywhere in the globe, can access this information, it must be shared
and presented consistently.
However, not every browser developer opts to interpret the format in the same
way. This implies to users that a website may have a varied appearance and
functionality. Web standards aim to provide uniformity among browsers so that
users can utilize and enjoy the internet.
When a web browser requests data from a server connected to the internet, it
converts that data into images and text using a program known as a rendering
engine. Web browsers read the Hypertext Markup Language (HTML) code
contained in this data to produce the images, sounds, and experiences of the
internet.
Users can navigate to other online pages or sites via hyperlinks. Every image,
video, and webpage has a distinct Universal Resource Locator (URL), commonly
called a web [Link] a browser requests data from a server, the web address
instructs the browser where to look for each item described in the HTML,
instructing the browser where it should appear on the web page.
Different Web Browsers
All web browsers do the same tasks. As a result, various web browsers have
been used over the years.
Several web browser examples and their unique features are covered below:
1. Google Chrome
The most widely used browser in the USA is Chrome, developed by internet
giant "Google" and regarded as quick, safe, and dependable by PC and smartphone
users. The Chrome Store offers various customization options in the form of helpful
extensions and apps that may be downloaded without charge.
Moreover, Chrome makes it simple to integrate with other Google services, like
Gmail.
Due to its popularity, Google has since expanded the "Chrome" brand name to
include more products such as the Chromebook, Chromebox, Chromecast, and
Chrome OS.
2. Apple Safari
On Apple phones, laptops, and other Apple devices, Safari is the default
browser. It is typically regarded as a reliable browser, and Apple's ethos is
reflected in its sleek appearance. Due to the dominance of iPhones and iPads,
Safari, initially designed for Macs, has grown to dominate the mobile industry.
In contrast to other browsers mentioned, Safari is only available from
Apple, cannot be used with Android devices, and no longer receives critical
security updates from Apple for the Windows version.
4. Mozilla Firefox
It had a user share of 32.21% as of the end of 2003 before steadily slipping
behind Google Chrome's influence. It still competes fiercely in the "desktop" market
but has a smaller market share on mobile devices, where Google Chrome and Apple
Safari are more popular.
5. Opera
Opera, made for Microsoft Windows, Android, iOS, macOS, and Linux
operating systems, is another web browser worth mentioning. It includes a few
unique characteristics and is usually considered a solid choice by many people.
Many of its original features have since been added to competing browsers. Also, it
offers a unique user interface. Opera has a usage rate of only 2.28% as of this
writing, yet it is nonetheless influential, if not widely.
In addition, the Opera Mini web browser, created especially for mobile users,
was released in 2005. Opera, the PC version, published in 1995 before the mobile
version, was created by Opera Software and supported an exemplary user interface.
Seven key elements work together to make the web browser functional. That is:
1. User Interface
When you launch a web browser, the first page you view is the user interface.
The address bar, forward/backward button, menu, bookmarking option, and a few
other options are all present on this page.
2. Browser Engine
The browser engine connects the rendering engine and the browser's user
interface. It adjusted the rendering engine to provide an output based on the input.
The production of required content for the browser and its display on the
screen is the responsibility of the rendering engine. The HTML documents are
parsed before being converted to a usable form. Every browser that we are aware of
has a different rendering engine.
Safari makes use of WebKit.
Blink is used by Chrome and Opera (fork of WebKit).
Trident is used on Internet Explorer.
Firefox uses Gecko.
The open-source rendering engine WebKit was developed for Linux, and
apple changed it to support both Mac and Windows.
4. Networking
On the internet, communication and security are handled by the network layer.
It is also utilized for HTTP queries and caches the retrieved pages to reduce network
traffic.
5. UI Backend
It is used to draw simple boxes, windows, and widgets. This is for a general-
purpose interface and is not platform-specific. Underneath all of this, UI techniques
are handled by an operating system.
6. JavaScript Interpreter
JavaScript powers all websites and web pages. These pages were all written
in JavaScript. The rendering engine uses the pages that the interpreter has translated
to display the finished product.
7. Data persistence
Cookies are an example of data that is saved locally through data persistence
or storage. The browsers allow you to store databases locally on your computer using
storage methods such as IndexedDB, WebSQL, File System, etc. User data is
managed in the same way as cookies, cache, and bookmarks are.
WEB SERVERS
Web pages are a collection of data, including images, text files, hyperlinks,
database files etc., all located on some computer (also known as server space) on
the Internet. A web server is dedicated software that runs on the server-side. When
any user requests their web browser to run any web page, the web server places all
the data materials together into an organized web page and forwards them back to
the web browser with the help of the Internet. Therefore, we can conclude that: -
This intercommunication of a web server with a web browser is done with the
help of a protocol named HTTP (Hypertext Transfer Protocol). These stored web
pages mostly use static content, containing HTML documents, images, style sheets,
text files, etc. However, web servers can serve static as well as dynamic contents.
Web Servers also assists in emailing services and storing files. Therefore it also uses
SMTP (Simple Mail Transfer Protocol) and FTP (File Transfer Protocol) protocols
to support the respective services. Web servers are mainly used in web hosting or
hosting the website's data and running web-based applications.
The hardware of the web servers are connected to the Internet that manages
the data exchange facility within different connected devices. In contrast, the
software of web server software is responsible for controlling how a user accesses
delivered files. Typically, web server management is an ideal example of the
client/server model. Therefore, it is compulsory for all computers that host websites
(whether with state or dynamic web page content) to have web server software.
The term web server can denote server hardware or server software, or in
most cases, both hardware and software might be working together.
1. On the hardware side, a web server is defined as a computer that stores
software and another website raw data, such as HTML files, images, text
documents, and JavaScript files. The hardware of the web servers are
connected to the web and supports the data exchange with different devices
connected to the Internet.
2. On the software side, a web server includes server software accessed
through website domain names. It controls how web users access the web
files and ensures the supply of website content to the end-user. The web
server contains several components, including an HTTP server.
1. First, any web user is required to type the URL of the web page in the address
bar of your web browser.
2. With the help of the URL, your web browser will fetch the IP address of
your domain name either by converting the URL via DNS (Domain Name
System) or by looking for the IP in cache memory. The IP address will
direct your browser to the web server.
3. After making the connection, the web browser will request for the web
page from the web server with the help of an HTTP request.
4. As soon as the web server receives this request, it immediately responds by
sending back the requested page or file to the web browser HTTP.
5. If the web page requested by the browser does not exist or if there occurs
some error in the process, the web server will return an error message.
6. If there occurs no error, the browser will successfully display the webpage.
Sending and receiving mails on Internet by using SMTP (Simple Mail transfer
Protocol) fetching requests for File Transfer Protocol (FTP) files; and designing,
developing, and publishing websites.
Many Web servers, even the basic one, also support the server-side scripting
technique. Server-side scripting is a web development method used to employ
scripts on a web server that produces a customized response for each user. This
technique operates on the server machine and consists of an extensive feature set,
including database access. The server-side scripting process will have various
scripting languages such ASP, PHP, Java, JavaScript, Python, ruby and many more.
This technique also enables the HTML files to be created dynamically.
A web server can be used to store, process or deliver either static or dynamic web
pages.
Let's understands the definition of static and dynamic web servers with
the help of their difference table given below:
PROXY SERVERS
The proxy server is a computer on the internet that accepts the incoming
requests from the client and forwards those requests to the destination server. It
works as a gateway between the end-user and the internet. It has its own IP address.
It separates the client system and web server from the global network.
In other words, we can say that the proxy server allows us to access any
websites with a different IP address. It plays an intermediary role between users and
targeted websites or servers. It collects and provides information related to user
requests. The most important point about a proxy server is that it does not encrypt
traffic.
1. If the requested data or page already exists in the local cache, the proxy
server itself provides the required retrieval to the client.
2. If the requested data or page does not exist in the local cache, the proxy
server forwards that request to the destination server.
3. The proxy servers transfer the replies to the client and also being cached to
[Link], it can be said that the proxy server acts as a client as well as the
server.
There are many types of proxy servers available. The two most common types
of proxy servers are forward and reverse proxy servers. The other proxy server has
its own feature and advantages. Let's discuss each in detail.
As we have discussed above, the proxy server has its own IP address and it
works as a gateway between the client and the internet. The client's computer knows
the IP address of the proxy server. When the client sends a request on the internet,
the request is re-routed to the proxy. After that, the proxy server gets the response
from the destination or targeted server/site and forwards the data from the page to
the client's browser (Chrome, Safari, etc.).
Overall, it can be said that the proxy server accesses the targeted site, on
behalf of the client, and collects all the requested information, and forwards them to
the user (client). The following figure clearly depicts the working of the proxy
server.
SEARCH ENGINES
In our search engine tutorial, we are going to discuss the following topics -
Searching content on the Internet becomes one of the most popular activities all
over the world. In the current era, the search engine is an essential part of everyone's
life because the search engine offers various popular ways to find valuable,
relevant, and informative content on the Internet.
1. Time-Saving
2. Variety of information
The search engine offers various variety of resources to obtain relevant and
valuable information from the Internet. By using a search engine, we can get
information in various fields such as education, entertainment, games, etc. The
information which we get from the search engine is in the form of blogs, pdf, ppt,
text, images, videos, and audios.
3. Precision
All search engines have the ability to provide more precise results.
4. Free Access
Mostly search engines such as Google, Bing, and Yahoo allow end-users to
search their content for free. In search engines, there is no restriction related to a
number of searches, so all end users (Students, Job seekers, IT employees, and
others) spend a lot of time to search valuable content to fulfill their requirements.
5. Advanced Search
Suppose we want to search for java tutorial on javaTpoint then type "java
site:[Link]" to get the advanced result quickly.
6. Relevance
Sometimes the search engine takes too much time to display relevant,
valuable, and informative content.
Search engines, especially Google, frequently update their algorithm,
and it is very difficult to find the algorithm in which Google runs.
It makes end-users effortless as they all time use search engines to solve
their small queries also.
1. Web Crawler
Web Crawler is also known as a search engine bot, web robot, or web spider. It plays
an essential role in search engine optimization (SEO) strategy. It is mainly a software
component that traverses on the web, then downloads and collects all the
information over the Internet.
There are the following web crawler features that can affect the search results –
Included Pages
Excluded Pages
Document Types
Frequency of Crawling
2. Database
There are the following two database variable features that can affect the search
results:
3. Search Interfaces
There are the following features Search Interfaces that affect the search results -
Operators
Phrase Searching
Truncation
4. Ranking Algorithms
The ranking algorithm is used by Google to rank web pages according to the
Google search algorithm.
There are the following ranking features that affect the search results -
1. Crawling
Crawling is the first stage in which a search engine uses web crawlers to find, visit,
and download the web pages on the WWW (World Wide Web). Crawling is
performed by software robots, known as "spiders" or "crawlers." These robots are
used to review the website content.
2. Indexing
Indexing is an online library of websites, which is used to sort, store, and organize
the content that we found during the crawling. Once a page is indexed, it appears
as a result of the most valuable and most relevant query.
The ranking is the last stage of the search engine. It is used to provide a piece of
content that will be the best answer based on the user's query. It displays the best
content at the top rank of the website.
To know more about how the search engine works click on the following link -
1. Indexing process
blocks -
i. Text acquisition
Index creation takes the output from text transformation and creates the indexes or
data searches that enable fast searching.
2. Query process
The query is the process of producing the list of documents based on a user's search
query. There are the following three tasks of the Query process -
i. User interaction
User interaction provides an interface between the users who search the content and
the search engine.
ii. Ranking
The ranking is the core component of the search engine. It takes query data from the
user interaction and generates a ranked list of data based on the retrieval model.
iii. Evaluation
Evaluation is used to measure and monitor the effectiveness and efficiency. The
evaluation result helps us to improve the ranking of the search engine.
ISP ( INTERNET SERVICE PROVIDER )
ISP stands for Internet Service Provider. It is a company that provides access
to the internet and similar services such as Website designing and virtual hosting.
For example, when you connect to the Internet, the connection between your
Internet-enabled device and the internet is executed through a specific transmission
technology that involves the transfer of information packets through an Internet
Protocol route.
DSL: DSL, which stands for 'digital subscriber line' is an advanced version of
the dial-up Internet access method. It uses high frequency to execute a connection
over the telephone network and allows the internet and the phone connection to run
on the same telephone line. This method offers an Asymmetric Digital Subscriber
(ADSL), where the upload speed is less than the download speed, and a Symmetric
Digital Subscriber Line (SDSL), which offers equal upload and download speeds.
Out of these two, ADSL is more popular among users and is popularly known as
DSL.
Wi-Fi Internet: It is the short form for "wireless fidelity," which is a wireless
networking technology that provides wireless high-speed Internet connections using
radio waves. To use the internet, you are required to be within the range of wi-fi
network. It is commonly used in public places such as hotels, airports, restaurants
to provide internet access to customers.
Internet Connection
A user can connect to the Internet either by dialing into an ISP's computer or
by directly connecting to the ISP. A dial-up connection is the one you initiate by
dialing into a modem over a telephone link whereas a direct connection is a
continuous connection to an ISP through a dedicated phone line.
The various types of Internet connections are:
Dial-up connection
ISDN
Leased Line or TI/T3 connections
DSL
Cable TV Internet connections
Satellite Internet Connections
Wireless Internet Connections
Dial-up connection
In dial-up connection, a connection is not always on. You connect only when
you want to access the services provided by the Internet and when you are finished
using it just disconnect.
The dial-up connection uses either the SLIP (Serial Line Internet Protocol) or
PPP (Point to Point Protocol) protocols. However, most of the dial up connections
are PPP protocol.
Most ISPs support modems at a speed upto 56 kbps, so these basically used
to read Emails and for Internet browsing. However, this type of connection has
problems with video connections, high speed gaming and multimedia intensive
websites. While using a dial up connection to access the Internet, you cannot talk on
the phone. Thus a dial-up connection does not allow simultaneous talking and
Internet access.
Isdn, ti and t3
ISDN was one of the first widely available truly digital services that stands
for Integrated Services Digital Network. Like a dial-up connection, it also
establishes a connection to your ISP over a phone line when required. However,
unlike dial-up connection it uses an upgraded phone line which carries the digital
signals instead of analog signals to provide fast internet access as well as regular
voice calls simultaneously. It also provides an interface for connecting other devices
such as fax machines, video phones etc.
A BRI ISDN service consists of three distinct channels on a single ISDN line:
two 64 Kbps B (Bearer) channels and one 16 Kbps D (Delta or Data) channels. The
B channels can carry digital data, digitized voice, images, and video at a speed of
64 Kbps. By combining two B channels, a data transfer rate upto 128 kbps may be
achieved. Besides this you can use each B channel for different purposes. One may
be used to talk on the phone and the other may be used to access the Internet.
However, this would temporarily limit your data transmission rate (bandwidth) to
64 Kbps as only one of the B channels is used for the Internet. The D channel is
intended for the purpose of signaling and control, messaging, and network
management. It can also be used for packet data transmission if not in use for
signaling purposes. However, it does not have the voice capability. BRI is basically
used for residential, small business and telecommuting applications that are not
particularly bandwidth intensive.
In order to connect your computer to the ISDN line, Terminal Adapters are
used which are also known as ISDN modems. These modems come with phone
jacks that allow you to plug in your telephone, computers, fax machine and video
conferencing devices and use the ISDN line just like a normal phone line.
Advantages of ISDN:
Dsl:
DSL stands for Digital Subscriber Line which has emerged as a new Internet
access technology that has brought high connection speeds to home users and
business organizations. It is one of the most common forms of broadband
connection as it provides fast Internet access over ordinary telephone lines. Several
versions of DSL technology are available for home and business use which is the
reason why it is also known as xDSL.
Some of the most common DSL are:
All these differ depending on their upload and download speeds, bits transfer
rate, levél of service, distance, and other factors. ADSL is the most popular and
widely used high speed broadband connection that allows digital information to be
sent at a very high speed over ordinary phone lines. Using this connection, the same
phone line is splitted into two separate channels, one for data and one for voice.
Thus it allows you to talk on the phone without disconnecting from the Internet. It
is called asymmetric because the download speed is different from the upload
speed. These speeds vary according to the location from the phone company.
Some of the common download/upload speeds are 256 Kbps/128 Kbps, 512
Kbps/128 Kbps, 1Mbps/256 Kbps etc. The speed varies from 128 Kbps to 8 Mbps
and uploads speed ranges from 64 Kbps to 256 Kbps.
SDSL is like ADSI. With the difference that in the former both the upload and
download speeds are the same, In order to access the Internet using this connection,
DSL modem is required for the home users and DSL router is required for
connecting network computers.
1. It is always on connection and users need not wait as in case of dialup connection.
2. It allows you to use a telephone and Internet simultaneously.
3. It is ideal for downloading video games, watching movies online,
online video conferencing etc.
4. It is faster than ISDN and dial-up connection.
5. It eliminates the performance overhead associated with standard dial-up
connection as there is no need to convert analog signal to digital signal and
digital to analog signals.
Disadvantages of DSL Connection:
Satellite Internet Connection offers yet another option for accessing the
Internet. It is the most expensive alternative for getting a high speed connection to
the Internet. These connections are aimed for those remote areas where other
Internet connections such as dial-up. ISDN, DSL cannot be used.
Satellite Internet connection can either be a one way connection or a two way
connection.
In one way connection, you can only download data but in order to upload the data
you need a dialup access through an ISP over telephone line. However, in a two way
connection, downloads and uploads are done by the satellite without a need of dialup
connection.
In order to use this connection, you need a satellite dish, a coaxial cable and
communication software. In addition, we may also need a phone line and a dial-up
Internet connection if the user has to upload also.
The download speed for these connections varies from 400 Kbps to 6 Mbps
and the upload speed is limited upto a maximum of 128 Kbps.
Advantages of Satellite Internet Connection:
With the recent advancement in wireless technology, the high speed Internet
access is not limited to the desktop but it is now accessible on laptops, PDA's,
mobiles also, Wireless Internet is the broadband Internet connection that uses the
radio frequency bands to connect to the Internet instead of using a telephone line or
a cable network. This type of connection provides a very high speed, usually more
than 10 Mbps and costs considerably less than the equivalent other wired versions.
Its speeds range from 256 Kbps to 10 Mbps. It is possible to use this type of
connection to transmit data over long distances. This type of connection is used by
business travelers and persons who are away from home as long as they are within
the network's coverage area.
Bluetooth: It is also used for Internet access although it doesn't have the
potential of wi-fi due to its limited distance. It was primarily promoted as it
eliminated the need for cables between devices such as keyboard, computers,
cellular phones, PDA's etc. The current standard allows upto 8 devices to be
connected to it.