Storage Management Techniques Explained
Storage Management Techniques Explained
Storage Management
Storage management refers to the software and processes that improve the
performance of data storage resources in a computer system.
1. primary storage
2. backup storage
3. archived storage
Backup storage holds copies of primary storage data for use in disaster
recovery.
Archived storage holds outdated or seldom-used data that must be retained for
compliance or business continuity.
2. Click System
3. Click Storage Category on the left side of the window to view the computer
storage
Under the storage section, we can click a hard drive to see what is taking up
space on the drive.
In the storage usage screen, the items are broken down into categories. We can
click any category shown to see items wherein we want to remove from the hard
drive or not.
Storage management also includes 10 processes which help businesses store more
data on existing hardware, speed up data retrieval, prevent data loss, meet
data retention requirements, and reduce IT expenses. These processes are:
1. Network virtualization
2. Replication
3. Mirroring
4. Security
5. Data Compression
6. Deduplication
7. Traffic analysis
8. Process automation
9. Storage provisioning
[Link] management
Network Virtualization
Example image of a home network with multiple computers and other network
devices all connected.
In the traditional way a business operates is by having one machine for one
application.
• One server is assigned to run an email service and that server has MS
Windows as its OS.
• 2nd server is used for running a website and it is using a Linux as its
OS.
• 3rd server is used for running a database and it is using Unix as its OS.
So, one machine is running with one application. Those 3 servers are running
3 different OS.
But instead of having 3 servers running one application each, what if just one
server could do the job just as good and do more efficiently. So, basically
one server would take the place of the three and run all the applications and
even run their different Operating Systems.
So, this is what Virtualization does.
But not only the applications, but it’s also running the different Operating
systems side by side and it is doing this all by using software and it does it
so well when the user interact with a virtual server they would interact the
same way as if they were still on multiple physical servers. They won’t be
able to tell the difference.
The software that creates and runs the Virtualization is called Hypervisor.
1. Type 1
2. Type II
A Type 1 hypervisor is installed on empty, bare metal hardware. Meaning that
there are no existing OS or any other software installed on the machine.
Type 1 hypervisors are the most common because they are used in Enterprise
Data Centers.
Data Centers are a large group of networked computer servers typically used by
organizations for the remote storage, processing, or distribution of large
amounts of data.
A data center can take on various tasks, from simple processes such as storage
for data backup to the storage and execution of basic IT processes. Some data
centers act as a connection point that bring together different colocation
environments.
Microsoft Hyper-V, codenamed Viridian, and briefly known before its release as
Windows Server Virtualization, is a native hypervisor; it can create virtual
machines on x86-64 systems running Windows.
Hyper-V was first released with Windows Server 2008, and has been available
without additional charges since Windows Server 2012 and Windows 8.
Example:
People will use a Type II hypervisor on their computer if they want to test
out new software for research purposes or to try out and test different
Operating systems.
The machine is running 2 VMs. One MS Windows and the other is Linux.
1. Oracle VM Virtualbox
2. Microsoft Virtual PC
3. VMWare Workstation
Each virtual machine can execute its own operating system, including versions
of Microsoft Windows, Linux, BSD (Berkeley Software Distribution), and MS-DOS.
It is developed and sold by VMware, Inc.
Benefits of Virtualization
Because a business won’t need as many physical machines or the power that
it takes to run the machines. It can just create a virtual machine
instead.
4. Portability
And this is because today computers and servers are so powerful that most
of the time their full potential is not being used.
The software applications that they are running are not able to utilize
the full machine’s potential.
So, the majority of the machines power is not being used. But with
virtualization, it can create virtual machines.
So, it can push a machine’s capacity to its limit and take full advantage
of a machine’s capability.
Virtual machines are just software files and those files can be backed
up and they can be uploaded to multiple machines. So, if a machine goes
down, the other machines will be there to take over.
A Virtual Local Area Network (VLAN) is any broadcast domain that is partitioned
and isolated in a computer network at the data link layer.
With the color code, we can identify that the particular VM1 lies on VLAN ID
100 and VM12 lies on VLAN ID 101.
Moving the NP from VM1 to VM12, so how this network packet moves.
In the absence of Network Virtualization, let’s see how the NP moves from VM1
to VM12.
1. The NP has to come out from the VM1 and then go to the physical network
of Server 1.
2. From the Server 1, it has to go to the uplink switch where it is installed.
On the switch, we have to configure it into VLAN ID 100 and VLAN ID 101.
If we did not configure it, the switch will not get to know to which VM
we have to forward the particular NP.
3. To move the packet from VM1 to VM12, the NP will go all the way from the
VM1 to the Server1 and from the Server1 to the Switch. The Switch will
get to know that which VLAN ID it has to pass the NP but if the VLAN ID
is different then it has to go through the router also.
But once the NP pass through the Router, it will go back to the Switch
and then going to the Server1, from the Server1 it will go to the Virtual
Switch until it landed to the VM12 virtual machine.
1. Nodes
3. Storage
A physical view of a network shows the network topology with the physical
aspects like ports, cables, racks, routers, switches, hubs, etc.
The Internet Protocol is a set of rules for communication over the internet,
such as sending mail, streaming video, or connecting to a website.
1. Network ID
2. Host ID
Example:
On a home network — [Link], the 192.168.1 is the network ID, and the final
number is the host ID.
The Host ID refers to the specific device on that network. (Usually your router
is .1, and each subsequent device gets assigned .2, .3, and so on.)
IT staff can create Virtual Private Networks (VPN) without changing hardware
configuration and provision virtualized services like SD-WAN (Software-Defined
Wide Area Network) and security faster, making it easier to connect users to
applications.
SD-WAN simplifies the deployment and management of Wide Area Network (WAN) and
provides secure, reliable connectivity for all employees on any devices
everywhere.
1. Data Centers
2. Cloud Computing or Cloud Services
3. Virtualization or Virtual Computing
Data Centers
Though Data Centers can be smaller, they are generally large facilities which
provides massive amounts of power cooling and bandwidth.
Only large organizations like Facebook or Google can afford to build their own
private data centers to provide services to their users.
Cloud Service Providers like CISCO, or Amazon Web Services (AWS) or Microsoft
Azure, offer their services out of data centers.
1. Public clouds
2. Private clouds
We can find SaaS in Office 365, Adobe Creative Cloud, or even computer gaming
software in which access to the software happens typically through a Web
browser.
PaaS is where the cloud service provider provides the platform like the Java
or .Net platform for a developer to develop an application. The application
involves providing data bases and tools to the developer so that they can
quickly develop an application.
IaaS refers to virtual computing that can be provided over the Internet on
demand.
1. Virtual servers
2. Virtualized storage
3. Virtualized networking capabilities
The central console is an OS window where users interact with the OS or with
a text-based console application by entering text input through the computer
keyboard, and by reading text output from the computer terminal.
It is the text entry and display device for system administration messages,
particularly those from the BIOS or boot loader, the kernel, from the init
(initialization) system and from the system logger.
The guest machines can be cloud based virtual machines, file shares, or physical
servers. The virtualization is done at the host level via software regardless
of the physical storage array.
Replication
1. Administering
2. Managing management servers
3. Managing storage systems
4. Managing host systems
5. Managing logical paths
6. Setting up data replication
7. Practicing disaster recovery
8. Monitoring health and status
9. Managing security
Administering
You can set up active and standby management servers, restore a lost
connection between the management servers, or complete a takeover on the
standby management server.
To replicate data among storage systems using Copy Services Manager, you
must add connections to the storage systems. After a storage system is
added, you can associate a location, modify connection properties, set
volume protection, and refresh the storage configuration for that storage
system.
You can use practice volumes to test your disaster recovery actions while
maintaining disaster recovery capability.
There are several options within Copy Services Manager for monitoring the
health and status of sessions, storage systems, host systems, and
management servers.
Managing security
• Electronic banking,
• Sensor networks,
Mirroring
• If done, we can open any drive that you want. The 2 disks content are
the same.
• If we add content, it will automatically be copied to the drive.
• We can use it as a backup. That is recommended.
• Click Next
• Select the 2nd disk to be added to the mirror and
• Click Next
• Change the volume label and check the box in perform a quick format
• Click Next
• Once done, since both are empty, we are going to add a folder, we will
notice that it automatically add into the 2nd drive.
• Just keep in mind that this is a redundant system, not a back up
system.
• So, everything copied on the 1st drive will automatically be copied on
the 2nd drive. That is how mirroring works.
Security
Operating system security (OS security) is the process of ensuring OS
integrity, confidentiality and availability.
OS security refers to specified steps or measures used to protect the OS from
threats, viruses, worms, malware or remote hacker intrusions.
Patch management is the process of ensuring that your software and operating
systems are kept up-to-date with the latest security patches and software
updates. These updates are released by software vendors to fix bugs, patch
security vulnerabilities, and improve the overall performance of your
software.
You can create a local user account (an offline account) for anyone who will
frequently use your PC. The best option in most cases, though, is for everyone
who uses your PC to have a Microsoft account. With a Microsoft account, you
can access your apps, files, and Microsoft services across your devices.
If needed, the local user account can have administrator permissions; however,
it's better to just create a local user account whenever possible.
1. Select Start > Settings > Accounts and then select Family & other
users. (In some versions of Windows, you'll see Other users.)
2. Next to Add other user, select Add account.
3. Select I don't have this person's sign-in information, and on the next
page, select Add a user without a Microsoft account.
4. Enter a user name, password, or password hint—or choose security
questions—and then select Next.
Note: If you choose an account that shows an email address or doesn't say
"Local account", then you're giving administrator permissions to a
Microsoft account, not a local account.
Data Compression
Data compression squeezes(reduces) data into a smaller size.
Data compression can decrease the amount of storage a file takes up.
To do compression, we have to encode data using fewer bits than the original
presentation.
For example, in a 2:1 compression ratio, a 20-megabyte (MB) file takes up 10
MB of space. As a result of compression, administrators spend less money and
less time on storage.
The pixel -- a word invented from picture element -- is the basic unit of
programmable color on a computer display or in a computer image. It is the
smallest unit in a digital display.
It is a tiny dot of comprises in a computer-based images.
These are the small dots we see if you put your face too close to your
television or computer screen. Each digital image is comprised of thousands or
millions of individual pixels, each with its own color.
To know where rows end, image files have metadata, which defines properties
like dimensions.
Each pixel’s color is a combination of 3 additive primary colors: red, green
and blue.
Mixing full intensity, red and green, but no blue, but no blue (it’s 0), it
results to yellow color.
The data image has 16 pixels, and each of those pixels needs 3 bytes of color
data.
That means the data images will consume 48 bytes of storage.
16 pixels x 3 bytes = 48 bytes
The data can be compressed and pack it into a smaller number of bytes than 48
bytes.
Ways to compressed data:
1. Reduce repeated or redundant information.
Run-Length Encoding takes advantage of the fact that there are often
runs of identical values in files.
To ensure that computers don’t get confused with which bytes are run lengths
and which bytes represent color, we have to be consistent on how we apply the
scheme.
3 3 3 9 9
To do this, we need a dictionary that stores the mapping from codes to data.
Example:
We can view our image as not just as a string of individual pixels, but as
little blocks of data.
For simplicity, we are going to use pixel pairs, which are 6 bytes long, but
blocks can be of any size.
• White-yellow
• Black-yellow
• Yellow-yellow
• White-white
Those are the data blocks in our dictionary we want to generate compact codes
for.
What’s is interesting is that the blocks occur at different frequencies.
4. Now, we repeat the process. This time we have 3 things to choose from.
Just like before, we select the two with the lowest frequency, put them
into a little tree and record the new total frequency of all the sub-
items.
5. This time it’s easy to select the 2 items with the lowest frequency
because there are only 2 things left to pick.
6. Then combine the figure above into a tree. Below is the final figure of
our tree and it having a very cool property; it is arranged by
frequency, with less common items lower down.
Then, how the tree gets us to a dictionary?
Yellow-Yellow is encoded as 0 .
White-yellow is encoded as 1 0 .
Black-Yellow is 1 1 0 .
White is 1 1 1.
The really cool thing about these codewords is that there’s no way to have
conflicting codes, because each path down the tree is unique.
This means our code is prefix-free, that is no code starts with another
complete code.
WY = 10
BY = 110
YY = 0
YY = 0
YY = 0
WW = 111
WY = 10
YY = 0
10 110 0 0 0 111 10 0 = 14 bits
But this data is meaningless unless we also save our code dictionary.
So, we will need to append it to the front of image data like below:
Now, including the dictionary, the image data is 30 bytes long (refer to
blocks).
That is still a significant improvement over 48 bytes.
The 2 approaches, removing redundancies and using more compact
representations, are often combined and underlie almost all lossless
compressed file formats like GIF, PNG, PDF, and ZIP files.
GIF stands for Graphics Interchange Format.
GIF is a raster file format designed for relatively basic images that appear
mainly on the internet. Each file can support up to 8 bits per pixel and can
contain 256 indexed colors. GIF files also allow images or frames to be
combined, creating basic animations.
PNG, short for Portable Network Graphics, is a popular and high-quality
graphic file format. The PNG format is both lossless and supports
transparency, making it great for webpages. You can view PNG files in almost
any graphic program, image viewer, and web browser.
Adobe PDF files—short for portable document format files—are one of the most
commonly used file types today. If you've ever downloaded a printable form or
document from the Web, such as an IRS tax form, there's a good chance it was
a PDF file. Whenever you see a file that ends with . pdf, that means it's a
PDF file.
A zip file is a file format that can contain multiple files combined and
compressed into one file. Files that are zipped have a file extension of.
zip. Since it's a type of compressed file, a zip file can be smaller in size
than the files it contains. This makes the zip file easier and faster to
download.
Both run-length encoding and dictionary coders are lossless compression
techniques.
Lossless compression technique means no information is lost. When decompress,
the original file is still there.
That’s really important for many types of files.
Like, it would be very odd if I zipped up a word document to send to you, and
when you decompressed it on your own computer, the text was different.
But there are other types of files that we can get away with little changes,
perhaps by removing unnecessary or less important information, especially
information that human perception is not good of detecting.
And this trick underlies most lossy compression techniques.
These tend to be pretty complicated, so we are going to attack this at a
conceptual level.
Let’s take sound as an example.
Your hearing is not perfect.
We can hear some frequencies of sound better than others.
And there are some we can’t hear at all, like ultrasound.
Basically, if we make a recording of music, and there’s data in the
ultrasonic frequency range, we can discard it, because we know that humans
can’t hear it.
On the other hand, humans are very sensitive to frequencies in the vocal
range, like people singing, so it’s best to preserve quality there as much as
possible.
Deep bass is somewhere in between. Humans can hear it but we are less attuned
to it.
We mostly sense it.
Lossy audio compressors take advantage of this, and encode different
frequency bands at different precisions.
Even if the result is rougher, it’s likely that users won’t perceive the
difference.
Or at least it doesn’t affect the experience.
And here comes the hate mail of audiophiles.
Audiophiles are an exceptional breed of people who are fascinated by pure
audio, motivated by sound quality and addicted to audio gadgets. Audiophiles
take their passion for music one step further. They're curious about how
songs are recorded and the science behind how sounds are reproduced.
You encounter this type of audio compression all the time. It’s one of the
reasons we sound different on a cellphone versus in person.
The audio data is being compressed, allowing more people to take calls at
once.
As the signal quality or bandwidth get worse, compression algorithms remove
more data, further reducing precision, which is why Skype calls sometimes
like robots talking.
Compared to an uncompressed audio format, like a WAV or a FLAC (there we go,
got the audiophiles back).
The full form of WAV is Waveform Audio File Format. It's used on Computing, File
Extensions in Worldwide Waveform Audio File Format (WAVE) or WAV.
FLAC is an acronym for Free Lossless Audio Codec. Files with the .flac file
extension contain audio files that are compressed using lossless audio
compression. The compression of a FLAC file is similar to the compression of
a ZIP file, making the file more manageable and saving file storage space.
Compressed audio files like, MP3s, are often 10 times smaller.
This idea of discarding or reducing precision in a manner that aligns with
human perception is called perceptual coding, and it relies on models of
human perception, which come from a field of study called Psychophysics.
This same idea is the basis of lossy compressed of image formats, most
famously JPEGs (Joint Photographic Expert Groups).
Like hearing, the human visual system is imperfect (not perfect).
We are really good in detecting sharp contrasts, like the edges of objects,
but our perceptual system is not so hot with subtle color variations.
JPEG takes advantage of this by breaking images up into blocks of 8x8 pixels,
then throwing away a lot of high-frequency spatial (relating to) data.
Example:
A dog’s picture. On the other side is a patch of 8x8 pixels.
The one on the left is 1/3 the file size of the one on the right when using
JPEG file compression.
Data Deduplication
Traditional data back-up does not provide any inherent capability to prevent
duplicate data from being backed-up.
With the growth of information and 24 by 7 application availability
requirements, back-up windows are shrinking.
Traditional back-up process back-up a lot of duplicate data.
Backing up of duplicate data significantly increased the back-up windows
size, requirements, and results in unnecessary consumption of resources such
as storage space and network bandwidth.
Data deduplication helps to reduce the storage requirement for back-up, shot
in the back of window and remove the network burden.
It also helps to store more back-ups on the disk and retain the data for a
longer period of time.
Data deduplication is a process that eliminates excessive copies of data and
significantly decreases storage capacity requirements.
It is a technique used for eliminating duplicate copies of
repeating/redundant data.
It is one of the most important technologies used for online back-up and
recovery solution.
File level deduplication is simple and fast but does not address the problem
with the duplicate content inside the files.
Example:
10MB Powerpoint presentations with different title page are not considered as
duplicate files because each file will be stored separately.
Subfile level deduplication it breaks the file into smaller chunks and then
uses specialized algorithm to detect redundant data within and across files.
Traffic Analysis
Traffic Analysis is the analysis of patterns in communications for the
purpose of gaining intelligence about a system or its users.
The key benefits of network traffic analysis
Improved visibility into devices connecting to your network (e.g. IoT
devices, healthcare visitors) meet compliance requirements.
Network traffic analysis is defined as a method of tracking network activity
to spot issues with security and operations, as well as other irregularities.
An Overview of Network Traffic Analysis
It involves:
1. Collecting, storing, and monitoring all the data traversing (moving back
and forth or sideways) your on-premises, hybrid, or multi-cloud
infrastructure.
2. Need to visualize and search this data for network planning and design.
3. Need notifications when something’s gone wrong to effectively
troubleshoot.
This is to find out what’s out there on the network. We can’t analyze and
monitor something if we don’t know it exists.
We need to identify and categorize the types of sources we can collect data
from. There are applications, desktops, servers, routers, switches, firewalls,
and more. Each of these can provide various metrics we can collect for
analysis.
Decide Methods of Identification
You’ll need to determine the best methods you can use to identify your data
sources.
You can use a manual or automated approach. The manual approach involves
shifting through topology maps and other documentation.
So, consider the automated method with application and network discovery.
Network traffic, also called data traffic, is broken down into data packets
and sent over a network before being reassembled by the receiving device or
computer.
1. north-south
2. east-west
North-south Traffic
East-west Traffic
East-west traffic refers to traffic within a data center, also known as server-
to-server traffic.
A data packet is a unit of data made into a single package that travels along
a given network path.
When data travels over a network or over the internet, it must first be broken
down into smaller batches so that larger files can be transmitted efficiently.
The network breaks down, organizes, and bundles the data into data packets so
that they can be sent reliably through the network and then opened and read by
another user in the network.
Each packet takes the best route possible to spread network traffic evenly.
1. real-time
2. non-real-time
Real-time Traffic
Non-real-time Traffic
Process Automation
Process automation uses technology to automate complex business processes. It
typically has three functions: automating processes, centralizing information,
and reducing the requirement for input from people.
Storage Provisioning
Storage provisioning is a management technique that assigns storage capacity
to servers, computers, virtual machines and other devices. It may use
automation to allocate storage space in a networked environment.
Memory Management
Memory management is the process of controlling and coordinating a computer's
main memory.
It ensures that blocks of memory space are properly managed and allocated so
the operating system (OS), applications and other running processes have the
memory they need to carry out their operations.
The three major activities of the operating system with regard to memory
management are:
1. Keeping track of which parts of memory are currently being used and by
whom.
2. Deciding which processes are to be loaded into memory when memory space
becomes available.
3. Allocating and deallocating memory space as needed.
References:
[Link]
knbcthp3w8o7#:~:text=Page%20Number(p)%20%2D%20It,a%20word%20on%20a%20page.
[Link]
[Link]
[Link]
[Link]
[Link]
[Link]
ials&docid=603502557915658263&mid=790332991A71BD4329DD790332991A71BD4329DD&vi
ew=detail&FORM=VIRE
[Link]
[Link]
il&mid=FF598E7AC79EB2308EC4FF598E7AC79EB2308EC4&FORM=VDRVRV&ru=%2Fvideos%2Fse
arch%3Fq%3Dnetwork%2520virtualizatino%2520define%26FORM%3DVDVVXX&ajaxhist=0
[Link]
=vid&sa=X&ved=2ahUKEwiKi8PL5qj-
AhXkUPUHHRtVDLMQ_AUoA3oECAEQBQ&biw=1280&bih=581&dpr=1.5#fpstate=ive&vld=cid:d
2b6453f,vid:G1lCF5VALsc
[Link]
20operating%20system%20video&oq=replication%20in%20storage%20management%20in%
20operating%20system%20video#fpstate=ive&vld=cid:abf2bd66,vid:bB_qs0ZZci0
[Link]
[Link]/free
[Link]
[Link]
security#What_Does_Operating_System_Security_Mean
[Link]
[Link]
[Link]
=lnms&tbm=vid&sa=X&ved=2ahUKEwjyjM60283-
AhWXb2wGHc5zANoQ_AUoAnoECAEQBA&biw=1280&bih=581&dpr=1.5#fpstate=ive&vld=cid:2
215189f,vid:OtDxDvCpPL4
[Link]
=lnms&tbm=vid&sa=X&ved=2ahUKEwjyjM60283-
AhWXb2wGHc5zANoQ_AUoAnoECAEQBA&biw=1280&bih=581&dpr=1.5#fpstate=ive&vld=cid:2
215189f,vid:OtDxDvCpPL4
[Link]
m=vid&sa=X&ved=2ahUKEwjKxaiT2tb-
AhVTsVYBHcsvB5UQ_AUoAnoECAIQBA&biw=1280&bih=581&dpr=1.5#fpstate=ive&vld=cid:d
af0719e,vid:bcMBO39mzWo