0% found this document useful (0 votes)
14 views4 pages

Unit 4

Cloud storage management focuses on organizing, protecting, and optimizing data in the cloud to achieve security, performance, and cost goals. It encompasses various storage types, including block, file, and object storage, while emphasizing data protection, backup, performance monitoring, and compliance. Key benefits include cost efficiency, scalability, and improved accessibility, with best practices for operational management and data classification to enhance security and compliance.

Uploaded by

knightsoul01
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views4 pages

Unit 4

Cloud storage management focuses on organizing, protecting, and optimizing data in the cloud to achieve security, performance, and cost goals. It encompasses various storage types, including block, file, and object storage, while emphasizing data protection, backup, performance monitoring, and compliance. Key benefits include cost efficiency, scalability, and improved accessibility, with best practices for operational management and data classification to enhance security and compliance.

Uploaded by

knightsoul01
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

for this too in detail

Cloud storage management covers how data is organized, protected, stored, and optimized in
the cloud to meet security, performance, availability, compliance, and cost goals, using tiers,
redundancy, and managed services across distributed data centers. [1]

Core concepts
Cloud storage centralizes administration of object, block, and file services while enforcing
encryption, backup/recovery, performance monitoring, capacity planning, and compliance
policies across providers’ infrastructures. [1]
Data is distributed across global data centers with replication and virtualization for high
availability, durability, scalability, and on‑demand access via APIs, protocols, and
dashboards. [1]

Key benefits
Cost efficiency through pay‑as‑you‑go usage and provider economies of scale, avoiding
large hardware capex and reducing ops burden with managed reliability. [1]
Elastic scalability, improved accessibility from any location, and performance gains via
optimized configurations and parallel access patterns. [1]

Management focus areas


Data protection and security: encryption, access control, and integrity checks to prevent
unauthorized access or tampering of sensitive data. [1]
Backup and recovery: automated, policy‑driven backups with point‑in‑time recovery to
sustain availability during failures or disasters. [1]
Performance and reliability: monitoring IOPS/throughput/latency and tuning tiers/placement
to avoid bottlenecks. [1]
Capacity and cost: right‑sizing allocations, using lifecycle policies, and storage tiering to
curb spend while meeting SLAs. [1]
Policy and compliance: implementing retention, residency, and audit controls aligned with
regulations like GDPR/HIPAA. [1]
How storage is organized
Storage tiers: hot, cool, and archive tiers balance access frequency, latency, and cost to
match data temperature over time via lifecycle policies. [1]
Access methods: consoles, APIs/SDKs, and mounting options expose storage to apps and
users securely at scale. [1]
Data management plans: define structure, ownership, retention, and quality processes to
keep data findable and useful over its lifecycle. [1]

How data is stored


Data centers and virtualization: data lands on VMs over physical servers in provider
facilities, with virtualization pooling and abstracting underlying resources. [1]
Redundancy and replication: multiple copies across machines and regions sustain durability
and availability despite localized failures. [1]
Dynamic scaling: providers allocate additional storage and compute as demand grows
without downtime. [1]

Storage types overview


Block storage: raw, high‑performance volumes for databases and transactional workloads
that need low latency and fine‑grained control. [1]
File storage: hierarchical shares via NFS/SMB for collaborative file access, content repos,
and user directories. [1]
Object storage: objects with rich metadata for massive unstructured datasets like media,
logs, and backups with high durability and global access. [1]

Distributed file systems


DFS splits files into chunks, distributes and replicates them across nodes, with a metadata
service mapping locations for transparent client access. [1]
Benefits include horizontal scalability, high availability via replication, improved performance
through parallelism, cost efficiency, and collaboration at global scale. [1]

Cloud databases primer


Cloud DB characteristics: internet accessibility, elastic horizontal/vertical scaling,
public/private/hybrid deployment models, and DBaaS for managed operations. [1]
Benefits include lower upfront cost, agility for rapid experiments, reduced admin overhead,
and better reliability from automated HA/backups. [1]
NoSQL in the cloud
HBase: column‑oriented, Bigtable‑style store atop HDFS/object storage with real‑time
random access, horizontal scaling, and cloud‑native deployments for large sparse data and
low‑latency reads/writes. [1]
MongoDB: document model with flexible schemas; managed offerings (e.g., Atlas) provide
automated backups, global distribution, failover, and easy scaling for modern app agility. [1]
Cassandra: peer‑to‑peer, linearly scalable, multi‑datacenter replication with high write
throughput; managed services simplify operations and integrate with analytics/monitoring
stacks. [1]
DynamoDB: fully managed, serverless key‑value/document store with single‑digit
millisecond latency at any scale, automatic capacity scaling, HA/durability, and flexible
schemas for internet‑scale apps. [1]

Data classification in cloud


Levels: Public, Internal, Confidential, and Restricted, each dictating controls like encryption,
access restrictions, and monitoring rigor. [1]
Importance: mitigates risk, enforces compliance, strengthens security posture, reduces cost
by tiering/retiring low‑value data, and improves visibility/control. [1]

Big data in cloud


3 Vs: Volume, Velocity, Variety across structured, semi‑structured, and unstructured sources
necessitate specialized processing and storage approaches. [1]
Issues: security and privacy compliance, complex integration across formats/sources, data
quality/veracity, cost control at large scale, vendor lock‑in, and sustaining performance as
volumes surge. [1]

Cloud storage systems recap


Core traits: on‑demand access, elasticity, cost‑effectiveness, provider‑managed
infrastructure, and high availability/durability from redundancy. [1]
How it works: data upload over the network, off‑site storage in data centers, virtualized
distribution across nodes, API/protocol access, and replicated copies for resilience. [1]

Choosing the right storage


Use object storage for durable, scalable archives, media, logs, backups, and data lakes with
rich metadata and lifecycle tiering. [1]
Use block storage for latency‑sensitive databases, VM disks, and transactional systems
needing consistent IOPS and filesystem control. [1]
Use file storage for shared workspaces, content management, legacy apps expecting
POSIX‑like semantics, and lift‑and‑shift file servers. [1]
Operational best practices
Encrypt in transit and at rest, enforce least‑privilege IAM, and enable detailed auditing with
alerts on abnormal access patterns. [1]
Implement automated backups with immutable copies and cross‑region replication; test
restores and define RPO/RTO aligned to business needs. [1]
Use lifecycle rules to tier cold data, deduplicate/recompress where applicable, and monitor
usage, I/O, and egress to optimize spend and performance. [1]

Quick comparisons
Topic Highlights

Block: raw, low‑latency volumes for DB/VMs; File: shared hierarchical access via
Block vs File vs Object
NFS/SMB; Object: scalable, metadata‑rich storage for unstructured data [1]

DFS Chunks, replication, metadata server, parallel access, resilient scaling [1]

HBase: columnar on DFS; MongoDB: document with managed global options;


HBase vs MongoDB vs
Cassandra: peer‑to‑peer, linear scale; DynamoDB: fully managed, serverless
Cassandra vs DynamoDB
low‑latency store [1]

Exam pointers
Explain how tiering and lifecycle policies reduce cost without sacrificing SLAs, and map
workloads to storage types with trade‑offs in latency, durability, and semantics. [1]
Justify database selection using access patterns, consistency/latency needs, write
throughput, global distribution, and operational constraints (managed vs self‑hosted). [1]
A one‑page cheat sheet with diagrams for tiers, DFS architecture, and NoSQL choices can be
prepared on request, along with practice questions mapping workloads to the right
storage/database choice. [1]

1. [Link]

You might also like