SAA-C02 Storage Services

Different solutions to allow different types of migrations.
On prem, use different types, same on cloud. DIfferent benefits: costs, capacity, durability, r/w/ speeds, auditing, backup etc.
Not all data treated the same.
Blocks: low latency, like direct access storage
File: Separate files in directories, in a file system. Like NAS
Object: referenced by key, flat address space.

EBS Elastic block storage

EBS Disk volume types

HDD is for high throughput, big data, logging, larger blocks, not boot volumes.
SSD good for smaller blocks, db transactions, boot volumes

EBS Encryption

EBS Pricing

Charged per month for capacity, even if don't use capacity.

AWS EC2 instance store (Instance store volume)

Benefits:

Features:

S3 (Simple storage Service)

Definitions

Features

S3 storage classes

Lifecycle rules can move or delete data to save storage costs.

S3 Glacier

Glacier retrieval

S3 Glacier Deep Archive

Good for long term access, e.g. financial records. Only one retrieval method, slow.

S3 storage classes summary

Choose based on durability, access times, etc.

S3 Bucket properties and management features

S3 Versioning

S3 Server access logging

S3 Static website hosting

S3 Object level logging

S3 Default encryption

Only new update objects.

Object lock

S3 Tags

Use for cost allocation to help with categorisation
Tags can be e.g. project name, and environment name (e..g test, dev)

S3 Transfer acceleration

S3 Events

S3 Requestor pays

Elastic File store (EFS)

File system Differences:

To the user looks like a on-prem file system.

EFS Storage classes and options

Both same level of durability and accessibility.
Can use EFS lifecycle management to move between these storage classes. Small files and metadata don't get moves to IA.

EFS has 2 different performance modes:

EFS has 2 throughput modes:

Creating EFS

Mount to connect storage to EC2. Use either Linux NFS or EFS mount helper (preferred). Install EFS mount helper to set recommended options. Built in logging. Create a security group to allow the EC2 instances to write to the EFS.

Basically create the EFS, create a folder, mount the EFS to that folder. Like mapping, so you can access e.g. one EFS mount from 2 EC2 instances.

EFS Security

To create the EFS file system, you need 'allow' access to EFS, and EC2 access to create network interface. Need to have policies set up.

  1. Encryption at rest
    Need customer master key. AWS KMS encrypts and decrypts the AWS keys. 2 types of customer master keys: created by amazon / created externally. CMKs created first time you use them
    Another course on encryption.

  2. encryption in transit
    TLS: transport layer security Stunnel: open source app for TSL / SSL tunnelling service. Listens for encrypted data and routes to decryption.

Importing data to EFS

AWS Datasync: to transfer data from customer site.
Download Datasync agent, configure source and target.
Also good for syncing data between EFS instances.

Backup and DR strategies

Either could the be priority.

  1. Backup and restore
  1. Pilot light
  1. Warm standby
  1. Multi site: full replica

Replication considerations

Replication types

Synchronous: atomic write to both.
Asynchronous: not atomically written to multi locations. Transfers as network allows. Used by file systems, some databases, good for read only copy.

Using AWS Storage for on-prem backup and DR

Traditional

issues

data transfer mechanisms

3 direct connection types (plus snowball, )

  1. Direct connect (fastest)
  2. VPN connection
  3. Internet connection

Different times to transfer data after restore. Calculate the time.

S3 as a backup solution

Storage classes

Moving data to glacier

Has Vault locks.

Glacier retrieval

S3 regions

S3 is by default single region. Can turn on cross region replication (CRR).
Maybe do this for extra durability

S3 performance

Multipart upload should be used when object > 100Mb. Faster, more throughput, interruption recovery, easier management

S3 Security

AWS Snowball for data transfer

in and out of AWS
Physical appliance
50 or 80 Tb devices.
Dust, impact and tamper resistant.
E-link shipping label so goes to the correct place.
tracked with SNS (simple notification service)

Consider snowball for restore when your connection is slow.
Create export job in AWS management console

AWS storage gateway

Software client in your data centre to connect to amazon.
It's a VM running in your site.
Can be configured as file gateways, volume gateways, virtual tape library

Optimising Storage costs

S3 costs based on volume.
With intelligent tier, costs decrease as more data added per month

Other storage classes are fixed cost per Gb

Retrieval costs: Pay for requests (e.g. API calls) and for volume retrieval per Gb

Data transfer costs: Free with to EC2 or cloudfront, but costs when transfer to internet. Transfer acceleration (via edge locations) is more costly.

Management and replication costs: costs when use S3 inventory, analytics, Object tagging.

Batch operations: Batch for S3 operations. Priced per job and per mission operations.

S3 and glacier select: select the data you want, cheaper, but fixed costs

S3 replication: CRR cross region replication, or SRR same region replication. Costs for storage at other location, and data transfer between regions.

Versioning: implement life cycle policies on S3 to cover versions.

Cost optimisation

understand your data: profile of retrieval, data request patterns, retrieval costs, speed to get data. Make use of lifecycle controls.

EFS costs

EFS Standard storage is volume per month, no access charges.
EFS IA has cheaper storage classes (90% cheaper), but also read and write charges.
Lifecycle management moves data between these automatically. Excludes files below 128k, and metadata. configure the lifecycle management timer to e.g. 30 days.
EFS bursting throughput: caters for occasional high peaks, included.
EFS provisioned throughput: higher level, costs.

FSx

FSX for windows file system: managed windows files system on AWS. Full support for NTFS.
FSx for Lustre: for compute intensive workloads & ML
Pay for storage capacity used.

AWS Storage Gateway costs

Hardware or software on prem to connect to AWS. Types are File / volume / tape.

AWS Backup costs

Solution to backup AWS services.
Also automate backup from on prem.
Centralises management, auditing, logging, restores.
Need to create backup plans: schedule, window, vault, tags, regional copies etc.
Can use tag based policies.