ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Data Tiering

Darshika Srivastava

Associate Project Manager @ HuQuo | MBA,Amity Business School

å‘å¸ƒæ—¥æœŸ: 2024å¹´1æœˆ11æ—¥

Data Tiering

Data Tiering refers to a technique of moving less frequently used data, also known as cold data, to cheaper levels of storage or tiers. The term â€œdata tieringâ€ arose from moving data around different tiers or classes of storage within a storage system, but has expanded now to mean tiering or archiving data from a storage system to other clouds and storage systems. See also cloud tiering and choices for cloud data tiering.

Data Tiering Cuts Costs Because 70%+ of Data is Cold

As data grows, storage costs are escalating. It is easy to think the solution is more efficient storage. But the real cause of storage costs is poor data management. Over 70% of data is cold and has not been accessed in months, yet it sits on expensive storage and consumes the same backup resources as hot data. As a result, data storage costs are rising, backups are slow, recovery is unreliable, and the sheer bulk of this data makes it difficult to leverage new options like Flash and Cloud.

Data Tiering Was Initially Used within a Storage Array

Data Tiering was initially a technique used by storage systems to reduce the cost of data storage by tiering cold data within the storage array to cheaper but less performant options â€“ for example, moving data that has not been touched in a year or more from an expensive Flash tier to a low-cost SATA disk tier.

Typical storage tiers within a storage array include:

Flash or SSD: A high-performance storage class but also very expensive. Flash is usually used on smaller data sets that are being actively used and require the highest performance.
SAS Disks: Usually the workhorse of a storage system, they are moderately good at performance but more expensive than SATA disks.
SATA Disks: Usually the lowest price-point for disks but not as performant as SAS disks.
Secondary Storage, often Object Storage: Usually a good choice for capacity storage â€“ to store large volumes of cool data that is not as frequently accessed, at a much lower cost.

Cloud Data Tiering is now Popular

Increasingly, customers are looking at another option â€“ tiering or archiving data to a public cloud.

Public Cloud Storage: Public clouds currently have a mix of object and file storage options. The object storage classes such as Amazon S3 and Azure Blob (Azure Storage) provide tremendous cost efficiency and all the benefits of object storage without the headaches of setup and management.

Tiering and archiving less frequently used data or cold data to public cloud storage classes is now more popular. This is because customers can leverage the lower cost storage classes within the cloud to keep the cold data and promote them to the higher cost storage classes when needed. For example, data can be archived or tiered from on-premises NAS to Amazon S3 Infrequent Access or Amazon Glacier for low ongoing costs, and then promoted to Amazon EFS or FSX when you want to operate on it and need performance.

But in order to get this level of flexibility, and to ensure youâ€™re not treating the cloud as just a cheap storage locker, data that is tiered to the cloud needs to be accessible natively in the cloud without requiring third-party software. This requires file-tiering, not block-tiering.

é¢†è‹±æŽ¨è

How consistent storage services across all tiers and platforms attains data simplicity, compatibility, and lower cost

How consistent storage services across all tiers andâ€¦

Dana Gardner 4 å¹´å‰

If you really care about your data, then care to switch to cloud-based data management systems too

If you really care about your data, then care toâ€¦

Naveen Joshi 6 å¹´å‰

On-Premises Data Gateway Compendium: A Comprehensive Guide with Critical Review

On-Premises Data Gateway Compendium: A Comprehensiveâ€¦

Marcel Broschk 1 ä¸ªæœˆå‰

Block Tiering Creates Unnecessary Costs and Lock-In

Block-level tiering was first introduced as a technique within a storage array to make the storage box more efficient by leveraging a mix of technologies such as more expensive SAS disks as well as cheaper SATA disks.

Block tiering breaks a file into various blocks â€“ metadata blocks that contain information about the file, and data blocks that are chunks of the original file. Block-tiering or Block-level tiering moves less used cold blocks to lower, less expensive tiers, while hot blocks and metadata are typically retained in the higher, faster, and more expensive storage tiers.

Block tiering is a technique used within the storage operating system or filesystem and is proprietary. Storage vendors offer block tiering as a way to reduce the cost of their storage environment. Many storage vendors are now expanding block tiering to move data to the public cloud or on-premises object storage.

But, since block tiering (often called CloudPools â€“ examples are NetApp FabricPool and Dell EMC Isilon CloudPools) is done inside the storage operating system as a proprietary solution, it has several limitations when it comes to efficiency of reuse and efficiency of storage savings. Firstly, with block tiering, the proprietary storage filesystem must be involved in all data access since it retains the metadata and has the â€œmapâ€ to putting the file together from the various blocks. This also means that the cold blocks that are moved to a lower tier or the cloud cannot be directly accessed from the new location without involving the proprietary filesystem because the cloud does not have the metadata map and the other data blocks and the file context and attributes to put the file together. So, block tiering is a proprietary approach that often results in unnecessary rehydration of the data and treats the cloud as a cheap storage locker rather than as a powerful way to use data when needed.

The only way to access data in the cloud is to run the proprietary storage filesystem in the cloud which adds to costs. Also, many third-party applications such as backup software that operate at a file level require the cold blocks to be brought back or rehydrated, which defeats the purpose of tiering to a lower cost storage and erodes the potential savings. For more details, read the white paper: Block vs. File-Level Tiering and Archiving.

Know Your Cloud Tiering Choices

File Tiering Maximizes Savings and Eliminates Lock-In

File-tiering is an advanced modern technology that uses standard protocols to move the entire file along with its metadata in a non-proprietary fashion to the secondary tier or cloud. File tiering is harder to build but better for customers because it eliminates vendor lock-in and maximizes savings. Whether files have POSIX-based Access Control Lists (ACLs) or NTFS extended attributes, all this metadata along with the file itself is fully tiered or archived to the secondary tier and stored in a non-proprietary format. This ensures that the entire data can be brought back as a file when needed. File tiering does not just move the file, but it also moves the attributes and security permissions and ACLS along with the file and maintains full file fidelity even when you are moving a file to a different storage architecture such as object storage or cloud. This ensures that applications and users can use the moved file from the original location, and they can directly open the file natively in the secondary location or cloud without requiring any third-party software or storage operating system.

Since file tiering maintains full file fidelity and native access based on standards at every tier, it also means that third party applications can access the moved data without requiring any agents or proprietary software. This ensures that savings are maximized since backup software and other third -arty applications can access moved data without rehydrating or bringing the file back to the original location. It also ensures that the cloud can be used to run valuable applications such as compliance search or big data analytics on the trove of tiered and archived data without requiring any third-party software or additional costs.

File-tiering is an advanced technique for archiving and cloud tiering that maximizes savings and breaks vendor lock-in.

Data Tiering Can Cut 70%+ Storage and Backup Costs When Done Right

In summary, data tiering is an efficient solution to cut storage and backup costs because it tiers or archives cold, unused files to a lower-cost storage class, either on-premises or in the cloud. However, to maximize the savings, data tiering needs to be done at the file level, not block level. Block-level tiering creates lock-in and erodes much of the cost savings because it requires unnecessary rehydration of the data. File tiering maximizes savings and preserves flexibility by enabling data to be used directly in the cloud without lock-in.

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Darshika Srivastavaçš„æ›´å¤šæ–‡ç«

LGD Model

2025å¹´3æœˆ22æ—¥

LGD Model

Loss Given Default (LGD) models play a crucial role in credit risk measurement. These models estimate the potentialâ€¦
CCAR ROLE

2025å¹´3æœˆ21æ—¥

CCAR ROLE

What is the Opportunity? The CCAR and Capital Adequacy role will be responsible for supporting the companyâ€™s capitalâ€¦
End User

2025å¹´3æœˆ20æ—¥

End User

What Is End User? In product development, an end user (sometimes end-user)[a] is a person who ultimately uses or isâ€¦
METADATA

2025å¹´3æœˆ19æ—¥

METADATA

WHAT IS METADATA? Often referred to as data that describes other data, metadata is structured reference data that helpsâ€¦
SSL

2025å¹´3æœˆ18æ—¥

SSL

What is SSL? SSL, or Secure Sockets Layer, is an encryption-based Internet security protocol. It was first developed byâ€¦
BLOATWARE

2025å¹´3æœˆ17æ—¥

BLOATWARE

What is bloatware? How to identify and remove it Unwanted pre-installed software -- also known as bloatware -- has longâ€¦
Data Democratization

2025å¹´3æœˆ15æ—¥

Data Democratization

What is Data Democratization? Unlocking the Power of Data Cultures For Businesses Data is a vital asset in today'sâ€¦
Rooting

2025å¹´3æœˆ13æ—¥

Rooting

What is Rooting? Rooting is the process by which users of Android devices can attain privileged control (known as rootâ€¦
Data Strategy

2025å¹´3æœˆ12æ—¥

Data Strategy

What is a Data Strategy? A data strategy is a long-term plan that defines the technology, processes, people, and rulesâ€¦
Product

2025å¹´3æœˆ11æ—¥

Product

What is the Definition of Product? Ask a few people that question, and their specific answers will vary, but theyâ€™llâ€¦

See all articles

Data Tiering

Darshika Srivastava

Associate Project Manager @ HuQuo | MBA,Amity Business School

Data Tiering

Data Tiering Cuts Costs Because 70%+ of Data is Cold

Data Tiering Was Initially Used within a Storage Array

Typical storage tiers within a storage array include:

Cloud Data Tiering is now Popular

é¢†è‹±æŽ¨è

Block Tiering Creates Unnecessary Costs and Lock-In

Know Your Cloud Tiering Choices

File Tiering Maximizes Savings and Eliminates Lock-In

Data Tiering Can Cut 70%+ Storage and Backup Costs When Done Right

Darshika Srivastavaçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Hammerspace March Newsletter

Securely Transferring Sensitive Data Between Clouds

Hybrid Cloud Data Management: Integrating On-Premises and Cloud Databases

Mastering Multi-Cloud Data Management across OCI and GCP

The Role of Data Clouds in Managing and Scaling Big Data

Storage Tiering in Cloud Data Lakes: Optimizing Cost and Performance

Why The Cloud Can't Resolve All Data Platform Scalability Concerns

Five Dysfunctions of the 'Data Cloud' Strategy

THE CLOUD AND DATA WAREHOUSE - ARE THEY COMPATIBLE?I

Using Azure Blob Storage for Large-Scale Data Storage

Data Tiering

Data Tiering Cuts Costs Because 70%+ of Data is Cold

Data Tiering Was Initially Used within a Storage Array

Typical storage tiers within a storage array include:

Cloud Data Tiering is now Popular

é¢†è‹±æŽ¨è

Block Tiering Creates Unnecessary Costs and Lock-In

Know Your Cloud Tiering Choices

File Tiering Maximizes Savings and Eliminates Lock-In

Data Tiering Can Cut 70%+ Storage and Backup Costs When Done Right

Darshika Srivastavaçš„æ›´å¤šæ–‡ç«

LGD Model

CCAR ROLE

End User

METADATA

SSL

BLOATWARE

Data Democratization

Rooting

Data Strategy

Product

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Hammerspace March Newsletter

Securely Transferring Sensitive Data Between Clouds

Hybrid Cloud Data Management: Integrating On-Premises and Cloud Databases

Mastering Multi-Cloud Data Management across OCI and GCP

The Role of Data Clouds in Managing and Scaling Big Data

Storage Tiering in Cloud Data Lakes: Optimizing Cost and Performance

Why The Cloud Can't Resolve All Data Platform Scalability Concerns

Five Dysfunctions of the 'Data Cloud' Strategy

THE CLOUD AND DATA WAREHOUSE - ARE THEY COMPATIBLE?I

Using Azure Blob Storage for Large-Scale Data Storage

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†