登录查看更多内容

Doing More with Less - Computational Storage & Compression

Scott Shadley, MBA, Board Member

Leading Tech Innovation, Marketing, & Evangelism. Expanding Markets at Solidigm & as a SNIA Board Member

发布日期: 2020年1月3日

Compression and decompression issues are becoming more common throughout the entire datacenter and across all industries. With data volumes ballooning, everyone is trying to save space. One area where compression and decompression play a prominent role is in database management, and it’s even becoming more popular in artificial intelligence (AI) use cases. Many enterprises are leveraging Microsoft’s Project Zipline to help improve the process of compressing and decompressing data. If you’re one of them, this conversation is especially relevant for you.

Compressing and decompressing files is a convenient way to minimize data volumes temporarily. But at the enterprise-level, this process has become rife with bottlenecks and wasted resources. There’s a better way to do it, and computational storage can help.

The purpose of compression is to free up more storage space. But any compressed file needs to be decompressed to do something useful with it, such as analytics. The act of compression, and especially decompression, requires significant compute resources. Compressing files into storage causes many problems and getting that data back out of storage to decompress it is an even bigger problem. You can add capacity, but that doesn’t improve performance. And even if you add plenty of GPUs to boost performance, you’ll still run into bottlenecks, not to mention extra costs.

The problem is data movement, which is always costly and cumbersome. And compressing and decompressing files involves lots of data movement back and forth between compute and storage. Most data centers employ a traditional Von Neumann architecture, which is a 70-year old computer architecture for nearly all general-purpose servers that has only barely evolved in that time. In this architecture, data is moved between compute and storage as needed. But data naturally has gravity and requires resources (host memory and CPUs) and energy to move. As deployments grow, data is moved over increasingly longer distances between nodes and local compute/memory complexes, increasing resource and energy usage, and thus costs.

Until recently, the size of typical data sets has made data movement only moderately costly. However, as data sets grow and data-intensive applications such as advanced analytics, AI, machine learning (ML), genomics, and IoT gain in use, the costs and time needed for data movement is becoming critically challenged. Compressing and decompressing massive amounts of data from storage to host CPU memory is too costly in terms of power consumption and time.

You can always add capacity and compute resources, but they don’t scale equally in a traditional data center architecture. Computational storage, though, solves that problem by bringing compute resources directly to the storage. NGD’s approach to computational storage centers on in-Situ processing, which is processing that's done right where the data resides. NGD offloads the compression and decompression to in-Situ processing, giving enterprises back more resources to host memory, therefore making it possible to give enterprises the data they’re decompressing much faster and far more efficiently.

Below are results comparing NGD’s Newport SSD vs. two other common SSDs in a compression use case. By leveraging computational storage via in-Situ processing, NGD delivers much better performance and power efficiency and ultimately saves time in compressing and decompressing files.

Microsoft recently introduced its opensource Project Zipline technology to achieve much better results when compressing files, including up to 2X compression ratios compared to the commonly used Zlib-L4 64KB model. NGD has worked closely with Microsoft on the Project, in addition to a few other players. In this case, NGD’s computational storage technology was the only option that made it possible to add compute as easily and efficiently as capacity.

Data movement is a persistent problem, and computational storage is the singular answer, whether you’re struggling with compressing and decompressing files, processing edge data or supporting a CDN environment. Computational storage is the only approach that addresses the fundamental challenge of data movement.

Stay tuned for more details on how NGD is taking part in Microsoft’s Project Zipline to make compression and decompression easier, faster and cheaper.

Read more on the NGD Blogs as well - https://www.ngdsystems.com/blogs

Eugene Salamatov

Bootstrapping GetSales.io toward 5M ARR. Secure your clients & GTM strategy with a bulletproof LinkedIn infrastructure ???

5 年

Good article!

1 次回应

要查看或添加评论，请登录

Scott Shadley, MBA, Board Member的更多文章

Chia - Plotting/Farming - Part 2 - Power & AutoPlotting

2021年6月15日

Chia - Plotting/Farming - Part 2 - Power & AutoPlotting

In my previous article, I discussed Dirt and Tractors..

3 条评论
CHIA - Auto-Plotting - Farming - Blockchain - Crypto Done Right

2021年5月12日

CHIA - Auto-Plotting - Farming - Blockchain - Crypto Done Right

What is Chia? I am sure you have heard of the new Crypto Coin "Chia" right? It the first of the eco-friendly, non power…

8 条评论
What's With All the Hype??

2020年7月10日

What's With All the Hype??

What does a #HypeCycle from Gartner mean to you as a customer??? From their site here: https://www.gartner.
Driving a Business - Random Thoughts

2020年3月27日

Driving a Business - Random Thoughts

We all have the opportunity to find new ways to engage our WFH toolbox for outreach to customers and potential…
Solving the Data Deluge at the Edge

2019年6月11日

Solving the Data Deluge at the Edge

With our latest release, we can't help but continue to see this ongoing problem with data at the edge..
The Battle for Compute Locality – Computational Storage to Surpass Persistent Memory

2019年1月9日

The Battle for Compute Locality – Computational Storage to Surpass Persistent Memory

From my last article I mentioned a need or change to storage is coming. I called it “Computational Storage” and since…

6 条评论
New Storage Set 2 Rise from the Ashes

2018年10月29日

New Storage Set 2 Rise from the Ashes

The Phoenix is an amazing fictional character that drives significant value in so many ways as a reference of change..

8 条评论
What’s in a name? Comprehending the scope of Cloud Computing

2017年11月28日

What’s in a name? Comprehending the scope of Cloud Computing

By: Scott Shadley - Thoughts and Opinions within are MINE..

See all articles

Doing More with Less - Computational Storage & Compression

Scott Shadley, MBA, Board Member

Leading Tech Innovation, Marketing, & Evangelism. Expanding Markets at Solidigm & as a SNIA Board Member

Scott Shadley, MBA, Board Member的更多文章

社区洞察

其他会员也浏览了

From Data to Decision: The Power of VAST InsightEngine with NVIDIA

AIStor’s promptObject API, GPU Trends and the new Customer Corner: The December 2024 MinIO Newsletter

The Rise of S3/RDMA: Modernizing Data Access for AI

Inside story on HPC's role in the Bridges Research Project at Pittsburgh Supercomputing Center

VAST Data Unveils Groundbreaking VAST InsightEngine with NVIDIA to Unlock Insights from All Enterprise Data

Everything You Need to Know About KX CON [23]

The Power of Letta (formerly MemGPT): An Easy-to-Understand Guide"- Revolutionizing Memory Management for (LLMs)

FiftyOne Computer Vision Community Update – October 2023

AIM Weekly 26 August 2024

Reactor Pulse

Scott Shadley, MBA, Board Member的更多文章

Chia - Plotting/Farming - Part 2 - Power & AutoPlotting

CHIA - Auto-Plotting - Farming - Blockchain - Crypto Done Right

What's With All the Hype??

Driving a Business - Random Thoughts

Solving the Data Deluge at the Edge

The Battle for Compute Locality – Computational Storage to Surpass Persistent Memory

New Storage Set 2 Rise from the Ashes

What’s in a name? Comprehending the scope of Cloud Computing

社区洞察

其他会员也浏览了

From Data to Decision: The Power of VAST InsightEngine with NVIDIA

AIStor’s promptObject API, GPU Trends and the new Customer Corner: The December 2024 MinIO Newsletter

The Rise of S3/RDMA: Modernizing Data Access for AI

Inside story on HPC's role in the Bridges Research Project at Pittsburgh Supercomputing Center

VAST Data Unveils Groundbreaking VAST InsightEngine with NVIDIA to Unlock Insights from All Enterprise Data

Everything You Need to Know About KX CON [23]

The Power of Letta (formerly MemGPT): An Easy-to-Understand Guide"- Revolutionizing Memory Management for (LLMs)

FiftyOne Computer Vision Community Update – October 2023

AIM Weekly 26 August 2024

Reactor Pulse