登录查看更多内容

Presto (on-premise data warehouse)

Saikrishna Cheruvu

Lead Developer | Data Engineer | MLOPS | ex@ BOFA

发布日期: 2021年3月19日

Presto definition from developers: Presto is an open-source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes.

This is designed for any data size. It supports Hadoop (HDFS), S3 (Amazon), Mongo DB, PostgreSQL, Teradata etc...

Where we can fit this presto?

Distributed systems like Hadoop or S3 to move the data for reporting applications like Tableau, MicroStrategy etc .. presto clusters will take the pressure of query execution, and The data transfer rates, especially for Tableau extracts, are much faster than existing JDBC connectivity.

The specialty of presto is we can query the data where it can store, without moving the data to any other analytical execution systems, its pure memory-based architecture.

Sample view of Presto CLI

Combine Data from multiple sources.

A Single Presto Query can even combine data from multiple sources
Ability to join data between all data sources integrated in presto.
one SQL query will join multiple data sources

要查看或添加评论，请登录

Saikrishna Cheruvu的更多文章

How Databricks AI/BI is Revolutionizing BI and Overtaking Power BI

2024年8月4日

How Databricks AI/BI is Revolutionizing BI and Overtaking Power BI

In recent years, the landscape of Business Intelligence (BI) has witnessed significant transformations. One of the most…
"Which tool is the right choice for cloud data transformation?" ?? #Cloud #DataTransformation #Databricks #DecisionMaking #Dbt

2024年6月30日

"Which tool is the right choice for cloud data transformation?" ?? #Cloud #DataTransformation #Databricks #DecisionMaking #Dbt

I am trying to attempt a comparison between dbt and Databricks (delta live tables) Note: Not prompted and copied from…

3 条评论
Problems with scalable data systems need creative approaches.

2024年4月7日

Problems with scalable data systems need creative approaches.

Maybe chatGpt will help to write the code, not the solutions that we need to do with human intelligence. (?? soon the…

3 条评论
Datasbricks vs Snowflake ??part 1??

2023年8月19日

Datasbricks vs Snowflake ??part 1??

Snowflake and Databricks have wonderful features and most of them are common. If any feature is released on one of the…

4 条评论
What is Z-Order on Databricks?

2023年4月1日

What is Z-Order on Databricks?

What is Z-Order? We can compare the z-order with the cluster index in Oracle (I am a fan of SQL and databases, so my…
SQL Statement Execution API by Databricks

2023年3月9日

SQL Statement Execution API by Databricks

Recently, Databricks released an API for the execution of SQL statements. as of now, this is available on AWS and Azure…

2 条评论
What is Data Mesh?

2022年11月2日

What is Data Mesh?

What is a data mesh? Data mesh is not a technology; it is a conceptual theory of what types of applications we can…

3 条评论
Enterprise Scale Analytics/AI

2022年10月31日

Enterprise Scale Analytics/AI

few lines on ESA Enterprise scale is an architecture approach and reference implementation that enables effective…
Data bricks Governance and Security(Data masking) Implementation with example

2022年10月19日

Data bricks Governance and Security(Data masking) Implementation with example

Some lines about Data masking: Data masking is a technique for creating a dummy data (fake) but realistic version of…

2 条评论
Building Python SDK for Databricks REST API

2022年10月17日

Building Python SDK for Databricks REST API

This article is about a project I've started to work on lately. Please welcome Databricsk REST API - Python.

See all articles

Presto (on-premise data warehouse)

Saikrishna Cheruvu

Lead Developer | Data Engineer | MLOPS | ex@ BOFA

Saikrishna Cheruvu的更多文章

社区洞察

其他会员也浏览了

Advisory Solutions : Connecting OBIEE & Tableau with integrated to Hive,HBase,Impala tables DW environment offloading to Hadoop.

PolyBase in SQL Server 2016

Cloudera Releases CDH 5.5: Navigator Optimizer and more Spark

Let's Learn Netezza!

TEAM TASK - INSIDE A HADOOP CLUSTER!

My experience with Big Data

Planning A Data Lake?

Oracle Big Data SQL White Paper

Hadoop Vs Teradata Aster

Hortonworks Data Platform and Hortonworks Data Flow integration Use Case discussion: Athens,Greece

Saikrishna Cheruvu的更多文章

How Databricks AI/BI is Revolutionizing BI and Overtaking Power BI

"Which tool is the right choice for cloud data transformation?" ?? #Cloud #DataTransformation #Databricks #DecisionMaking #Dbt

Problems with scalable data systems need creative approaches.

Datasbricks vs Snowflake ??part 1??

What is Z-Order on Databricks?

SQL Statement Execution API by Databricks

What is Data Mesh?

Enterprise Scale Analytics/AI

Data bricks Governance and Security(Data masking) Implementation with example

Building Python SDK for Databricks REST API

社区洞察

其他会员也浏览了

Advisory Solutions : Connecting OBIEE & Tableau with integrated to Hive,HBase,Impala tables DW environment offloading to Hadoop.

PolyBase in SQL Server 2016

Cloudera Releases CDH 5.5: Navigator Optimizer and more Spark

Let's Learn Netezza!

TEAM TASK - INSIDE A HADOOP CLUSTER!

My experience with Big Data

Planning A Data Lake?

Oracle Big Data SQL White Paper

Hadoop Vs Teradata Aster

Hortonworks Data Platform and Hortonworks Data Flow integration Use Case discussion: Athens,Greece