登录查看更多内容

Summary of new releases from AWS re:invent 2022 to date

Hugh Christensen

Leadership in data at Amazon

发布日期: 2022年11月30日

+ 关注

AWS re:Invent 2022: Monday,?28 November - Friday,?2 December

What can we publicly summize about what we have seen this year at re:invent?

Data takes centre stage. More so than in previous years it felt like AWS addressed 'data', rather than just 'technology'. Doubtless driven by (1) Increased customer focus on extracting value from data (2) Maturating of fundamental cloud tech building blocks.
We are heading towards a no-ETL world. ETL is a friction to extracting value from data. Customers do not want to shoulder this burden, as it is not value-additive to their businesses.
Customers continue to demand vertical specific solutions to problems unique to their industries.
Serverless - it just makes so much sense!:) Know where and on what you want to compete. Double down in that area, and out-source the rest.
Big picture: Cloud is not just about reducing costs, enabling better data management, or using the latest ML techniques. It is a framework to more quickly understand your customers problems, enable innovation and deploy effective, scalable and secure solutions. #customerobsession

Below is a summary of some of the cool new releases from AWS re:invent to date.

DATA

Clean Rooms. New service helps customers create data clean rooms to collaborate with their business partners and generate new insights while protecting underlying raw data. ?https://press.aboutamazon.com/2022/11/aws-announces-aws-clean-rooms
DataZone. ?A new data management service that makes it faster and easier for customers to catalog, discover, share, and govern data stored across AWS, on-premises, and third-party sources.? https://press.aboutamazon.com/2022/11/aws-announces-amazon-datazone

SECURITY

Verified Permissions. A scalable, fine-grained permissions management and authorization service for custom applications.?With Verified Permissions,?application developers can let their end users manage permissions and share access to data. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-verified-permissions-preview/
Security Lake. Automatically centralizes security data from cloud, on-premises, and custom sources into a purpose-built data lake stored in your account. With Security Lake, you can get a more complete understanding of your security data across your entire organization.? https://aws.amazon.com/security-lake/

领英推荐

Uploading Large File (Say, 1 TB size) to AWS S3

Sanjoy Kumar Malik . 3 个月前

re:Invent or re:Position? AWS tries to ‘out Google’…

Phil Fersht 2 年前

MongoDB’s Atlas Capabilities Expand To The Government…

Sramana Mitra 3 年前

REDSHIFT

Redshift Dynamic Data Masking. Allows customers to simplify the process of protecting sensitive data in Redshift. With Dynamic data masking, customers control access to their data through SQL based masking policies that determine how Redshift returns sensitive data to the user at query time. Dynamic data masking makes it simple for the customers to adapt to changing privacy requirements without altering underlying data or updating SQL queries. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-redshift-support-dynamic-data-masking-preview/
Redshift integration with Apache Spark. Enables Apache Spark applications to access Redshift data from AWS analytics services such as EMR, Glue, and SageMaker. Customers pushdown operations such as sort, aggregate, limit, join, and scalar functions so that only the relevant data is moved from Redshift to the consuming Spark application https://aws.amazon.com/redshift/features/integration-for-apache-spark/
Redshift integration with Aurora. With Aurora zero-ETL integration with Redshift, transactional data is automatically and continuously replicated seconds after it is written into Aurora and seamlessly made available in Redshift . Zero-ETL integration makes it easier to run petabyte-scale analytics on transactional data in Aurora in near real time with Redshift. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-aurora-zero-etl-integration-redshift/
Redshift real-time streaming ingestion from KDS and MSK. Enables customers to achieve low latency, measured in seconds, while ingesting hundreds of megabytes of streaming data per second into Redshift.?https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-redshift-real-time-streaming-ingestion-kds-msk/
Redshift now supports Multi-AZ. A Redshift Multi-AZ deployment allows you to recover in case of AZ failures without any user intervention. A Redshift Multi-AZ deployment is accessed as a single data warehouse with one endpoint and helps you maximize your data warehouse performance by distributing workload processing across multiple AZs automatically. https://press.aboutamazon.com/2022/11/aws-announces-five-new-database-and-analytics-capabilities
Redshift Integration with Informatica Data Loader. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-redshift-integration-informatica-data-loader-tool-data-uploads/ ?
Redshift data sharing now supports centralized access control with AWS Lake formation. With the new Amazon Redshift data sharing managed by AWS Lake Formation customers can view, modify, and audit permissions on the tables and views in the Redshift datashares using Lake Formation APIs and the AWS Console, and allow the Redshift datashares to be discovered and consumed by other Redshift data warehouses. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-redshift-data-sharing-centralized-access-control-lake-formation-preview/
Redshift Auto-Copy from S3. simplify and automate data ingestion from S3 into Redshift by setting up copy jobs, user-defined data ingestion rules to track S3 locations for new files, and executing configured copy statements for each detected file. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-redshift-supports-auto-copy-amazon-s3/

ANALYTICS

OpenSearch Serverless. Run Search and Analytics Workloads without Managing Clusters. https://aws.amazon.com/blogs/aws/preview-amazon-opensearch-serverless-run-search-and-analytics-workloads-without-managing-clusters/
Five new QuickSight capabilities. Today’s announcement expands QuickSight Q, a natural language querying capability, to support forecast and “why” questions and automate data preparation, making it easier and faster to start asking questions in natural language. Additionally, customers can now create and share paginated reports alongside interactive dashboards, quickly analyze and visualize billion-row datasets directly in QuickSight, and programmatically create and manage BI assets to accelerate migration from legacy systems. https://www.businesswire.com/news/home/20221128005874/en/AWS-Announces-Five-New-Capabilities-for-Amazon-QuickSight
Glue for Apache Spark Native support for Data Lake Frameworks (Apache Hudi, Apache Iceberg, Delta Lake) https://aws.amazon.com/about-aws/whats-new/2022/11/aws-glue-apache-spark-native-data-lake-frameworks-apache-hudi-iceberg-delta-lake/
Glue for Ray. Makes it easy to scale Python code to process large scale data in Glue. https://aws.amazon.com/about-aws/whats-new/2022/11/aws-glue-ray-preview/
Glue Data Quality. Glue Data Quality builds confidence in your data by ensuring high data quality. It automatically measures, monitors, and manages data quality in your data lakes and data pipelines.?https://aws.amazon.com/about-aws/whats-new/2022/11/aws-glue-data-quality-preview/
Athena for Apache Spark. The streamlined, interactive, serverless experience of Athena with Spark, in addition to SQL.?Athena takes care of managing the infrastructure and configuring Spark settings.?Build interactive PySpark applications using a simplified notebook experience in the Athena console or through Athena APIs. Spin up Spark workloads up to 75 times faster than other serverless Spark offerings. https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-athena-now-supports-apache-spark/

MACHINE LEARNING

SageMaker Data Wrangler integration with Amazon AppFlow. With SageMaker Data Wrangler, you can explore and import data from a variety of popular sources, such as S3, Athena, Redshift, Snowflake, Databricks and Salesforce Customer Data Platform. Starting today, we are making it easier for customers to aggregate data for ML from over 40 third-party application data sources, including Salesforce Marketing, SAP, Google Analytics, LinkedIn and more https://aws.amazon.com/about-aws/whats-new/2022/11/amazon-sagemaker-data-wrangler-over-40-third-party-applications-data-sources/

VERTICAL SPECIFIC

Supply Chain. A ML powered, cloud-based supply chain management application shaped by learnings from Amazon.com's 25+ years of supply chain excellence. https://aws.amazon.com/about-aws/whats-new/2022/11/aws-supply-chain-preview/
Omics. A Purpose-Built Service to Store, Query, and Analyze Genomic and Biological Data at Scale ?https://aws.amazon.com/blogs/aws/introducing-amazon-omics-a-purpose-built-service-to-store-query-and-analyze-genomic-and-biological-data-at-scale/
SimSpace Weaver. Fully managed compute service for large-scale spatial simulations. https://aws.amazon.com/about-aws/whats-new/2022/11/aws-simspace-weaver-available/

OTHER

Connect. New ML-Powered Capabilities for Forecasting, Capacity Planning, Scheduling, and Agent Empowerment. https://aws.amazon.com/blogs/aws/amazon-connect-new-ml-powered-capabilities-for-forecasting-capacity-planning-scheduling-and-agent-empowerment/

Monikaben Lala

Chief Marketing Officer | Product MVP Expert | Cyber Security Enthusiast | @ GITEX DUBAI in October

2 年

Hugh, thanks for sharing!

1 次回应

Ruben Falk

2 年

Great summary!

2 次回应

查看更多评论

要查看或添加评论，请登录

Hugh Christensen的更多文章

Converting bits into dollars: Why and how data is able to generate business value

2023年9月1日

Converting bits into dollars: Why and how data is able to generate business value

A post on why and how data is able to generate business value. Introduction Data is important because it encodes…
Hunting Alpha With The Sling-Shot Of Big-Data, The Bow Of Cloud-Computing And The Spear Of Machine-Learning: A View From The Primordial Swamp

2022年10月25日

Hunting Alpha With The Sling-Shot Of Big-Data, The Bow Of Cloud-Computing And The Spear Of Machine-Learning: A View From The Primordial Swamp

An article written by me in 2018 for Mondo Visione. Darwin's Theory of Evolution by Natural Selection has long been…
A Review of The Man Who Solved the Market: How Jim Simons Launched the Quant Revolution

2021年2月15日

A Review of The Man Who Solved the Market: How Jim Simons Launched the Quant Revolution

This is a short review/series of notes from the book The Man Who Solved the Market with a focus on the quantitative…

1 条评论

Summary of new releases from AWS re:invent 2022 to date

Hugh Christensen

Leadership in data at Amazon

DATA

SECURITY

领英推荐

REDSHIFT

ANALYTICS

MACHINE LEARNING

VERTICAL SPECIFIC

OTHER

Hugh Christensen的更多文章

社区洞察

其他会员也浏览了

2025 - Week 6 (3 Feb - 9 Feb)

Cloud & Data Metamorphosis, Part 3.3

Understanding AWS S3 Directory Buckets

AWS re:Invent 2022 - Part Three

What are Azure Arc-Enabled Data Services?

Reading from Azure DataLake & Writing to Google BigQuery via Databricks

Topics – The Redpanda Newsletter (Issue #023)

RisingWave Newsletter September 2023

Cloud & Data Metamorphosis, Part 3.4

DATA

SECURITY

领英推荐

REDSHIFT

ANALYTICS

MACHINE LEARNING

VERTICAL SPECIFIC

OTHER

Hugh Christensen的更多文章

Converting bits into dollars: Why and how data is able to generate business value

Hunting Alpha With The Sling-Shot Of Big-Data, The Bow Of Cloud-Computing And The Spear Of Machine-Learning: A View From The Primordial Swamp

A Review of The Man Who Solved the Market: How Jim Simons Launched the Quant Revolution

社区洞察

其他会员也浏览了

2025 - Week 6 (3 Feb - 9 Feb)

Cloud & Data Metamorphosis, Part 3.3

Understanding AWS S3 Directory Buckets

AWS re:Invent 2022 - Part Three

What are Azure Arc-Enabled Data Services?

Reading from Azure DataLake & Writing to Google BigQuery via Databricks

Topics – The Redpanda Newsletter (Issue #023)

RisingWave Newsletter September 2023

Cloud & Data Metamorphosis, Part 3.4