SERVERS

A key highlight from last week's re:Invent was the extension of serverless compute to a swath ofAWS analytics services, including Amazon EMR, Kinesis Data Streams, MSK (Managed Service for Kafka), and Redshift. For cloud analytics, AWS was not the first to offer serverless options, as Google Cloud BigQuery and Azure Synapse Analytics have long offered serverless options (by contrast, Snowflake's is still in preview).

Big Data

How to find out if you are involved in a data breach (and what to do next)
Fighting bias in AI starts with the data
Fair forecast? How 180 meteorologists are delivering 'good enough' weather data
Cancer therapies depend on dizzying amounts of data. Here's how it's sorted in the cloud

Serverless wasn't the only new feature announced last week. AWS also announced the preview ofautomated materialized views that treat the creation of these views much like cost-based query optimizers: it automatically generates the views based on data hot spots. Nonetheless, serverless grabbed the limelight.

AWS ups its industry ground game at re:Invent 2021

While AWS's serverless announcements could be viewed as keeping up with the Joneses, regardingAmazon Redshift , it is part of a larger narrative of the data warehousing service not only catching up but getting in a position to potentially bypass its rivals.

To recap, Amazon Redshift has long been known more as a market rather than a technology leader.

When AWS launched Redshift back in 2013, it was one of the first cloud data warehousing services. Starting with technology acquired from ParAccel, AWS profited but also paid the price for being among the first to market. Its early entry, along with the portfolio of other AWS analytics services, enabled Redshift to carve a large client roster with greater than tens of thousands of customers today.

AWS forked the acquired ParAccel technology. But from the get-go, it followed a conventional data warehousing architecture with locally attached storage. By contrast, Google Cloud BigQuery, launched back in 2010, pioneered the cloud-native, data warehouse. Nonetheless, it was the launch of Snowflake in 2014 that put the elastic cloud data warehouse on the map.

For last week's serverless announcement, the key development was the launch of RA3 instances back in 2019. They provided the long-sought elasticity with separation of computing and storage and paved the way for serverless. As it turns out, RA3 is the transformation that also allowed Redshift to do far more. Earlier this year, AWS released Advanced Query Accelerator (AQUA) for Amazon Redshift that we characterized at the time as a "generational shift" that leveraged the elasticity of the RA3 instances. It was aimed at workloads for "near-line" data sitting remotely onAmazon Redshift Managed Storage , storing hot data in SSD while using theNitro hypervisor and FPGAs to accelerate the processing of cooler data sitting on S3.

Incidentally, in our post last spring, we put serverless on our wish list for what we wanted to see next. Once in a blue moon, we occasionally get it right.

But there's more. Because RA3 instances pool much of the data in S3, that cleared the way for data sharing, which was initially released back in the spring for customers with multiple AWS accounts. At re:Invent last week, that capability was extended across multiple regions. Again, AWS wasn't first to market. For instance, Snowflake has been promoting various forms of data sharing since it started talking Data Sharehouse back in 2017 (they no longer use that term). AWS did launch a data marketplace (calledAmazon Data Exchange ) several years ago, but onlyjust extended it to Redshift .

Let's make a couple of disclaimers. First of all, don't confuse data sharing with federated queries. Redshift canremote query data sitting in RDS and Aurora databases for MySQL and PostgreSQL, and viaRedshift Spectrum , to EMR and S3. But that's quite similar to what Google already offers with BigQuery. Secondly, don't believe that AWS is abandoning provisioned instances -it will keep offering them for Redshift as well because there are customers who prefer level billing. Google eventually learned that when it subsequently introduced flat-rate slots for BigQuery.

With cloud-native architecture and serverless support, AWS has some opportunities to score some firsts. With cloud-native serverless architecture, AWS could move more analytic and AI processing in-database.

But in-database machine learning has already become table stakes for cloud data warehouses. AWS already does so withRedshift ML , where you can use SQL commands to trigger developing models in SageMaker, then bring the models in-database as a form of user-defined function (UDF) to run training and/or inference workloads. In turn, Google also provides in-database ML for BigQuery, but it is limited to specific, curated models; while Microsoft allows running of ML models within Azure Synapse Spark pools. And with Snowpark, you can use non-SQL languages to push down processing, such as ML models, as UDFs directly into the Snowflake database.

Our wish list is to bring Spark directly into Redshift. Today, you'd have to fire up a separate EMR cluster to run Spark (but at least now,it could also be triggered serverless as well). Of course, nothing is preventing AWS from breaking out Spark as a separate serverless service, just as Google Cloud recently did. But today, Azure Synapse Analytics lets you run a curated (subset) version of Spark in-database without firing up a separate cluster; we'd like to see AWS follow through.

But let's not stop there. Serverless also provides the opportunity to fire up workloads with third-party tools, especially with BI reporting and visualization. Redshift currently has integrations with its own QuickSight and with popular tools like Tableau, but you have to move data and process it in separate clusters.

So let's cut to the chase. We'd love to see AWS add a "Redshift-native" mode for third parties willing to run capabilities like ELT or visualization as containerized microservices that run directly inside Redshift RA3 compute nodes, or whatever next-generation nodes come out in future years. By comparison, Snowflake provides common APIs for third parties to access Snowflake data, but the data is processed in separate clusters. Imagine running an ELT service from Informatica or Fivetran as a microservice in a Redshift compute node. AWS could then promote Redshift as the cheapest, fastest data warehouse in the cloud.

Disclosure: AWS and Google Cloud are dbInsight clients.

AWS re:Invent

AWS ups its industry ground game at re:Invent 2021AWS CEO unveils new private 5G serviceAWS takes aim at mainframes with migration serviceAWS, CrowdStrike, and Presidio partner for ransomware mitigation kitAWS launches quartet of serverless, on-demand solutionsAWS targets auto and industrial sectors with FleetWise, TwinMakerProcessor roadmap adds Graviton3, Trainium, new instancesIoT RoboRunner aims to manage robot fleets

AWS ups its industry ground game at re:Invent 2021
AWS CEO unveils new private 5G service
AWS takes aim at mainframes with migration service
AWS, CrowdStrike, and Presidio partner for ransomware mitigation kit
AWS launches quartet of serverless, on-demand solutions
AWS targets auto and industrial sectors with FleetWise, TwinMaker
Processor roadmap adds Graviton3, Trainium, new instances
IoT RoboRunner aims to manage robot fleets

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Serverless at re:Invent: Where should Amazon Redshift go?

Big Data

AWS re:Invent

Hot Tags : Business Big Data

Ordering Guide

Resources

About Us

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVERS

HOT NEWS

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

Huawei S5731-H24T4XC: High-Performance Intelligent Gigabit Switch

Huawei S5731-S48P4X: High-Performance PoE Switch with Flexible Power and Uplink Options

Huawei S5731 Series: Advanced Networking Solutions for Enterprises

Difference between campus switch and data center switch

Huawei S6730-H28Y4C Campus CloudEngine Switch Datasheet

S6730-H48Y6C: Unleashing Power and Flexibility for Modern Networking

CloudEngine S6730-H Series Switches Datasheet

Huawei CloudEngine Switch S6730-S24X6Q Datasheet

CloudEngine S6700 Series Switches Naming Conventions & Description

Huawei CloudEngine S6730-H24X6C Datasheet

Huawei S6730 Series Switches Datasheet

Huawei CloudEngine Switch S6730-H48X6C Datasheet

Introduction to the Huawei CloudEngine S6730-S Series Switches

Huawei S6730-H48X6CZ-V2: The Ultimate High-Speed Network Switch

Overview of the S6730-H28X6CZ-V2 Switch

Huawei CloudEngine S6730-H24X4Y4C: A High-Performance Enterprise Switch for Modern Networks

​Introduction to Huawei CloudEngine S6730-H Series Switches

Comprehensive Guide to the CloudEngine S6730-H24X6C-V2: Features, Specifications, and Applications

Huawei S6730-S24X6Q: Advanced Ethernet Switch for Modern Networks

Comprehensive Guide to the S6730-H48X6C-V2 High-Performance Switch

Huawei CloudEngine S6730-H28Y4C: High-Performance Switch for Modern Networks

Overview of the S6730-H24X6C-V2

Unveiling the Huawei CloudEngine S6730 Series: Advanced Switching for Modern Networks

Huawei S6730-H48X6C: A Comprehensive Overview

Comprehensive Guide to Huawei S6730-H24X6C

Huawei Switches Visio Stencils

Serverless at re:Invent: Where should Amazon Redshift go?

Big Data

AWS re:Invent

Hot Tags : Business Big Data

Ordering Guide

Resources

About Us

Introduction to Huawei CloudEngine S6730-H Series Switches