[NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW)

Introduction and Overview

Adya Kalanikrishnan, a Product Manager on the Amazon S3 team, introduces a new capability called Amazon S3 Tables.

James Borel, a Principal Applied Scientist on the Amazon SWE team, will join the session to discuss use cases and provide a demo.

The session will cover the following topics:

Introduction to the S3 Tables capability
How S3 Tables works, including new components and fundamentals
Use cases and workloads
Demos showcasing the new capability

The Evolution of Amazon S3

When Amazon S3 was launched in 2006, it was intended as storage for the internet, allowing developers to store data securely and durably.

Over the years, S3 has been used for a wide range of workloads, particularly in the areas of data analytics, machine learning, and AI.

Customers have predominantly used S3 as a tabular data store, storing data in file formats like Apache Parquet.

The use of open table formats like Apache Iceberg has provided a better framework for organizing and managing these data sets.

Challenges with Existing Iceberg Solutions

While Iceberg's advanced capabilities have provided the reliability and flexibility needed for complex analytics workloads, customers still faced some challenges:

Performance: Increased traffic and scaling demands required optimization efforts.
Security and Governance: Enforcing security policies and governing tables required knowledge of the physical storage layout.
Operational Burden: Optimizing costs by managing the lifecycle of table snapshots and data files.

Introducing Amazon S3 Tables

S3 Tables is a purpose-built solution for storing tabular data in Amazon S3, providing fully managed Apache Iceberg tables.

Key components of S3 Tables:

Table Buckets: A new bucket type for storing Iceberg format tables natively in S3.
Table Operations APIs: APIs for creating, reading, updating, and deleting tables.
Table Management APIs: APIs for managing table-level and bucket-level policies and maintenance.

S3 Tables provides:

Optimized performance with up to 10x higher transactions per second and 3x faster queries.
Simpler security controls through table-level resource policies.
Automated cost optimization through lifecycle management of table snapshots and data files.

Integrating S3 Tables with the AWS Analytics Ecosystem

S3 Tables integrates with the AWS analytics ecosystem, including services like:

AWS Glue Data Catalog: Provides a centralized index of data sets, with S3 Tables automatically registered.
Amazon Athena: Allows querying S3 Tables using SQL.
Amazon Redshift: Enables querying S3 Tables through resource links.

The integration with AWS Lake Formation allows for fine-grained access control, including the ability to control access to individual columns or rows within a table.

Conclusion and Next Steps

S3 Tables is a fully managed Iceberg service that provides the benefits of Iceberg and the scale, performance, and cost-effectiveness of Amazon S3.

The team is excited to hear feedback from customers and continue evolving the S3 Tables offering.

Attendees are encouraged to check out the available documentation and provide feedback through the session survey.

[NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW)

Introduction and Overview

The Evolution of Amazon S3

Challenges with Existing Iceberg Solutions

Introducing Amazon S3 Tables

Integrating S3 Tables with the AWS Analytics Ecosystem

Accessing S3 Tables Beyond the AWS Ecosystem

Conclusion and Next Steps

Your Digital Journey deserves a great story.

Build one with us.

Headquarters

Delivery Centre

[NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW)

Introduction and Overview

The Evolution of Amazon S3

Challenges with Existing Iceberg Solutions

Introducing Amazon S3 Tables

Integrating S3 Tables with the AWS Analytics Ecosystem

Accessing S3 Tables Beyond the AWS Ecosystem

Conclusion and Next Steps

Your Digital Journey deserves a great story.

Build one with us.

This website stores cookies on your computer.