[NEW LAUNCH] Store tabular data at scale with Amazon S3 Tables (STG367-NEW)

Here is a detailed summary of the video transcription in markdown format, broken into sections for better readability, and with bullet points limited to a single level depth:

Introduction and Overview

  • Adya Kalanikrishnan, a Product Manager on the Amazon S3 team, introduces a new capability called Amazon S3 Tables.
  • James Borel, a Principal Applied Scientist on the Amazon SWE team, will join the session to discuss use cases and provide a demo.
  • The session will cover the following topics:
    • Introduction to the S3 Tables capability
    • How S3 Tables works, including new components and fundamentals
    • Use cases and workloads
    • Demos showcasing the new capability

The Evolution of Amazon S3

  • When Amazon S3 was launched in 2006, it was intended as storage for the internet, allowing developers to store data securely and durably.
  • Over the years, S3 has been used for a wide range of workloads, particularly in the areas of data analytics, machine learning, and AI.
  • Customers have predominantly used S3 as a tabular data store, storing data in file formats like Apache Parquet.
  • The use of open table formats like Apache Iceberg has provided a better framework for organizing and managing these data sets.

Challenges with Existing Iceberg Solutions

  • While Iceberg's advanced capabilities have provided the reliability and flexibility needed for complex analytics workloads, customers still faced some challenges:
    • Performance: Increased traffic and scaling demands required optimization efforts.
    • Security and Governance: Enforcing security policies and governing tables required knowledge of the physical storage layout.
    • Operational Burden: Optimizing costs by managing the lifecycle of table snapshots and data files.

Introducing Amazon S3 Tables

  • S3 Tables is a purpose-built solution for storing tabular data in Amazon S3, providing fully managed Apache Iceberg tables.
  • Key components of S3 Tables:
    • Table Buckets: A new bucket type for storing Iceberg format tables natively in S3.
    • Table Operations APIs: APIs for creating, reading, updating, and deleting tables.
    • Table Management APIs: APIs for managing table-level and bucket-level policies and maintenance.
  • S3 Tables provides:
    • Optimized performance with up to 10x higher transactions per second and 3x faster queries.
    • Simpler security controls through table-level resource policies.
    • Automated cost optimization through lifecycle management of table snapshots and data files.

Integrating S3 Tables with the AWS Analytics Ecosystem

  • S3 Tables integrates with the AWS analytics ecosystem, including services like:
    • AWS Glue Data Catalog: Provides a centralized index of data sets, with S3 Tables automatically registered.
    • Amazon Athena: Allows querying S3 Tables using SQL.
    • Amazon Redshift: Enables querying S3 Tables through resource links.
  • The integration with AWS Lake Formation allows for fine-grained access control, including the ability to control access to individual columns or rows within a table.

Accessing S3 Tables Beyond the AWS Ecosystem

  • S3 Tables can be integrated with non-AWS analytics engines through the open-source Iceberg Catalog for Iceberg.
  • This allows customers to bring their S3 Tables data to the analytics engine of their choice, whether it's running on AWS or elsewhere.

Conclusion and Next Steps

  • S3 Tables is a fully managed Iceberg service that provides the benefits of Iceberg and the scale, performance, and cost-effectiveness of Amazon S3.
  • The team is excited to hear feedback from customers and continue evolving the S3 Tables offering.
  • Attendees are encouraged to check out the available documentation and provide feedback through the session survey.

Your Digital Journey deserves a great story.

Build one with us.

Cookies Icon

These cookies are used to collect information about how you interact with this website and allow us to remember you. We use this information in order to improve and customize your browsing experience and for analytics and metrics about our visitors on this website.

If you decline, your information won’t be tracked when you visit this website. A single cookie will be used in your browser to remember your preference not to be tracked.

Talk to us