Spark & Iceberg
Managed Spark & Iceberg with OpenLineage by default.
Managed Spark & Iceberg with OpenLineage by default.
The more I learn, the less I know.
Introducing Lake: write SQL with lineage captured automatically, collaborative public/private datasets, and thoughtful moderation.
Spark on EMR Serverless, write Iceberg tables in the Glue data catalog, parse/chatify JSON text for word counts, query with Athena, and monitor lineage & pipeline health in oleander.
Explore our journey building a browser-based Parquet viewer, from the initial implementation to our current DuckDB-powered solution with filtering and sorting capabilities.
A practical guide to implementing OpenLineage with Spark and Iceberg. Learn how to set up data lineage tracking in your data pipeline.
Oleander is the culmination of years of experience building data observability and metadata management tools. Learn about our journey from WeWork to OpenLineage.