Your always on-call Data Engineer
Debugging gardeners for your daily data engineering work in production.

Ollie
Gets you to root cause
Autonomously investigates incidents and helps you run production AI and data infrastructure.

Lea
Compliance ready
Audits your AI and data infrastructure and helps you stay compliant with data governance policies.
Run SQL in our lake
Run SQL with our lake. Upload your data, use public datasets, join with your data, and get observability without any configuration. Contribute back to the community by uploading public datasets for everyone to use.
Observe your stack in under 5 minutes
We also work seamlessly with the tools you already use. Our lineage events view is compatible with popular data tools like Spark, Airflow, dbt, and Flink, providing the insights you need without disrupting your workflow.

We ❤️ open source
Get started
This is the easiest way to get started. Just run any SQL:
SELECT *FROM your_table INNER JOIN public_table ON your_table.id = public_table.idWHERE your_table.id = 1LIMIT 100Latest articles
Lake is (a)live
Introducing Lake: write SQL with lineage captured automatically, collaborative public/private datasets, and thoughtful moderation.
Chat stats with Spark & OpenLineage
Spark on EMR Serverless, write Iceberg tables in the Glue data catalog, parse/chatify JSON text for word counts, query with Athena, and monitor lineage & pipeline health in oleander.
Join our mailing list
Stay updated with our latest news