Snowflake's `Snowpark` product that they recently announced, which is to bring S...

dannyphantom · on Dec 30, 2020

Love that you brought up Snowflake. I've been wanting to get my hands on it to play around and learn more about a Salesforce integration with it.

mr_toad · on Dec 26, 2020

And not something like Spark on EMR?

jpau · on Dec 30, 2020

Well no, unfortunately.

Remember that "data is a team sport". Together, we try and make better decisions (in manual or automated ways). A DE can produce great data but it's only useful if it helps the DA/DS. There's a lot of friction there.

Most of that friction disappears with SQL-based orchestration tools (I mean specifically dbt here, but there are others). Suddenly the analyst can create the data they need! With minimal guidance from a DE.

That can be with Spark SQL (+ DeltaLake / Iceberg), or some warehouse. That's not the issue.

The issue is around keeping orchestration simple when you're not just doing simple stuff anymore. Keeping that DAG logical, clear, and smooth is difficult once you include non-SQL items.

This isn't solved by Spark UDFs unfortunately :)