Second Brain: Crafted, Curated, Connected, Compounded on 10月02日
Dagster数据资产与数据编排
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文探讨了Dagster中的数据资产概念及其在数据编排中的作用,分析了声明式模型在数据管理中的优势,并举例说明了数据资产与数据产品的联系。

Data Assets are the result of the declarative pipeline in Dagster, as detailed in Software-Defined Asset. The power of this approach is discussed in Dagster, highlighting how the declarative model is shaping the future of data.

In this model, triggers are associated with assets rather than jobs, simplifying understanding and management. For instance, rather than triggering daily updates based on pipeline completion, we focus on the asset itself, like updating the revenue data each day. Similarly, triggers can be based on the state of an asset, such as re-materializing an asset when its upstream counterpart changes.

The concept of Data Assets encompasses both the definition and the materialization, as defined in Dagster’s documentation. These assets are dynamic and can also be synonymous with Data Product within the Data Mesh framework.

The importance of Data Product is exemplified by DJ Patil, emphasizing that

    recommendations are also a form of product

Ascend.io pioneered this approach over a year ago, as seen in Use declarative pipelining instead of imperative and Declarative vs Imperative.

# Asset-based Data Orchestration

For an illustrative example, see Asset-Based Data Orchestration (from DATA + AI Summit 2023) - YouTube, and also refer to Software-Defined Asset.

For a visual representation, refer to Asset-Based Data Orchestration


Origin: Introducing Software-Defined Assets | Dagster Blog, Rethinking Orchestration as Reconciliation: Software Defined Assets in Dagster | Elementl - YouTube
References:
Created: 2022-04-28

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

Dagster 数据资产 数据编排 声明式模型 数据产品
相关文章