Second Brain: Crafted, Curated, Connected, Compounded on 10月02日 21:12
语义层中的指标层:统一业务语言与数据语言的桥梁
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

文章深入探讨了语义层中的指标层(Metrics Layer),强调其作为连接业务语言与数据语言的关键组件。指标层,也被称为指标存储或无头BI,负责详细定义度量(Measures)和维度(Dimensions)。它支持模型解析和API开发,遵循DRY原则,实现一次定义,跨多平台部署。文章指出,指标层通过统一的业务领域表示,解决了数据孤岛和逻辑重复问题,尤其是在处理跨数据库引擎的Jinja SQL逻辑复杂性时,指标存储的优势尤为突出。最后,文章列举了当前该领域的热门工具和概念,并强调了Analytics Engineers在其中日益增长的重要性。

💡 指标层是语义层的核心组成部分,其主要功能在于弥合“业务语言”与“数据语言”之间的鸿沟,提供统一、一致的业务领域数据表示。它详细定义了度量(Measures)和维度(Dimensions),并通过模型解析和API实现指标逻辑的开发与部署。

✅ 指标层遵循DRY(Don't Repeat Yourself)原则,允许在一个地方定义指标逻辑,然后将其部署到各种BI工具、内部应用程序或流程中,从而避免了重复开发和维护工作,提高了效率和一致性。

🚀 指标存储(Metrics Store)或无头BI(Headless BI)是指标层的一种实现形式,它充当所有业务指标的中央存储库。这种集中化的方式尤其有助于解决跨不同数据库引擎重用Jinja SQL逻辑时遇到的复杂性和局限性问题。

🛠️ Analytics Engineers在指标层的构建和维护中扮演着日益重要的角色,负责定义和转换这些核心业务指标。同时,引入类似Schema Registry的服务可以增强数据管道和应用逻辑的健壮性和一致性。

Similarities to a Semantic Layer

The metrics layer is a key component of a Semantic Layer. Often, a basic metrics layer is integrated within a BI tool, translating its metrics specifically for that BI tool.

Also known as a metrics store or headless BI, the metrics layer encompasses a detailed specification of metrics like Measures and Dimensions. It typically involves model parsing from files, predominantly YAML, and APIs to develop and implement metric logic, often including a cache layer. This approach advocates for the DRY (Don’t Repeat Yourself) principle, allowing for single-time definition and subsequent deployment across various BI tools or internal applications and processes.

Discover more on Semantic Layer or The Rise of the Semantic Layer.


An excellent depiction illustrating the placement and role of your Metrics Layer | Emerging Architectures for Modern Data Infrastructure (a16z)

Another insightful perspective RW Down the Semantic Rabbit Hole - By JP Monteiro:

The essence of a semantic layer is to bridge the gap between “business language” and “data language”, offering a unified, consistent business domain representation in the data.

Metrics Stores” – emerging as a pivotal element in the data stack, centralizing key business metrics Data & AI Landscape 2021

# What is the Metric Store / Headless BI?

In essence, the Headless BI or Metrics Store, as referred to here, is the hub for all metrics. Imagine it as the central repository of critical business logic, code, or metadata, implemented once and following the DRY principles.

Consider it as a repository of essential business logic, adhering to the DRY principles. This could be complex calculated measures in Apache Druid, for instance, that you wish to query across various BI tools or for diverse users. Instead of duplicating these measures in each tool, they’re centrally stored in the Analytics API, facilitated by the metrics store.

The communication within the Analytics API involves returning queries from the Query Engine via GraphQL, managing metadata store and data API configurations, and handling requests from the orchestration layer.

Currently, many rely on the popular Jinja and SQL templating solution, integrating it into the chosen BI tool. However, the rise of metrics stores is gaining momentum, primarily due to the complexities and limitations of re-using Jinja SQL logic across different database engines.

Metrics Stores, as integral parts of Analytics APIs, are crucial for analytics queries. The emerging role of Analytics Engineers, primarily tasked with defining and transforming these metrics, is becoming increasingly significant.

Moreover, the integration of additional services, akin to the Confluent Schema Registry (Schema Registry) for Kafka, could offer functionalities like comparing metric definitions and data store schemas, thus enhancing the robustness and consistency of data pipelines and application logic.

# The Recent Hype Around Headless BI

The data realm is buzzing with the concepts of Metrics Store and Headless BI. Noteworthy developments in this area include tools and platforms like Cube.js, Metriql, Supergrain, Transform.co, malloy, and AirBnB’s Minerva API. The adoption of GraphQL in these platforms is also notable.

Closed Sourced examples include Veezoo, uMetric, FlexIt with dbt

Drew Banin’s keynote at dbt’s Coalesce conference, discussing “The Metric System”, is a must-watch for insights into the evolution of measurement and metrics.

Visualihttps://www.ssp.sh/brain/Pasted%20image%2020211104081349.png20211104081349.png">

Some thoughts:

Related articles:


References: Supergrain
Created: 2021-11-04

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

语义层 指标层 指标存储 无头BI 数据架构 Semantic Layer Metrics Layer Metrics Store Headless BI Data Architecture
相关文章