WhatIs.com 09月29日 10:49
数据管理巨头携手共建语义模型开放标准
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

Snowflake、Salesforce等数据管理和分析领域的领先厂商宣布成立开放语义互联(OSI)联盟,旨在开发开放源代码的语义数据建模标准。此举旨在解决企业在构建AI和分析工具时面临的数据不一致问题。语义模型通过统一的数据定义,确保元数据的一致性,从而简化数据的搜索、发现和使用。目前,各厂商的语义建模能力多为专有工具,导致数据碎片化。OSI联盟的成立将推动跨平台互操作性,加速AI和分析应用的开发部署,并优化运营效率,为企业在大规模AI应用中建立信任和信心提供支持。

💡 行业巨头联合推动开放标准:Snowflake、Salesforce等数据管理和分析领域的知名厂商共同发起成立开放语义互联(OSI)联盟,致力于开发一套开放源代码的语义数据建模标准,以应对当前企业在AI和数据分析领域面临的数据不一致性挑战。

🌐 解决数据碎片化问题:通过建立统一的语义模型,OSI联盟旨在确保数据定义和元数据的一致性,从而打破不同平台和工具之间的数据壁垒,简化数据的搜索、发现和利用,提高数据整合效率。

🚀 加速AI与分析应用发展:标准化的语义模型将极大地简化AI和分析应用的开发过程,使开发者能够更便捷地获取和使用高质量、相关性强的数据,从而加速AI模型和应用程序的部署,提升其准确性和可信度。

⚙️ 优化运营效率与信任:该标准有望减少因语义定义不一致而产生的冲突和重复工作,显著缩短数据准备时间,使数据和AI团队能够专注于创新而非故障排除,从而在AI驱动的洞察中建立更大的信任和信心。

<p>A consortium of prominent data management and analytics vendors, including Snowflake and Salesforce, on Tuesday unveiled plans to develop an open source standard for semantic data modeling.</p><div class="ad-wrapper ad-embedded"> <div id="halfpage" class="ad ad-hp"> <script>GPT.display('halfpage')</script> </div> <div id="mu-1" class="ad ad-mu"> <script>GPT.display('mu-1')</script> </div> </div> <p>While enterprises face numerous barriers when developing AI and analytics tools <a href="https://www.techtarget.com/searchbusinessanalytics/news/252495543/Expert-Agile-data-driven-decision-making-key-to-growth"&gt;to inform decisions</a>, one of the biggest is inconsistent data. Semantic modeling makes data consistent so it can be <a href="https://www.techtarget.com/searchdatamanagement/post/Successful-data-analytics-starts-with-the-discovery-process"&gt;searched, discovered and used</a> to inform models and applications.</p> <p>A semantic model is a set of common definitions of data so that data's characteristics -- its <a href="https://www.techtarget.com/whatis/definition/metadata"&gt;metadata&lt;/a&gt; -- are classified consistently whenever data is ingested or transformed. Many enterprises, however, do not have semantic data modeling frameworks, nor do many data management and analytics platforms provide semantic layers within their platforms.</p> <p>Specialists such as <a href="https://www.techtarget.com/searchbusinessanalytics/news/366556394/DBT-Labs-updates-Semantic-Layer-adds-data-mesh-enablement"&gt;DBT Labs</a> and <a href="https://www.techtarget.com/searchdatamanagement/news/366619494/Cube-semantic-layer-eases-data-access-from-Power-BI-Excel"&gt;Cube&lt;/a&gt; focus on semantic modeling, while ThoughtSpot and Google's Looker are among the platforms providing semantic layers to underpin their broader set of tools. However, each vendor's semantic modeling capabilities are proprietary tools that differ from one another. Therefore, if an organization uses more than one platform for its data management needs, its data becomes fragmented.</p> <p>In addition to Snowflake and Salesforce, Alation, Atlan, Cube, DBT Labs, Mistral AI, Sigma and ThoughtSpot are among others participating in the Open Semantic Interchange (OSI).</p> <p>The OSI aims to change that by standardizing semantic modeling. As a result, its establishment is significant, according to Stephen Catanzano, an analyst at Enterprise Strategy Group, now part of Omdia.</p> <p>"The formation of a consortium to develop a standard for defining and sharing metadata is highly important," he said. "It addresses the challenge of fragmented data semantics across tools and platforms that create major roadblocks for both human and AI-enabled analysis. This standardization should [enable] organizations to scale AI and BI with greater confidence, speed, and trust."</p> <p>Beyond inconsistent data, poor data quality, outdated systems, talent shortages and organizational culture are <a target="_blank" href="https://www.statista.com/statistics/1557024/barriers-ai-adoption/#:~:text=Barriers%20to%20AI%20adoption%20in%20business%20worldwide%202025&amp;amp;text=In%202025%2C%20the%20biggest%20barrier,of%20AI%20products%20and%20services." rel="noopener">barriers</a> to successfully developing AI and analytics tools.</p> <section class="section main-article-chapter" data-menu-title="A new standard"> <h2 class="section-title"><i class="icon" data-icon="1"></i>A new standard</h2> <p>While poor semantic modeling -- or a complete lack of semantic modeling -- has long hindered integrating data and using all available data to inform analytics and AI applications, fragmented data has taken on greater significance over the past few years.</p> <blockquote class="main-article-pullquote"> <div class="main-article-pullquote-inner"> <figure> The formation of a consortium to develop a standard for defining and sharing metadata is highly important. It addresses the challenge of fragmented data semantics across tools and platforms that create major roadblocks for both human and AI-enabled analysis. </figure> <figcaption> <strong>Stephen Catanzano</strong>Analyst, Enterprise Strategy Group </figcaption> <i class="icon" data-icon="z"></i> </div> </blockquote> <p>OpenAI's November 2022 launch of ChatGPT marked a significant improvement in generative AI (GenAI) technology. Since then, because GenAI can fuel applications that make workers better informed and more efficient, many enterprises have <a target="_blank" href="https://www.ey.com/en_us/newsroom/2025/07/ai-investments-surge-but-agentic-ai-understanding-and-adoption-lag-behind" rel="noopener">increased their investments</a> in AI development.</p> <p>GenAI, however, requires large amounts of data to be accurate.</p> <p>GenAI outputs are based on aggregations of data rather than individual data points. As a result, when there is a large quantity of high-quality data to train a GenAI model or application, it's more likely to deliver an accurate output.&nbsp;</p> <p>Now, agents are the latest evolution in AI. Unlike GenAI tools that require inputs before delivering an output, agents can act autonomously. However, similar to GenAI tools, agents require large amounts of relevant training data to properly perform their prescribed tasks.</p> <p>Semantic modeling makes it easier for developers to discover <a href="https://www.techtarget.com/searchbusinessanalytics/feature/Talend-CEO-discusses-importance-of-mining-relevant-data"&gt;the requisite volume of relevant data</a> to train agents. An open standard for semantic modeling applied to all data would further simplify data discovery and development by enabling organizations to more easily integrate data stored in different systems.</p> <p>Kevin Petrie, an analyst at BARC U.S., noted that poor data quality is the top obstacle to analytics and AI success. As a result, the plan to develop a standard for semantic modeling is significant.</p> <p>"This is a good step forward for the industry," Petrie said. "A unified semantic layer can help overcome [data quality problems], enabling AI applications to consume diverse inputs to generate rich outputs."</p> <p>Surging interest in AI development, meanwhile, provided the impetus for forming the consortium to develop an open standard for semantic modeling, according to Josh Klahr, director of analytics product management <a href="https://www.techtarget.com/searchdatamanagement/feature/Customers-pleased-with-Snowflake-plans-for-AI"&gt;at Snowflake</a>.</p> <p>"Every company has struggled with fragmented, inconsistent semantic definitions for years," he said. "But until now, the pain was largely hidden inside BI tools and analytics teams. What's changed is the explosive demand for AI and agentic analytics. Suddenly, those inconsistencies aren't just slowing down dashboards. They're undermining the accuracy and trustworthiness of AI systems."</p> <p>Snowflake's role as one of the co-leaders of the OSI stemmed from its June launch of Semantic Views, Klahr continued. In conversations with customers, Snowflake repeatedly heard that semantics were fragmented, and organizations wanted <a href="https://www.techtarget.com/searchbusinessanalytics/news/252515720/Gartner-Augmented-analytics-ecosystem-for-BI-now-key"&gt;an interoperable way</a> to integrate them across their entire data estate.</p> <p>That feedback led Snowflake into discussions with partners, which Snowflake discovered were hearing similar feedback from their customers, according to Klahr.</p> <p>"Those conversations quickly coalesced into the idea of the Open Semantic Interchange initiative, bringing together leaders from across data, AI, BI, analytics and industry verticals to create an open, vendor-neutral semantic model specification," he said. "We formed OSI as an industry initiative to address this shared problem collectively, and we're continuing to bring more partners into the initiative."</p> <p>While the OSI has not yet released a standardized semantic modeling framework, a working group has been formed to develop the open standard "quickly," according to Klahr.</p> <p>Meanwhile, Catanzano noted that the sooner the OSI can develop a universal semantic modeling framework, the more beneficial it will be for organizations <a target="_blank" href="https://www.rand.org/pubs/research_reports/RRA2680-1.html" rel="noopener">struggling to successfully develop AI models</a> and applications.</p> <p>"As AI increasingly becomes the primary way businesses leverage data, inconsistent interpretations of business metrics and metadata across different tools are causing confusion, slowing adoption and eroding trust in AI-driven insights, making standardization critical for successful AI implementation at scale," he said.</p> <p>The OSI's specific goals include the following:</p> <ul class="default-list"> <li>Improve interoperability across tools and platforms through <a target="_blank" href="https://tdwi.org/articles/2023/07/13/arch-all-importance-of-the-universal-semantic-layer-in-modern-data-analytics-and-bi.aspx" rel="noopener">a shared semantic standard</a> to make integrating and preparing data easier.</li> <li>Accelerate developing and deploying AI and analytics applications by standardizing how semantics are defined and exchanged.</li> <li>Streamline operations by reducing the need to reconcile conflicting semantic definitions and duplicate work across platforms.</li> </ul> <p>The OSI's intent to develop an open standard for semantic modeling follows the recent launches of open standards to simplify how agents are trained and interoperate.</p> <p>Model Context Protocol, created by AI vendor Anthropic and launched in November 2024, is <a target="_blank" href="https://modelcontextprotocol.io/docs/getting-started/intro" rel="noopener">an open framework</a> that addresses how agents interact with data sources, including proprietary data sources such as databases and public data sources such as large language models. Agent2Agent Protocol, developed by Google and released in April, addresses how agents autonomously interact with one another once they are deployed.</p> <p>As agents become more ubiquitous, other processes, such as integrating data from external sources and AI model evaluation, could benefit from open standards, according to Catanzano.</p> <p>"We need ways to audit AI," he said.</p></section> <section class="section main-article-chapter" data-menu-title="Looking ahead"> <h2 class="section-title"><i class="icon" data-icon="1"></i>Looking ahead</h2> <p>Once the OSI launches its standardized semantic modeling framework, whether it serves its intended purpose will depend on how it's received. To truly reduce fragmented data, it will need more than the current group of vendors that make up the consortium to buy in, according to Petrie.</p> <p>The current group represents a good start with participants from different data management segments such as <a href="https://www.techtarget.com/searchbusinessanalytics/news/252510804/Data-catalogs-fuel-increased-efficiency-speed-to-insight"&gt;data catalogs</a>, <a href="https://www.techtarget.com/searchdatamanagement/definition/data-transformation"&gt;data transformation</a> and semantic modeling. But without participation from AWS, Databricks, Google, Microsoft and other leading data management vendors, the OCI's framework could be another tool that fragments data.</p> <p>"Snowflake has assembled a solid group of supporting vendors, [but] to be successful, rather than creating another incompatible silo, OSI will also need the support of the hyperscalers such as AWS and other cloud data platforms such as Databricks," Petrie said. "It also will need to address on-premises platforms, because our research shows that one-third of AI workloads reside on-premises."</p> <p>Catanzano noted that for the consortium to succeed, it needs widespread support.</p> <p>"What makes this initiative powerful is the collaboration among industry leaders," he said. "Rather than addressing semantic standardization in silos, these companies are creating a vendor-neutral specification that marks a decisive shift away from closed, single-vendor approaches. ... But we will see if others join in or introduce something competitive to this one."</p> <p>Ultimately, however, the OSI is attempting to provide a helpful framework. If it succeeds, <a href="https://www.techtarget.com/searchenterpriseai/tip/How-the-Model-Context-Protocol-simplifies-AI-development"&gt;like MCP</a>, it will be a major step toward simplifying the development of analytics and AI applications, according to Catanzano.</p> <p>"A standard for defining and sharing metadata will … streamline operations by eliminating the weeks currently spent reconciling conflicting definitions or duplicating work across platforms," he said. "This allows data and AI teams to focus on innovation rather than troubleshooting semantic inconsistencies."</p> <p><i>Eric Avidon is a senior news writer for Informa TechTarget and a journalist with more than 25 years of experience. He covers analytics and data management.</i></p></section>

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

语义数据建模 开放标准 数据管理 AI 分析 Snowflake Salesforce OSI 互操作性 数据碎片化 Semantic Data Modeling Open Standard Data Management AI Analytics Interoperability Data Fragmentation
相关文章