MarkTechPost@AI 2024年07月26日
IBM Researchers Introduce AI-Hilbert: An Innovative Machine Learning Framework for Scientific Discovery Integrating Algebraic Geometry and Mixed-Integer Optimization
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

IBM 研究人员与帝国理工学院和三星 AI 合作,开发了一种名为 AI-Hilbert 的机器学习框架,用于科学发现。该框架结合了代数几何和混合整数优化,能够从不完整公理和噪声数据中推导出新的科学规律。AI-Hilbert 能够识别变量之间的隐式多项式关系,并利用现有理论来缩小搜索空间,从而弥补数据不足或噪声过大的问题。该框架还能够识别理论中的不一致性,并通过最佳子集选择来确定最能解释数据的假设。

📒 **将公理和定律建模为多项式:** AI-Hilbert 将科学定律和公理建模为多项式,并使用二元变量和逻辑约束来解决多项式优化问题。通过混合整数线性规划或半定规划,AI-Hilbert 能够从假设和数据中推导出已知的科学规律,例如开普勒定律和辐射引力波功率方程。

📡 **整合背景知识和实验数据:** AI-Hilbert 能够将背景知识和实验数据相结合,以发现复杂自然规律。通过将背景理论和数据转化为多项式优化问题,AI-Hilbert 能够利用半定规划求解该问题,并获得候选公式及其形式推导。

📈 **确保发现的规律在公理上是正确的:** AI-Hilbert 借鉴了大卫·希尔伯特关于平方和与非负多项式关系的研究成果,确保发现的规律在给定背景理论的情况下是公理上正确的。如果背景理论不一致,AI-Hilbert 将通过最佳子集选择来识别不一致的来源,并确定最能解释数据的假设。

📇 **克服传统方法的局限性:** AI-Hilbert 克服了传统科学发现方法的局限性,这些方法通常依赖于理论或数据。AI-Hilbert 能够将理论和数据相结合,从而在数据稀缺和理论有限的情况下实现科学发现。

📃 **未来展望:** AI-Hilbert 的未来发展方向包括将该框架扩展到非多项式环境,自动调整超参数,以及通过优化底层计算技术来提高可扩展性。

Science aims to discover concise, explanatory formulae that align with background theory and experimental data. Traditionally, scientists have derived natural laws through equation manipulation and experimental verification, but this approach could be more efficient. The Scientific Method has advanced our understanding, but the rate of discoveries and their economic impact has stagnated. This slowdown is partly due to the depletion of easily accessible scientific insights. To address this, integrating background knowledge with experimental data is essential for discovering complex natural laws. Recent advances in global optimization methods, driven by improvements in computational power and algorithms, offer promising tools for scientific discovery.

Researchers from Imperial College Business School, Samsung AI, and IBM propose a solution to scientific discovery by modeling axioms and laws as polynomials. Using binary variables and logical constraints, they solve polynomial optimization problems via mixed-integer linear or semidefinite optimization, validated with Positivstellensatz certificates. Their method can derive well-known laws like Kepler’s Law and the Radiated Gravitational Wave Power equation from hypotheses and data. This approach ensures consistency with background theory and experimental data, providing formal proofs. Unlike deep learning methods, which can produce unverifiable results, their technique guarantees scalable and reliable discovery of new scientific laws.

The study establishes fundamental definitions and notations, including scalars, vectors, matrices, and sets. Key symbols include b for scalars,  x for vectors, A for matrices, and Z for sets. Various norms and cones in the SOS optimization literature are defined. Putinar’s Positivstellensatz is introduced to derive new laws from existing ones. The AI-Hilbert aims to discover a low-complexity polynomial model q(x)=0 consistent with axioms G and H, fits experimental data, and is bounded by a degree constraint. The formulated optimization problem balances model fidelity to data and hypotheses with a hyperparameter λ.

AI-Hilbert is a paradigm for scientific discovery that identifies polynomial laws consistent with experimental data and a background knowledge base of polynomial equalities and inequalities. Inspired by David Hilbert’s work on the relationship between sum-of-squares and non-negative polynomials, AI-Hilbert ensures that discovered laws are axiomatically correct given the background theory. In cases where the background theory is inconsistent, the approach identifies the sources of inconsistency through best subset selection, determining the hypotheses that best explain the data. This methodology contrasts with current data-driven approaches, which produce spurious results in limited data settings and fail to differentiate between valid and invalid discoveries or explain their derivations.

AI-Hilbert integrates data and theory to formulate hypotheses, using the theory to reduce the search space and compensate for noisy or sparse data. In contrast, data helps address inconsistent or incomplete theories. This approach involves formulating a polynomial optimization problem from the background theory and data, reducing it to a semidefinite optimization problem, and solving it to obtain a candidate formula and its formal derivation. The method incorporates hyperparameters to control model complexity and defines a distance metric to quantify the relationship between the background theory and the discovered law. Experimental validation demonstrates AI-Hilbert’s ability to derive correct symbolic expressions from complete and consistent background theories without numerical data, handle inconsistent axioms, and outperform other methods in various test cases.

The study introduces an innovative method for scientific discovery that integrates real algebraic geometry and mixed-integer optimization to derive new scientific laws from incomplete axioms and noisy data. Unlike traditional methods relying solely on theory or data, this approach combines both, enabling discoveries in data-scarce and theory-limited contexts. The AI-Hilbert system identifies implicit polynomial relationships among variables, offering advantages in handling non-explicit representations common in science. Future directions include extending the framework to non-polynomial contexts, automating hyperparameter tuning, and improving scalability by optimizing the underlying computational techniques.


Check out the Paper and Details. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter..

Don’t Forget to join our 47k+ ML SubReddit

Find Upcoming AI Webinars here

The post IBM Researchers Introduce AI-Hilbert: An Innovative Machine Learning Framework for Scientific Discovery Integrating Algebraic Geometry and Mixed-Integer Optimization appeared first on MarkTechPost.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

AI-Hilbert 科学发现 机器学习 代数几何 混合整数优化
相关文章