Second Brain: Crafted, Curated, Connected, Compounded on 10月02日
数据工程宝库介绍
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

数据工程宝库是我第二大脑的重要组成部分,这是一个精心构建的数据工程知识网络,旨在促进数据工程领域的探索、发现和深度学习。这里汇集了1000多个相互关联的术语和概念,每个都是深入见解的入口。它如同一个数字花园,让用户可以有机地探索和连接数据工程相关的想法,揭示隐藏的关系,扩展理解,为各种水平的数据工程师提供独特的沉浸式学习体验。

🌳 数据工程宝库是一个精心构建的知识网络,包含1000多个相互关联的术语和概念,旨在促进数据工程领域的探索、发现和深度学习。

🗺️ 它像一个数字花园,用户可以有机地探索和连接数据工程相关的想法,揭示隐藏的关系,扩展理解。

📚 该宝库涵盖了数据工程的基础知识、现代数据基础设施、数据转换与处理、现代分析方法和专业数据技术等多个关键主题和概念。

🧠 它为各种水平的数据工程师提供独特的沉浸式学习体验,帮助他们更好地理解和应用数据工程实践。

Data Engineering Vault

Welcome to the Data Engineering Vault, an integral part of my larger Second Brain. This curated network of data engineering knowledge is designed to facilitate exploration, discovery, and deep learning in the field of data engineering. Here, you'll find a rich ecosystem of 1000+ interconnected terms and concepts, each serving as a gateway to deeper insights. Functioning like a Digital Garden for data engineering, this network allows you to organically explore and connect ideas.

Key Topics & Concepts

As you navigate through the concepts, you'll uncover hidden relationships, expanding your understanding and providing a unique, immersive learning experience whether you're a seasoned data engineer or just starting your journey.

Data engineering is a term that has shifted over the years from a Database Admins (DBA), ETL Developer, and Business Intelligence Specialist and merged with Software Engineers to a Data Engineer with the growth of data made his title.

It’s still not well defined, the latest book on Fundamentals of Data Engineering (Joe Reis, Matt Housley) tries and does probably best as of today; it’s getting clearer. Besides several boot camps, universities are also starting to get a degree in data engineering like Data Science did before. Let’s start by defining what data engineering is.

# What is Data Engineering

Data engineering is the less famous sibling of data science. Data science is growing like no tomorrow, as does data engineering, but much less heard. Compared to existing roles, it would be a software engineering plus business intelligence engineer including big data abilities as the Hadoop ecosystem, streaming, and computation at scale.

Business creates more reporting artifacts, but with more data that needs to be collected, cleaned, and updated near real-time, complexity is expanding daily. With that said, more programmatic skills are required, similar to software engineering. The emerging language at the moment is Python (more The Tool Language, Python) which is used in engineering with tools identical to Apache Airflow, Dagster, other Data Orchestrators, and data science with powerful libraries. Today as a BI engineer, you use SQL for almost everything except when using external data from an FTP server, for example. You would use bash and PowerShell in the nightly batch jobs. But this is no longer sufficient, and because it gets a full-time job to develop and maintain all these requirements and rules (called pipelines), data engineering is needed.

# Evolution of Data Engineering

# Getting Started with Data Engineering

Additional resources that can further enhance your understanding of data engineering. Whether you’re just starting out or looking to deepen your expertise, these resources are handpicked for their clarity, depth, and practical insights.

Essential Toolkit

Start with the Data Engineering Toolkit - a comprehensive guide to 70+ technologies and tools that form the foundation of modern data engineering practice.

# Must-Read Articles

Begin your journey with the “holy trinity” from Maxime Beauchemin, defining the essence of data engineering:

# Community and Learning

Don’t miss out on these foundational reads and thought leaders in the field:

Feel free to explore, learn, and contribute to this ever-growing field. Your journey in data engineering is just beginning.


Origin: Data Engineering, the future of Data Warehousing?
References:
Created: 2021-10-11

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

数据工程 知识库 数据基础设施 数据转换 数据分析
相关文章