Second Brain: Crafted, Curated, Connected, Compounded on 10月02日 21:14
数据分析工具中的数据透视表:重拾昔日荣光
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

数据透视表(Pivot Table)作为一种强大的数据聚合和分析工具,在2024年正经历着一场显著的复兴。从其1986年的起源,到如今被Rill Developer、Cube和DuckDB等新兴及成熟的数据分析平台重新集成和优化,数据透视表的功能正在被不断拓展和增强。这些新一代的实现不仅保留了其核心的聚合、汇总能力,还结合了现代数据处理的优势,如DuckDB的性能优势和Cube与Excel/Google Sheets的无缝集成,使得数据透视表在当今复杂的数据分析场景中依然保持着重要的地位和实用价值。

📊 **数据透视表的历史渊源与现代回归**:数据透视表由Pito Salas于1986年发明,并在1989年随Lotus Improv首次发布。尽管曾一度被视为过时技术,但2024年见证了其在Rill Developer、Cube和DuckDB等平台上的显著回归,表明其核心价值依然存在,并被现代数据工具所采纳和发展。

💡 **DuckDB对数据透视表的创新实现**:DuckDB通过其Excel风格的`pivot_table`函数,为用户提供了强大的数据透视功能,支持子合计和总计等特性。该实现不仅继承了Pandas `pivot_table`的易用性,更结合了DuckDB的内存计算能力,提供了卓越的性能和可扩展性,并内置了丰富的测试用例以确保其鲁棒性。

🚀 **Rill Developer与Cube的集成优势**:Rill Developer将数据透视表功能直接嵌入其Web UI,简化了数据探索流程。Cube则通过与Excel和Google Sheets的集成,使用户能在熟悉的电子表格环境中利用Cube的数据模型进行透视分析,极大地提升了数据分析的便捷性和效率。

🔄 **数据透视表的持续演进与重要性**:从早期的电子表格到如今的云端数据平台,数据透视表经历了漫长的发展历程。其能够将大量原始数据转化为易于理解的汇总信息,对于数据洞察和决策支持仍然至关重要。2024年的复兴表明,数据透视表作为一种基础而强大的数据分析工具,其生命力依然旺盛。

Are pivot tables making a comeback in 2024? Rill Developer incorporates pivot tables within its web UI, Cube enables pivot table functionality in Excel WITH Cube, and DuckDB has introduced its own implementation. Tweet

# What is a Pivot Table?

A pivot table is a table of values that are aggregations of groups of individual values from a more extensive table (such as from a database, spreadsheet, or business intelligence program) within one or more discrete categories.

The aggregations or summaries of the groups of the individual terms might include sums, averages, counts, or other statistics. A pivot table is the outcome of the statistical processing of tabularized raw data and can be used for decision-making.

Although pivot table is a generic term, Microsoft held a trademark on the term in the United States from 1994 to 2020.

Pito Salas is a Curaçaoan-American Cambridge, Massachusetts-based software developer. While working with Lotus’ Advanced Technology Group in 1986, Salas invented the pivot table, a “next-generation” spreadsheet concept that was released by Lotus in 1989, as Lotus Improv. Source - Wiki

# History

gantt    title Milestones in Spreadsheet & Pivot Table Software    dateFormat YYYY    axisFormat %Y        section Early Foundations    Paper Spreadsheets         :milestone, m1, 1960, 0d        section Digital Revolution & Spreadsheets    IBM Financial Planning     :milestone, m2, 1970, 0d    VisiCalc           :milestone, m3, 1979, 0d    Lotus 1-2-3        :milestone, m4, 1983, 0d    Excel for Mac       :milestone, m5, 1985, 0d        section Desktop Era    Lotus Improv (First Pivot):milestone, m7, 1989, 0d    Excel 5.0 (Pivot Tables)  :milestone, m8, 1993, 0d    QlikView       :milestone, m9, 1993, 0d    Tableau      :milestone, m10, 2003, 0d    PowerPivot       :milestone, m12, 2010, 0d        section Cloud Era    Power BI   :milestone, m13, 2015, 0d    Sigma     :milestone, m14, 2019, 0d    Cube, Omni, Rill     :milestone, m14, 2025, 0d    click m8 href "Pivot Tables introduced"    click m7 href "First pivot concept"    click m12 href "PowerPivot & Power Query"    click m13 href "Python integration"    click m14 href "Python integration"

From Rill | Why Pivot Tables Never Die.

# Examples of the Resurgence

# DuckDB

DuckDB introduces pivot tables through the Excel pivoting API. More details can be found in Creating a SQL-Only Extension for Excel-Style Pivoting in DuckDB – DuckDB:

The pivot_table function allows for Excel-style pivots, including subtotals, grand_totals, and more. It is also very similar to the Pandas pivot_table function, but with all the scalability and speed benefits of DuckDB. It contains over 250 tests, so it is intended to be useful beyond just an example!

To use the pivot table functionality in DuckDB:

12
INSTALL pivot_table FROM community;LOAD pivot_table;

2024-09-27

An example by Michael Simons on pv/bin/import_and_update.sh at main · michael-simons/pv · GitHub.

# Rill

2024-09-10:

Vimeo Video: Introducing the Rill pivot table

For more information, see Rill Developer and Rill | Introducing the Rill pivot table.

# Cube

Cube integrates with Google Sheets and Excel. The following example shows integration with Google Sheets as of

Source: How Cube Improves Shttps://www.ssp.shhttps://www.ssp.sh/brain/Pivot%20Table-20240924153827343.webp>

Artyom Keydunov on LinkedIn: Excited to announce the release of Google Sheets integration for Cube! 🎉…

# Pandas

Pivot tables are so not bad that even pandas has a pivot_table function…Bsky

Pandas Docs on Pivot Table

# Further Readings


Origin: Excel for ever, DuckDB Con in Seattle
References:
Created 2024-09-24

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

数据透视表 Pivot Table 数据分析 Data Analysis DuckDB Cube Rill Developer Excel Google Sheets 数据可视化 Data Visualization 技术复兴
相关文章