Tristan's Projects

少点错误 08月15日

Tristan's Projects

../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

本文作者是一位积极寻求资金支持和合作机会的独立研究者与软件开发者，专注于人工智能对齐（AI Alignment）及相关公共利益项目。作者详细介绍了其正在进行或参与的几个核心项目，包括自我学习日志（My Self Study Journal, SSJ），旨在记录和分享AI对齐知识的学习与应用；N维交互式散点图（NDISP），致力于开发可视化工具以辅助机械可解释性研究；结果影响系统（OISs），一个关于超智能（ASI）不对齐风险的理论框架；以及“地图阐述所有言论”（MAAT）的社交媒体概念，旨在梳理和简化公共讨论。作者欢迎各方联系，无论是寻求资助、合作、指导，还是提供反馈。

✨ **自我学习日志 (SSJ)**: 作者将此项目定位为保持动力、提升AI对齐及相关项目知识与技能的平台，并作为公开的“进展报告”，以证明其研究的价值，激励他人，并作为同行反馈和合作的联系点。作者是日志条目的唯一贡献者，但欢迎任何关于内容或格式的反馈。

📊 **N维交互式散点图 (NDISP)**: 该项目旨在创建交互式可视化工具，并将其应用于机械可解释性研究。作者相信，对高维空间数据分布的分析在许多领域都有广泛应用。项目灵感来源于Mingwei Li等人的工作，作者已将其扩展为课程项目并继续深入研究，未来计划发布独立模块、用户友好的Web应用，并发表相关研究论文。

💡 **结果影响系统 (OISs)**: 作者提出“结果影响系统”（OISs）的概念，以填补超智能（ASI）不对齐生存风险讨论中的空白。通过发展围绕该模型的术语和形式化方法，作者希望能够缓解现有术语带来的问题，促进更具成效的跨学科研究，以应对ASI风险和社会协调问题。目前作者正在寻求合作者或能够指出其想法缺陷的人士。

💬 **地图阐述所有言论 (MAAT)**: MAAT是一个社交媒体应用概念，旨在通过将同一讨论的多个版本压缩为一系列思想节点，来使公共话语和学术领域的现状更加清晰易懂。这有助于避免因术语差异造成的混淆，并减少在冗余讨论中寻找进展所浪费的时间。作者欢迎有能力且动机正确的团队来开发此想法。

🤝 **寻求合作与资助**: 作者正积极寻求资金支持其研究工作，并期望以独立研究者和软件开发者的身份，在LessWrong上发表研究成果，并以开源许可证贡献软件。作者也欢迎导师、合作者，并考虑接受被指导者，但更倾向于在项目具体层面寻找合适人选。

Published on August 15, 2025 3:46 AM GMT

This page is an index of the projects I am working on or contributing to. I plan to keep it up to date as I continue working on various things.

I am actively looking for funding to support my work on these projects, or roles working on similar concepts. Ideally I would like funding as an independent researcher and software developer publishing my research on LessWrong and providing contributions to software under open source licenses. I feel this is the best incentive structure given my focus on AI alignment and other public benefit projects. If you know of funding or roles that seem suitable, please contact me.

This page is mainly directed towards people approving grants or those who would like to donate to support my work, but I am also looking for mentors, collaborators, and potentially accepting mentees, but I'm hoping to find people who fit those roles in more project specific ways. Feel free to use this page to browse my projects whoever you are!

The following are the projects I am focused on. For each one I provide a title, elevator pitch, hyperlinks, and my role and contributions.

My Self Study Journal (SSJ)

My project to stay motivated focusing on improving my knowledge and skill and applying it to AI Alignment and related projects. I hope this can serve as a public "progress report" justifying any funding I may receive as well as inspiring others and serving as a point of contact for peer and mentor feedback and collaboration.

First journal post with links to subsequent posts

I am the sole contributor to my journal entries, but I welcome any feedback on the contents or format.

NDISP

The "n-dimensional interactive scatter plot" (NDISP) is a working title for my project to create interactive visualization tools and applying them to mechanistic interpretability work. Analysis of data distributions in high dimensional space has many applications, so I believe the general core of the tools may benefit many areas.

TODO: Write main overview page for the project.

This project has been inspired by the work of Mingwei Li, particularly Grand Tour and UMAP Tour, as well as my own thinking. I first extended the Grand Tour application as a student project for a data visualization class and then continued working on it as a directed studies with George Tzanetakis and then as an honours project with Teseo Schneider.
I plan to continue the project by developing and releasing standalone modules, a user friendly web app, and publishing papers describing the tool and mechanistic interpretability results found using it.

OISs

There is a paradigm missing from the discussion of ASI misalignment existential risk. The threat of ASI generalizes to the concept of "Outcome Influencing Systems" (OISs). My hope is that developing terminology and formalism around this model may mitigate the issues associated with existing terminology and aid in more productive discourse and interdisciplinary research applicable to ASI risk and social coordination issues.

LW Wikitag for Outcome Influencing Systems (OISs)

WIP document to become main LW post introducing OISs

I am currently the only contributor. I think the idea has merit, but I am still at the point where I am seeking either to find collaborators and spread the idea, or to find people who can point out enough flaws in the idea for it to be worth abandoning.

MAAT

"Map Articulating All Talking" (MAAT) is a concept for a social media like app which could make public discourse and the state of academic fields clearer and easier to understand by compressing multiple versions of the same discussions into sets of idea nodes, avoiding confusion from differences in terminology and reducing wasted time spent finding progress among redundant discussion.

TODO: Write main overview page.

This is my original idea and I am interested in developing it, however, I would also be happy if a competent team with the correct motivations wanted to poach the idea. If anyone is interested, please contact me.

Discuss

My Self Study Journal (SSJ)

NDISP

OISs

MAAT

Fish AI Reader

FishAI

联系邮箱 441953276@qq.com

相关标签