cs.AI updates on arXiv.org 10月09日 12:05
多模态模型预测fMRI响应:Seinfeld团队参赛成果
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

本文介绍了Seinfeld团队在Algonauts 2025挑战赛中,通过集成多模态表示的模型预测fMRI脑响应的方法。该模型结合了大型语言模型、视频编码器、音频模型和视觉语言模型,并采用堆叠回归提高预测性能,最终排名第10。

arXiv:2510.06235v1 Announce Type: cross Abstract: We present our submission to the Algonauts 2025 Challenge, where the goal is to predict fMRI brain responses to movie stimuli. Our approach integrates multimodal representations from large language models, video encoders, audio models, and vision-language models, combining both off-the-shelf and fine-tuned variants. To improve performance, we enhanced textual inputs with detailed transcripts and summaries, and we explored stimulus-tuning and fine-tuning strategies for language and vision models. Predictions from individual models were combined using stacked regression, yielding solid results. Our submission, under the team name Seinfeld, ranked 10th. We make all code and resources publicly available, contributing to ongoing efforts in developing multimodal encoding models for brain activity.

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

多模态模型 fMRI脑响应 Algonauts 2025挑战赛 Seinfeld团队 预测性能
相关文章