热点
"语义对齐" 相关文章
GranViT: A Fine-Grained Vision Model With Autoregressive Perception For MLLMs
cs.AI updates on arXiv.org 2025-10-27T06:26:55.000000Z
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-23T04:10:56.000000Z
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-23T04:10:56.000000Z
DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning
cs.AI updates on arXiv.org 2025-10-23T04:10:56.000000Z
CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment
cs.AI updates on arXiv.org 2025-10-22T04:23:06.000000Z
Region in Context: Text-condition Image editing with Human-like semantic reasoning
cs.AI updates on arXiv.org 2025-10-21T04:27:07.000000Z
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
cs.AI updates on arXiv.org 2025-10-17T04:19:18.000000Z
Bridging the Semantic Gap: Contrastive Rewards for Multilingual Text-to-SQL
cs.AI updates on arXiv.org 2025-10-17T04:11:45.000000Z
Generate Any Scene: Scene Graph Driven Data Synthesis for Visual Generation Training
cs.AI updates on arXiv.org 2025-10-13T04:14:57.000000Z
AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents
cs.AI updates on arXiv.org 2025-10-08T04:12:42.000000Z
World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge
cs.AI updates on arXiv.org 2025-10-07T04:16:33.000000Z
World-To-Image: Grounding Text-to-Image Generation with Agent-Driven World Knowledge
cs.AI updates on arXiv.org 2025-10-07T04:16:33.000000Z
A Flexible Method for Behaviorally Measuring Alignment Between Human and Artificial Intelligence Using Representational Similarity Analysis
cs.AI updates on arXiv.org 2025-10-03T04:19:03.000000Z
Object-AVEdit: An Object-level Audio-Visual Editing Model
cs.AI updates on arXiv.org 2025-10-02T04:16:56.000000Z
Seeing Through Words, Speaking Through Pixels: Deep Representational Alignment Between Vision and Language Models
cs.AI updates on arXiv.org 2025-09-26T04:22:00.000000Z
RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing
cs.AI updates on arXiv.org 2025-09-18T04:48:25.000000Z
Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations
cs.AI updates on arXiv.org 2025-09-17T05:11:49.000000Z
浙大联合港理工团队新作InfiGUI-G1:通过自适应探索策略优化,攻克GUI智能体定位语义对齐瓶颈
MIT 科技评论 - 本周热榜 2025-08-25T15:10:53.000000Z
MOVER: Multimodal Optimal Transport with Volume-based Embedding Regularization
cs.AI updates on arXiv.org 2025-08-19T04:01:29.000000Z
GenOM: Ontology Matching with Description Generation and Large Language Model
cs.AI updates on arXiv.org 2025-08-15T04:18:23.000000Z