热点
关于我们
xx
xx
"
多语言基准
" 相关文章
PISA-Bench: The PISA Index as a Multilingual and Multimodal Metric for the Evaluation of Vision-Language Models
cs.AI updates on arXiv.org
2025-10-30T04:14:56.000000Z
VLURes: Benchmarking VLM Visual and Linguistic Understanding in Low-Resource Languages
cs.AI updates on arXiv.org
2025-10-16T04:23:54.000000Z
Parallel Scaling Law: Unveiling Reasoning Generalization through A Cross-Linguistic Perspective
cs.AI updates on arXiv.org
2025-10-03T04:18:50.000000Z
LinguaSafe: A Comprehensive Multilingual Safety Benchmark for Large Language Models
cs.AI updates on arXiv.org
2025-08-19T04:01:32.000000Z
2025.04.07 | 多语言基准测试揭示LLMs跨语言泛化局限,具身智能新方法提升规划效率与适应性。
HuggingFace 每日AI论文速递
2025-04-07T23:07:35.000000Z
MIRAGE-Bench: An Automatic Multilingual Benchmark for Retrieval-Augmented Generation Systems
MarkTechPost@AI
2024-10-26T10:21:23.000000Z