EdgeRunner 20B：军事任务优化版GPT-oss-20b

cs.AI updates on arXiv.org 10月31日 12:03

EdgeRunner 20B：军事任务优化版GPT-oss-20b

本文介绍了EdgeRunner 20B，一种针对军事任务优化的gpt-oss-20b版本。该模型在1.6M条军事文档和网站数据上训练，并在军事测试集上表现优异，同时分析了超参数设置、成本和吞吐量。

arXiv:2510.26550v1 Announce Type: new Abstract: We present EdgeRunner 20B, a fine-tuned version of gpt-oss-20b optimized for military tasks. EdgeRunner 20B was trained on 1.6M high-quality records curated from military documentation and websites. We also present four new tests sets: (a) combat arms, (b) combat medic, (c) cyber operations, and (d) mil-bench-5k (general military knowledge). On these military test sets, EdgeRunner 20B matches or exceeds GPT-5 task performance with 95%+ statistical significance, except for the high reasoning setting on the combat medic test set and the low reasoning setting on the mil-bench-5k test set. Versus gpt-oss-20b, there is no statistically-significant regression on general-purpose benchmarks like ARC-C, GPQA Diamond, GSM8k, IFEval, MMLU Pro, or TruthfulQA, except for GSM8k in the low reasoning setting. We also present analyses on hyperparameter settings, cost, and throughput. These findings show that small, locally-hosted models are ideal solutions for data-sensitive operations such as in the military domain, allowing for deployment in air-gapped edge devices.

Fish AI Reader

AI辅助创作，多种专业模板，深度分析，高质量内容生成。从观点提取到深度思考，FishAI为您提供全方位的创作支持。新版本引入自定义参数，让您的创作更加个性化和精准。

FishAI

鱼阅，AI 时代的下一个智能信息助手，助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

EdgeRunner 20B GPT-oss-20b 军事任务模型优化数据敏感操作

相关文章

Data Augmentation and Optimized Architectures for Computer Vision with Fatih Porikli - #635

航行警告：渤海海峡黄海北部海域执行军事任务

Rethinking Model Size: Train Large, Then Compress with Joseph Gonzalez - #378

GitHub加速器資助11個開源AI專案，強調普及化、資安與問責應用

Show HN: 用人工智能在 5 分钟内微调人工智能模型

Apple Intelligence边缘推理和模型隐私

拆解端到端迷局：算力奇迹、多元架构与落地挑战

Decoding How NVIDIA AI Workbench Powers App Development

航行警告：渤海海峡黄海北部执行军事任务

Achieve up to ~2x higher throughput while reducing costs by up to ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 2