Nvidia Developer 前天 04:18
NVIDIA携手GPU MODE举办开发者内核大赛
index_new5.html
../../../zaker_core/zaker_tpl_static/wap/tpl_guoji1.html

 

NVIDIA与GPU MODE联合推出为期四部分的开发者内核大赛,旨在挑战开发者在NVIDIA Blackwell硬件上优化低级别内核的性能极限。比赛将分四个阶段发布内核问题,鼓励开发者创作接近光速的内核。参赛者可通过GPU MODE Discord获取支持和交流,并使用Popcorn CLI提交作品。大赛设有丰厚奖品,包括最新一代NVIDIA硬件,以及最终接近光速性能的大奖——Dell Pro Max with GB300。优胜者还将受邀参加GTC 2026大会。

🚀 **性能挑战与目标**:本次开发者内核大赛由NVIDIA与GPU MODE联合举办,核心目标是激励开发者挑战NVIDIA Blackwell硬件的GPU性能极限,通过优化低级别内核实现极致效率。整个赛事分为四个阶段,每个阶段都将发布一个具体的内核问题,鼓励参赛者提交接近“光速”的性能表现。

💡 **参赛方式与工具**:大赛面向个人开发者开放,鼓励开发者通过指定的页面进行注册(截止日期为2026年2月13日)。参赛者可以通过加入GPU MODE Discord社区获取最新通知、参与讨论并获得技术支持。代码提交将使用Popcorn CLI工具,并需遵循提供的设置和提交指南。

🎁 **丰厚奖项与认可**:大赛为每个内核问题设置了前三名优胜者,他们将获得最新的NVIDIA硬件奖励。此外,还将评选出一位总冠军,其内核性能最接近“光速”,将获得Dell Pro Max with GB300作为终极大奖。各问题的前两名优胜者还将受邀参加2026年3月在圣何塞举行的GTC大会特别颁奖典礼。

📊 **评分机制与资源**:每个内核问题将独立评分,基于参赛者提交的内核与基准参考内核的绝对运行时长和相对加速比进行评判。比赛将在NVIDIA GPU上进行基准测试。此外,GPU MODE YouTube频道提供丰富的学习资源,包括来自NVIDIA研究员和工程师的讲座,助力开发者提升技能。

Overview

​Join the Developer Kernel Hackathon, a four-part performance challenge hosted by NVIDIA in collaboration with GPU MODE.

​This event invites developers to push the limits of GPU performance and optimize low-level kernels for maximum efficiency on NVIDIA Blackwell hardware.

​Across four problems released throughout the hackathon, participants will compete to author kernels that approach the speed of light.

​Whether you’re a seasoned kernel developer or just eager to test your limits, this hackathon offers the chance to showcase your expertise and join a community of world-class developers.

Special thanks to our partners:
Sesterce, a high-performance GPU cloud platform, is contributing DGX B200 compute resources to support participants throughout the competition.

Dell is providing a Dell Pro Max with GB300 as the grand prize.


Schedule

​Each kernel problem will be released sequentially. Once one problem ends, another one begins.

    ​Kernel #1 - NVFP4 Batched GEMV

    ​Kernel #2 - NVFP4 GEMM

    ​Kernel #3 - NVFP4 Gated Dual GEMM

    ​Kernel #4 - NVFP4 Grouped GEMM


How to Participate

    ​Open to individuals only (no teams).

    ​Register through this page by February 13th, 2026 to be eligible to win prizes.

    ​Join the GPU MODE Discord, and head to the nvidia-competition channel for announcements, discussions, questions, and assistance.

    ​Submissions can be made using the Popcorn CLI. Follow the setup and submission instructions here:
    👉 https://github.com/gpu-mode/popcorn-cli


Prizes

​Each of the four kernel problems will have 3 winners who will receive latest generation NVIDIA hardware, with one grand prize winner across all problems for achieving performance closest to the speed of light.

​The top 2 winners of each problem will also be invited to a special awards ceremony at GTC in San Jose, March 2026.

​💥 Grand Prize:
1× Dell Pro Max with GB300 + GTC 2026 Pass – awarded to the participant whose submission (across any of the four problems) achieves performance closest to the speed of light.

​🏆 Prizes for Each Kernel Problem:
There are four kernel problems in total, and each will have its own set of winners:

​🥇 1st Place: NVIDIA DGX Spark + GTC 2026 Pass
🥈 2nd Place: NVIDIA RTX 5090 + GTC 2026 Pass
🥉 3rd Place: NVIDIA RTX 5080


Scoring and Judging

    ​There will be four independent problems, each scored separately.

    ​The top 3 submissions for each problem will win prizes.

    ​The grand prize will be awarded to the participant with the fastest kernel overall, measured by closeness to the published “speed of light” performance for that specific kernel problem.

    ​Submissions are benchmarked on NVIDIA GPUs using the GPU MODE infrastructure.

    ​Scoring is based on absolute runtime and relative speed-up against baseline reference kernels.


Additional Resources

​For learning resources, check out and subscribe to GPU MODE YouTube channel, where you can find weekly lectures from top voices in the ML community, including researchers and engineers from NVIDIA.


Terms & Conditions

​Participation in this hackathon is subject to the official terms and conditions.
🔗 View full Terms & Conditions

Fish AI Reader

Fish AI Reader

AI辅助创作,多种专业模板,深度分析,高质量内容生成。从观点提取到深度思考,FishAI为您提供全方位的创作支持。新版本引入自定义参数,让您的创作更加个性化和精准。

FishAI

FishAI

鱼阅,AI 时代的下一个智能信息助手,助你摆脱信息焦虑

联系邮箱 441953276@qq.com

相关标签

NVIDIA GPU MODE Kernel Hackathon GPU Performance Blackwell NVIDIA Blackwell AI Machine Learning GTC Developer Competition
相关文章