热点
关于我们
xx
xx
"
DINOv2
" 相关文章
CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting
cs.AI updates on arXiv.org
2025-10-29T04:23:37.000000Z
Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry
cs.AI updates on arXiv.org
2025-10-13T04:12:45.000000Z
Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry
cs.AI updates on arXiv.org
2025-10-13T04:12:45.000000Z
DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification
cs.AI updates on arXiv.org
2025-09-17T05:07:59.000000Z
Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases?
cs.AI updates on arXiv.org
2025-09-05T04:45:51.000000Z
DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model
cs.AI updates on arXiv.org
2025-07-18T04:14:12.000000Z
Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder
cs.AI updates on arXiv.org
2025-07-15T04:24:26.000000Z
FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed
cs.AI updates on arXiv.org
2025-07-08T06:58:14.000000Z
This AI Paper Introduces a Novel DINOv2-LLaVA Framework: Advanced Vision-Language Model for Automated Radiology Report Generation
MarkTechPost@AI
2025-01-20T20:04:56.000000Z
Gaze-LLE: A New AI Model for Gaze Target Estimation Built on Top of a Frozen Visual Foundation Model
MarkTechPost@AI
2024-12-17T03:49:50.000000Z