DINOv2_Fishai

热点

"DINOv2" 相关文章

CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting

cs.AI updates on arXiv.org 2025-10-29T04:23:37.000000Z

Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry

cs.AI updates on arXiv.org 2025-10-13T04:12:45.000000Z

Into the Rabbit Hull: From Task-Relevant Concepts in DINO to Minkowski Geometry

cs.AI updates on arXiv.org 2025-10-13T04:12:45.000000Z

DinoAtten3D: Slice-Level Attention Aggregation of DinoV2 for 3D Brain MRI Anomaly Classification

cs.AI updates on arXiv.org 2025-09-17T05:07:59.000000Z

Is an Ultra Large Natural Image-Based Foundation Model Superior to a Retina-Specific Model for Detecting Ocular and Systemic Diseases?

cs.AI updates on arXiv.org 2025-09-05T04:45:51.000000Z

DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model

cs.AI updates on arXiv.org 2025-07-18T04:14:12.000000Z

Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder

cs.AI updates on arXiv.org 2025-07-15T04:24:26.000000Z

FastDINOv2: Frequency Based Curriculum Learning Improves Robustness and Training Speed

cs.AI updates on arXiv.org 2025-07-08T06:58:14.000000Z

This AI Paper Introduces a Novel DINOv2-LLaVA Framework: Advanced Vision-Language Model for Automated Radiology Report Generation

MarkTechPost@AI 2025-01-20T20:04:56.000000Z

Gaze-LLE: A New AI Model for Gaze Target Estimation Built on Top of a Frozen Visual Foundation Model

MarkTechPost@AI 2024-12-17T03:49:50.000000Z

Copyright © 2019 FISHAI.All Rights Reserved