Computer Vision – ECCV 2024
Springer International Publishing (Verlag)
978-3-031-72979-9 (ISBN)
The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024.
The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.
CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion.- SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers.- Learn to Memorize and to Forget: A Continual Learning Perspective of Dynamic SLAM.- Forecasting Future Videos from Novel Views via Disentangled 3D Scene Representation.- GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring.- Get Your Embedding Space in Order: Domain-Adaptive Regression for Forest Monitoring.- ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion.- CoDA: Instructive Chain-of-Domain Adaptation with Severity-Aware Visual Prompt Tuning.- Curved Diffusion: A Generative Model With Optical Geometry Control.- Mini-Splatting: Representing Scenes with a Constrained Number of Gaussians.- MeshSegmenter: Zero-Shot Mesh Segmentation via Texture Synthesis.- OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation.- Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures.- Conceptual Codebook Learning for Vision-Language Models.- LingoQA: Video Question Answering for Autonomous Driving.- AnimateMe: 4D Facial Expressions via Diffusion Models.- HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning.- LATTE3D: Large-scale Amortized Text-To-Enhanced3D Synthesis.- PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors.- Unveiling and Mitigating Memorization in Text-to-image Diffusion Models through Cross Attention.- iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning.- Context Diffusion: In-Context Aware Image Generation.- Pose Guided Fine-Grained Sign Language Video Generation.- RAP: Retrieval-Augmented Planner for Adaptive Procedure Planning in Instructional Videos.- Certifiably Robust Image Watermark.- Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery.- Online Zero-Shot Classification with CLIP.
Erscheinungsdatum | 30.10.2024 |
---|---|
Reihe/Serie | Lecture Notes in Computer Science |
Zusatzinfo | LXXXV, 481 p. 193 illus., 191 illus. in color. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 155 x 235 mm |
Themenwelt | Informatik ► Grafik / Design ► Digitale Bildverarbeitung |
Mathematik / Informatik ► Informatik ► Netzwerke | |
Informatik ► Software Entwicklung ► User Interfaces (HCI) | |
Schlagworte | Artificial Intelligence • Computer Networks • Computer systems • computer vision • Education • Human-Computer Interaction (HCI) • Image Analysis • image coding • Image Processing • image reconstruction • Image Segmentation • learning • machine learning • Object recognition • pattern recognition • reconstruction • Signal Processing • Software engineering |
ISBN-10 | 3-031-72979-X / 303172979X |
ISBN-13 | 978-3-031-72979-9 / 9783031729799 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
aus dem Bereich