Computer Vision – ECCV 2024

18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LXII

Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol (Herausgeber)

Buch | Softcover

LXXXV, 499 Seiten

2024
Springer International Publishing (Verlag)
978-3-031-73032-0 (ISBN)

Artikel merken

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024.

The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.

Generating Physically Realistic and Directable Human Motions from Multi-Modal Inputs.- CoTracker: It is Better to Track Together.- SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models.- PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology.- Improving Adversarial Transferability via Model Alignment.- RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios.- ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation.- Embodied Understanding of Driving Scenarios.- Learning to Drive via Asymmetric Self-Play.- OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentation.- ViLA: Efficient Video-Language Alignment for Video Question Answering.- Factorizing Text-to-Video Generation by Explicit Image Conditioning.- MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices.- Open-Set Biometrics: Beyond Good Closed-Set Models.- UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening.- Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution.- Osmosis: RGBD Diffusion Prior for Underwater Image Restoration.- Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization.- Computing the Lipschitz constant needed for fast scene recovery from CASSI measurements.- DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields.- Flowed Time of Flight Radiance Fields.- 3D-GOI: 3D GAN Omni-Inversion for Multifaceted and Multi-object Editing.- Fast Registration of Photorealistic Avatars for VR Facial Animation.- CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings.- HiFi-Score: Fine-grained Image Description Evaluation with Hierarchical Parsing Graphs.- Image-to-Lidar Relational Distillation for Autonomous Driving Data.- Thinking Outside the BBox: Unconstrained Generative Object Compositing.

Erscheinungsdatum	01.11.2024
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	LXXXV, 499 p. 292 illus., 159 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Informatik ► Grafik / Design ► Digitale Bildverarbeitung
	Mathematik / Informatik ► Informatik ► Netzwerke
	Technik ► Elektrotechnik / Energietechnik
Schlagworte	Artificial Intelligence • Computer Networks • Computer systems • computer vision • Education • Human-Computer Interaction (HCI) • Image Analysis • image coding • Image Processing • image reconstruction • Image Segmentation • learning • machine learning • Object recognition • pattern recognition • reconstruction • Signal Processing • Software engineering
ISBN-10	3-031-73032-1 / 3031730321
ISBN-13	978-3-031-73032-0 / 9783031730320
Zustand	Neuware