Computer Vision – ECCV 2024

18th European Conference, Milan, Italy, September 29–October 4, 2024, Proceedings, Part LVII

Aleš Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol (Herausgeber)

Buch | Softcover

LXXXV, 499 Seiten

2024
Springer International Publishing (Verlag)
978-3-031-72997-3 (ISBN)

Artikel merken

The multi-volume set of LNCS books with volume numbers 15059 up to 15147 constitutes the refereed proceedings of the 18th European Conference on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29-October 4, 2024.

The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; reinforcement learning; object recognition; image classification; image processing; object detection; semantic segmentation; human pose estimation; 3d reconstruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; motion estimation.

ST-LLM: Large Language Models Are Effective Temporal Learners.- Exact Diffusion Inversion via Bidirectional Integration Approximation.- Textual Query-Driven Mask Transformer for Domain Generalized Segmentation.- EmoTalk3D: High-Fidelity Free-View Synthesis of Emotional 3D Talking Head.- Arbitrary-Scale Video Super-Resolution with Structural and Textural Priors.- Object-Centric Diffusion for Efficient Video Editing.- Single-Mask Inpainting for Voxel-based Neural Radiance Fields.- McGrids: Monte Carlo-Driven Adaptive Grids for Iso-Surface Extraction.- Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval.- Adapt2Reward: Adapting Video-Language Models to Generalizable Robotic Rewards via Failure Prompts.- Diffusion for Natural Image Matting.- Agglomerative Token Clustering.- CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection.- Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning.- ClusteringSDF: Self-Organized Neural Implicit Surfaces for 3D Decomposition.- NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition.- GIVT: Generative Infinite-Vocabulary Transformers.- Mismatch Quest: Visual and Textual Feedback for Image-Text Misalignment.- Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density.- Multi-Modal Video Dialog State Tracking in the Wild.- Factorized Diffusion: Perceptual Illusions by Noise Decomposition.- To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now.- Dissecting Dissonance: Benchmarking Large Multimodal Models Against Self-Contradictory Instructions.- StereoGlue: Joint Feature Matching and Robust Estimation.- Boosting Transferability in Vision-Language Attacks via Diversification along the Intersection Region of Adversarial Trajectory.- Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction.- Robust Zero-Shot Crowd Counting and Localization with Adaptive Resolution SAM.

Erscheinungsdatum	03.10.2024
Reihe/Serie	Lecture Notes in Computer Science
Zusatzinfo	LXXXV, 499 p. 197 illus., 188 illus. in color.
Verlagsort	Cham
Sprache	englisch
Maße	155 x 235 mm
Themenwelt	Informatik ► Grafik / Design ► Digitale Bildverarbeitung
	Mathematik / Informatik ► Informatik ► Netzwerke
	Technik ► Elektrotechnik / Energietechnik
Schlagworte	Artificial Intelligence • Computer Networks • Computer systems • computer vision • Education • Human-Computer Interaction (HCI) • Image Analysis • image coding • Image Processing • image reconstruction • Image Segmentation • learning • machine learning • Object recognition • pattern recognition • reconstruction • Signal Processing • Software engineering
ISBN-10	3-031-72997-8 / 3031729978
ISBN-13	978-3-031-72997-3 / 9783031729973
Zustand	Neuware