🔥 Paper Reading List (2024)

Paper Reading List in 2024 (Work-in-Progress)

S-VolSDF: Sparse Multi-View Stereo Regularization of Neural Implicit Surfaces
Link to Paper
GenS: Generalizable Neural Surface Reconstruction from Multi-View Images
Link to Paper
Digging into Uncertainty in Self-supervised Multi-view Stereo
Link to Paper
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
Link to Paper
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation
Link to Paper
Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume
Link to Paper
Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields, ICCV 2023
Link to Paper
Instant-NGP: Instant Neural Graphics Primitives with a Multiresolution Hash Encoding, SIGGRAPH 2022
Link to Paper
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields, ICCV 2021
Link to Paper
Mip-nerf 360: Unbounded anti-aliased neural radiance fields, CVPR 2022
Link to Paper
GMFlow: Learning Optical Flow via Global Matching, CVPR 2022
Link to Paper
Iterative geometry encoding volume for stereo matching, CVPR 2023
Link to Paper
Parameterized Cost Volume for Stereo Matching, ICCV 2023
Link to Paper
High-frequency Stereo Matching Network, CVPR 2023
Link to Paper
Digging into uncertainty-based pseudo-label for robust stereo matching, TPAMI 2023
Link to Paper
Masked representation learning for domain generalized stereo matching, CVPR 2023
Link to Paper
Learning Depth Estimation for Transparent and Mirror Surfaces, ICCV 2023
Link to Paper
Efficient Multi-view Stereo by Iterative Dynamic Cost Volume, CVPR 2022
Link to Paper
MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions, CVPR 2022
Link to Paper
(DLNR) High-frequency stereo matching network, CVPR 2023
Link to Paper
(ACVNet) Attention Concatenation Volume for Accurate and Efficient Stereo Matching, CVPR 2022
Link to Paper
Learning in the Frequency Domain, CVPR 2020
Link to Paper
Fast Vision Transformers with HiLo Attention, NeurIPS 2022
Link to Paper
On the Over-Smoothing Problem of CNN Based Disparity Estimation, ICCV 2019
Link to Paper
(PDSNet) Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, NeruIPS 2018
Link to Paper
(NP-CVP-MVSNet) Non-parametric depth distribution modeling based depth inference for multi-view stereo, CVPR 2022
Link to Paper
Itsa: An information-theoretic approach to automatic shortcut avoidance and domain generalization in stereo matching networks, CVPR 2022
Link to Paper
GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature, CVPR 2022
Link to Paper
Extreme Rotation Estimation using Dense Correlation Volumes, CVPR 2021
Link to Paper
The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs, 3DV 2022
Link to Paper
DynamicStereo: Consistent Dynamic Depth from Stereo Videos, CVPR 2023
Link to Paper
Deep Depth Completion of a Single RGB-D Image, CVPR 2018
Link to Paper
CompletionFormer: Depth Completion with Convolutions and Vision Transformers, CVPR 2023
Link to Paper
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints, ICCV 2019
Link to Paper
Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image, ICRA 2018
Link to Paper
LiStereo: Generate Dense Depth Maps from LIDAR and Stereo Imagery, ICRA 2020
Link to Paper
High-precision Depth Estimation with the 3D LiDAR and Stereo Fusion, ICRA 2018
Link to Paper
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data, ArXiv
Link to Paper
Mosaic-SDF for 3D Generative Models, ArXiv
Link to Paper
SparsePose: Sparse-View Camera Pose Regression and Refinement, CVPR 2023
Link to Paper
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation, CVPR 2023
Link to Paper
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021
Link to Paper
DistractFlow: Improving Optical Flow Estimation via Realistic Distractions and Pseudo-Labeling, CVPR 2023
Link to Paper
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, NeurIPS 2018
Link to Paper
(Flow-Supervisor) Semi-Supervised Learning of Optical Flow by Flow Supervisor, ECCV 2022
Link to Paper
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth, ArXiv 2023
Link to Paper
GraftNet: Towards Domain Generalized Stereo Matching With a Broad-Spectrum and Task-Oriented Feature, CVPR 2022
Link to Paper
(CoodConv) An intriguing failing of convolutional neural networks and the CoordConv solution, NeurIPS 2018
Link to Paper
(CODD) Temporally Consistent Online Depth Estimation in Dynamic Scenes, WACV 2023
Link to Paper
SPECTRAL NORMALIZATION FOR GENERATIVE ADVERSARIAL NETWORKS, ICLR 2018
Link to Paper
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction, NeurIPS 2021
Link to Paper
(Semantic-NeRF) In-Place Scene Labelling and Understanding with Implicit Scene Representation, ICCV 2021, Oral
Link to Paper
VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality, ArXiv 2024
Link to Paper
Gaussian Splatting SLAM, CVPR 2024
Link to Paper
From Coarse to Fine: Robust Hierarchical Localization at Large Scale, CVPR 2019
Link to Paper
VINDLU: A Recipe for Effective Video-and-Language Pretraining, CVPR 2023
Link to Paper
Tag2Text: Guiding Vision-Language Model via Image Tagging, ICLR 2024
Link to Paper
(RAM) Recognize Anything: A Strong Image Tagging Model, CVPR Workshop 2024
Link to Paper
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction, CVPR 24, Oral
Link to Paper
Unifying Flow, Stereo and Depth Estimation, TPAMI 2023
Link to Paper
Language-Assisted 3D Feature Learning for Semantic Scene Understanding, AAAI 2023
Link to Paper
(Good Blog) - What are Diffusion Models?
Link to Blog
(latent-diffusion) High-Resolution Image Synthesis with Latent Diffusion Models, CVPR 2022
Link to Paper
ELFNet: Evidential Local-global Fusion for Stereo Matching (ICCV 2023)
Link to Paper
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers, ICCV 2021 Oral
Link to Paper
Check its Github repo for good attention visualization.
Low-rank bottleneck in multi-head attention models, PMLR 2020
Link to Paper
High-frequency Stereo Matching Network, CVPR 2023
Link to Paper
Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching, CVPR 2024
Link to Paper
MoCha-Stereo: Motif Channel Attention Network for Stereo Matching, CVPR 2024
Link to Paper
COTRACKER3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
Link to Paper
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature, CVPR 2023
Link to Paper
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions, ECCV 2024
Link to Paper
3D VISION-LANGUAGE GAUSSIAN SPLATTING
Link to Paper
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt
Link to Paper
Towards Foundation Models for 3D Vision: How Close Are We?
Link to Paper
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models, CVPR 2024
Link to Paper
(OSEDiff) One-Step Effective Diffusion Network for Real-World Image Super-Resolution, NeurIPS 2024
Link to Paper, Link to Code
DiffRAW: Leveraging Diffusion Model to Generate DSLR-Comparable Perceptual Quality sRGB from Smartphone RAW Images, AAAI 2024
Link to Paper N/A
Patch-level Representation Learning for Self-supervised Vision Transformers, CVPR 2022
Link to Paper
Self-Supervised Representation Learning from Flow Equivariance, ICCV 2021
Link to Paper
3D Common Corruptions and Data Augmentation, CVPR 2022, Oral
Link to Paper
NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video, CVPR 2021, Oral
Link to Paper
Passthrough+: Real-time Stereoscopic View Synthesis for Mobile Mixed Reality, Proceedings of the ACM on Computer Graphics and Interactive Techniques, 2020
Link to Paper
Crafting Better Contrastive Views for Siamese Representation Learning, CVPR 2022, Oral
Link to Paper
Masked Autoencoders Are Scalable Vision Learners
Link to Paper
How to Understand Masked Autoencoders
Link to Paper
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation, CVPR 2022
Link to Paper
Deep Homography for Efficient Stereo Image Compression, CVPR 2021
Link to Paper
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers, ICCV 2021
Link to Paper
DeepMVS: Learning Multi-View Stereopsis, CVPR 2018
Link to Paper
Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera, CVPR 2019
Link to Paper
Depth Map Prediction from a Single Image using a Multi-Scale Deep Network, NeurIPS 2014
Link to Paper
View-Consistent 3D Editing with Gaussian Splatting, ECCV 2024
Link to Paper
Text2Scene: Text-Driven Indoor Scene Stylization with Part-Aware Details, CVPR 2023
Link to Paper
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts, ECCV 2024
Link to Paper
HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction, BMVC 2024
Link to Paper
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction, CVPR 2024 Oral
Link to Paper
DepthSplat: Connecting Gaussian Splatting and Depth, arXiv 2024
Link to Paper
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM, CVPR 2024
Link to Paper

Written on November 13, 2024