🔥 Paper Reading List (2024)
Paper Reading List in 2024 (Work-in-Progress)
-
S-VolSDF: Sparse Multi-View Stereo Regularization of Neural Implicit Surfaces
Link to Paper -
GenS: Generalizable Neural Surface Reconstruction from Multi-View Images
Link to Paper -
Digging into Uncertainty in Self-supervised Multi-view Stereo
Link to Paper -
NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo
Link to Paper -
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation
Link to Paper -
Learning Inverse Depth Regression for Multi-View Stereo with Correlation Cost Volume
Link to Paper -
Zip-NeRF: Anti-Aliased Grid-Based Neural Radiance Fields, ICCV 2023
Link to Paper -
Instant-NGP: Instant Neural Graphics Primitives with a Multiresolution Hash Encoding, SIGGRAPH 2022
Link to Paper -
Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields, ICCV 2021
Link to Paper -
Mip-nerf 360: Unbounded anti-aliased neural radiance fields, CVPR 2022
Link to Paper -
GMFlow: Learning Optical Flow via Global Matching, CVPR 2022
Link to Paper -
Iterative geometry encoding volume for stereo matching, CVPR 2023
Link to Paper -
Parameterized Cost Volume for Stereo Matching, ICCV 2023
Link to Paper -
High-frequency Stereo Matching Network, CVPR 2023
Link to Paper -
Digging into uncertainty-based pseudo-label for robust stereo matching, TPAMI 2023
Link to Paper -
Masked representation learning for domain generalized stereo matching, CVPR 2023
Link to Paper -
Learning Depth Estimation for Transparent and Mirror Surfaces, ICCV 2023
Link to Paper -
Efficient Multi-view Stereo by Iterative Dynamic Cost Volume, CVPR 2022
Link to Paper -
MVS2D: Efficient Multi-view Stereo via Attention-Driven 2D Convolutions, CVPR 2022
Link to Paper -
(DLNR) High-frequency stereo matching network, CVPR 2023
Link to Paper -
(ACVNet) Attention Concatenation Volume for Accurate and Efficient Stereo Matching, CVPR 2022
Link to Paper -
Learning in the Frequency Domain, CVPR 2020
Link to Paper -
Fast Vision Transformers with HiLo Attention, NeurIPS 2022
Link to Paper -
On the Over-Smoothing Problem of CNN Based Disparity Estimation, ICCV 2019
Link to Paper -
(PDSNet) Practical Deep Stereo (PDS): Toward applications-friendly deep stereo matching, NeruIPS 2018
Link to Paper -
(NP-CVP-MVSNet) Non-parametric depth distribution modeling based depth inference for multi-view stereo, CVPR 2022
Link to Paper -
Itsa: An information-theoretic approach to automatic shortcut avoidance and domain generalization in stereo matching networks, CVPR 2022
Link to Paper -
GraftNet: Towards Domain Generalized Stereo Matching with a Broad-Spectrum and Task-Oriented Feature, CVPR 2022
Link to Paper -
Extreme Rotation Estimation using Dense Correlation Volumes, CVPR 2021
Link to Paper -
The 8-Point Algorithm as an Inductive Bias for Relative Pose Prediction by ViTs, 3DV 2022
Link to Paper -
DynamicStereo: Consistent Dynamic Depth from Stereo Videos, CVPR 2023
Link to Paper -
Deep Depth Completion of a Single RGB-D Image, CVPR 2018
Link to Paper -
CompletionFormer: Depth Completion with Convolutions and Vision Transformers, CVPR 2023
Link to Paper -
Depth Completion from Sparse LiDAR Data with Depth-Normal Constraints, ICCV 2019
Link to Paper -
Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image, ICRA 2018
Link to Paper -
LiStereo: Generate Dense Depth Maps from LIDAR and Stereo Imagery, ICRA 2020
Link to Paper -
High-precision Depth Estimation with the 3D LiDAR and Stereo Fusion, ICRA 2018
Link to Paper -
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data, ArXiv
Link to Paper -
Mosaic-SDF for 3D Generative Models, ArXiv
Link to Paper -
SparsePose: Sparse-View Camera Pose Regression and Refinement, CVPR 2023
Link to Paper -
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation, CVPR 2023
Link to Paper -
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021
Link to Paper -
DistractFlow: Improving Optical Flow Estimation via Realistic Distractions and Pseudo-Labeling, CVPR 2023
Link to Paper -
Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, NeurIPS 2018
Link to Paper -
(Flow-Supervisor) Semi-Supervised Learning of Optical Flow by Flow Supervisor, ECCV 2022
Link to Paper -
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric Depth, ArXiv 2023
Link to Paper -
GraftNet: Towards Domain Generalized Stereo Matching With a Broad-Spectrum and Task-Oriented Feature, CVPR 2022
Link to Paper -
(CoodConv) An intriguing failing of convolutional neural networks and the CoordConv solution, NeurIPS 2018
Link to Paper -
(CODD) Temporally Consistent Online Depth Estimation in Dynamic Scenes, WACV 2023
Link to Paper -
SPECTRAL NORMALIZATION FOR GENERATIVE ADVERSARIAL NETWORKS, ICLR 2018
Link to Paper -
NeuS: Learning Neural Implicit Surfaces by Volume Rendering for Multi-view Reconstruction, NeurIPS 2021
Link to Paper -
(Semantic-NeRF) In-Place Scene Labelling and Understanding with Implicit Scene Representation, ICCV 2021, Oral
Link to Paper -
VR-GS: A Physical Dynamics-Aware Interactive Gaussian Splatting System in Virtual Reality, ArXiv 2024
Link to Paper -
Gaussian Splatting SLAM, CVPR 2024
Link to Paper -
From Coarse to Fine: Robust Hierarchical Localization at Large Scale, CVPR 2019
Link to Paper -
VINDLU: A Recipe for Effective Video-and-Language Pretraining, CVPR 2023
Link to Paper -
Tag2Text: Guiding Vision-Language Model via Image Tagging, ICLR 2024
Link to Paper -
(RAM) Recognize Anything: A Strong Image Tagging Model, CVPR Workshop 2024
Link to Paper -
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction, CVPR 24, Oral
Link to Paper -
Unifying Flow, Stereo and Depth Estimation, TPAMI 2023
Link to Paper -
Language-Assisted 3D Feature Learning for Semantic Scene Understanding, AAAI 2023
Link to Paper -
(Good Blog) - What are Diffusion Models?
Link to Blog -
(latent-diffusion) High-Resolution Image Synthesis with Latent Diffusion Models, CVPR 2022
Link to Paper -
ELFNet: Evidential Local-global Fusion for Stereo Matching (ICCV 2023)
Link to Paper -
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers, ICCV 2021 Oral
Link to Paper
Check its Github repo for good attention visualization. -
Low-rank bottleneck in multi-head attention models, PMLR 2020
Link to Paper -
High-frequency Stereo Matching Network, CVPR 2023
Link to Paper -
Selective-Stereo: Adaptive Frequency Information Selection for Stereo Matching, CVPR 2024
Link to Paper -
MoCha-Stereo: Motif Channel Attention Network for Stereo Matching, CVPR 2024
Link to Paper -
COTRACKER3: Simpler and Better Point Tracking by Pseudo-Labelling Real Videos
Link to Paper -
Understanding Masked Image Modeling via Learning Occlusion Invariant Feature, CVPR 2023
Link to Paper -
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions, ECCV 2024
Link to Paper -
3D VISION-LANGUAGE GAUSSIAN SPLATTING
Link to Paper -
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt
Link to Paper -
Towards Foundation Models for 3D Vision: How Close Are We?
Link to Paper -
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion Models, CVPR 2024
Link to Paper -
(OSEDiff) One-Step Effective Diffusion Network for Real-World Image Super-Resolution, NeurIPS 2024
Link to Paper, Link to Code - DiffRAW: Leveraging Diffusion Model to Generate DSLR-Comparable Perceptual Quality sRGB from Smartphone RAW Images, AAAI 2024
Link to Paper N/A -
Patch-level Representation Learning for Self-supervised Vision Transformers, CVPR 2022
Link to Paper -
Self-Supervised Representation Learning from Flow Equivariance, ICCV 2021
Link to Paper -
3D Common Corruptions and Data Augmentation, CVPR 2022, Oral
Link to Paper -
NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video, CVPR 2021, Oral
Link to Paper -
Passthrough+: Real-time Stereoscopic View Synthesis for Mobile Mixed Reality, Proceedings of the ACM on Computer Graphics and Interactive Techniques, 2020
Link to Paper -
Crafting Better Contrastive Views for Siamese Representation Learning, CVPR 2022, Oral
Link to Paper -
Masked Autoencoders Are Scalable Vision Learners
Link to Paper -
How to Understand Masked Autoencoders
Link to Paper -
Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation, CVPR 2022
Link to Paper -
Deep Homography for Efficient Stereo Image Compression, CVPR 2021
Link to Paper -
Revisiting Stereo Depth Estimation From a Sequence-to-Sequence Perspective with Transformers, ICCV 2021
Link to Paper -
DeepMVS: Learning Multi-View Stereopsis, CVPR 2018
Link to Paper -
Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera, CVPR 2019
Link to Paper -
Depth Map Prediction from a Single Image using a Multi-Scale Deep Network, NeurIPS 2014
Link to Paper -
View-Consistent 3D Editing with Gaussian Splatting, ECCV 2024
Link to Paper -
Text2Scene: Text-Driven Indoor Scene Stylization with Part-Aware Details, CVPR 2023
Link to Paper -
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts, ECCV 2024
Link to Paper -
HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction, BMVC 2024
Link to Paper -
pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction, CVPR 2024 Oral
Link to Paper -
DepthSplat: Connecting Gaussian Splatting and Depth, arXiv 2024
Link to Paper - SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM, CVPR 2024
Link to Paper