Image-based rendering: Difference between revisions

From David's Wiki
Line 24: Line 24:


===Multi-plane Image (MPI)===
===Multi-plane Image (MPI)===
Multiple perpendicular planes each with some transparency which are composited together.
* [https://arxiv.org/abs/1805.09817 Stereo Magnification (SIGGRAPH 2018)]
* [https://arxiv.org/abs/1805.09817 Stereo Magnification (SIGGRAPH 2018)]
* [https://openaccess.thecvf.com/content_CVPR_2019/html/Flynn_DeepView_View_Synthesis_With_Learned_Gradient_Descent_CVPR_2019_paper.html DeepView (CVPR 2019)]
* [https://openaccess.thecvf.com/content_CVPR_2019/html/Flynn_DeepView_View_Synthesis_With_Learned_Gradient_Descent_CVPR_2019_paper.html DeepView (CVPR 2019)]


===Layered Depth Image (LDI)===
===Layered Depth Image (LDI)===
Multiple meshes each with some transparency. Unlike MPI, these meshes are not necessarily planes but may not correspond directly to scene objects.
* [https://facebookresearch.github.io/one_shot_3d_photography/ One-shot 3D photography]
* [https://facebookresearch.github.io/one_shot_3d_photography/ One-shot 3D photography]
* Casual 3D Photography
* Casual 3D Photography


===Multi-sphere Image (MSI)===
===Multi-sphere Image (MSI)===
Similar to MPI but using spheres.
* [http://visual.cs.brown.edu/projects/matryodshka-webpage/ Matryodshka (ECCV 2020)] - Renders 6-dof video from ODS videos.
* [http://visual.cs.brown.edu/projects/matryodshka-webpage/ Matryodshka (ECCV 2020)] - Renders 6-dof video from ODS videos.



Revision as of 15:32, 1 September 2021

Image-based rendering focuses on rendering scenes from existing captured or rasterized images, typically from a new viewpoint.
Recent research allows adding new objects, performing relighting, and other AR effects.

Implicit Representations

Light Fields

Lightfields aim to capture the radiance of light rays within the scene.

Light Field Networks

This is an implicit representation similar to NeRF.
However, you directly predict colors from light rays instead of performing volume rendering.

NeRF

NeRF preprocesses unstructured light fields into a neural network (MLP) representation which predicts radiance at different points during volume rendering.

Resources

Layered Representations

Some notable people here are Noah Snavely and Richard Tucker.
Representations here vary from implicit (MPI, MSI) to explicit (LDI, Point Clouds).

Multi-plane Image (MPI)

Multiple perpendicular planes each with some transparency which are composited together.

Layered Depth Image (LDI)

Multiple meshes each with some transparency. Unlike MPI, these meshes are not necessarily planes but may not correspond directly to scene objects.

Multi-sphere Image (MSI)

Similar to MPI but using spheres.

Point Clouds

Classical Reconstruction

Reconstruction aims to recreate the 3D scene from a set of input images.
Techniques include structure from motion, multi-view stereo.
This type of reconstruction is also studied in the field of photogrammetry.