Neural Fields: Difference between revisions
No edit summary |
|||
(3 intermediate revisions by the same user not shown) | |||
Line 25: | Line 25: | ||
;MLP | ;MLP | ||
* SIREN | |||
** Proposes using sine activation to remove the spectrial bias instead of positional encoding. | |||
* [https://arxiv.org/pdf/2104.09125.pdf SAPE] | |||
** Progressively exposes additional frequencies during training based on time and space. | |||
;CNN + MLP | ;CNN + MLP | ||
Line 39: | Line 43: | ||
* Neural Sparse Voxel Fields | * Neural Sparse Voxel Fields | ||
* KiloNeRF | * KiloNeRF | ||
** Uses thousands of small voxels, each modelled by a single NeRF. Optimized using a teacher network. | |||
;Point Clouds | ;Point Clouds | ||
Line 45: | Line 50: | ||
====Feature Grids==== | ====Feature Grids==== | ||
* Plenoctrees | |||
** Convert a NeRF into a octree of spherical harmonics for fast rendering. | |||
* Plenoxels | |||
** Directly use a voxel grid of spherical harmonics to fast optimization and rendering. | |||
https://nv-tlabs.github.io/vqad/ | * Hash (Instant-NGP) | ||
** Use a hash function map voxels to features in a codebook. Disconnects grid resolution from codebook size. | |||
* [https://nv-tlabs.github.io/vqad/ Variable Bitrate Neural Fields] | |||
** Use vector quantization to compress feature grids. However, need to store an grid of indices. | |||
;Factorized Feature Grids | ;Factorized Feature Grids | ||
* TensoRF | * TensoRF | ||
===Dynamic Content=== | |||
====Deformation==== | |||
The idea here is to have an MLP which models the deformation of a canonical frame to the target frame. | |||
* [https://nerfies.github.io/ Nerfies: Deformable Neural Radiance Fields] | |||
** Windowed positional encoding | |||
* [https://www.albertpumarola.com/research/D-NeRF/index.html D-NeRF (CVPR 2020)] | |||
* [https://hypernerf.github.io/ HyperNeRF (SIGGRAPH Asia 2021)] | |||
** Allows the deformation network to output 2 additional feature values which ''slice'' the canonical NeRF. | |||
====Latent code==== | |||
* [https://neural-3d-video.github.io/ Neural 3D Video Synthesis (CVPR 2022)] | |||
** Use a latent code for each time step. | |||
* [https://arxiv.org/abs/2210.15947 NeRFPlayer] | |||
** Use a time-dependent sliding window along the feature channels in a feature grid. | |||
====Time-axis==== | |||
* [https://video-nerf.github.io/ Space-time Neural Irradiance Fields for Free-Viewpoint Video (Video-NeRF)] | |||
** Adds a bunch of regularization which allows directly inputting time to the MLP. | |||
* [https://aoliao12138.github.io/FPO/ Fourier PlenOctrees] | |||
** Apply DFT to spherical harmonics in PlenOctrees. | |||
* [https://arxiv.org/pdf/2202.06088.pdf NeuVV] | |||
** Hyperspherical Harmonics | |||
* [https://arxiv.org/abs/2301.11280 Text-To-4D Dynamic Scene Generation (2023)] | |||
** Extends the tri-plane feature grid to a six-plane feature grid ({x, y, z, t} choose 2). | |||
====Segmentation==== | |||
Segment static background and objects from dynamic background and objects | |||
* [https://arxiv.org/pdf/2104.14786.pdf Editable Free-Viewpoint Video using a Layered Neural Representation] | |||
** Create a scene by compositing one NeRF per actor. | |||
* [https://arxiv.org/abs/2303.03361 NeRFlets (2023)] | |||
* [https://arxiv.org/abs/2303.14536 SUDS: Scalable Urban Dynamic Scenes (2023)] | |||
===Generalization=== | ===Generalization=== |