Neural Fields: Difference between revisions

No edit summary
 
(3 intermediate revisions by the same user not shown)
Line 25: Line 25:


;MLP
;MLP
* SIREN
** Proposes using sine activation to remove the spectrial bias instead of positional encoding.
* [https://arxiv.org/pdf/2104.09125.pdf SAPE]
** Progressively exposes additional frequencies during training based on time and space.


;CNN + MLP
;CNN + MLP
Line 39: Line 43:
* Neural Sparse Voxel Fields
* Neural Sparse Voxel Fields
* KiloNeRF
* KiloNeRF
** Uses thousands of small voxels, each modelled by a single NeRF. Optimized using a teacher network.


;Point Clouds
;Point Clouds
Line 45: Line 50:


====Feature Grids====
====Feature Grids====
;Plenoxels
* Plenoctrees
;Plenoctrees
** Convert a NeRF into a octree of spherical harmonics for fast rendering.
;Hash (Instant-NGP)
* Plenoxels
;Vector Quantization
** Directly use a voxel grid of spherical harmonics to fast optimization and rendering.
https://nv-tlabs.github.io/vqad/
* Hash (Instant-NGP)
** Use a hash function map voxels to features in a codebook. Disconnects grid resolution from codebook size.
* [https://nv-tlabs.github.io/vqad/ Variable Bitrate Neural Fields]
** Use vector quantization to compress feature grids. However, need to store an grid of indices.


;Factorized Feature Grids
;Factorized Feature Grids
* TensoRF
* TensoRF
===Dynamic Content===
====Deformation====
The idea here is to have an MLP which models the deformation of a canonical frame to the target frame.
* [https://nerfies.github.io/ Nerfies: Deformable Neural Radiance Fields]
** Windowed positional encoding
* [https://www.albertpumarola.com/research/D-NeRF/index.html D-NeRF (CVPR 2020)]
* [https://hypernerf.github.io/ HyperNeRF (SIGGRAPH Asia 2021)]
** Allows the deformation network to output 2 additional feature values which ''slice'' the canonical NeRF.
====Latent code====
* [https://neural-3d-video.github.io/ Neural 3D Video Synthesis (CVPR 2022)]
** Use a latent code for each time step.
* [https://arxiv.org/abs/2210.15947 NeRFPlayer]
** Use a time-dependent sliding window along the feature channels in a feature grid.
====Time-axis====
* [https://video-nerf.github.io/ Space-time Neural Irradiance Fields for Free-Viewpoint Video (Video-NeRF)]
** Adds a bunch of regularization which allows directly inputting time to the MLP.
* [https://aoliao12138.github.io/FPO/ Fourier PlenOctrees]
** Apply DFT to spherical harmonics in PlenOctrees.
* [https://arxiv.org/pdf/2202.06088.pdf NeuVV]
** Hyperspherical Harmonics
* [https://arxiv.org/abs/2301.11280 Text-To-4D Dynamic Scene Generation (2023)]
** Extends the tri-plane feature grid to a six-plane feature grid ({x, y, z, t} choose 2).
====Segmentation====
Segment static background and objects from dynamic background and objects
* [https://arxiv.org/pdf/2104.14786.pdf Editable Free-Viewpoint Video using a Layered Neural Representation]
** Create a scene by compositing one NeRF per actor.
* [https://arxiv.org/abs/2303.03361 NeRFlets (2023)]
* [https://arxiv.org/abs/2303.14536 SUDS: Scalable Urban Dynamic Scenes (2023)]


===Generalization===
===Generalization===