🤖 Artificial Intelligence OpenAlex

NeRF

📅 December 17, 2021 👤 Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik et al. 📖 Communications of the ACM 📊 5,764 citations

🤖 Plain-English Summary

We present a method that achieves advanced results for synthesizing novel views of complex scenes by optimizing an underlying continuous volumetric scene function using a sparse set of input views. Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses.

🔑 Key Findings

  • Our algorithm represents a scene using a fully connected (nonconvolutional) deep network, whose input is a single continuous 5D coordinate (spatial location ( x , y , z ) and viewing direction ( θ, ϕ )) and whose output is the volume density and view-dependent emitted radiance at that spatial location.
  • We synthesize views by querying 5D coordinates along camera rays and use classic volume rendering techniques to project the output colors and densities into an image.
  • Because volume rendering is naturally differentiable, the only input required to optimize our representation is a set of images with known camera poses.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Dec 17, 2021
Journal Communications of the ACM
Authors Ben Mildenhall, Pratul P. Srinivasan, Matthew Tancik, Jonathan T. Barron, Ravi Ramamoorthi
DOI 10.1145/3503250
Citations 5,764
Source OpenAlex

More 🤖 Artificial Intelligence Research