CAREER: Toward Video2Sim: Turning Real World Videos into Simulations
项目来源
项目主持人
项目受资助机构
立项年度
立项时间
项目编号
研究期限
项目级别
受资助金额
学科
学科代码
基金类别
关键词
参与者
参与机构
1.VIEW SYNTHESIS WITH SCULPTED NEURAL POINTS
- 关键词:
- Rendering (computer graphics);Implicit representation;Neural representations;New approaches;Novel techniques;Parameterized;Point-based methods;Point-based rendering;Point-clouds;View synthesis;Visual qualities
- Zuo, Yiming;Deng, Jia
- 《11th International Conference on Learning Representations, ICLR 2023》
- 2023年
- May 1, 2023 - May 5, 2023
- Kigali, Rwanda
- 会议
We address the task of view synthesis, generating novel views of a scene given a set of images as input. In many recent works such as NeRF (Mildenhall et al., 2020), the scene geometry is parameterized using neural implicit representations (i.e., MLPs). Implicit neural representations have achieved impressive visual quality but have drawbacks in computational efficiency. In this work, we propose a new approach that performs view synthesis using point clouds. It is the first point-based method that achieves better visual quality than NeRF while being 100× faster in rendering speed. Our approach builds on existing works on differentiable point-based rendering but introduces a novel technique we call "Sculpted Neural Points (SNP)", which significantly improves the robustness to errors and holes in the reconstructed point cloud. We further propose to use view-dependent point features based on spherical harmonics to capture non-Lambertian surfaces, and new designs in the point-based rendering pipeline that further boost the performance. Finally, we show that our system supports fine-grained scene editing. Code is available at https://github.com/princeton-vl/SNP. © 2023 11th International Conference on Learning Representations, ICLR 2023. All rights reserved.
...2.Infinite Photorealistic Worlds Using Procedural Generation
- 关键词:
- Optical flows;3D scenes;Dataset and evaluation;External sources;Natural phenomenon;Natural world;Object and scenes;Objects detection;Photo-realistic;Semantic segmentation;Training data
- Raistrick, Alexander;Lipson, Lahav;Ma, Zeyu;Mei, Lingjie;Wang, Mingzhe;Zuo, Yiming;Kayan, Karhan;Wen, Hongyu;Han, Beining;Wang, Yihan;Newell, Alejandro;Law, Hei;Goyal, Ankit;Yang, Kaiyu;Deng, Jia
- 《2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023》
- 2023年
- June 18, 2023 - June 22, 2023
- Vancouver, BC, Canada
- 会议
We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from scratch via randomized mathematical rules, using no external source and allowing infinite variation and composition. Infinigen offers broad coverage of objects and scenes in the natural world including plants, animals, terrains, and natural phenomena such as fire, cloud, rain, and snow. Infinigen can be used to generate unlimited, diverse training data for a wide range of computer vision tasks including object detection, semantic segmentation, optical flow, and 3D reconstruction. We expect Infinigen to be a useful resource for computer vision research and beyond. Please visit infinigen.org for videos, code and pre-generated data. © 2023 IEEE.
...3.Coupled Iterative Refinement for 6D Multi-Object Pose Estimation
- 关键词:
- ;
- Lipson, Lahav;Teed, Zachary;Goyal, Ankit;Deng, Jia
- 《2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022》
- 2022年
- June 19, 2022 - June 24, 2022
- New Orleans, LA, United states
- 会议
We address the task of 6D multi-object pose: given a set of known 3D objects and an RGB or RGB-D input image, we detect and estimate the 6D pose of each object. We propose a new approach to 6D object pose estimation which consists of an end-to-end differentiable architecture that makes use of geometric knowledge. Our approach iteratively refines both pose and correspondence in a tightly coupled manner, allowing us to dynamically remove outliers to improve accuracy. We use a novel differentiable layer to perform pose refinement by solving an optimization problem we refer to as Bidirectional Depth-Augmented Perspective-N-Point (BD-PnP). Our method achieves state-of-the-art accuracy on standard 6D Object Pose benchmarks. Code is available at https://github.com/princeton-vl/Coupled-Iterative-Refinement. © 2022 IEEE.
...4.Multiview Stereo with Cascaded Epipolar RAFT
- 关键词:
- 3D modeling ; Benchmarking ; Computer vision ; Three dimensional computer graphics;3;D vision ; 3D models ; 3d;modeling ; Depthmap ; Epipolar ; Multi;view stereo ; Multi;views ; Multiresolution fusion ; New approaches ; Point;clouds
- MaZeyu;TeedZachary;DengJia
- 《17th European Conference on Computer Vision, ECCV 2022》
- 2022年
- October 23, 2022 - October 27, 2022
- Tel Aviv, Israel
- 会议
We address multiview stereo (MVS), an important 3D vision task that reconstructs a 3D model such as a dense point cloud from multiple calibrated images. We propose CER-MVS (Cascaded Epipolar RAFT Multiview Stereo), a new approach based on the RAFT (Recurrent All-Pairs Field Transforms) architecture developed for optical flow. CER-MVS introduces five new changes to RAFT: epipolar cost volumes, cost volume cascading, multiview fusion of cost volumes, dynamic supervision, and multiresolution fusion of depth maps. CER-MVS is significantly different from prior work in multiview stereo. Unlike prior work, which operates by updating a 3D cost volume, CER-MVS operates by updating a disparity field. Furthermore, we propose an adaptive thresholding method to balance the completeness and accuracy of the reconstructed point clouds. Experiments show that our approach achieves state-of-the-art performance on the DTU and Tanks-and-Temples benchmarks (both intermediate and advanced set). Code is available at https://github.com/princeton-vl/CER-MVS. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
...5.RAFT-3D:Scene Flow using Rigid-Motion Embeddings(Open Access)
- Teed, Zachary ; Deng, Jia
- 《Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition》
- 2021年
- 会议
