Explore our 4D reconstruction results of VROOM on various track mini-sectors.
Drag with left click to rotate view
Scroll to zoom in/out
Drag with right click to move view
Moving forward and backward
Moving left and right
Moving upward and downward
Abstract
We introduce VROOM, a system for reconstructing 3D
models of Formula 1 circuits using only onboard camera
footage from racecars. Leveraging video data from the
2023 Monaco Grand Prix, we address challenges in the
videos such as high-speed motion, sharp turns of perspec-
tives from a singular camera perspective. Our pipeline uti-
lizes different methods such as DROID—Slam, AnyCam,
and Monst3r and combine preprocessing techniques such
as different methods of masking, temporal chunking, and
resolution scaling to account for dynamic motion and com-
putational constraints. We show Vroom is able to partially
recover the track and vehicle trajectories in complex envi-
ronments. These findings indicate the feasibility of using
onboard video for scalable 4D reconstruction across multi-
ple agents in real-world settings.
Preprocessing Methods
In order to make our method with the long F1 videos, we implement several preprocessing methods to improve
efficiency while maintaining reconstruction quality.
Next, in addition to downsampling resolution and FPS, we also strategically chunk the video on straight segments
in the race. Since our method processes the video chunk by chunk, chunking on the straights is essential for
ensuring each turn is reconstructed with the highest accuracy possible.
(Top) First Video Chunk; (Bottom) Second Video Chunk. As we see, the two overlap in a
straight segment of the race.
Chunk-wise Point Cloud and Camera Extrinsics
We utilize and extend the monst3r repo for learning the point cloud and camera parameters per chunk.
Dynamic global point cloud and camera pose estimation
@article{yadav2024vroom,
author = {Yadav, Yajat and Bharadwaj, Varun and Korrapati, Jathin and Baranwal, Tanish},
title = {VROOM: Visual Reconstruction over Onboard Multiview},
year = {2025},
note = {Unpublished manuscript, for a class project},
institution = {University of California, Berkeley},
url = {http://varun-bharadwaj.github.io/vroom},
}
Acknowledgements:
We borrow this template from the Monst3r paper at Monst3r.
Similarly,
the interactive 4D visualization is inspired by the visualizations presented by Monst3r.