Jiyuan Wang1,2, Chunyu Lin1✉, Lei Sun2✝, Rongying Liu1, Mingxing Li2, Lang Nie3, Kang Liao4, Xiangxiang Chu2, Yao Zhao1
1Beijing Jiaotong University
2Alibaba Group
3Chongqing University of Posts and Telecommunications
4Nanyang Technological University
✉Corresponding author.
✝Project leader.
We present FE2E, a DiT-based foundation model for monocular dense geometry prediction. We pioneer the adaptation of an advanced image editing model for dense geometry prediction, revealing that editing models possess inherent structural priors beneficial for these tasks. With limited supervision (71K images), FE2E achieves significant performance improvements in zero-shot depth and normal estimation.
- [2025-09-05]: Paper released on arXiv.
This code was tested on Ubuntu 20.04, Python 3.10, and CUDA 12.1.
-
Clone the repository:
git clone https://github.com/AMAP-ML/FE2E.git cd FE2E -
Install dependencies: We recommend using
condafor environment management.Dependencies will release soon.
-
Initialize Accelerate Environment:
accelerate config
-
Prepare Training Data: Please refer to our paper for details on the training datasets. After downloading, organize the data as specified in the configuration files and update the corresponding paths.
-
Run Training Script: To train the FE2E model for joint depth and normal estimation, run the training script:
Script will release soon.
-
Place your images in a directory, for example,
assets/examples. -
Run the inference script. Our model jointly predicts depth and normals.
Script will release soon.
-
Prepare Benchmark Datasets:
- For depth estimation, download the evaluation datasets provided by Marigold:
cd datasets/eval/depth/ wget -r -np -nH --cut-dirs=4 -R "index.html*" -P . [https://share.phys.ethz.ch/~pf/bingkedata/marigold/evaluation_dataset/](https://share.phys.ethz.ch/~pf/bingkedata/marigold/evaluation_dataset/)
- For normal estimation, download the evaluation datasets provided by DSINE (
dsine_eval.zip) intodatasets/eval/normal/and unzip it.
- For depth estimation, download the evaluation datasets provided by Marigold:
-
Run Evaluation Script: Execute the evaluation script to assess the model's performance on the prepared benchmarks:
Script will release soon.
Below are the official models released. Our framework jointly estimates depth and normals in a single forward pass.
Model will release soon.
If you find our work useful in your research, please consider citing our paper:
@article{wang2025editor,
title={From Editor to Dense Geometry Estimator},
author={Wang, JiYuan and Lin, Chunyu and Sun, Lei and Liu, Rongying and Nie, Lang and Li, Mingxing and Liao, Kang and Chu, Xiangxiang and Zhao, Yao},
journal={arXiv preprint arXiv:2509.04338},
year={2025}
}
