DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning

Ruowen Zhao^*1,3, Junliang Ye^*1,3, Zhengyi Wang^*1,3,

Guangce Liu³, Yiwen Chen², Yikai Wang¹, Jun Zhu^1,3

¹Tsinghua University, ²Nanyang Technological University, ³ShengShu

(*Equal Contribution)

Demo Video

All of the meshes above are generated by DeepMesh. DeepMesh can generate high-quality meshes conditioned on the given point cloud by auto-regressive transformer.

Point-cloud Conditioned Mesh Generation

DeepMesh creates the mesh on the right from the point cloud on the left. Drag with the left mouse button to change the view, right mouse button to move the mesh.

Animation of Mesh Generation

The following video shows an animation of the mesh generation process. We generate all faces of mesh sequentially.

Abstract

Triangle meshes play a crucial role in 3D applications for efficient manipulation and rendering. While auto-regressive methods generate structured meshes by predicting discrete vertex tokens, they are often constrained by limited face counts and mesh incompleteness. To address these challenges, we propose DeepMesh, a framework that optimizes mesh generation through two key innovations: (1) an efficient pre-training strategy incorporating a novel tokenization algorithm, along with improvements in data curation and processing, and (2) the introduction of Reinforcement Learning (RL) into 3D mesh generation to achieve human preference alignment via Direct Preference Optimization (DPO). We design a scoring standard that combines human evaluation with 3D metrics to collect preference pairs for DPO, ensuring both visual appeal and geometric accuracy. Conditioned on point clouds and images, DeepMesh generates meshes with intricate details and precise topology, outperforming state-of-the-art methods in both precision and quality.

Method