Zilong Chen's CV

Summary

I am interested in multimodal generation. Previously, my primary research focused on 3D reconstruction and generation. I am a Ph.D student at Department of Computer Science and Technology, Tsinghua University, where I am advised by Prof. Huaping Liu. Before joining Tsinghua University, I completed my undergraduate studies at Xi'an Jiaotong University under the supervision of Prof. Minnan Luo, focusing on knowledge graphs and their applications in natural language processing.

Education

Tsinghua University, PhD in Computer Science

Xi'an Jiaotong University, BS in Physics

Experience

Shengshu Inc., Research intern on video and 3D generation

Publications (= Indicates Equal Contribution)

[1] MeshGen: Generating PBR Textured Mesh with Render-Enhanced Auto-Encoder and Generative Data Augmentation (https://heheyas.github.io/MeshGen)

[2] V3d: Video diffusion models are effective 3d generators (https://heheyas.github.io/V3D)

[3] Text-to-3d using gaussian splatting (https://gsgen3d.github.io/)

[4] Gaussianeditor: Swift and controllable 3d editing with gaussian splatting (https://buaacyw.github.io/gaussian-editor/)

[5] Masked space-time hash encoding for efficient dynamic scene reconstruction (https://masked-spacetime-hashing.github.io/)

[6] Video4DGen: Enhancing Video and 4D Generation through Mutual Optimization (https://vidu4d-dgs.github.io/)

[7] Vidu4d: Single generated video to high-fidelity 4d reconstruction with dynamic gaussian surfels (https://vidu4d-dgs.github.io/)

[8] Meshanything v2: Artist-created mesh generation with adjacent mesh tokenization (https://buaacyw.github.io/meshanything-v2/)

[9] Dimensionx: Create any 3d and 4d scenes from a single image with controllable video diffusion (https://chenshuo20.github.io/DimensionX/)

[10] Freeplane: Unlocking Free Lunch in Triplane-Based Sparse-View Reconstruction Models (https://freeplane3d.github.io/)

[11] Twibot-22: Towards graph-based twitter bot detection

[12] Knowledge graph augmented political perspective detection in news media

[13] Encoding heterogeneous social and political context for entity stance prediction

[14] KCD: Knowledge walks and textual cues enhanced political perspective detection in news media

[15] BIC: Twitter bot detection with text-graph interaction and semantic consistency

[16] KRACL: Contrastive learning with graph context modeling for sparse knowledge graph completion

[17] Kgap: Knowledge graph augmented political perspective detection in news media

[18] PAR: Political Actor Representation Learning with Social Context and Expert Knowledge

Projects

3D Gaussian Splatting

Segment Anything in NeRF

Awards

Huiyan Scholarship (Tsinghua University)

Track winner, BMW hackthon

Technologies