Collections
Discover the best community collections!
Collections including paper arxiv:2312.03704
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper ⢠2307.12981 ⢠Published ⢠37 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper ⢠2401.17981 ⢠Published ⢠1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper ⢠2312.02126 ⢠Published ⢠2 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠33
-
aMUSEd: An Open MUSE Reproduction
Paper ⢠2401.01808 ⢠Published ⢠31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper ⢠2401.01885 ⢠Published ⢠28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper ⢠2401.00604 ⢠Published ⢠6 -
LARP: Language-Agent Role Play for Open-World Games
Paper ⢠2312.17653 ⢠Published ⢠33
-
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Paper ⢠2312.08889 ⢠Published ⢠15 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠6 -
Text-Guided Generation and Editing of Compositional 3D Avatars
Paper ⢠2309.07125 ⢠Published ⢠7
-
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Paper ⢠2310.12474 ⢠Published ⢠5 -
Drivable 3D Gaussian Avatars
Paper ⢠2311.08581 ⢠Published ⢠47 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper ⢠2311.12775 ⢠Published ⢠28 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper ⢠2311.13141 ⢠Published ⢠16
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper ⢠2401.15687 ⢠Published ⢠24 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠26 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠16
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠16 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠26 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠33
-
MVDream: Multi-view Diffusion for 3D Generation
Paper ⢠2308.16512 ⢠Published ⢠105 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠6 -
Dynamic Mesh-Aware Radiance Fields
Paper ⢠2309.04581 ⢠Published ⢠7 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ⢠2402.17485 ⢠Published ⢠195 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ⢠2312.01841 ⢠Published ⢠1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ⢠2311.16498 ⢠Published ⢠1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ⢠2312.02134 ⢠Published ⢠2
-
3D-LLM: Injecting the 3D World into Large Language Models
Paper ⢠2307.12981 ⢠Published ⢠37 -
Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study
Paper ⢠2401.17981 ⢠Published ⢠1 -
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
Paper ⢠2312.02126 ⢠Published ⢠2 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠33
-
Media2Face: Co-speech Facial Animation Generation With Multi-Modality Guidance
Paper ⢠2401.15687 ⢠Published ⢠24 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠26 -
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠16
-
aMUSEd: An Open MUSE Reproduction
Paper ⢠2401.01808 ⢠Published ⢠31 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper ⢠2401.01885 ⢠Published ⢠28 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper ⢠2401.00604 ⢠Published ⢠6 -
LARP: Language-Agent Role Play for Open-World Games
Paper ⢠2312.17653 ⢠Published ⢠33
-
DREAM-Talk: Diffusion-based Realistic Emotional Audio-driven Method for Single Image Talking Face Generation
Paper ⢠2312.13578 ⢠Published ⢠29 -
Splatter Image: Ultra-Fast Single-View 3D Reconstruction
Paper ⢠2312.13150 ⢠Published ⢠16 -
Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians
Paper ⢠2312.03029 ⢠Published ⢠26 -
Relightable Gaussian Codec Avatars
Paper ⢠2312.03704 ⢠Published ⢠33
-
SEEAvatar: Photorealistic Text-to-3D Avatar Generation with Constrained Geometry and Appearance
Paper ⢠2312.08889 ⢠Published ⢠15 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠6 -
Text-Guided Generation and Editing of Compositional 3D Avatars
Paper ⢠2309.07125 ⢠Published ⢠7
-
Enhancing High-Resolution 3D Generation through Pixel-wise Gradient Clipping
Paper ⢠2310.12474 ⢠Published ⢠5 -
Drivable 3D Gaussian Avatars
Paper ⢠2311.08581 ⢠Published ⢠47 -
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
Paper ⢠2311.12775 ⢠Published ⢠28 -
Diffusion360: Seamless 360 Degree Panoramic Image Generation based on Diffusion Models
Paper ⢠2311.13141 ⢠Published ⢠16
-
MVDream: Multi-view Diffusion for 3D Generation
Paper ⢠2308.16512 ⢠Published ⢠105 -
Learning Disentangled Avatars with Hybrid 3D Representations
Paper ⢠2309.06441 ⢠Published ⢠6 -
Dynamic Mesh-Aware Radiance Fields
Paper ⢠2309.04581 ⢠Published ⢠7 -
Towards Practical Capture of High-Fidelity Relightable Avatars
Paper ⢠2309.04247 ⢠Published ⢠10