-
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Paper ⢠2404.02514 ⢠Published ⢠11 -
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance
Paper ⢠2404.08252 ⢠Published ⢠6 -
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Paper ⢠2404.09833 ⢠Published ⢠30 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ⢠2404.12385 ⢠Published ⢠27
Collections
Discover the best community collections!
Collections including paper arxiv:2404.12385
-
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
Paper ⢠2203.00585 ⢠Published ⢠2 -
Emerging Properties in Self-Supervised Vision Transformers
Paper ⢠2104.14294 ⢠Published ⢠4 -
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Paper ⢠2404.06903 ⢠Published ⢠21 -
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Paper ⢠2404.07973 ⢠Published ⢠32
-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper ⢠2402.08714 ⢠Published ⢠15 -
Data Engineering for Scaling Language Models to 128K Context
Paper ⢠2402.10171 ⢠Published ⢠25 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper ⢠2402.10893 ⢠Published ⢠12 -
Coercing LLMs to do and reveal (almost) anything
Paper ⢠2402.14020 ⢠Published ⢠13
-
3D Congealing: 3D-Aware Image Alignment in the Wild
Paper ⢠2404.02125 ⢠Published ⢠10 -
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Paper ⢠2404.04319 ⢠Published ⢠25 -
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Paper ⢠2404.09956 ⢠Published ⢠12 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ⢠2404.12385 ⢠Published ⢠27
-
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Paper ⢠2403.10395 ⢠Published ⢠9 -
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Paper ⢠2403.05034 ⢠Published ⢠22 -
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Paper ⢠2404.00987 ⢠Published ⢠23 -
Advances in 3D Generation: A Survey
Paper ⢠2401.17807 ⢠Published ⢠19
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ⢠2403.01807 ⢠Published ⢠9 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ⢠2403.02151 ⢠Published ⢠16 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ⢠2403.01779 ⢠Published ⢠30 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ⢠2403.02460 ⢠Published ⢠8
-
3D-aware Image Generation using 2D Diffusion Models
Paper ⢠2303.17905 ⢠Published ⢠2 -
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Paper ⢠2310.08529 ⢠Published ⢠18 -
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Paper ⢠2310.16818 ⢠Published ⢠32 -
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
Paper ⢠2312.04543 ⢠Published ⢠22
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper ⢠2401.09416 ⢠Published ⢠11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper ⢠2401.10171 ⢠Published ⢠14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper ⢠2311.09217 ⢠Published ⢠22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper ⢠2401.12979 ⢠Published ⢠9
-
Freditor: High-Fidelity and Transferable NeRF Editing by Frequency Decomposition
Paper ⢠2404.02514 ⢠Published ⢠11 -
MonoPatchNeRF: Improving Neural Radiance Fields with Patch-based Monocular Guidance
Paper ⢠2404.08252 ⢠Published ⢠6 -
Video2Game: Real-time, Interactive, Realistic and Browser-Compatible Environment from a Single Video
Paper ⢠2404.09833 ⢠Published ⢠30 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ⢠2404.12385 ⢠Published ⢠27
-
3D Congealing: 3D-Aware Image Alignment in the Wild
Paper ⢠2404.02125 ⢠Published ⢠10 -
SpatialTracker: Tracking Any 2D Pixels in 3D Space
Paper ⢠2404.04319 ⢠Published ⢠25 -
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization
Paper ⢠2404.09956 ⢠Published ⢠12 -
MeshLRM: Large Reconstruction Model for High-Quality Mesh
Paper ⢠2404.12385 ⢠Published ⢠27
-
Isotropic3D: Image-to-3D Generation Based on a Single CLIP Embedding
Paper ⢠2403.10395 ⢠Published ⢠9 -
CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model
Paper ⢠2403.05034 ⢠Published ⢠22 -
FlexiDreamer: Single Image-to-3D Generation with FlexiCubes
Paper ⢠2404.00987 ⢠Published ⢠23 -
Advances in 3D Generation: A Survey
Paper ⢠2401.17807 ⢠Published ⢠19
-
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology
Paper ⢠2203.00585 ⢠Published ⢠2 -
Emerging Properties in Self-Supervised Vision Transformers
Paper ⢠2104.14294 ⢠Published ⢠4 -
DreamScene360: Unconstrained Text-to-3D Scene Generation with Panoramic Gaussian Splatting
Paper ⢠2404.06903 ⢠Published ⢠21 -
Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models
Paper ⢠2404.07973 ⢠Published ⢠32
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ⢠2403.01807 ⢠Published ⢠9 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ⢠2403.02151 ⢠Published ⢠16 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ⢠2403.01779 ⢠Published ⢠30 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ⢠2403.02460 ⢠Published ⢠8
-
3D-aware Image Generation using 2D Diffusion Models
Paper ⢠2303.17905 ⢠Published ⢠2 -
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Paper ⢠2310.08529 ⢠Published ⢠18 -
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Paper ⢠2310.16818 ⢠Published ⢠32 -
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image
Paper ⢠2312.04543 ⢠Published ⢠22
-
PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models
Paper ⢠2402.08714 ⢠Published ⢠15 -
Data Engineering for Scaling Language Models to 128K Context
Paper ⢠2402.10171 ⢠Published ⢠25 -
RLVF: Learning from Verbal Feedback without Overgeneralization
Paper ⢠2402.10893 ⢠Published ⢠12 -
Coercing LLMs to do and reveal (almost) anything
Paper ⢠2402.14020 ⢠Published ⢠13
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper ⢠2401.09416 ⢠Published ⢠11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper ⢠2401.10171 ⢠Published ⢠14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper ⢠2311.09217 ⢠Published ⢠22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper ⢠2401.12979 ⢠Published ⢠9