Bob Wikipublic profile

Selected projects

From Bob Wiki, a public encyclopedia-style profile page

Bob’s projects are presented as research nodes: each one represents a technical step toward dynamic, animatable, and eventually embodied human and animal AI characters.

Project narrative

The public narrative is sequential. Bob began from the question of whether a single image could control the creation of an animatable animal. DogRecon focused on dogs as a category where canine priors could make single-image reconstruction feasible. AniGauss broadened the setting toward arbitrary animals and less restrictive input images. WildAni4D then moved from single images toward video, synthetic data, and temporally coherent 4D animal motion.

ProjectStatusContributionTechnical challenge / lesson
DogReconInternational Journal of Computer Vision, 2025Proposed an early framework for controllable animal generation and reconstruction from a single image, focused on animatable 3D Gaussian dog reconstruction guided by canine priors.Zero123-style generated views were not always reliable, so the system needed canine-prior and mask-guided mechanisms. One of the hardest parts was robust sampling: selecting between the input sample and RSW-generated samples, and also sampling among generated views to reduce failure cases.
AniGaussCV4Animals Workshop at CVPR 2026Extends the animal reconstruction direction beyond restrictive dog inputs. Whereas DogRecon required a relatively clean input where all four legs were visible, AniGauss aims to handle more general in-the-wild animal images and broader animal categories.Uses 3D Gaussian representations as an anchor representation during inference. It is a workshop-stage project and still needs further development, but it represents a step toward more general animatable animal reconstruction.
WildAni4DCVPR 2026 FindingsBuilds an end-to-end system for extracting animal motion from video, using a synthetic animal video pipeline and a model for temporally coherent 4D animal mesh reconstruction.The system currently works especially well on generated videos such as Veo3-style outputs. A key next step is improving the data and robustness so that it performs strongly on truly in-the-wild videos. The strongest demo to highlight is demo1.mov on the project page.
Human / animal avatar threadOngoing research directionConnects lessons from human avatar reconstruction to animal avatar reconstruction, while identifying where human-centric assumptions fail.Important for positioning animal reconstruction as part of a broader embodied-character stack rather than a niche visual reconstruction problem.
Personal AI research OSPublic high-level description onlyUses structured notes and AI tools to compound research taste, writing, and execution over time.Should remain high-level in public. Private agent logs, relationship context, and strategy details are intentionally excluded.

DogRecon

DogRecon is important in Bob’s research identity because it established a concrete thesis: a single image can be used to control the creation of an animatable animal representation. Its contribution was not only producing dog reconstructions, but proposing a framework where category-specific animal priors guide single-image 3D Gaussian reconstruction.

Bob considers the most meaningful part to be that the work was early in this direction. The hardest engineering and research challenge was reliability. Generated views could contain artifacts or inconsistent geometry, so the framework needed a reliable sampling strategy and canine-prior-guided constraints. The mask-guided use of a canine prior was especially satisfying because it converted an unreliable generative component into a more controllable reconstruction pipeline.

AniGauss

AniGauss can be viewed as a generalization attempt after DogRecon. DogRecon worked best when the input was relatively favorable, such as a dog image where all four legs were visible. AniGauss aims to relax this assumption: the goal is to reconstruct animatable animals from more general in-the-wild images and across a broader set of animal categories.

The project uses 3D Gaussian representations as an anchor during inference. Its public status is CV4Animals Workshop at CVPR 2026. Bob sees it as promising but still needing further development.

WildAni4D

WildAni4D moves the research thread from single-image reconstruction toward video-based 4D animal motion. Its core idea is to create synthetic animal video data and use it to build an end-to-end system that can extract temporally coherent animal motion from videos.

The current system performs especially well on generated videos, but Bob’s future goal is to refine the data and model so that it becomes strong on truly in-the-wild animal videos. The project’s strongest visual demonstration is demo1.mov on the WildAni4D project page.

Needs more detail from Bob

Add links for AniGauss if/when public, exact paper titles/authors for each project, and 1–2 representative images or videos per project.