Whalefishin / LLM_animationLinks
A showroom for various animations generated by large language models (LLM). Our method takes a rigged 3D model and produces novel animations specified via natural language descriptions in a matter of seconds.
☆28Updated last year
Alternatives and similar repositories for LLM_animation
Users that are interested in LLM_animation are comparing it to the libraries listed below
Sorting:
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 11 months ago
- Meshcapade support for Unreal Editor for Fortnite (UEFN)☆22Updated last year
- Code for "Re-Thinking Inverse Graphics With Large Language Models"; TMLR 2024☆69Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Updated last year
- Synthetic data generator for image, video and 3D models☆30Updated last year
- ☆37Updated last year
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Implementation code for our paper "Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings"☆44Updated last month
- Resources, benchmarks for 3D generation via AI and beyond☆69Updated last month
- Gradio app to track objects in video and add visual effects☆17Updated last month
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆42Updated last year
- Code for our SIGGRAPH 2023 paper, "Acting as Inverse Inverse Planning"☆18Updated 2 years ago
- NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction☆22Updated last year
- SATO: Stable Text-to-Motion Framework☆115Updated 7 months ago
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- ☆19Updated last year
- Generate 3D meshes from a single 2D image using TripoSR, complete with manual geometry editing and texture baking support☆54Updated 10 months ago
- Implementation of the premier Text to Video model from OpenAI☆56Updated 9 months ago
- ☆19Updated last year
- ☆55Updated this week
- Portal hopping with Stable Diffusion 👾☆22Updated last year
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆16Updated last week
- ☆13Updated last year
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆37Updated this week
- Image Generation API Server - Similar to https://text-generator.io but for images☆53Updated last week
- A one-stop library to standardize the inference and evaluation of all the conditional video generation models.☆49Updated 6 months ago
- ☆18Updated 10 months ago
- Anim-Director: Controllable Animation Video Generation with Large Models-based Multimodal Agents☆85Updated 2 months ago
- We present a model that can generate accurate 3D sound fields of human bodies from headset microphones and body pose as inputs.☆88Updated last year
- ☆16Updated last year