fal

updated a model 5 days ago

fal/control-light

Updated 5 days ago

Paper • 2605.15193 • Published 18 days ago • 8

published a model 5 days ago

fal/control-light

Updated 5 days ago

tmeral

submitted a paper to Daily Papers 17 days ago

Aligning Latent Geometry for Spherical Flow Matching in Image Generation

updated a model about 2 months ago

fal/LTX-2.3-FlashPack

Updated Apr 14 • 30 • 6

in fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA about 2 months ago

Upload оно.jpg

#26 opened about 2 months ago by

Huliana

Paper • 2204.11824 • Published Apr 25, 2022 • 1

in fal/Qwen-Image-Edit-2511-Multiple-Angles-LoRA 3 months ago

Upload images (2).jpeg

#21 opened 3 months ago by

venix15

kaan-oktay

authored 2 papers 6 months ago

Semi-Parametric Neural Image Synthesis

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25, 2025 • 51

kaanakan

authored a paper 6 months ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25, 2025 • 51

posted an update 7 months ago

Post

8383

FlashPack: Lightning-Fast Model Loading for PyTorch

https://github.com/fal-ai/flashpack

FlashPack — a new, high-throughput file format and loading mechanism for PyTorch that makes model checkpoint I/O blazingly fast, even on systems without access to GPU Direct Storage (GDS).

With FlashPack, loading any model can be 3–6× faster than with the current state-of-the-art methods like accelerate or the standard load_state_dict() and to() flow — all wrapped in a lightweight, pure-Python package that works anywhere.

2 replies

posted an update 12 months ago

Post

1026

TryOffDiff goes multi-garment!

We're excited to share TryOffDiff v2, extending our approach to support multiple garment categories. Key updates include:
- Training on the multi-garment DressCode dataset, covering upper-body, lower-body, and dresses.
- A simplified adapter design for improved training efficiency and modularity.
- Introduction of four specialized models:
- One model per category (upper, lower, dress),
- Plus a multi-garment model capable of generating multiple garments sequentially from a single image.

*PS:* Visit us this Friday at 10:30 AM in ExHall-B for our live demo @CVPR '25!

Demo: rizavelioglu/tryoffdiff
Project page: https://rizavelioglu.github.io/tryoffdiff

1 reply

Paper • 2504.13078 • Published Apr 17, 2025

authored a paper about 1 year ago

Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-Off

posted an update about 1 year ago

Post

3791

Comparing reconstruction quality of various VAEs with an interactive demo
rizavelioglu/vae-comparison

1 reply

posted an update over 1 year ago

Post

3258

Zonos is flying up the trending tab, and for good reason - it's the most expressive and emotive open-source TTS I've used to date. I'm happy to say it's now supported in Taproot, with added long-form synthesis support and other goodies.

Try it here: https://huggingface.co/spaces/benjamin-paine/zonos-longform

Getting started with Zonos in Taproot is easy; with a working CUDA toolkit and Python/Pip installation, all you have to do is:

apt install espeak-ng
pip install taproot
taproot install speech-synthesis:zonos-transformer
taproot invoke speech-synthesis:zonos-transformer --text "Hello, world!"

See more on GitHub at https://github.com/painebenjamin/taproot/

2 replies

posted an update over 1 year ago

Post

4152

Hello HuggingFace 🤗, and happy new year! 🎆

I'm thrilled to be releasing the first iteration of a project I've been working on for quite awhile now. It's called Taproot, and it's a seamlessly scalable open-source AI/ML inference engine designed for letting developers build real-time experiences clustered across a small-to-mid-sized cluster, without the burden of hyperscale infrastructure.

Along with the server and task framework is a client library for node and the browser. And what good is a server and client without an app to go alongside it? To that end, I'm also releasing Anachrovox, a fun, real-time hands-free voice assistant that can run on mid-level devices in <12GB VRAM, with web search, weather, and other tools. It uses my real-time browser wake-word library to detect utterances of the phrase 'Hey Vox', 'Hi Vox', 'Okay Vox', 'Anachrovox' or just 'Vox' (alongside some others.)

Releasing this many things at once will definitely result in bugs, so please report them when sighted! Thank you all!

Taproot: https://github.com/painebenjamin/taproot
Taproot JS Client: https://github.com/painebenjamin/taproot.js
Anachrovox: https://github.com/painebenjamin/anachrovox

The Anachrovox Spaces are networked together, balancing load across them to keep all front-ends responsive. You only have to choose what color you like the most!

https://huggingface.co/spaces/benjamin-paine/anachrovox
https://huggingface.co/spaces/benjamin-paine/anachrovox-amber

12 replies

posted an update over 1 year ago

Post

2312

Introducing Virtual Try-Off (VTOFF), a novel task focused on generating standardized garment images from single photos of clothed individuals. Unlike traditional Virtual Try-On (VTON), which digitally dresses models, VTOFF aims to extract a canonical garment image, posing unique challenges in capturing garment shape, texture, and intricate patterns.

Try it out: rizavelioglu/tryoffdiff
Paper: TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models (2411.18350)
Project page: https://rizavelioglu.github.io/tryoffdiff

3 replies

Paper • 2411.18350 • Published Nov 27, 2024 • 28

authored a paper over 1 year ago

TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models