National media has been linking to this beautiful video of a giant paper airplane flying over the Arizona desert today, though the paper airplane flew originally in March, 2012. The 45-foot-long, ...
FrameFusion reduces the number of tokens in Large Vision-Language Models (LVLMs) by combining similarity-based merging with importance-based pruning. It achieves a 70% vision token reduction, 3.4–4.4× ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results