SlowFast-LLaVA-1.5: A Household of Token-Environment friendly Video Giant Language Fashions for Lengthy-Kind Video Understanding
We introduce SlowFast-LLaVA-1.5 (abbreviated as SF-LLaVA-1.5), a household of video massive language fashions (LLMs) providing a token-efficient answer for long-form ...