Mirasol3B

From Humanoid Robots Wiki
Revision as of 20:15, 16 May 2024 by Ben (talk | contribs) (Created page with "Mirasol3B is an autoregressive multimodal model for time-aligned video and audio. {{infobox paper | name = Mirasol3B | full_name = Mirasol3B: A Multimodal Autoregressive Mode...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Mirasol3B is an autoregressive multimodal model for time-aligned video and audio.

Mirasol3B
NameMirasol3B
Full NameMirasol3B: A Multimodal Autoregressive Model for Time-Aligned and Contextual Modalities
ArxivLink
TwitterTwitter
Publication DateFebruary 2024
AuthorsAJ Piergiovanni, Isaac Noble, Dahun Kim, Michael Ryoo, Victor Gomes, Anelia Angelova