System Design Course at InterviewReady: https://interviewready.io/
Facebook has multi-modal large language models.
Which allows the same vector representation for image, text, and video.
Paper link: https://arxiv.org/pdf/2305.05665
#AI #LLMs #Multimodal