Multimodal Transformers: AI Foundation Models, Part 1