MGIE by Apple

A multimodal MLLMs (multimodal model) that can transform an image based on a specific instruction.

image