The big alter from GPT-3.five is OpenAI's 4th era language product is multimodal, which means it could possibly process equally textual content, illustrations or photos and audio. This implies you are able to clearly show it pictures and it will reply to them along with a text prompt – an early illustration of this, famous from the Big apple Time