Because the AI has to do the work of translating the image to text before deciding what to do with it

  • @S13Ni
    link
    1220 hours ago

    So start normalizing using ffmpeg to type in whatever you want to say, and render it as a video with just static text on white background to make it even more expensive?