AI with eyes is coming to an API near you – Interesting Ways to use it…

Have you started playing with ChatGPT Vision? Drop a picture of anything, and ask it a question about it and it can tell you anything you want to know about it.

This is a breakthrough, but soon it will be available on the APIs – that will truly enable some interesting ways to use it.

👉 Personal Executive Assistant for everyone – the AI can watch everything you do on your PC.

Creepy?  Sure, but when you get past that, it can act as your own PA, watching everything you do, taking notes for you and suggest things that it could do for you. It can watch you do your tasks and learn from you. In the future it will ask you if you want it to do certain things to save you time.

👉 Comparing Images – you can add up to 20 images at the same time, allowing it to compare and contrast. This could be used for website testing, plagiarism detection, understanding processes.

👉 “Help me learn this” – Show it the internet router you are trying to configure and get personalized help

👉 Extract text and diagrams – from the white boards, notes, handwritten homework


Personally, I have built an app to allow the AI to see what I do on my PC all day every day… I am looking forward to when they activate Vision on the API next month so it can start looking at what I do and ask to take over some tasks from me!

Would you let an AI Personal Assistant watch you?

Howi Avatar

Posted by

Leave a comment

AI driven learning solutions