What is Nostr?
Simon Willison /
npub13v9…w5eu
2024-08-26 16:54:18

Simon Willison on Nostr: Did you know Google’s Gemini 1.5 Pro vision LLM is trained to return bounding boxes ...

Did you know Google’s Gemini 1.5 Pro vision LLM is trained to return bounding boxes for objects found within images?

I built this browser tool that lets you run a prompt with an image against Gemini and visualize the bounding boxes

You can try it out using your own Google Gemini API key: https://tools.simonwillison.net/gemini-bbox

Author Public Key
npub13v97j0kknscwnf5pt87nsn7cxzxwfwl3dsu7ss8qsq7ukmqgwg8q84w5eu