AI Content Describer (AIContentDescriber)
- Author: Carter Temm
- Visit add-on's website/source code
Available downloads
Version | Channel | Minimum NVDA version | Last tested NVDA version | download count since last release update | Last release date | Download |
---|---|---|---|---|---|---|
2025.06.09 | stable | 2023.1 | 2024.1 | 151 | 2025-06-10 20:09:41 | AIContentDescriber 2025.06.09 (stable) |
Description
This add-on makes it possible to describe the focus object, navigator object, entire screen, or scene from the camera using popular vision capable AI language models, like Claude, Gemini, or GPT4. It also lets one understand where their face is positioned in the frame of a connected camera. Though content descriptions are quite detailed, they may not always be completely accurate or reflect real world information. Press NVDA+shift+i to pop up a menu asking how you wish to describe based on the current position, or NVDA+shift+u to describe the navigator object, or NVDA+shift+y for an image that has been copied to the clipboard such as in windows explorer, or NVDA+shift+c to ask additional questions about a description. Other keystrokes are customizable from the input gestures dialog. By default, usage of GPT4 is free, thanks to the generocity of the team at PollinationsAI. If you would like to use other models from OpenAI, head to https://platform.openai.com/account/api-keys and create an account, then create a key for interacting with the API. Then, choose the "AI content describer" category from NVDA's settings dialog -> manage models and enter your API key. The process is similar for other model providers, see add-on documentation for more information on this.