AIContentDescriber | NVDA Add-ons Directory

AI Content Describer (AIContentDescriber)

Author: Carter Temm
Visit add-on's website/source code

Available downloads

Available downloads for AIContentDescriber
Version	Channel	Minimum NVDA version	Last tested NVDA version	download count since last release update	Last release date	Download
2026.05.06	stable	2023.1	2026.1	562	2026-05-06 19:06:58	AIContentDescriber 2026.05.06 (stable)

Description

This add-on makes it possible to describe the focus object, navigator object, entire screen, or scene from the camera using popular vision capable AI language models, like Claude, Gemini, or GPT4. It also lets one understand where their face is positioned in the frame of a connected camera. Though content descriptions are quite detailed, they may not always be completely accurate or reflect real world information. Press NVDA+shift+i to pop up a menu asking how you wish to describe based on the current position, or NVDA+shift+u to describe the navigator object, or NVDA+shift+y for an image that has been copied to the clipboard such as in windows explorer, or NVDA+shift+c to ask additional questions about a description. Other keystrokes are customizable from the input gestures dialog. By default, usage of GPT4 is free, thanks to the generocity of the team at PollinationsAI. If you would like to use other models from OpenAI, head to https://platform.openai.com/account/api-keys and create an account, then create a key for interacting with the API. Then, choose the "AI content describer" category from NVDA's settings dialog -> manage models and enter your API key. The process is similar for other model providers, see add-on documentation for more information on this.