Hi there Sahil Saini
Thanks for using QandA platform
U can use Azure Cognitive Services, specifically the Custom Vision models, which can process and analyze images such as scanned textbook pages, handwritten notes, or diagrams. For text extraction from images, the Azure Computer Vision service provides the Read
API, which supports OCR to extract text from images. If you need to recognize specific objects, you can use Azure Custom Vision, where you can train a custom model to detect and understand images in the context of your chatbot.
To get started, you can create a Computer Vision resource or a Custom Vision resource in your Azure portal. After uploading your image, use the Read
API for text extraction or train a model with Custom Vision for specific image recognition tasks. The extracted information can then be fed into your AI assistant, which can process both text and image-based queries to provide more accurate and comprehensive responses.
Cognitive Services are available in many regions, Azure region availability documentation.
I know this might seem overwelming but try documenation have a good read and im sure you can get started
Computer Vision API documentation
Custom Vision API documentation.
If this helps kindly accept the answer thanks much.