Querying Models with an Image URL
You can provide images to vision models by referencing a publicly accessible HTTP URL.- cURL
- Python - OpenAI
- Python - Gravix SDK
- JavaScript
- JavaScript - Gravix SDK
Querying Models with a Base64 Encoded Image
You can also provide images by embedding them directly into the request payload as a Base64 encoded string.- cURL
- Python - OpenAI
- Python - Gravix SDK
- JavaScript
- JavaScript - Gravix SDK
Common applications for VLMs include:
- Image Captioning: Automatically generating descriptive text for images.
- Visual Question Answering (VQA): Answering questions based on the content of an image.
- Document Analysis: Extracting and interpreting information from scanned documents or forms.
- Chart Interpretation: Analyzing data visualizations like graphs and charts.
- Optical Character Recognition (OCR): Extracting printed or handwritten text from images.
- Content Moderation: Identifying and flagging inappropriate or sensitive visual content.

