Base URL
The root endpoint for all API requests is:Authentication
The Gravix Layer API uses API key authentication. Include your API key in theAuthorization header:
Rate Limits
API requests are subject to rate limiting:- Serverless: 100 requests per minute
- Dedicated: Custom limits based on your plan
- X-RateLimit-Limit: Total requests allowed per minute
- X-RateLimit-Remaining: Requests remaining in current window
- X-RateLimit-Reset: Unix timestamp when the limit resets
Response Format
All API responses use JSON format with a consistent structure: Success ResponseHTTP Status Codes
The API uses standard HTTP status codes:| Code | Description | 
|---|---|
| 200 | Success | 
| 201 | Created | 
| 400 | Bad Request | 
| 401 | Unauthorized | 
| 403 | Forbidden | 
| 404 | Not Found | 
| 429 | Rate Limited | 
| 500 | Internal Server Error | 
SDK Libraries
While you can use the REST API directly, we recommend using our official SDKs:- Python: pip install gravixlayer
- JavaScript: Coming Soon (use OpenAI-compatible libraries with baseURL)
API Sections
Our API is organized into the following sections:Chat Completions
Generate text using large language models
Completions
OpenAI-style prompt completions
Embeddings
Create vector embeddings from text
Files
Upload, manage, and retrieve files
Vectors
Vector database operations and search
Deployments
Manage dedicated model deployments
Vision
Image analysis and vision models
Getting Started
- Get your API key from the dashboard
- 
Make your first request using curl:
- Explore the endpoints in this documentation
- Build your application using our SDKs or direct API calls
Support
- Documentation Issues: GitHub Issues
- API Support: Discord Community
- Enterprise Support: Contact Sales

