Skip to main content

Serverless Models

Gravix Layer provides instant access to high-performance AI models through our serverless infrastructure. Deploy powerful language and vision models without managing any infrastructure.


Text Models

Model NameModel IDOrganizationParametersContext LengthQuantization
Llama-3.1-8B-Instructllama3.1:8b-instruct-fp16Meta-Llama8B128,000FP16

Features:

  • Optimized for instruction following and conversational AI
  • Extended context window for processing large documents
  • Fine-tuned for tool usage and function calling
  • High-performance FP16 quantization for optimal speed

Vision Models

Model NameModel IDOrganizationParametersContext LengthQuantization
Qwen2.5-VL-3B-Instructqwen2.5vl:3b-fp16Qwen3B32,768FP16

Features:

  • Multimodal processing combining text and images
  • Efficient 3B parameter architecture for fast inference
  • Advanced visual reasoning capabilities
  • Support for image analysis and description tasks

Embedding Models

Status: Coming Soon

Advanced embedding models for semantic search and vector operations are currently in development and will be available soon.


Model Selection Guide

Use CaseRecommended ModelBest For
Text GenerationLlama-3.1-8B-InstructConversational AI, content creation, code generation
Document AnalysisLlama-3.1-8B-InstructLarge document processing with 128k context
Image AnalysisQwen2.5-VL-3B-InstructVisual question answering, image description
Multimodal TasksQwen2.5-VL-3B-InstructCombined text and image processing