Large Language Models (LLMs) are at the core of Generative AI applications.
Model Type | Request Schema | Response Schema |
---|---|---|
EMBEDDING | EmbeddingRequest | EmbeddingResponse |
RERANKING | RerankingRequest | RerankingResponse |
COMPLETION | CompletionRequest | CompletionResponse |
CHAT_COMPLETION | ChatCompletionRequest | ChatCompletionResponse |
AGENT | AgentRequest | AgentResponse |
Information Architecture of Custom Models in Scale GP