4.4 KiB
Create embeddings
post /embeddings
Creates an embedding vector representing the input text.
Body Parameters
-
input: string or array of string or array of number or array of array of numberInput text to embed, encoded as a string or array of tokens. To embed multiple inputs in a single request, pass an array of strings or array of token arrays. The input must not exceed the max input tokens for the model (8192 tokens for all embedding models), cannot be an empty string, and any array must be 2048 dimensions or less. Example Python code for counting tokens. In addition to the per-input token limit, all embedding models enforce a maximum of 300,000 tokens summed across all inputs in a single request.
-
String = stringThe string that will be turned into an embedding.
-
Array = array of stringThe array of strings that will be turned into an embedding.
-
Array = array of numberThe array of integers that will be turned into an embedding.
-
Array = array of array of numberThe array of arrays containing integers that will be turned into an embedding.
-
-
model: string or EmbeddingModelID of the model to use. You can use the List models API to see all of your available models, or see our Model overview for descriptions of them.
-
string -
EmbeddingModel = "text-embedding-ada-002" or "text-embedding-3-small" or "text-embedding-3-large"-
"text-embedding-ada-002" -
"text-embedding-3-small" -
"text-embedding-3-large"
-
-
-
dimensions: optional numberThe number of dimensions the resulting output embeddings should have. Only supported in
text-embedding-3and later models. -
encoding_format: optional "float" or "base64"The format to return the embeddings in. Can be either
floatorbase64.-
"float" -
"base64"
-
-
user: optional stringA unique identifier representing your end-user, which can help OpenAI to monitor and detect abuse. Learn more.
Returns
-
CreateEmbeddingResponse object { data, model, object, usage }-
data: array of EmbeddingThe list of embeddings generated by the model.
-
embedding: array of numberThe embedding vector, which is a list of floats. The length of vector depends on the model as listed in the embedding guide.
-
index: numberThe index of the embedding in the list of embeddings.
-
object: "embedding"The object type, which is always "embedding".
"embedding"
-
-
model: stringThe name of the model used to generate the embedding.
-
object: "list"The object type, which is always "list".
"list"
-
usage: object { prompt_tokens, total_tokens }The usage information for the request.
-
prompt_tokens: numberThe number of tokens used by the prompt.
-
total_tokens: numberThe total number of tokens used by the request.
-
-
Example
curl https://api.openai.com/v1/embeddings \
-H 'Content-Type: application/json' \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-d '{
"input": "The quick brown fox jumped over the lazy dog",
"model": "text-embedding-3-small",
"encoding_format": "float",
"user": "user-1234"
}'
Response
{
"data": [
{
"embedding": [
0
],
"index": 0,
"object": "embedding"
}
],
"model": "model",
"object": "list",
"usage": {
"prompt_tokens": 0,
"total_tokens": 0
}
}
Example
curl https://api.openai.com/v1/embeddings \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"input": "The food was delicious and the waiter...",
"model": "text-embedding-ada-002",
"encoding_format": "float"
}'
Response
{
"object": "list",
"data": [
{
"object": "embedding",
"embedding": [
0.0023064255,
-0.009327292,
.... (1536 floats total for ada-002)
-0.0028842222,
],
"index": 0
}
],
"model": "text-embedding-ada-002",
"usage": {
"prompt_tokens": 8,
"total_tokens": 8
}
}