Deepinfra Documentation

Models Docs Pricing Chat DeepStart Blog Log In
Documentation
Contents
Function Calling
Function Calling allows models to call external functions provided by the user, and use the results to provide a comprehensive response to the user query. To learn more, read our
blog.
We provide OpenAI compatible API.
Currently supported for:
Llama-3-70b
Llama-3-8b
Mixtral 8x22b
Mixtral 8x7b
Mistral 7b
Mistral 7b-v3
Example
Let's go through some simple example of requesting a weather.
This is how you set up our endpoint
python
import openai
import json
client = openai.OpenAI(
base_url="https://api.deepinfra.com/v1/openai",
api_key="<Your-DeepInfra-API-Key>",
)
This is the function that we will execute whenever the model asks us to do so
python
# Example dummy function hard coded to return the same weather
# In production, this could be your backend API or an external API
def get_current_weather(location):
"""Get the current weather in a given location"""
print("Calling get_current_weather client side.")
if "tokyo" in location.lower():
return json.dumps({
"location": "Tokyo",
"temperature": "75"
})
elif "san francisco" in location.lower():
return json.dumps({
"location": "San Francisco",
"temperature": "60"
})
elif "paris" in location.lower():
return json.dumps({
"location": "Paris",
"temperature": "70"
})
else:
return json.dumps({"location": location, "temperature": "unknown"})
Let's now call our DeepInfra endpoint with tools and a user request
python
# here is the definition of our function
tools = [{
"type": "function",
"function": {
"name": "get_current_weather",
"description": "Get the current weather in a given location",
"parameters": {
"type": "object",
"properties": {
"location": {
"type": "string",
"description":
"The city and state, e.g. San Francisco, CA"
}
},
"required": ["location"]
},
}
}]
# here is the user request

messages = [
{
"role": "user",
"content": "What is the weather in San Francisco?"
}
]
# let's send the request and print the response

response = client.chat.completions.create(
model="mistralai/Mistral-7B-Instruct-v0.1",
messages=messages,
tools=tools,
tool_choice="auto",
)
tool_calls = response.choices[0].message.tool_calls
for tool_call in tool_calls:
print(tool_call.model_dump())
Output:
Models Docs Pricing Chat DeepStart Blog Log In
{'id': 'call_X0xYqdnoUonPJpQ6HEadxLHE', 'function': {'arguments': '{"location": "San Francisco"}', 'name': 'get_current_weather'}, 'type': 'function'}
Now let's respond back with a function call response and see the results
python
# extend conversation with assistant's reply
messages.append(response.choices[0].message)
for tool_call in tool_calls:

function_name = tool_call.function.name
if function_name == "get_current_weather":
function_args = json.loads(tool_call.function.arguments)
function_response = get_current_weather(
location=function_args.get("location")
)
# extend conversation with function response

messages.append({
"tool_call_id": tool_call.id,
"role": "tool",
"content": function_response,
})
# get a new response from the model where it can see the function responses
second_response = client.chat.completions.create(
model="mistralai/Mistral-7B-Instruct-v0.1",
messages=messages,
)
print(second_response.choices[0].message.content)
Output:
The current temperature in San Francisco, CA is 60 degrees.
Tips on using function calling

Here are some tips to get the most out of function calling:
Make sure the descriptions of the functions are well written, it will make models perform better.
Make sure to use lower temperatures < 1.0, this ensures the model won't plug in some random stuff to the parameters
Try not to use system messages
Models function calling quality degrades with the number of functions supplied.
Try to keep top_p and top_k values on the default
Notes
There is additional usage for prompting when using function calling + your function de nitions will also be counted toward usage.
Supported:
single calls
parallel calls (though quality might be lower, it's under active development)
tool_choice with only auto or none
streaming mode
Not supported:
nested calls (not supported)
Log P robabilities JSON Mode
Latest Models Featured Models Company

Phind/Phind-CodeLlama-34B-v2 mistralai/Mixtral-8x7B-Instruct-v0.1 Pricing © 2024 Deep Infra. All rights reserved.
Gryphe/MythoMax-L2-13b stability-ai/sdxl Docs

bigcode/starcoder2-15b meta-llama/Meta-Llama-3-70B-Instruct Compare
openai/whisper-tiny lizpreciatior/lzlv_70b_fp16_hf DeepStart
openchat/openchat_3.5 microsoft/WizardLM-2-8x22B About
Qwen/Qwen2-72B-Instruct Careers
Privacy
Terms

Deepinfra Documentation

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Deepinfra Documentation

Uploaded by

Copyright:

Available Formats

Models Docs Pricing Chat DeepStart Blog Log In

# here is the user request

# let's send the request and print the response

for tool_call in tool_calls:

# extend conversation with function response

The current temperature in San Francisco, CA is 60 degrees.

Tips on using function calling

Latest Models Featured Models Company

Gryphe/MythoMax-L2-13b stability-ai/sdxl Docs

You might also like