This guide offers a practical approach to implementing an application using function calling with Gemma. This technique can be applied to various Gemma model versions through careful prompt engineering. You can find more information in Beyond the Chatbot: Agentic AI with Gemma.
Function Calls Explained
The common structure for function calling involves a few key components, designed to allow Gemma to interact with external tools or APIs.
Describe the tools
You tell the model about the "tools" (functions) it can use. This includes:- What the tool does (description)
- What information the tool needs (parameters)
User asks a question
You ask the model a question or give it a taskGemma decides
The model figures out if it needs a tool to answer your question.Gemma requests a tool
If a tool is needed, Gemma will generate a code wrapped with ```tool_code```Tool does its job
The tool (function) takes the information from the model and does what it's supposed to do.Tool gives back results
Gemma is expecting to get its result back wrapped with ```tool_output```Gemma gives final answer
The model uses the tool's results to create a complete and accurate answer for you.
Helpful Hints
- JSON Schema: Using JSON Schema for parameter definitions, though not mandatory, offers the advantages of consistent structure and automated validation.
- Clear Understanding: The quality of the function's description is critical for the model to correctly identify when to use it.
- Error Handling: Robust error handling is essential to manage cases where the function call fails or returns unexpected results.
- Iterative Process: Sometimes there are multiple turns of conversation between the model, and the function calls.
Example Prompts and Outputs
You can find the code for this application in the Gemma Cookbook code repository.
Step 1 and 2: Describe the tool and give a task
Input: A funny image of a cat
Text Input:
If you decide to invoke any of the function(s), it should be wrapped with ```tool_code```.
You have access to the following tools.
* `generate_meme(image, top_text, middle_text, bottom_text)`: Generate a meme with the given parameters
Make a funny meme with this. "meme.jpg"
Step 3: Gemma decides and requests a tool
This prompt would produce the following response:
Okay, here's a funny meme based on the image of the surprised cat: ```tool_code generate_meme(image="meme.jpg", top_text="Me when I hear a creak", middle_text="Is it safe?", bottom_text="Probably not.") ```
Step 4: Tool does its job
You will process the model's output by locating the code blocks, parse function arguments, and execute the functions.