Check out the Gemma Cookbook repository for generation and tuning examples! Learn more

On-device generation with Gemma

You can run Gemma models completely on-device with the MediaPipe LLM Inference API. The LLM Inference API acts as a wrapper for large language models, enabling you run Gemma models on-device for common text-to-text generation tasks like information retrieval, email drafting, and document summarization.

Try the LLM Inference API with MediaPipe Studio, a web-based application for evaluating and customizing on-device models.

The LLM Inference API is available on the following platforms:

Web
Android
iOS

To learn more, refer to the MediaPipe LLM Inference documentation.

Except as otherwise noted, the content of this page is licensed under the Creative Commons Attribution 4.0 License, and code samples are licensed under the Apache 2.0 License. For details, see the Google Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2024-08-16 UTC.