Skip to main content
Server path: /embedded-gemini | Type: Embedded | PCID required: No Generate text using Google Gemini models. Supports multimodal analysis including images, video, audio, and documents.

Tools

ToolDescription
embedded-gemini_generateGenerate text using Gemini

embedded-gemini_generate

Generate text using a Google Gemini model. Supports multimodal analysis including images, video, audio, and documents. Parameters:
ParameterTypeRequiredDescription
modelenumNoModel to use: gemini-3-pro-preview, gemini-3-flash-preview, gemini-2.5-pro, gemini-2.5-flash, gemini-2.0-flash. Default gemini-2.5-flash
systemPromptstringNoSystem prompt to guide model behavior
userPromptstringYesUser prompt or question
fileUrlsstring[]NoURLs of files to analyze. Supported formats: images (.jpg, .jpeg, .png, .webp), videos (.mp4, .webm, .mkv, .mov), documents (.pdf, .txt), audio (.mp3, .wav, .webm, .m4a, .opus, .aac, .flac)
Response fields:
FieldTypeDescription
outputstring or objectGenerated text or structured output
metadataobjectResponse metadata
metadata.modelstringModel used for generation
metadata.usageobjectToken usage statistics
metadata.usage.promptTokensnumberNumber of input tokens
metadata.usage.completionTokensnumberNumber of output tokens
metadata.usage.totalTokensnumberTotal tokens used
Available models:
ModelDescription
gemini-3-pro-previewMultimodal support, large context, advanced reasoning
gemini-3-flash-previewFast and cost-efficient with multimodal support
gemini-2.5-proEnhanced thinking and reasoning, advanced coding
gemini-2.5-flashCost-efficient, supports higher token limits (default)
gemini-2.0-flashNext generation speed, thinking, realtime streaming