What can you do with it?
Use Gemini for tasks involving text, images, audio, and video. Perfect for complex reasoning, multimodal analysis, content generation, and real-time interactions. The command provides access to Google’s advanced AI models with support for multiple file types and formats.How to use it?
Basic Command Structure
Parameters
Required:prompt
- Your instructions or questions for Gemini
-
model
- Specific Gemini model to use (defaults to gemini-2.0-flash) -
system prompt
- Override the default system prompt -
files
- File URLs to include (supports audio, images, videos, and text)
Response Format
The command returns:Examples
Basic Usage
Advanced Usage
Specific Use Case
Notes
See LLM File Type Support for detailed information about file formats supported by Gemini and other models.Supported Models
Choose the appropriate Gemini model based on your specific needs:gemini-2.5-flash
- Cost-efficient model for fast, general tasks (supports audio, images, videos, text → text)gemini-2.5-pro
- Enhanced thinking and reasoning, multimodal understanding, advanced coding (supports audio, images, videos, text → text)gemini-2.0-flash
(default) - Next generation features with speed, thinking, and realtime streaming for high-volume general tasks (supports audio, images, videos, text → text)gemini-2.0-flash-exp
- Cost-efficient experimental model for fast, general tasks (supports audio, images, videos, text → text)gemini-2.0-flash-lite
- Cost efficiency and low latency for cost-sensitive, basic tasks (supports audio, images, videos, text → text)gemini-1.5-pro
- Complex reasoning with long context for complex document analysis (supports audio, images, videos, text → text)gemini-1.5-flash
- Volume-optimized for lower-cost, high-frequency tasks (supports audio, images, videos, text → text)