Text-to-Speech Integration
Turn written responses into spoken ones for a more immersive experience.
Vision Capabilities
Image and video Understanding.
Image Generation
Audio Understanding and Use Cases
Transcription and translation.
Reasoning with AI model