Skip to content

[Feat]: request text + image + audio to text or audio sample for Gemini 2.0 Multimodal Live API #1545

Open
@shenshaoyong

Description

@shenshaoyong

Is your feature request related to a problem? Please describe.

No response

Describe the solution you'd like

Hi, including these samples(Text-to-text generation / Text-to-audio generation / Text-to-audio conversation) , more samples(text + image + audio to text or audio) are also needed. Could you add these samples in the near future? thanks.
https://github.com/GoogleCloudPlatform/generative-ai/blob/main/gemini/multimodal-live-api/intro_multimodal_live_api_genai_sdk.ipynb

Describe alternatives you've considered

No response

Additional context

No response

Code of Conduct

  • I agree to follow this project's Code of Conduct

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions