Multimodal Gemini 1.5 Flash API

Multimodal app that does math and generates strategic insights.

What it does

The App runs a Gemini-1.5-Flash model and presents multimodality features, where you can generate a report via function calling, analyze a PDF file making math calculations, analyze a price table in an image, audio from Apple Q2 2024 earnings report, and also a marketing video. All this data then serves as input to Gemini 1.5 Flash to make an overall strategic analysis of the financial and marketing strategies. This help decision makers to take better decisions.

Built with

  • Streamlit
  • Alpha Vantage API
  • Cloud Run
  • Artifact Registry
  • Secret Manager
  • Vertex AI

Team

By

RubensZimbres

From

Brazil