Google has unveiled Gemini 2.0 Flash, the latest advancement in its AI technology. Unlike its predecessor, Gemini 1.5 Flash, which could only generate text, the new Gemini 2.0 Flash can natively produce images and audio alongside text.
This enhanced version integrates seamlessly with third-party apps and services, including Google Search and the ability to execute code, broadening its application scope. The experimental release is currently available through the Gemini API, AI Studio, and Vertex AI, starting Wednesday.
Initially, the audio and image generation features are accessible to early access partners, with a wider rollout planned for January. According to Google, Gemini 2.0 Flash operates twice as fast as the Gemini 1.5 Pro model on certain benchmarks. It also boasts significant improvements in coding capabilities and image analysis, making it a more versatile tool for developers and users alike.
To ensure authenticity, Google is implementing its SynthID technology to watermark all audio and images generated by Gemini 2.0 Flash. On platforms that support SynthID, these outputs will be clearly marked as synthetic, promoting transparency and trust in AI-generated content.
Reference(s):
cgtn.com