Google announced the launch of the AI model Gemini. Gemini is Google's most powerful and versatile AI model to date, capable of simultaneously understanding, manipulating, and combining different types of information such as text, code, audio, images, and videos.
State-of-the-art Performance
The native multimodal AI model Gemini launched by Google emphasizes the creation of a modular AI model from scratch. Similar to how humans have five senses and simultaneously perceive and sense the world, Gemini, too, can comprehensively and seamlessly understand, manipulate, and combine different types of information such as text, code, audio, images, and videos.
This ability surpasses the effectiveness of other individually constructed text or voice models that are later connected to produce results.
Google rigorously tests the Gemini model and evaluates its performance across various tasks. From natural image, audio, and video understanding to mathematical reasoning, the performance of Gemini Ultra surpasses current state-of-the-art results in 30 out of 32 widely used academic benchmarks used in the development of large language models (LLMs).
Beyond Humanity
The highest-tier Gemini Ultra scored as high as 90.0%, making it the first model to surpass human experts on MMLU (Massive Multi-Task Language Understanding).
Three Versions
Gemini represents our most adaptable model to date, capable of running efficiently across various platforms, from data centers to mobile devices. Its advanced capabilities are set to substantially improve the development and scalability options for both developers and enterprise customers leveraging AI.
Google have optimized Gemini 1.0, their first version, for three different sizes:
-Gemini Ultra — the largest and most capable model for highly complex tasks.
-Gemini Pro — the best model for scaling across a wide range of tasks.
-Gemini Nano — the most efficient model for on-device tasks.
Participation of Bard
The AI chatbot Bard by Google has started using a refined version of Gemini Pro to perform advanced reasoning, planning, understanding, and more. This marks the biggest upgrade since Bard's launch. It will be available in English across more than 170 countries and regions, with plans to expand to different modes and support new languages and locations in the near future.
No Internet Needed
Google has also introduced Gemini to Pixel. The Pixel 8 Pro is the first smartphone running Gemini Nano, supporting new features like 'Summarize' in the Recorder app. This doesn't require an internet connection and allows for organizing meeting summaries from recorded files using Gemini on the phone. Starting with WhatsApp, they're launching Smart Reply in Gboard, and next year, they'll introduce more applications.
Google and Alphabet CEO Sundar Pichai stated,
"That’s what excites me: the chance to make AI helpful for everyone, everywhere in the world"