What is Google Gemini 1.0: Google has fully revealed its next-generation Gemini 1.0 generative AI model, which is now available through the Bard conversational chatbot.
The Gemini 1.0 model, which was initially previewed at Google I/O in May, is more powerful than the existing technology and potentially mode equipped to go up against OpenAI’s ChatGPT model used by Microsoft.
Google says Gemini is its “largest and most capable AI model” and it’ll be responsible for powering everything from Bard to the Google Pixel range of smartphones. It’s available within Bard from today.
The company says the key to Gemini and the “multimodal” AI model. That means it can “generalise and seamlessly grasp, work across and combine different types of information including text, code, audio, image and video.” Previous models achieved much more limited capabilities by stitching models together. Gemini is natively multimodal.
You can see the new capabilities in an incredibly impressive looking demonstration below. It’s fast, reactive, conversational and terrifyingly smart.
Three different sizes for Gemini
Google says Gemini 1.0 is the first version of the model, as the numbering convention would denote. There are tiers built for different purposes. All of them benefit from the multimodal design and their purposes are detailed below.
Nano, for example, will be great for on-device AI and will soon be available on the Pixel 8 Pro, while Ultra is geared for extreme use cases admire data centres. Pro is the happy middle ground, which Bard is going to be powered by from today.
- Gemini Ultra — our largest and most capable model for highly complex tasks.
- Gemini Pro — our best model for scaling across a wide range of tasks.
- Gemini Nano — our most efficient model for on-device tasks.
Super-human performance
Google says Gemini’s performance has been rigorously tested on tasks admire natural image, audio and video understanding to mathematical reasoning. Gemini Ultra beats incumbent models on 30 out of 32 academic benchmarks for large language models.
Now, Google says, Gemini is outperforming human experts on 57 subjects when it comes to MMLU (massive multitask language understanding). Gemini scored 90.0% in those tests, while ChatGPT 4 scored 86.4%. That’s a key takeaway.
Galaxy Z Flip 5 Price Crash
Offerring 100GB of data and a total cost that’s significantly cheaper than buying the phone outright, this Z Flip 5 deal is almost too good to be true.
- Mobiles UK
- £139 upfront
- Only £26.99/month
How and why the next-generation was built
Google says it has approached the training of Gemini differently to previous multimodal functionality. Previously they were trained separately and put together afterwards. That meant it was good as describing images, for instance, but lacked the ability for competent complex reasoning.
Google says Gemini was pre-trained from the start to be natively multimodal.
“Then we fine-tuned it with additional multimodal data to advance refine its effectiveness,” the company says in the blog post. “This helps Gemini seamlessly grasp and reason about all kinds of inputs from the ground up, far better than existing multimodal models — and its capabilities are state of the art in nearly every domain.”
Availability
As we mentioned, Gemini Pro is available to use within Google Bard from today, while Gemini Nano will launch on the Pixel 8 Pro next year.
Google adds: “We’re also brining Gemini to Pixel. Pixel 8 Pro is the first smartphone engineered to run Gemini Nano, which is powering new features admire Summaride in the Recorder app and rolling out in Smart Reply in Gboard, starting with WhatsApp — with more messaging apps coming next year.”