Artificial intelligence advancements are rolling out constantly and dominating the social media and the business worlds. Everyone is aware of ChatGPT and the power it holds for both business and personal usage, and big tech is taking notice. Google has thrown its hat into the AI arena, and the major corporation has moved forward with some powerful tools. Most recently, we've seen the launch of Google Gemini, a collection of multimodal LLMs (Large Language Models) making a promising name for itself in the AI world. Here's what you need to know.
What is Google Gemini?
Google Gemini is similar to OpenAl's GPT in the sense that it too belongs to a family of AI models. However, one key difference with Google Gemini that sets it apart from the likes of Open AI is its ability to both understand and generate text while simultaneously doing the same with videos, code, images, and audio. For instance, you could upload a photo, present Google Gemini with a prompt, and it would begin working on the photo as requested.
This new and powerful artificial intelligence model can complete the most complex requested tasks, including math and physics problems, and can understand and generate various programming languages. Gemini is available through Google Pixel 8 and Google Bard; however, we expect it will generally be available through other Google services as it rolls out.
"Gemini is the result of large-scale collaborative efforts by teams across Google, including our colleagues at Google Research," Dennis Hassabis, the co-founder and CEO of Google DeepMind said of Gemini. "It was built from the ground up to be multimodal, which means it can generalize and seamlessly understand, operate across, and combine different types of information including text, code, audio, image, and video."
Who created Gemini?
Google and Alphabet, Google's parent company, were responsible for creating the company's most advanced AI model to-date. Google DeepMind also contributed to the creation of Gemini, although its contributions were not as prominent as Google and Alphabet's.
The different versions of Gemini
Google released Gemini as a flexible program that can run on Google data centers, mobile devices, and everything in between. However, Google wasn't able to accomplish this by only releasing one size. So, they released three different versions of Google Gemini. The three sizes include Gemini Pro, Gemini Ultra, and Gemini Nano.
Gemini Pro, the pinnacle of Gemini's capabilities, operates within Google's data centers. It powers the latest iteration of Google's AI chatbot, Bard. This version is tailored for efficiency and speed, delivering complex responses with remarkable promptness.
On the other hand, Gemini Nano is designed for mobile devices, particularly the Google Pixel 8. It is crafted to execute AI tasks independently, without relying on external servers, ensuring seamless performance on smartphones.
Lastly, Gemini Ultra, although limited in availability, stands out as Google's premier model. Acknowledged as its most advanced variant, it represents the epitome of Gemini's technological prowess, setting the standard for excellence in AI models.
Gemini apps and Gemini models
Previously, Gemini apps on the web and mobile were called Bard. And although these two things may seem similar (and it may be confusing to differentiate between them), they are separate and distinct. Now, the apps serve as an interface from which you can access Gemini models. Think of them as a conduit between the two platforms.