Gadgetonix: Google Gemini

Thursday, December 7, 2023

Google Gemini

On December 6, Alphabet, Google's parent company, introduced Gemini, its most substantial and versatile AI model to date, aiming to compete with OpenAI's GPT-4 and Meta's Llama 2 in the emerging field of artificial intelligence (AI). This marks the initial AI model after the consolidation of Alphabet's AI research divisions, DeepMind and Google Brain, into a unified entity named Google DeepMind, overseen by DeepMind CEO Demis Hassabis.

Gemini, built from scratch, is "multimodal," enabling it to comprehend and process diverse information types—text, code, audio, image, and video simultaneously. The model comes in three variants: Ultra (for intricate tasks), Pro (for a broad range of tasks), and Nano (for on-device tasks). Sundar Pichai, Alphabet CEO, termed this as a monumental science and engineering effort for the company, reflecting the vision set upon the formation of Google DeepMind.

Gemini Pro will be accessible to developers through the Gemini API on Google AI Studio and Google Cloud Vertex AI starting December 13. Gemini Nano will be available to Android developers via AICore, introduced in Android 14, initially on Pixel 8 Pro devices. Gemini Ultra is in early experimentation with select users, developers, partners, and safety experts, with broader availability to developers and enterprises expected in early 2024.

Gemini will be integrated across Google's products, with Bard utilizing a fine-tuned version of Gemini Pro for advanced reasoning, planning, and understanding starting December 6. The Gemini Nano will power new features on Pixel 8 Pro smartphones. In terms of performance, Gemini Ultra surpasses current benchmarks on 30 out of 32 widely-used academic benchmarks in large language model (LLM) research. It outperforms human experts on MMLU (massive multitask language understanding) benchmark, covering 57 subjects. Gemini Pro outperforms GPT-3.5 in six out of eight benchmarks before its public launch.

Hassabis highlighted Gemini's flexibility, capable of running efficiently from data centers to mobile devices, enhancing how developers and enterprises leverage AI. Gemini's multimodal reasoning enables it to extract insights from vast amounts of documents, understand nuanced information, answer questions on complex topics, and generate high-quality code in popular programming languages. Gemini's capabilities come with added protections, incorporating safety policies and AI principles to address potential risks, with ongoing collaboration with external experts and partners for comprehensive stress testing.

Monday, December 4, 2023

Google Gemini launch date rumour

During the Google I/O 2023 event, Google unveiled its upcoming AI model named 'Gemini'. Originally slated for launch next week, recent reports suggest that Google may postpone the release to January 2024. Sources cited by The Information indicate that Gemini encountered challenges in handling non-English queries, prompting Google CEO Sundar Pichai to delay the launch. The report emphasizes the importance of language support for Gemini to compete with or surpass OpenAI’s GPT-4, noting that Google has made strides in meeting this standard in certain aspects. Pichai had previously mentioned the company's focus on delivering Gemini 1.0 promptly while finalizing the comprehensive version of the next-generation AI model. When introduced at Google I/O in May, Google positioned Gemini as its most potent model, building upon PaLM 2 and developed by the Brain Team and DeepMind. Described as 'multimodal,' Gemini is expected to handle diverse content types, including images and text, while incorporating memory and planning capabilities. The AI model may integrate various AI models, with Google planning to utilize it internally across its products and make it available to developers and cloud customers in various sizes and capabilities. Pichai emphasized that Gemini would be 'highly efficient with tools and API integrations.' The forthcoming AI model is anticipated to power Google’s existing AI-driven products such as Search, Google Assistant, and the popular chatbot Bard.