Google has just announced (but not released) Gemini 1.5, an update to its flagship language model — the model used in the chatbot once known as Bard, but synergistically renamed Gemini a week ago.
The big claim with this release is "a breakthrough in long-context understanding across modalities." It's also meant to be a step-up in terms of efficiency, having been built on an architecture type known as "Mixture-of-Experts (MoE)," meaning performance supposedly akin to Gemini 1.0, but relying on fewer electricity-hungry GPUs cranking away to achieve it.
That first big claim about multi-modal "long-context" understanding is as jargony as it sounds, but Google Deepmind co-founder posted a demo on X meant to show what this means in practice.
This Tweet is currently unavailable. It might be loading or has been removed.
Smartly utilizing a large piece of public-domain text that won't rankle any copyright sticklers — in this case a 402-page transcript of the NASA mission that landed on the moon — the LLM is able to narrow its focus to what the user needs ("context") in spite of the prompt being absolutely gigantic ("long"), so apparently that's what "long-context" means.
In the demo, Gemini 1.5 is able to pick out three amusing moments from the novel-length piece of text. It's also able to spot the event in the transcript that matches a picture of a lunar boot print — the part where, y'know, Neil Armstrong walks on the moon — which elucidates what "multimodal" is intended to mean in this context: an image recognition model working hand-in-hand with the LLM.
SEE ALSO: Samsung Galaxy S24 series comes with Google's Gemini AI modelThis upgrade is part of an ongoing effort to keep Google in the AI conversation after OpenAI ate everyone's AI lunch in 2022 by releasing ChatGPT. Late last year, Google began seriously hyping changes to come with Bard and the model powering it, which remains an also-ran large language model, more known for being shoehorned into popular Google and Android products than for being used like ChatGPT to solve day-to-day problems and blow minds at cocktail parties. In particular, a December 2023 research paper touted a version of Gemini that had exceeded the performance of OpenAI's GPT-4 model in certain cases, and become the first LLM to get a passing score on a specific AI test of "Multitask Language Understanding " or MLU.
Among other assertions about Gemini 1.5, Google says the new model can crunch large datasets with impressive accuracy, and — in a somewhat more eyebrow raising claim — perform well at reasoningacross all sorts of data types. Reasoning is the most famous weakness among most LLMs.
According to CEO Sundar Pichai, Google is releasing Gemini 1.5 to a limited group. "We’re excited to offer a limited preview of this experimental feature to developers and enterprise customers," Pichai wrote in Google's blog post.
The broader base of Gemini users will be the ultimate judges of Google's performance claims when they're actually permitted to try out Gemini 1.5 as part of an officially-released product. Google's most powerful model, Gemini Ultra was released a week ago, so it may be a while, and it's probably safe to assume that Gemini 1.5, will one day be part of Google's new premium — in other words "paid" — package of Workspace products called Google One AI Premium.
Copyright © 2023 Powered by
Google announces Gemini 1.5, a flashy upgrade to its flagship AI model-额手相庆网
sitemap
文章
9596
浏览
5954
获赞
9763
Surreal memes deserve their own internet dimension
In November, the surreal meme creator @surrealvault decided to shutter her merch shop, which had exi'Pay with Google' wants to take the forms out of online shopping
Google is about to make it way easier for you to shop online. The search giant's new Pay with GoogleNintendo Switch update adds support for GameCube controllers
If you're one of the many people who loveGameCube controllers, we have some good news: GameCube contBest laptop deal: Get an M2 MacBook Air for $699 at Best Buy
SAVE $300:As of March 7, get an Apple MacBook Air (M2, 16GB RAM, 256GB SSD) for $699 at Best Buy, doSeth Rogen has been casually posting safety warnings on rappers' Instagram photos
Seth Rogen has everybody's best interests at heart.He enjoys a good time, sure -- who doesn't? -- buAriana Grande sets sexist trolls ablaze with 5 fire tweets
Ariana Grande is out here courageously taking steps to upend the patriarchy; sexists, take note.TheBest laptop deal: Get the Samsung Galaxy Book4 Edge for 21% off at Amazon
SAVE $193.31: Get the Samsung Galaxy Book4 Edge for $706.68 at Amazon, down from its usual price ofBest Apple deal: Save $50 on AirPods Pro 2
SAVE $50:As of March 5, AirPods Pro 2 are on sale for $199 at Amazon. That's 20% off their list pricAloe Bud is a self
"Self-care" is a difficult term these days. Divorced from its original activist meaning and co-optedGoogle slashes its ridiculously
Google took a page out of Apple's book and killed the headphone jack with its new Pixel 2 phones &mdAll the reasons you should be creeped out by Amazon Key
Amazon has built up a lot of trust over the years as it's evolved from digital storefront to ecommerAirbnb CEO teases updates for 2017 on Twitter
Airbnb CEO Brian Chesky is spending his holiday working. On Twitter on Christmas evening he asked thUber Direct and Connect will deliver toilet paper, meds, and more
Since people in quarantine don't need many Uber rides, the company is now delivering goods.Uber annoRobotaxis and autonomous cars are still scary to most Americans
A convincing majority of Americans remain “afraid” to ride in self-driving vehicles, acciPhone Upgrade Program members can now be pre
The iPhone X will finally be available for pre-order on Oct. 27 — but if you're part of Apple'