AI

Google Released The Gemini 1.5 

Alphabet Inc.'s Google unveiled a new version of its powerful Gemini artificial intelligence model that it says can process more text and video than competitors' products. Gemini 1.5 has many improvements. The Gemini 1.5 Pro, which will power many of Google's services, beats the Gemini 1.0 Pro by 87% in tests, putting it roughly on par with the high-end Gemini 1.0 Ultra. When creating a new model, the increasingly popular “Mixture of Experts” (MoE) approach is used, which implies that when sending a request, only part of the overall model is launched, and not the whole. This approach should make the model faster for the user and more efficient for Google.

Apple Introduced Keyframer AI: It Turns Static Pictures Into Animated Ones

Apple researchers have created Keyframer, a generative AI test app that allows users to describe an image and how it should animate. Animation requires a more complex set of parameters, including scene duration and coordination of object movement, which are not easily captured in a single task description, so alternative means, including command clarification, may be required.

NVIDIA Reveals Chat With RTX

NVIDIA has introduced its own analogue of the ChatGPT chatbot. A new product called Chat with RTX has become available on the official website of the video card manufacturer. Chat with RTX can process YouTube videos - just enter the URL to receive a summary of the content in text form from the chatbot. Chat with RTX allows you to search video transcripts. According to experts, searching videos takes a matter of seconds. At the same time, there were cases when the chatbot, for some unknown reason, used the content of another video instead of the requested one to search. This clearly indicates errors in the early demo.

Google Bard Is Officially Renamed Gemini

The American corporation Google has announced a major rebranding of its chatbot Bard, which is now called Gemini, similar to the name that has the set of artificial intelligence models that underlie the chatbot The Gemini mobile app will likely be the most accessible option for exploring Google's AI bot capabilities. Once installed on an Android device, the Gemini AI bot can, among other things, replace the Google Assistant voice assistant. “I think this is an important step towards creating a true AI assistant,” said Sissie Hsiao, head of development at Bard (now Gemini). She also added that the company's voice assistant is "more useful than ever."

OpenAI Adds Watermarks To DALL-E 3

OpenAI's DALL-E 3 image generator will automatically add watermarks to image metadata to comply with Content Provenance and Authenticity (C2PA) Coalition standards. Metadata will be used to tag images created by artificial intelligence on the ChatGPT website and when connecting to the API for the DALL-E 3 model, OpenAI said.

“Google Maps” Gets AI To Help You Find Interesting Places

Google has found an interesting use for large language models (LLMs) in the Google Maps service. AI analyzes information about more than 250 million places, including ratings, photos and reviews from 300 million responses - all data is used to search for places that match even unusual user requests made in any form.