


Google releases multi-modal Bard assistant: another milestone towards the era of interactive AI
At a new product launch conference a few days ago, Google officially released the new generation of Android flagship phone Pixel 8/Pro series, equipped with Tensor G3 chip. This chip can run more complex ML (machine learning) models, providing new models for new phones. A number of AI enhancements have been added, such as reading web pages to users in different languages and "more natural" voices, and virtual assistants speaking more naturally.
Google pointed out that Pixel 8 Pro is the first phone to run Google’s basic large model directly on the device, which requires 150 times the calculation of the largest ML model on Pixel 7.
At the same time, Google announced the launch of "Assistant with Bard" for Android and iOS devices, which combines the mobile phone's personal assistant function with generative AI. Users can use text, voice or images to Interact with the Bard assistant - in other words, it's multi-modal.
When a user asks "What important emails have I missed this week?", Bard Assistant will provide the following services: First, it will list the key points and specific content of each important email and provide links to the corresponding emails. Secondly, it can also help users extract active addresses and display them in Google Maps
In addition, DeepMind co-founder Mustafa Suleyman said in a recent interview that
the current stage of generative AI is only a transitional technical stage, and will next enter the era of interactive AI, AI will be based on user For different task needs, arrange for other software and or contact real people to complete the work.
He believes that the first wave of artificial intelligence mainly focused on classification - deep learning shows that humans can train artificial intelligence to classify input data such as images, videos, audios, and languages. Humanity is currently in the second wave of "generative artificial intelligence", which is "enter data and generate new data." The third wave in the future will belong to "interactive artificial intelligence". "Conversation is the interactive interface of the future." Users not only click buttons and type text, but directly talk to artificial intelligence. By then, interactive artificial intelligence will Able to take action independentlyTianfeng Securities pointed out that
The importance of scenarios in the C-end AI application landing stage is highlighted. Chat robots, AI companions and content production tool scenarios are the first to be implemented. The development speed and commercialization progress of AI applications in these scenarios may exceed expected.
According to analysts’ predictions, the iteration of artificial intelligence and the catalytic effect of later events will continue to accelerate. In the second half of the year, the iteration speed of applications and models of overseas giant companies will be significantly improved, and the capabilities of general chatbots are expected to be further enhanced. This may lead to an improvement in user experience and further increase the number of usersIn addition, Huajin Securities added that the shift of large models from general to vertical scenarios is more of an exploration of commercialization and is the driving force for large models to move from training to inference.
With the development and improvement of vertical large models, the application of large models is the key to opening up greater room for growth. Edge computing is a clear and huge incremental market. It has now reached the industry implementation stage. Cloud computing companies, telecom operators, equipment manufacturers, CDN companies, etc. are all actively promoting the implementation of the industry. The rewritten content is: Source: Financial Associated Press
The above is the detailed content of Google releases multi-modal Bard assistant: another milestone towards the era of interactive AI. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

Video Face Swap
Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics



Vibe coding is reshaping the world of software development by letting us create applications using natural language instead of endless lines of code. Inspired by visionaries like Andrej Karpathy, this innovative approach lets dev

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAI’s Grok 3 and Anthropic’s Claude 3.7 Sonnet, to OpenAI’s G

YOLO (You Only Look Once) has been a leading real-time object detection framework, with each iteration improving upon the previous versions. The latest version YOLO v12 introduces advancements that significantly enhance accuracy

The article reviews top AI art generators, discussing their features, suitability for creative projects, and value. It highlights Midjourney as the best value for professionals and recommends DALL-E 2 for high-quality, customizable art.

ChatGPT 4 is currently available and widely used, demonstrating significant improvements in understanding context and generating coherent responses compared to its predecessors like ChatGPT 3.5. Future developments may include more personalized interactions and real-time data processing capabilities, further enhancing its potential for various applications.

The article compares top AI chatbots like ChatGPT, Gemini, and Claude, focusing on their unique features, customization options, and performance in natural language processing and reliability.

Mistral OCR: Revolutionizing Retrieval-Augmented Generation with Multimodal Document Understanding Retrieval-Augmented Generation (RAG) systems have significantly advanced AI capabilities, enabling access to vast data stores for more informed respons

The article discusses top AI writing assistants like Grammarly, Jasper, Copy.ai, Writesonic, and Rytr, focusing on their unique features for content creation. It argues that Jasper excels in SEO optimization, while AI tools help maintain tone consist
