Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience-AI-php.cn

Home

Technology peripherals

Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience

王林

Sep 17, 2023 pm 01:21 PM

Sound transmission ai voice Multimodal technology.

With the development of 5G and artificial intelligence technology, intelligent voice has penetrated into people's daily lives with various intelligent terminal products, bringing more convenience and possibilities. As a provider of smart terminal products and mobile Internet services in emerging markets, Transsion focuses on continuous innovation in the field of artificial intelligence, continuously promotes the research and application of AI voice technology, explores more localized user scenario requirements, and brings full-scenario intelligence to users in emerging markets. interactive experience.

At present, Transsion has formed its own underlying AI voice technology capabilities in speech recognition, semantic understanding, speech synthesis, natural language processing, knowledge graphs, etc., has built advantages in small language voice data, and has developed in multilingual voice Major breakthroughs have been made in assistants, digital humans, and voice forgery detection technology. Since the beginning of this year, Transsion's AI technology department has continued to achieve results, winning great results in the ICASSP 2023 SLU Spoken Language Understanding Challenge and the IJCAI 2023 ADD Voice Deep Forgery Detection International Challenge, and published the Digital Human Multi-Model at the international multimedia flagship academic conference ICME 2023. Academic papers related to dynamic interaction.

Building a multilingual voice assistant for local voice interactive content ecosystem

Voice assistant is one of the standard applications of smartphones. Its core technology is voice interaction and natural language understanding, aiming to help users perform target tasks more quickly and efficiently. Faced with the demand for local voice interaction in emerging markets, TRANSSION has been deeply involved in multi-lingual voice assistant technology for a long time, focusing on understanding the needs of local users and forming technical solutions. It has accumulated profound technical capabilities and practical experience in the process of exploration and research and development.

At the top international conference ICASSP in 2023, Transsion AI Technology Department achieved great success in the SLU (Spoken Language Understanding) Challenge. With their excellent performance in speech recognition and semantic understanding, they won first place in the offline voice assistant sub-track with an accuracy of 71.97%. Their entry paper "A Two-Stage System for Spoken Language Understanding" was also included in the IEEE Institute of Electrical and Electronics Engineers

Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience

Colleagues from Transsion’s AI technology department shared research results at ICASSP 2023

Currently, voice assistants are mainly oriented to mainstream languages, but have less coverage of niche languages, specific groups of people and other subdivided areas. Targeting the local accents and minority languages of users in emerging markets such as Africa and South Asia, TRANSSION has built a localized low-cost, high-quality corpus data production system based on massive mobile phone user resources to solve the problem of lack of corpus and data scarcity in minority languages. . On this basis, Transsion develops multilingual voice assistants that can adapt to the language and cultural characteristics of local users in emerging markets, helping local users more conveniently use local languages to interact with their mobile phones via voice. Currently, Transsion's multilingual voice assistant technology supports voice interaction and natural language understanding capabilities in English, French, Hausa, Arabic, Swahili and other languages, covering contact calls, APP quick launch, music playback, More than 100 usage scenarios such as WhatsApp messaging and chatting

In order to meet the needs of local users in life services, Transsion's multilingual AI voice assistant technology will continue to be applied to more life, travel, study and work scenarios to build a cross-language AI content service Ecosystem enables intelligent voice services to penetrate into all aspects of local life and benefit more people who speak small languages

Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience

AI digital human technology empowers Transsion’s multi-scenario business

With the accelerated development of interactive intelligence technology, digital humans are moving from technological innovation to industrial application, playing a role in entertainment, education, medical and other fields. Transsion actively embraces AI development opportunities, deploys digital human technology in advance, and has established complete full-link technology and engineering self-research capabilities. Transsion's digital human system includes 2D real people and 3D realistic digital humans. It has data resources based on multilingual speech recognition, speech synthesis, voice awakening, natural language understanding and digital human capabilities. It can be used in multilingual voice dialogue, human design and Appearance, intelligent scene interaction and other areas have formed their own localized characteristics and industry leadership. In January this year, Transsion’s digital human system received the authoritative standard certification in the digital human field issued by the China Academy of Information and Communications Technology. This is also the only digital human system from a Chinese mobile phone manufacturer that has passed the evaluation of China Academy of Information and Communications Technology and is based on "interactive dialogue".

In order to improve the simulation effect of virtual images and synthesize realistic and expressive digital human videos, Transsion AI Technology Department independently developed end-to-end technology. In the process of optimizing the quality of digital human video generation, it proposed based on the Unet network A new technical framework densely-connected Unet structure is developed, and the encoder structure of CLIP is introduced to use text semantic information to improve the animation effect of digital human mouths. At the same time, this technology proposes a probability density map of face key point technology, which increases the modal information of the model network and improves the quality of model generation. This technological breakthrough can make the facial image of digital people more realistic and delicate, while improving the consistency of voice and lip shape. Its generation effect has reached an academically leading level. The related academic paper "CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation" was successfully accepted by the international multimedia flagship academic conference ICME 2023 (IEEE International Conference on Multimedia and Expo).

Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience

Currently, Transsion Digital Human System has been widely used in multiple business scenarios. It is not only used as a smart shopping guide in overseas mobile phone stores to provide users with a reference for purchasing mobile phones, but can also provide smart voice assistant functions for various smart terminal products to enhance user experience. In the future, Transsion will further utilize "AI digital human" technology to empower businesses in a variety of scenarios, actively explore new business forms such as digital human voice assistants and customer service systems, and bring a new intelligent interactive experience to users

Continue to build the underlying technical capabilities of AI voice

With the rapid development of AI technology today, algorithm-generated audio and audio forgery can be used to confuse fake audio with real audio. It is very difficult for ordinary users to distinguish audio authenticity from fake audio. In order to maintain the credibility of information and ensure social security, voice forgery detection technology has become crucial and has become a new research direction in the field of artificial intelligence. Transsion focuses on the business scenarios of smart terminal products and is guided by local user needs, continuously extending the underlying technical capabilities of AI voice, deploying new technology fields, and making major breakthroughs in voice forgery detection technology.

The Second Audio Deepfake Detection Challenge ADD (The Second Audio Deepfake Detection Challenge) "Tampering Area" organized by Transsion AI Technology Department at IJCAI 2023 (The 32nd International Joint Conference on Artificial Intelligence) Won second place in the Manipulation Region Location track. During the competition, Transsion's AI technology department independently developed innovative AI model algorithms and technologies that can accurately identify and locate speech tampering in audio, thereby effectively ensuring the originality and authenticity of digital audio and building a foundation for AI applications and information security. Provide new ideas. Relevant academic papers have been successfully published at this IJCAI 2023 Workshop on Deepfake Audio Detection and Analysis (DADA 2023) conference.

Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience

In the next step, Transsion's AI technology department will continue to explore the application of voice deep forgery detection technology on Transsion's smart terminal products, such as call fraud checks to protect user privacy and security, etc., and continuously improve user experience.

In the future, Transsion will continue to work hard in the field of AI voice multi-modal technology, focusing on the core business needs of "mobile phone Internet services home appliances and digital accessories", combined with deep insights into emerging markets and local consumers, to provide users with Smart life experiences that meet their needs form a localized AI content service ecosystem that continues to meet multi-lingual, multi-scenario, personalized, and intelligent application needs.

The above is the detailed content of Deeply cultivate AI voice multi-modal technology to achieve localized intelligent interactive experience. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress images for free

Clothoff.io

AI clothes remover

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

R.E.P.O. Best Graphic Settings

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

Assassin's Creed Shadows: Seashell Riddle Solution

2 weeks ago By DDD

R.E.P.O. How to Fix Audio if You Can't Hear Anyone

4 weeks ago By 尊渡假赌尊渡假赌尊渡假赌

WWE 2K25: How To Unlock Everything In MyRise

1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Hot Topics

Where is the login entrance for gmail email?

7514

CakePHP Tutorial

1378

What is the format of the account name of steam

win11 activation key permanent

nyt connections hints and answers

Related knowledge

Transsion responds to Qualcomm's patent infringement lawsuit in India: The agreement has been signed and fulfilled Jul 18, 2024 pm 03:03 PM

According to news on July 13, it was recently reported that Qualcomm is suing Transsion Holdings Group in the Delhi High Court in India, accusing the latter of infringing four of its non-standard essential patents. Transsion responded that it had signed a 5G standard patent license agreement with Qualcomm and was fulfilling the agreement. Transsion said that its sales network covers more than 70 countries in emerging markets such as Africa and South Asia. In some countries, some patent holders do not own or only own a small number of patents. However, it requires a globally unified rate and appeals for excessive licensing fees, which does not take into account the differences in economic development levels in different regions, the fact that it has no patents or only a small number of patents in specific regions or markets, and the existing cases provide different fees in different regions. rate and other factors. This practice does not fully comply with the principles of fairness, reasonableness and non-discrimination. Sound transmission

'The King of Mobile Phones in Africa'! Transsion ranks fourth in the world in terms of shipments in the first quarter of 2024, increasing by 86% May 02, 2024 pm 12:43 PM

According to news on May 2, the analysis agency Canalys recently released global smartphone market data for the first quarter of 2024. In this quarter, the global smartphone market increased by 10% year-on-year to 296.2 million units. Data shows that the top five mobile phone manufacturers in the first quarter were Samsung, Apple, Xiaomi, Transsion and OPPO. Among them, Transsion, known as the "King of African Mobile Phones", performed well. In the first quarter, Transsion mobile phone shipments reached 28.6 million units, with a market share of 10%, achieving strong growth of 86%. The financial report shows that Transsion's operating income in 2023 was 62.295 billion yuan, a year-on-year increase of 33.69%, and its net profit was 5.537 billion yuan, a year-on-year increase of 122.93%. In the main business, Transsion mobile phone revenue is 573

Tecno Phantom V2 Fold/Flip foldable screen phone exposed, equipped with Dimensity 9000+/Dimensity 8050 processor Apr 14, 2024 pm 09:07 PM

According to news on April 14, Tecno’s first foldable screen mobile phone PhantomVFold was launched in April last year, equipped with Dimensity 9000+ processor. Now the successor model of this phone has been revealed. Recently, two new Transsion smartphones have passed European EEC certification, with model numbers AE10 and AE11, and are expected to be Phantom V2 Fold and V2 Flip respectively. For reference, the previous generation models are AD10 and AD11. According to inquiries, these two new phones have also appeared on the Geekbench5.4.6 Android version AArch64 of the benchmarking platform. Among them, the AE10 model scored 1283 points in single-core and 3974 points in multi-core; the AE11 model scored 832 points in single-core and 3 in multi-core.

Transsion Infinix announces that NOTE 30 series mobile phones will be equipped with advanced voice assistant based on ChatGPT Jun 03, 2023 pm 05:30 PM

According to news on June 3, Transsion Infinix plans to introduce a new voice assistant on its NOTE30 series mobile phones, which is developed based on advanced ChatGPT technology. This movement has attracted widespread attention, because ChatGPT, as an intelligent system that can conduct continuous conversations and answer various questions, is considered to have achieved a completely different human-computer interaction experience from the past. Some people even compared it to Iron Man Jarvis in the movie. Transsion Infinix is a domestic mobile phone manufacturer focusing on overseas markets. Although it is less well-known in the domestic market, it enjoys a high reputation in places such as India and Africa, and is known as "Africa's No. 1 Brother". Transsion Holdings is its parent company and owns multiple mobile phone brands.

Transsion Holdings, the 'King of Africa”, will have revenue of nearly 62.3 billion yuan in 2023, and its African smartphone market share will exceed 40% Apr 23, 2024 pm 04:40 PM

According to news from this site on April 23, Shenzhen Transsion Holdings Co., Ltd. released its 2023 annual report today. Data shows that in 2023, the company's overall mobile phone shipments will be approximately 194 million units. Citing IDC data statistics, the report said that in 2023, its global mobile phone market share will be 14.0%, ranking third among global mobile phone brand manufacturers, among which smartphones will have a global smartphone market share of 8.1%, ranking fifth. In terms of revenue, this site summarizes as follows: In 2023, the company achieved operating income of 62,294,876,800 yuan (nearly 62.3 billion yuan), an increase of 33.69% over the same period last year; operating profit was 6,746,584,700 yuan (nearly 6.75 billion yuan), an increase of 33.69% over the same period last year; Growth of 122.50%; profit

Transsion responds to patent litigation: It has signed a 5G standard patent license with Qualcomm and is implementing the agreement Jul 17, 2024 pm 02:28 PM

On the evening of July 12, according to foreign media IPfray, Qualcomm is suing Transsion Holdings Group in the Delhi High Court in India for infringement of four non-standard basic patents. In response to this matter, Transsion stated that it has signed a 5G standard patent license agreement with Qualcomm and is implementing the agreement, and will continue to conduct patent negotiations with third parties to determine reasonable licensing fees. Qualcomm had not commented as of press time. ▲Transsion’s e-sports mobile phone Infinix GT20Pro said its sales network covers more than 70 countries in emerging markets such as Africa and South Asia. In these countries, some patent holders do not own or only own a small number of patents, but they demand a uniform global rate and excessive licensing fees, without taking into account the differences in economic development levels in different regions and their special needs.

The 'King of Africa' lives up to his name! Transsion Holdings' annual revenue in 2023 is nearly 62.3 billion: Africa's smartphone market share exceeds 40% Apr 23, 2024 pm 07:00 PM

According to news on April 23, Transsion Holdings released its 2023 annual report today. It shows that in 2023, the company will achieve operating income of 62.295 billion yuan, a year-on-year increase of 33.69%; net profit of 5.537 billion yuan, a year-on-year increase of 122.93%; basic earnings per share is 6.88 yuan. In terms of products, among the company's main businesses in 2023, mobile phone revenue will be 57.348 billion yuan, a year-on-year increase of 34.88%, accounting for 92.06% of operating revenue, and the overall mobile phone shipments will be approximately 194 million units. The report quoted IDC statistics and said that in 2023, its global mobile phone market share will be 14.0%, ranking third among global mobile phone brand manufacturers. Among them, smartphones will have a global smartphone market share of 8.1%, ranking fifth.

TECNO becomes the only mobile phone brand sponsor of the 34th African Cup of Nations Sep 27, 2023 pm 11:13 PM

TECNO, a technology brand under Transsion Holdings, announced that it has become the official sponsor of the 34th African Cup of Nations and is the only official mobile phone sponsor of this African Cup. The official signing ceremony was held in Singapore on September 21. Ha, Vice President of Transsion Holdings, Le, Guo Lei, general manager of the TECNO business unit, Duan Shengxiao, general manager of the African region, Véron Mosengo-Omba, secretary-general of the Confederation of African Football (CAF) and other senior officials attended the signing ceremony. Transsion said that with the strong traffic from the Africa Cup of Nations, TECNO will open up online and offline contact points, fully expose TECNO's entire range of products, and communicate in depth about TECNO's brand upgrade. At the same time, TECNO will also support ordinary people's football dreams through cooperation with football celebrities, and assume social responsibility.

See all articles