


Alibaba DAMO Academy wins SemEval's best paper to help AI understand human language better
News on July 19, SemEval-2022, the world's largest semantic evaluation competition, recently announced that this year's only "Best System Paper Award" will be awarded to researchers from Alibaba Damo Academy and other institutions. They designed a named entity recognition (NER) system that incorporates knowledge for 11 languages, including Chinese and English. It can accurately identify key entity information such as people's names, place names, institutions, works, etc., which effectively improves AI's understanding of human language. Ability.
SemEval (Semantic Evaluation) is an authoritative international competition in the field of natural language processing. It has a history of more than 20 years and is hosted by the Lexicon and Semantics Group of the International Association for Computer Linguistics (ACL). It aims to make AI To analyze and understand the meaning contained in human language.
SemEval has two best paper awards: Best Task Paper Award and Best System Paper Award. Popular understanding is that one is to ask questions and the other is to solve problems. The joint research team of Alibaba DAMO Academy, Shanghai University of Science and Technology, Zhejiang University, and Singapore University of Technology and Design won this year's Best System Paper Award. The article that stood out from 221 candidate papers is called "DAMO-NLP at SemEval- 2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition》.
SemEval-2022 Best System Paper
The winning team participated One of the 12 tasks of SemEval-2022: Multilingual Complex Named Entity Recognition (Multilingual Complex Named Entity Recognition). Named entity recognition (NER) is a basic work in the field of natural language processing. It refers to the identification of entity words (Entities) with specific meanings in text, mainly including names of people, place names, organization names, proper nouns, etc.
The task requires researchers to design a system that can identify entities in 11 languages including Chinese and English, including sentence patterns that are mixed with multiple languages, including "stalks", abbreviations, and colloquialisms. , achieve accurate identification. For example: "In 2016, she guest-starred in the HBO TV series Game of Thrones." The AI needs to recognize and understand the abbreviated organization name "HBO" and the title of the work "Game of Thrones."
Dharma Academy System won the first place in the overall score
The paper proposed a new set of multi-language named entities that incorporate knowledge The recognition system won 10 first places among the 13 sub-items of the competition tasks, ranking first in total score, which greatly improved the industry level.
Generally speaking, because words have ambiguous meanings, we can only accurately understand words based on context, and the same is true for AI. The power of the new system is that it allows AI to understand complex entity words even if there is no context. According to the researchers, the system introduces additional external knowledge to build a multi-lingual general knowledge base, which expands the contextual information of the text through interactive retrieval to eliminate ambiguity; coupled with multi-stage fine-tuning, it can accurately identify entity information .
Dharma Academy System Principle
According to reports, this award-winning research has been widely used in translation, search, human-computer dialogue, etc. The field has broad application prospects. Currently, DAMO Academy's machine translation system can provide translation services in 214 languages, translating hundreds of millions of words for 2 million domestic small and medium-sized businesses every day, helping domestic products to go global. The latest report "Cloud AI Developer Service Key Capabilities Report" by Gartner, an authoritative international research organization, points out that Alibaba Language AI has ranked second in the world, the best result in the history of Chinese companies.
The above is the detailed content of Alibaba DAMO Academy wins SemEval's best paper to help AI understand human language better. For more information, please follow other related articles on the PHP Chinese website!

Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

Notepad++7.3.1
Easy-to-use and free code editor

SublimeText3 Chinese version
Chinese version, very easy to use

Zend Studio 13.0.1
Powerful PHP integrated development environment

Dreamweaver CS6
Visual web development tools

SublimeText3 Mac version
God-level code editing software (SublimeText3)

Hot Topics

Why does map iteration in Go cause all values to become the last element? In Go language, when faced with some interview questions, you often encounter maps...

There is no simple and direct free XML to PDF tool on mobile. The required data visualization process involves complex data understanding and rendering, and most of the so-called "free" tools on the market have poor experience. It is recommended to use computer-side tools or use cloud services, or develop apps yourself to obtain more reliable conversion effects.

XML beautification is essentially improving its readability, including reasonable indentation, line breaks and tag organization. The principle is to traverse the XML tree, add indentation according to the level, and handle empty tags and tags containing text. Python's xml.etree.ElementTree library provides a convenient pretty_xml() function that can implement the above beautification process.

XML format validation involves checking its structure and compliance with DTD or Schema. An XML parser is required, such as ElementTree (basic syntax checking) or lxml (more powerful verification, XSD support). The verification process involves parsing the XML file, loading the XSD Schema, and executing the assertValid method to throw an exception when an error is detected. Verifying the XML format also requires handling various exceptions and gaining insight into the XSD Schema language.

Why does using locks cause panic occasionally? Let's take a look at an interesting question: Why in Go, even if locks are added in the code, sometimes...

The char array stores character sequences in C language and is declared as char array_name[size]. The access element is passed through the subscript operator, and the element ends with the null terminator '\0', which represents the end point of the string. The C language provides a variety of string manipulation functions, such as strlen(), strcpy(), strcat() and strcmp().

There is no "universal" method: XML to image conversion requires selecting the appropriate strategy based on XML data and target image style. Parsing XML: Use libraries such as DOM, SAX, StAX or JAXB. Image processing: Use java.awt.image package or more advanced libraries such as ImageIO and JavaFX. Data to image mapping: Defines the mapping rules of XML nodes to image parts. Consider complex scenarios: dealing with XML errors, image scaling, and text rendering. Performance optimization: Use SAX parser or multithreading technologies.

How to use lowercase names in different files within the same package? On Go...
